Search code examples
pythonanacondajupyterjupyter-lab

Scheduling Jupyter Notebooks to run one after the other


I have 9 jupyter notebooks in Anaconda that are parsing large txt files, validating their data and providing validation output in 9 different excel workbooks to end users. With my algo being now complete, I need to take all these notebooks into production and they need to be run in a particular order one after the other without having the user need to click the Run button anywhere (they have no idea of Anaconda or Jupyter).
My question is - is there an option of designing a workflow in Anaconda itself which will enable this? Or any other external option to make these Jupyters run?

I tried using Jupyter Labs but to no avail.


Solution

  • Hello have you tried to automate the execution of Jupyter notebooks throught Papermill? You can use it to set up your jobs in a specific order, one approach is to use a workflow management tool or create a script that executes the notebooks sequentially. You can try Papermill that is a Python library that allows you to parameterize and execute Jupyter notebooks.You can define the execution order of the notebooks by creating a script that uses Papermill to run each notebook one after the other. You can pass input parameters and output paths between the notebooks as needed. regards juliano