Search code examples
hadoopscheduleroozieoozie-coordinator

What is difference between Oozie workflow, coordinator and bundle


What is difference between Oozie workflow, coordinator and bundle ?

Oozie workflow defines a sequence of actions. And we need to invoke it manually every time we want it to run. Where as same workflow can be scheduled through coordinator. Is this understanding correct ?

Then what is extra in bundle ?

I guess it is used again to schedule set of coordinators. Then why can't one coordinator be used to schedule other coordinator like one workflow can have another sub-workflow.


Solution

  • Workflow:

    It is a sequence of actions. It is written in xml and the actions can be map reduce, hive, pig etc.

    Coordinator:

    It is a program that triggers actions (commonly workflow jobs) when a set of conditions are met. Conditions can be a time frequency,other external events etc.

    Bundle:

    It is defined as a higher level oozie abstraction that batches a set of coordinator jobs.We can specify the time for bundle job to start as well.