i am a newbie to Airflow. i have some .jar jobs generated with Talend Open Studio for Big Data, and i want to schedule and manage those with Airflow my question is , does Airflow support .jar file or generated by TOS as DAG ? and if it does how ? or is there any alternative to run .jar on Airlow ?
im using Airflow v1.10.3 the jobs are mainly to extract and process data from a mongodb database then update the database with the new processed data.
Thanks !
Airflow will happily run .jar
files. There is a few examples kicking about for you to have a look at.
Running a standard .jar
file: run_jar.py
Running a "built" Talend jobl loan_application_data.py
Obviously with both these examples the .jar
or Talend file(s) will need to be on the server Airflow is executing on (as well as Java).