google-app-engine cron google-compute-engine apache-beam dataflow

How to schedule Dataflow Job by running Google Compute Engine cron job

In the Dataflow FAQ, it is listed that running custom (cron) job processes on Compute Engine is a way to schedule dataflow pipelines. I am confused about how exactly that should be done: how to start the dataflow job on compute engine and start a cron job.

Thank you!

Solution

You can use the Google Cloud Scheduler to execute your Dataflow Job. On Cloud Scheduler you have targets, these could be HTTP/S endpoints, Pub/Sub topics, App Engine applications, you can use your Dataflow template as target. Review this external article to see an example: Schedule Your Dataflow Batch Jobs With Cloud Scheduler or if you want to add more services to the interacion: Scheduling Dataflow Pipeline using Cloud Run, PubSub and Cloud Scheduler.

Viewing deployed files app engine standard?
Hosting multiple customer websites on Google App Engine with Different custom domains pointing to different services
Failing to generate all the header and implementation classed for iOS
Enable Memcached on App Engine runtime PHP 7.2
Deploying an Angular app on Google Cloud with minimal costs
Permissions error fetching application - Gcloud app deploy
How to Mock a Google API Library with Python 3.7 for Unit Testing
GAE List tasks that start with a name?
How to install Google Cloud SDK app-engine-python on Debian 12?
Google App Engine showing only index.php file
GAE on java 21 throws NullPointerException at CookieCache.parseCookies()
FTP to Google Storage
Google Cloud logging, Python3.8 standard environment, group request related logs by trace id
Mac terminal ERROR 2002 (HY000): Can't connect to local MySQL server through socket '/tmp/mysql.sock' (2)
Access google cloud storage bucket from other project using python
How to include third party Python libraries in Google App Engine?
Scraping Python advice needed
How to perform web scraping to find specific linked pages in Java on Google App Engine?
Connecting to Endpoint API on Android Client
Deploying to GCP App Engine from terminal produces contradictory error messages
Issue when trying to deploy dotnet 8 web app to gcloud app engine flex
The resource 'projects/<my project>' was not found" error when trying to get list of running instances
Angular 17 web app deploy to Google App Engine not working
App Engine says "Your runtime version for ruby33 is past End of Support", but it isn't
Error during `gcloud app deploy` for GAE app: "Failed to create cloud build: invalid bucket"
Python admin only decorators
Google App Engine deployment fails because of failing readiness check
What is the difference between Google App Engine and Google Compute Engine?
gcloud app deploy eror The "vpcaccess.connectors.use" permission is required
Can't connect to localhost from Chrome extension