Search code examples
pythonamazon-web-servicesaws-glueaws-glue-data-catalog

Is it essential to have a connection in a aws glue job?


I have my data tables in the Glue Metadata catalog. I need to use this data in my glue job's python shell script. When I create the glue job it gives me the option to select the connection type at last. Is it essential to add a connection? If the tables are in glue catalog what would be the connection type?


Solution

  • If the tables are in the Glue Catalog you don't need any connections. As long as your data sources / data sinks are Glue / S3 you don't need a connection apart from a VPC S3 Endpoint.

    If you want to connect to let's say Redshift or a MySQL database you would need a connection.