Search code examples
amazon-web-servicesapache-sparkaws-glueamazon-emr

How to access existing Glue catalog from EMR?


I have created a Glue catalog in my account. It has 1 DB and 1 table. Screenshot of Glue catalog from AWS console

I followed this guide from AWS and created my EMR cluster. However, when I run spark-shell and try to access Glue catalog, I am not able to see the database from Glue catalog being accessed in my EMR. Screenshot of terminal showing spark-shell

What am I missing?


Solution

  • This was a non issue. I was trying to launch and EMR in US-East-1, and for some reason, the EMR was not getting provisioned even if the underlying EC2's were provisioned and in running state. I was able to ssh to the EC2s and run spark-shell on them too.

    I launched an EMR in US-East-2 and it was completely provisioned. I was able to connect to the Glue catalog successfully.