Search code examples
amazon-web-servicesamazon-s3aws-glue

AWS Glue job to unzip a file from S3 and write it back to S3


I'm very new to AWS Glue, and I want to use AWS Glue to unzip a huge file present in a S3 bucket, and write the contents back to S3.

I couldn't find anything while trying to google this requirement.

My questions are:

  1. How to add a zip file as data source to AWS Glue?
  2. How to write it back to same S3 location?

I am using AWS Glue Studio. Any help will be highly appreciated.


Solution

  • I couldn't find anything while trying to google this requirement.

    You couldn't find anything about this, because this is not what Glue does. Glue can read gzip (not zip) files natively. If you have zip, then you have to convert all the files yourself in S3. Glue will not do it.

    To convert the files, you can download them, re-pack, and re-upload in gzip format, or any other format that Glue supports.