I'm making an app that cleans-etc-search a CSV that is updated daily on a website. I was using an EC2 to download the file using python pandas.read_csv(url)
to an EBS, but now I want to make the app serverless.
I want to automate the download from 'https://sam.gov/api/prod/fileextractservices/v1/api/download/Contract%20Opportunities/datagov/ContractOpportunitiesFullCSV.csv?privacy=Public' and upload it to S3 serverless. I'm not sure if is possible to do it serverless. Is there a better way to do it?
The file size is about 500 MB.
A lambda is exactly what you would want to use for this kind of scenario. Do the following: