Search code examples
Spark Dataframe - Merge Nested Columns into one...

dataframeapache-sparkaws-glue

Read More
ParamValidationError: Parameter validation failed: Bucket name must match the regex...

pythonamazon-web-servicesamazon-s3aws-lambdaaws-glue

Read More
Event based trigger of AWS Glue Crawler after a file is uploaded into a S3 Bucket?...

amazon-web-servicesamazon-s3aws-glue

Read More
Distinct performance in Redshift...

sqlamazon-redshiftquery-optimizationaws-glueamazon-redshift-spectrum

Read More
AWS Lake Formation: Grant permission for one role to ALL databases...

amazon-web-servicesaws-glueamazon-athenaaws-lake-formation

Read More
How to exclude either files or folder paths on S3 within an AWS Glue job when reading an Athena tabl...

amazon-web-servicesamazon-s3aws-glueamazon-athenaapache-hudi

Read More
Getting the Job Run Datetime in a Scheduled Glue Job...

amazon-web-servicesaws-glue

Read More
why is the session name appended to the role name when trying to connect to AWS Glue?...

amazon-web-servicesboto3amazon-iamaws-glueaws-glue-connection

Read More
Update Trigger AWS Glue Trigger Using boto3...

pythonamazon-web-servicesboto3aws-glue

Read More
How to define the AWS Athena s3 output location using terraform when using aws_glue_catalog_database...

terraformaws-glueterraform-provider-awsamazon-athenaaws-glue-data-catalog

Read More
Debugging Glue Crawler EOFException...

aws-glue

Read More
AWS Glue vs EMR Serverless...

amazon-web-servicesamazon-emraws-glueemr-serverless

Read More
AWS Glue schema registry AVRO multiple events of specific record type in same topic...

apache-kafkaspring-kafkaavroaws-glue

Read More
AWS Glue Access denied for crawler with administrator policy attached...

amazon-s3aws-glue

Read More
AWS Athena Returning Zero Records from Tables Created from GLUE Crawler database using parquet from ...

amazon-s3parquetaws-glueamazon-athena

Read More
AWS Glue: get job_id from within the script using pyspark...

amazon-web-servicesaws-glue

Read More
How to Trigger Glue ETL Pyspark job through S3 Events or AWS Lambda?...

amazon-web-servicesamazon-s3aws-lambdaaws-glue

Read More
AWS Glue enableUpdateCatalog not creating new partitions after successful job run...

amazon-web-servicesaws-glueaws-glue-data-catalog

Read More
AWS Glue Job Cloudformation - Values Set in Cloudformation Not Sticking...

pythonamazon-web-servicesapache-sparkaws-cloudformationaws-glue

Read More
How to create a text file with windows line ending (CRLF) using pyspark?...

pythonapache-sparkpysparkaws-glue

Read More
Not able to remove ( ) in pyspark...

pysparkaws-glueregexp-replace

Read More
Partitioning by date on Glue: 1 date column vs 3 columns (year/month/day)?...

amazon-web-servicesamazon-redshiftaws-glueaws-glue-data-catalogamazon-redshift-spectrum

Read More
How can we update existing partition data in aws glue table without running crawler?...

amazon-s3aws-glueamazon-athenaaws-glue-data-catalog

Read More
AWS GLUE Pyspark job delete S3 folder unexpectly...

amazon-s3pysparkaws-glueaws-glue-workflow

Read More
How to set a specific compression value in aws glue? If possible, can the compression level and part...

amazon-web-servicespysparkaws-glueaws-glue-sparkaws-glue-workflow

Read More
pg8000 get inserted id into dataframe...

pysparkapache-spark-sqlaws-glue

Read More
AWS Glue Job parallel running got error "Rate exceeded" ThrottlingException Status Code: 4...

amazon-web-servicesaws-glue

Read More
Not able to replace / with - using pyspark regexp_replace...

pythonpysparkaws-glue

Read More
When do I use a glue job or a Sagemaker Processing job for an etl?...

amazon-web-servicesaws-glueamazon-sagemaker

Read More
Tables not found in Spark SQL after migrating from EMR to AWS Glue...

apache-sparkamazon-emraws-glue

Read More
BackNext