Search code examples
EMRserverless is allocating half of the memory to the executors than what we actually define in spar...


apache-sparkamazon-emremr-serverless

Read More
apache-beam installation issue on AWS EMR-EC2 cluster...


apache-sparkpysparkapache-beamamazon-emrspark-submit

Read More
Amazon EMR 7.2 does not support Ganglia?...


amazon-web-servicesamazon-emr

Read More
Flink Job Execution Fails with `NoClassDefFoundError` on AWS EMR with Python...


apache-flinkamazon-emrflink-streamingpyflink

Read More
spark-submit using --py-files option could not find path to modules...


amazon-web-servicesapache-sparkamazon-s3pysparkamazon-emr

Read More
Persistent and transient EMR equivalent clusters in azure and HDInsight...


azureamazon-emrazure-hdinsight

Read More
What is the difference between AWS Glue ETL Job and AWS EMR?...


amazon-web-servicesamazon-s3etlamazon-emraws-glue

Read More
Apache Sedona on EMR version > 6.9.0: JavaPackage object is not callable...


apache-sparkpysparkamazon-emrapache-sedona

Read More
Pyspark error in EMR writting parquet files to S3...


pythonapache-sparkamazon-s3pysparkamazon-emr

Read More
Access credential for EMR Jupyter Notebook...


amazon-emrjupyterhub

Read More
Apache Crunch Job On AWS EMR using Oozie...


hadoopmapreduceamazon-emroozieapache-crunch

Read More
How to enable "Use for Hive table metadata" in "AWS Glue Data Catalog settings" ...


amazon-web-servicesterraformaws-glueamazon-emrtrino

Read More
Start token not found error while using JsonSerDe...


amazon-web-serviceshiveemramazon-emr

Read More
Access data on EMR directory from EMR Studio: Workspaces (Notebooks)...


pythonamazon-s3importamazon-emr

Read More
Add Bootstrap Actions while creating EMR cluster from AWS Step Functions...


amazon-emraws-step-functions

Read More
Use pyspark shell or Zeppelin with Docker for EMR...


dockerapache-sparkpysparkamazon-emrapache-zeppelin

Read More
EMR Pyspark does not see computed columns when running select statements...


pysparkamazon-emr

Read More
Spark recommends listing Spark and Hadoop dependencies as provided in the docs, is this strictly req...


apache-sparkhadoophbaseamazon-emr

Read More
EMR Spark Job Step can't find mysql connector...


amazon-web-servicesapache-sparkpysparkairflowamazon-emr

Read More
Syntax error when using Nessie commands with DBT but not using Spark...


apache-sparkamazon-emrthriftdbtnessie

Read More
df.show returning java.lang.ClassNotFoundException: org.postgresql.Driver...


postgresqljdbcpysparkamazon-rdsamazon-emr

Read More
400 Bad Request error when trying to write to S3 from an EMR 7.0.0 cluster...


apache-sparkamazon-s3apache-spark-sqlamazon-emr

Read More
aws s3 - SdkInterruptedException...


javaamazon-web-servicesapache-sparkamazon-s3amazon-emr

Read More
Ensuring File Size Limit is Adhered to When Batch Processing Downloads in PySpark on EMR...


pythonapache-sparkpysparkamazon-emr

Read More
Is high availability really not possible with aws emr instance fleets?...


amazon-emr

Read More
Apache Iceberg tables not working with AWS Glue in AWS EMR...


amazon-web-servicesapache-sparkaws-glueamazon-emrapache-iceberg

Read More
Dealing with a large gzipped file in Spark...


apache-sparkgzipamazon-emr

Read More
When using Iceberg with EMR 7.0.0 with s3 I got awssdk SdkClientException: Timeout waiting for conne...


amazon-web-servicesamazon-s3pysparkamazon-emrapache-iceberg

Read More
pyspark & iceberg: `update *` not working in `merge into`?...


apache-sparkpysparkapache-spark-sqlamazon-emrapache-iceberg

Read More
Python version running on EMR 6.8...


pysparkamazon-emr

Read More
BackNext