Search code examples
Spark worker nodes unable to access file on master node...


scalaapache-sparkamazon-emr

Read More
mrjob in emr is running only 1 MRStep out of 3 MRSteps and cluster is shutting down...


pythonamazon-web-servicesamazon-emrmrjob

Read More
PySpark textFile replace text...


apache-sparkpysparkamazon-emr

Read More
Getting GeoSpark error with upload_jars function...


pythonamazon-web-servicesapache-sparkamazon-emrgeospark

Read More
Getting import error while executing statements via livy sessions with EMR...


apache-sparkjarhadoop-yarnamazon-emrlivy

Read More
How do you add partitions to a partitioned table in Presto running in Amazon EMR?...


hiveamazon-emrparquetprestohadoop-partitioning

Read More
getting error while trying to read athena table in spark...


apache-sparkpysparkamazon-emramazon-sagemaker

Read More
Presto-Glue-EMR integration: presto-cli giving NullPointerException...


amazon-emraws-glueprestotrino

Read More
Presto SocketTimeoutException...


javajdbcamazon-emrprestotrino

Read More
Error consuming records caused by SdkInterruptedException when inserting into Hudi Table...


amazon-web-servicesamazon-emrapache-hudi

Read More
How is deprecated.legacy-timestamp supposed to work in Presto 0.220?...


amazon-emrprestotrino

Read More
Access denied - EMR Presto - File Based Authorization...


hiveamazon-emrprestotrino

Read More
Does using Parquet on S3 with EMR/Spark save bandwidth when using subset of columns?...


apache-sparkamazon-s3amazon-emr

Read More
How to *safely* install a python private package from github in an AWS EMR bootstrap script...


amazon-web-servicesgithubpipamazon-emr

Read More
spark-submit on AWS EMR runs but fails on accessing S3...


apache-sparkamazon-emr

Read More
Setting spark.driver.maxResultSize in EMR notebook jupyter...


apache-sparkjupyter-notebookamazon-emrspark-notebook

Read More
Keep Hive session open EMR...


bashhivehiveqlamazon-emr

Read More
How do I connect Spark to JDBC driver in Zeppelin?...


apache-sparkpysparkamazon-emrapache-zeppelin

Read More
Amazon EMR pip install in bootstrap actions runs OK but has no effect...


pythonamazon-web-servicespipamazon-emr

Read More
Hive query shows few reducers killed but query is still running. Will the output be proper?...


hadoophivemapreduceamazon-emrapache-tez

Read More
spark job timing out when trying to save as table on aws emr...


amazon-web-servicesapache-sparkpysparkamazon-emr

Read More
How to properly check resource usage of AWS EMR cluster(master and cores) from notebook...


amazon-web-servicesapache-sparkpysparkamazon-emr

Read More
Where to find node logs in AWS EMR cluster?...


amazon-web-servicesapache-sparkhadoop-yarnamazon-emr

Read More
AWS EMR pyspark notebook fails with `Failed to run command /usr/bin/virtualenv (...)`...


amazon-web-servicesapache-sparkpysparkjupyter-notebookamazon-emr

Read More
Using MySQL instance on AWS EMR cluster...


mysqlamazon-emr

Read More
How to set Hadoop fs.s3a.acl.default on AWS EMR?...


scalaapache-sparkhadoopamazon-s3amazon-emr

Read More
AWS EMR Bootstrap action "aws s3 cp ..." to download 11GB file failing due to [Errno 28] N...


amazon-web-servicesamazon-emr

Read More
Checkpoint s3p flink on EMR...


amazon-s3apache-flinkamazon-emr

Read More
Flink on AWS EMR Task Nodes...


apache-flinkamazon-emrflink-streaming

Read More
Is there a way to get Step Functions input values into EMR step Args...


amazon-emraws-step-functions

Read More
BackNext