Search code examples
Java Spark collect() javaRdd fails with Memory errors (EMR cluster)...

javaapache-sparkrddemramazon-emr

Read More
Running Spark app on EMR is slow...

apache-sparkjava-8mapreduceemramazon-emr

Read More
Converting JSON to Parquet in Amazon EMR...

apache-sparkemr

Read More
Run cron task on AWS EMR master node...

amazon-web-servicescronemramazon-emr

Read More
Restart hiveserver2 on emr...

amazon-web-serviceshiveemr

Read More
spark-sql: How to get the progress bar (with stages and tasks)?...

amazon-web-servicesapache-sparkapache-spark-sqlhadoop-yarnemr

Read More
EMR/boto - How to get cluster id and step id using boto?...

botoemramazon-emr

Read More
Run a Unix shell command on every EMR / Yarn node...

hadoophadoop-yarnemr

Read More
Spark on EMR w/multiple JDBC jars...

apache-sparkjdbcsbtemr

Read More
EMR Spark job using less executors than nodes in the cluster...

apache-sparkemr

Read More
Launch EMR and kill it after running one JOB (automatically)...

amazon-web-servicesamazon-ec2emr

Read More
How to access local files on EMR using java jar?...

amazon-web-servicesamazon-s3emramazon-emr

Read More
PrestoDB EMR Server refused connection...

amazon-web-serviceshiveemrpresto

Read More
How to get cluster information to call REST API (from the driver)?...

apache-sparkhadoop-yarnemramazon-cloudwatch

Read More
s3-dist-cp fails with OutOfMemoryException when I upgrade from EMR 5.7 to EMR 5.8...

amazon-s3emramazon-emr

Read More
Airflow - Task Instance in EMR operator...

pythonemrairflow

Read More
External dependency for spark job...

pysparkhadoop-yarnemr

Read More
Why spark environment parameters is not consistent with executors?...

amazon-web-servicesapache-sparkemr

Read More
Does AWS EMR has resource manager in HA?...

amazon-web-serviceshadoop-yarnemr

Read More
How to connect EMR Cluster to EC2 server...

amazon-web-servicesamazon-ec2emramazon-emr

Read More
AWS Spark EMR Numpy Import Error...

amazon-web-servicesnumpypysparkemr

Read More
How to use HadoopJarStepConfig.StepProperties?...

amazon-web-serviceshadoopemramazon-emr

Read More
Increasing network IO on EC2...

network-programmingamazon-ec2emr

Read More
How much memory is allocated for cached RDDs?...

hadoopapache-sparkcachingmemoryemr

Read More
AWS EMR Zeppelin is missing MYSQL interpreter...

amazon-web-servicesemramazon-emrapache-zeppelin

Read More
Disable multipart upload on EMR...

amazon-web-servicesfile-uploadamazon-s3emramazon-emr

Read More
AWS EMR Spark save to S3 is very slow...

amazon-s3apache-sparkemr

Read More
nginx reverse proxy treats request urls differently...

emrprestonginx-reverse-proxy

Read More
Unable to set Environment Variables in Spark Application...

javaapache-sparkenvironment-variablesemr

Read More
Load props file in EMR Spark Application...

apache-sparkemramazon-data-pipeline

Read More
BackNext