Search code examples
Hadoop streaming job using Mxnet failing in AWS Emr...

hadoopemrhadoop-streamingamazon-data-pipelinemxnet

Read More
EMR Spark working in a java main, but not in a java function...

javaamazon-web-servicesapache-sparkemramazon-emr

Read More
Get a yarn configuration from commandline...

hadoophadoop-yarnhadoop2emrelastic-map-reduce

Read More
Is it Possible to Pass Additional Info to the EMR Cluster using Terraform?...

amazon-web-servicesemrterraform

Read More
Parquet Warning Filling up Logs in Hive MapReduce on Amazon EMR...

hivehadoop-yarnemrparquettez

Read More
Why Spark on AWS EMR doesn't load class from application fat jar?...

apache-sparkclasspathemramazon-emr

Read More
Manual installation of Sqoop 1.4 on AWS EMR 5.5.0...

amazon-web-serviceshivesqoopemr

Read More
Oozie sample on EMR...

oozieemr

Read More
AWS EMR Spark Step args bug...

amazon-web-servicesapache-sparkemramazon-emr

Read More
In Spark, How to convert multiple dataframes into an avro?...

apache-sparkpysparkavroemrspark-avro

Read More
I am getting the below error while trying to execute spark submit using oozie on emr...

apache-sparkoozieemrspark-submit

Read More
Transform JSON into Parquet using EMR/Spark...

javahadoopapache-sparkavroemr

Read More
Using data present in S3 inside EMR mappers...

amazon-s3emramazon-emr

Read More
EMR Hue: CUSTOM server authentication not supported. Valid are ['NONE', 'KERBEROS', ...

hadoopapache-sparkhiveemrhue

Read More
Cannot use a "." in a Hive table column name...

hadoophivehiveqlemr

Read More
Unable to restart Hue in EMR...

hadoopemrhue

Read More
Efficient way to work in s3 gzip file from EMR Spark...

apache-sparkpysparkemramazon-emrapache-spark-sql

Read More
Unhealthy node on the cluster...

hdfsapache-spark-sqlhadoop-yarnemramazon-emr

Read More
AWS' EMR vs EC2 pricing confusion...

amazon-web-servicesapache-sparkamazon-ec2cluster-computingemr

Read More
How to properly provide credentials for spark-redshift in EMR instances?...

amazon-web-servicesapache-sparkamazon-redshiftemraws-sdk

Read More
Using Spark to get names of all columns that have a value over some threshold...

pythonapache-sparkpysparkemr

Read More
AWS-SDK alignment errors with Spark 2.1.0 on AWS EMR?...

amazon-web-servicesapache-sparkemramazon-emr

Read More
Why does Spark job fail while executing multiple Hive scripts using spark-sql in parallel?...

hqlapache-spark-sqlemr

Read More
Number of executors and cores...

amazon-web-servicesapache-sparkemr

Read More
Is there a way to group my DynamoDB export tasks on one EMR cluster?...

amazon-web-servicesbackupamazon-dynamodbemramazon-data-pipeline

Read More
Stop hadoop/EMR/AWS creating S3 paths with _$folder$ extensions...

hadoopamazon-web-servicesamazon-s3apache-sparkemr

Read More
Multiple MySQL catalogs on EMR/PrestoDB...

mysqlemrpresto

Read More
Spark Container & Executor OOMs during `reduceByKey`...

apache-sparkmemory-managementpysparkemr

Read More
Spin down after last spark-submit completion (or failure) + xxx time...

apache-sparktimestampemr

Read More
pyspark.sql.utils.AnalysisException: u'Path does not exist...

hadoopapache-sparkpysparkemrapache-spark-sql

Read More
BackNext