Search code examples
Hadoop YARN: How to force a Node to be Marked "LOST" instead of "SHUTDOWN"?...


hadoophadoop-yarnamazon-emr

Read More
How to spin up all nodes in my EMR cluster before running my spark job...


amazon-web-servicesapache-sparkamazon-emr

Read More
How to efficiently aggregate data in billions of individual records in AWS?...


amazon-web-servicesamazon-redshiftanalyticsamazon-emramazon-athena

Read More
How to check for Jupyter active notebooks through command line...


amazon-web-servicesjupyter-notebookjupyteramazon-emr

Read More
How to read app.properties file from Java Spark application...


javaapache-sparkconfigurationamazon-emr

Read More
Exporting Spark DataFrame to S3...


scalacsvapache-sparkamazon-s3amazon-emr

Read More
AWS EMR - Get master node ip from java code...


javaaws-sdkemramazon-emr

Read More
Will this method force parallelization of "for" loops in spark?...


apache-sparkfor-looppysparkparallel-processingamazon-emr

Read More
I can't install spacy model in EMR PySpark notebook...


pythonamazon-web-servicespysparkamazon-emrspacy

Read More
Spark Exit Status 134. What does it mean...


apache-sparkpysparkamazon-emr

Read More
Running GeoMesa HBase on AWS S3, how do I ingest / export remotely...


amazon-emrgeomesa

Read More
How to read stderr logs from AWS logs...


apache-sparkdebuggingloggingpysparkamazon-emr

Read More
ExitCodeException exitCode=13 when running PySpark via EMR console...


amazon-web-servicesapache-sparkpysparkhadoop-yarnamazon-emr

Read More
Spark worker nodes unable to access file on master node...


scalaapache-sparkamazon-emr

Read More
mrjob in emr is running only 1 MRStep out of 3 MRSteps and cluster is shutting down...


pythonamazon-web-servicesamazon-emrmrjob

Read More
PySpark textFile replace text...


apache-sparkpysparkamazon-emr

Read More
Getting GeoSpark error with upload_jars function...


pythonamazon-web-servicesapache-sparkamazon-emrgeospark

Read More
Getting import error while executing statements via livy sessions with EMR...


apache-sparkjarhadoop-yarnamazon-emrlivy

Read More
How do you add partitions to a partitioned table in Presto running in Amazon EMR?...


hiveamazon-emrparquetprestohadoop-partitioning

Read More
getting error while trying to read athena table in spark...


apache-sparkpysparkamazon-emramazon-sagemaker

Read More
Presto-Glue-EMR integration: presto-cli giving NullPointerException...


amazon-emraws-glueprestotrino

Read More
Presto SocketTimeoutException...


javajdbcamazon-emrprestotrino

Read More
Error consuming records caused by SdkInterruptedException when inserting into Hudi Table...


amazon-web-servicesamazon-emrapache-hudi

Read More
How is deprecated.legacy-timestamp supposed to work in Presto 0.220?...


amazon-emrprestotrino

Read More
Access denied - EMR Presto - File Based Authorization...


hiveamazon-emrprestotrino

Read More
Does using Parquet on S3 with EMR/Spark save bandwidth when using subset of columns?...


apache-sparkamazon-s3amazon-emr

Read More
How to *safely* install a python private package from github in an AWS EMR bootstrap script...


amazon-web-servicesgithubpipamazon-emr

Read More
spark-submit on AWS EMR runs but fails on accessing S3...


apache-sparkamazon-emr

Read More
Setting spark.driver.maxResultSize in EMR notebook jupyter...


apache-sparkjupyter-notebookamazon-emrspark-notebook

Read More
Keep Hive session open EMR...


bashhivehiveqlamazon-emr

Read More
BackNext