Search code examples
Module error caused from AWS EMR by running PySpark code in Apache Livy via lambda function...


amazon-web-servicespysparkaws-lambdaamazon-emrlivy

Read More
Identifying changes in large amounts of data using pyspark...


pysparkbigdatadata-miningamazon-emr

Read More
Get EMR cluster to kill its EC2 instances on termination...


amazon-web-servicesamazon-ec2boto3amazon-emr

Read More
Where is s3-dist-cp of EMR 6.2.0?...


amazon-s3amazon-emr

Read More
How to redo a failed step in AWS emr...


amazon-emr

Read More
Beam on EMR throws a java.util.ServiceConfigurationError...


apache-flinkapache-beamamazon-emr

Read More
AWS step function does not add next step to EMR cluster when current step fails...


amazon-web-servicesamazon-emraws-step-functions

Read More
Can't use python variable in jinja template with Airflow...


pythonamazon-web-servicesairflowamazon-emrmwaa

Read More
Amazon Redshift table to external table in S3 every hour...


amazon-web-servicesamazon-s3amazon-redshiftamazon-emr

Read More
"No module named 'pandas' " error occurs when using pyspark pandas_udf with AWS EM...


python-3.xapache-sparkpysparkamazon-emrapache-zeppelin

Read More
AWS emr driver jars...


amazon-web-servicesapache-sparkpysparkamazon-emr

Read More
Find the yarn ApplicationID of of the current Spark job from the DRIVER node?...


pythonapache-sparkamazon-emr

Read More
How to install R language version 4 in AWS EMR - Amazon linux 2...


ramazon-web-servicesamazon-emramazon-linux

Read More
How to use Spark-Submit to run a scala file present on EMR cluster's master node?...


scalaapache-sparkamazon-emrspark-submit

Read More
Not able to connect to Snowflake from EMR Cluster using Pyspark using airflow emr operator...


amazon-web-servicesapache-sparkpysparkairflowamazon-emr

Read More
Why does AWS EMR require 2 different security groups for Master and Core/Task nodes?...


amazon-web-servicesamazon-emraws-security-group

Read More
For Loop keeps restarting in EMR (pyspark)...


apache-sparkpysparkmemory-leaksnested-loopsamazon-emr

Read More
Hadoop YARN: How to force a Node to be Marked "LOST" instead of "SHUTDOWN"?...


hadoophadoop-yarnamazon-emr

Read More
How to spin up all nodes in my EMR cluster before running my spark job...


amazon-web-servicesapache-sparkamazon-emr

Read More
How to efficiently aggregate data in billions of individual records in AWS?...


amazon-web-servicesamazon-redshiftanalyticsamazon-emramazon-athena

Read More
How to check for Jupyter active notebooks through command line...


amazon-web-servicesjupyter-notebookjupyteramazon-emr

Read More
How to read app.properties file from Java Spark application...


javaapache-sparkconfigurationamazon-emr

Read More
Exporting Spark DataFrame to S3...


scalacsvapache-sparkamazon-s3amazon-emr

Read More
AWS EMR - Get master node ip from java code...


javaaws-sdkemramazon-emr

Read More
Will this method force parallelization of "for" loops in spark?...


apache-sparkfor-looppysparkparallel-processingamazon-emr

Read More
I can't install spacy model in EMR PySpark notebook...


pythonamazon-web-servicespysparkamazon-emrspacy

Read More
Spark Exit Status 134. What does it mean...


apache-sparkpysparkamazon-emr

Read More
Running GeoMesa HBase on AWS S3, how do I ingest / export remotely...


amazon-emrgeomesa

Read More
How to read stderr logs from AWS logs...


apache-sparkdebuggingloggingpysparkamazon-emr

Read More
ExitCodeException exitCode=13 when running PySpark via EMR console...


amazon-web-servicesapache-sparkpysparkhadoop-yarnamazon-emr

Read More
BackNext