Search code examples
Amazon EMR MapReduce streaming program terminated with errors...


python-2.7mapreducebigdataamazon-emrhadoop-streaming

Read More
Using Json Input Variables In Airflow EMR Operator...


amazon-emrairflow

Read More
What is the option for presto to map multiple row to single file in S3?...


amazon-s3amazon-emrpresto

Read More
How to check EMR spot instance price history with boto...


python-3.xamazon-web-servicesboto3amazon-emr

Read More
How to submit job over dispatcher http api (tcp/8081) on AWS EMR?...


amazon-web-servicesapache-flinkapache-beamamazon-emr

Read More
I am trying to import a csv file into HDFS. I am getting a error that states: -cp: Not enough argume...


amazon-web-serviceshdfshadoop2amazon-emrhue

Read More
Deploying Bokeh server on EMR...


pythonbokehamazon-emr

Read More
Determine where spark program is failing?...


apache-sparkpysparkhadoop-yarnamazon-emr

Read More
How to set User-Agent (prefix) for every upload request to S3 from Amazon EMR application...


hadoopamazon-emr

Read More
Can't get Apache Airflow to write to S3 using EMR Operators...


amazon-s3airflowamazon-emr

Read More
EMR fails to run python 3.x...


pythonamazon-web-servicesamazon-emr

Read More
EMR pyspark notebook Spark progress widget gone...


pythonamazon-web-servicespysparkjupyter-notebookamazon-emr

Read More
is it possible to run spark udf functions (mainly python) under docker?...


pythondockerapache-sparkpysparkamazon-emr

Read More
How to install cloudera impala on EMR?...


hadoophiveclouderaamazon-emrimpala

Read More
Credential problems when using both S3 and Redshift...


amazon-s3pysparkamazon-redshiftapache-spark-sqlamazon-emr

Read More
Install ExtJS in Oozie EMR...


hadoopextjsamazon-emroozie

Read More
What is Taskmanager, Task, Slots, Parallelism, CPU cores in Flink?...


apache-flinkamazon-emrflink-streamingtaskmanager

Read More
Spark on EMR - Downloading Different Jar Files...


amazon-web-servicesapache-sparkjaramazon-emr

Read More
Nutch FetchData job is too slow...


hadoopmapreduceweb-crawleramazon-emrnutch

Read More
How do I run a local Python script on a remote Spark cluster?...


pythonamazon-web-servicesamazon-ec2pysparkamazon-emr

Read More
How to set Jupyter notebook to Python3 instead of Python2.7 in AWS EMR...


python-3.xamazon-web-servicesjupyter-notebookamazon-emr

Read More
Connection timed out exception with spark-redshift on EMR...


apache-sparkapache-spark-sqlamazon-emrdatabricks

Read More
Bootstrap Action after HBase Installation...


hbaseamazon-emrgeomesa

Read More
Bootstrap Failure when trying to install Spark on EMR...


amazon-web-servicesapache-sparkhadoopamazon-emr

Read More
How can I install a module to a specific Jupyter kernel without use of ! or terminal?...


pythonjupyter-notebookamazon-emr

Read More
Pyspark UDF unable to use large dictionary...


pythondictionarypysparkuser-defined-functionsamazon-emr

Read More
How can i add multiple columns to existing dataframe in pyspark aws emr?...


pythondataframepysparkapache-spark-sqlamazon-emr

Read More
EMR creation task and core nodes not able to specify as "Max on demand" for spot pricing...


amazon-web-servicesterraformamazon-emrterraform-provider-aws

Read More
EMR JupyterHub: S3 persistence of notebooks not working...


amazon-s3jupyter-notebookamazon-emr

Read More
Cross Apply Equivalent in Hive?...


hivehiveqlamazon-emr

Read More
BackNext