Search code examples
Pass system property to spark-submit and read file from classpath or custom path...


javascalaapache-sparkapache-spark-2.0spark-submit

Read More
Spark DataFrame: find and set the main root for child...


apache-sparkapache-spark-sqlapache-spark-datasetapache-spark-2.0

Read More
Exception in thread "broadcast-exchange-0" java.lang.OutOfMemoryError: Not enough memory t...


javaapache-sparkapache-spark-sqlapache-spark-2.0

Read More
Spark parquet partitioning : Large number of files...


apache-sparkapache-spark-sqlrddapache-spark-2.0bigdata

Read More
Maximum JDK version supported for apache spark 2.4.5.1...


apache-sparkapache-spark-2.0

Read More
Where is table data stored in Spark?...


apache-sparkapache-spark-sqlapache-spark-2.0

Read More
Why does SparkSQL require two literal escape backslashes in the SQL query?...


scalaapache-sparkapache-spark-sqlapache-spark-2.0

Read More
How to create encoder for custom Java objects?...


javaapache-sparkapache-spark-2.0

Read More
how to read CSV file with apache-spark spring boot...


javaspring-bootapache-sparkapache-spark-sqlapache-spark-2.0

Read More
Apache Spark Dataframe - Load data from nth line of a CSV file...


apache-sparkapache-spark-sqlapache-spark-2.0

Read More
spark off heap memory config and tungsten...


apache-sparkapache-spark-sqlapache-spark-2.0off-heap

Read More
What are the various join types in Spark?...


scalaapache-sparkapache-spark-sqlapache-spark-2.0

Read More
How to tail yarn logs?...


apache-sparkhadoophadoop-yarntailapache-spark-2.0

Read More
Issues in fetching data from cassandra using spark cassandra connector...


javaapache-sparkcassandraspark-cassandra-connectorapache-spark-2.0

Read More
Extracting entries from multiple primary keys in one query...


scalaapache-sparkcassandraspark-cassandra-connectorapache-spark-2.0

Read More
Two big files join as one to many relationship in Java Spark...


apache-sparkapache-spark-sqlapache-spark-datasetapache-spark-2.0

Read More
Spark step on EMR just hangs as "Running" after done writing to S3...


amazon-web-servicesapache-sparkamazon-s3pysparkapache-spark-2.0

Read More
java.lang.ClassCastException: org.apache.hadoop.conf.Configuration cannot be cast to org.apache.hado...


apache-sparkhadoop-yarnclouderaapache-spark-2.0scala-2.11

Read More
Timeout Exception in Apache-Spark during program Execution...


scalaapache-sparkspark-graphxapache-spark-2.0

Read More
How to replace a particular value in a Pyspark Dataframe column with another value?...


pythonapache-sparkpysparkapache-spark-sqlapache-spark-2.0

Read More
How to delete old data that was created by Spark Structured Streaming?...


apache-sparkapache-spark-sqlspark-structured-streamingapache-spark-2.0

Read More
Why does spark not recognize my "dataframe boolean expression"?...


pythonapache-sparkpysparkapache-spark-2.0

Read More
How to count the number of occurence of a key in pyspark dataframe (2.1.0)...


pythonapache-sparkpysparkapache-spark-2.0

Read More
Splitting a specific PySpark df column and create another DF...


python-3.xpysparkapache-spark-2.0

Read More
How to select rows that are not present in another dataframe ith pyspark 2.1.0?...


pythondataframepysparkapache-spark-2.0

Read More
Apache spark join with dynamic re-partitionion...


scalaapache-sparkapache-spark-sqlapache-spark-datasetapache-spark-2.0

Read More
How to transform this dataset to the following dataset...


scalaapache-sparkapache-spark-sqlapache-spark-datasetapache-spark-2.0

Read More
Caching in spark before diverging the flow...


apache-sparkapache-spark-sqlapache-spark-2.0

Read More
How to pivot streaming dataset?...


apache-sparkspark-structured-streamingapache-spark-2.0

Read More
Spark fails to start in local mode when disconnected [Possible bug in handling IPv6 in Spark??]...


macosshellapache-sparkapache-spark-2.0

Read More
BackNext