Search code examples
Spark AQE Post-Shuffle partitions coalesce don't work as expected, and even make data skew in so...


apache-sparkapache-spark-sqlspark-kafka-integrationspark3

Read More
Spark ignores Iceberg Nessie catalog...


javaapache-sparkapache-icebergspark3nessie

Read More
Convert date to ISO week date in Spark...


apache-sparkdatepysparkapache-spark-sqlspark3

Read More
Pyspark.ml - Error when loading model and Pipeline...


apache-sparkpysparkspark3

Read More
Why are spark3 dynamic partitions slow to write to hive...


apache-sparkapache-spark-sqlhivebigdataspark3

Read More
Start of the week on Monday in Spark...


apache-sparkpysparkapache-spark-sqldayofweekspark3

Read More
java.lang.NullPointerException while reading specific sheet from xlsx using org.zuinnote.spark.offic...


apache-poispark3spark-excel

Read More
Adaptive Query Execution and Shuffle Partitions...


apache-sparkpysparkapache-spark-sqlspark3

Read More
Spark 3 KryoSerializer issue - Unable to find class: org.apache.spark.util.collection.OpenHashMap...


scalaapache-sparkapache-spark-mllibkryospark3

Read More
Why would finding an aggregate of a partition column in Spark 3 take very long time?...


apache-sparkapache-spark-sqlspark3catalyst-optimizer

Read More
Spark: DF.as[Type] fails to compile...


scalaapache-sparkapache-spark-sqlscala-3spark3

Read More
Spark can't connect to DB with built-in connection providers...


scalaapache-sparkscala-2.13spark3apache-spark-3.0

Read More
Date from week date format: 2022-W02-1 (ISO 8601)...


apache-sparkdatepysparkapache-spark-sqlspark3

Read More
Spark 3.0 is much slower to read json files than Spark 2.4...


scalaapache-sparkjava-11spark3

Read More
Create a lookup column in pyspark...


pysparkapache-spark-sqlwindow-functionsspark3

Read More
Scala: Parse timestamp using spark 3.1.2...


scalaapache-sparkparsingtimestampspark3

Read More
How to save spark dataset in encrypted format?...


javaapache-sparkhadoopencryptionspark3

Read More
SPARK 3 - Populate value with value from previous rows (lookup)...


apache-sparkpysparkapache-spark-sqlspark3

Read More
AnalysisException when loading a PipelineModel with Spark 3...


pythonapache-sparkmachine-learningspark3

Read More
Does Apache Spark 3 support GPU usage for Spark RDDs?...


apache-sparkgpurddrapidsspark3

Read More
spark struct streaming writeStream output no data but no error...


pysparkapache-kafkaspark-structured-streamingspark-kafka-integrationspark3

Read More
org.apache.spark.shuffle.FetchFailedException: Connection from server1/xxx.xxx.x.xxx:7337 closed...


apache-sparkspark-streaminghadoop-yarnshufflespark3

Read More
Spark 3.0 streaming metrics in Prometheus...


apache-sparkprometheusspark-structured-streamingspark3

Read More
Spark 3.0.1 tasks are failing when using zstd compression codec...


apache-sparkspark3zstd

Read More
Does Spark 3.0.1 support custom Aggregators on window functions?...


javaapache-sparkspark3

Read More
How to access Spark DataFrame data in GPU from ML Libraries such as PyTorch or Tensorflow...


tensorflowapache-sparkpytorchrapidsspark3

Read More
How to create a map column to count occurrences without udaf...


scalaapache-sparkspark3

Read More
find set of keys in Scala map where values overlap...


scalaapache-sparkmapsspark3

Read More
How to port UDAF to Aggregator?...


scalaapache-sparkspark3

Read More
Pyspark SelectExp() not working for first() and last()...


pysparkapache-spark-sqlspark3

Read More
BackNext