Search code examples
Yarn UI "Application Memory MB" almost doubled from spark-submit configuration memory sett...


apache-sparkhadoop-yarnresourcemanager

Read More
Write to Iceberg/Glue table from local PySpark session...


apache-sparkpysparkaws-glueapache-iceberg

Read More
How to read several JSON files with different column count into one Dataframe in Spark...


jsonapache-sparkpyspark

Read More
PySpark: Parsing JSON files where column names are defined once in the header...


azureapache-sparkpysparkazure-synapse

Read More
Extract from List of JSON...


sqlapache-sparkapache-iceberg

Read More
Not able to get Array size in Apache Iceberg with Spark 3.2.0 or before...


pythonsqlapache-sparkapache-iceberg

Read More
Spark DataFrame not persisting to the ADLS Gen2 container...


pythonazureapache-sparkpyspark

Read More
pyspark & iceberg: `update *` not working in `merge into`?...


apache-sparkpysparkapache-spark-sqlamazon-emrapache-iceberg

Read More
How to flatten a struct in a Spark dataframe?...


javaapache-sparkpysparkapache-spark-sql

Read More
spark mlib: requirement failed: Index 0 follows 0 and is not strictly increasing...


apache-sparkapache-spark-mllibapache-spark-ml

Read More
generate list field in json format data...


apache-sparkpyspark

Read More
Spark: get number of cluster cores programmatically...


javaapache-sparkdatasethadoop-yarncpu-cores

Read More
Spark ElasticSearch query missing fields in dataframe...


scalaapache-sparkelasticsearch

Read More
Recasting column types with a function and a dictionary in pyspark...


pythonfunctionloopsapache-sparkpyspark

Read More
spark write to iceberg table without repartition...


pythonapache-sparkpysparkapache-iceberg

Read More
Printing secret value in Databricks...


amazon-web-servicesapache-sparkpysparkdatabricksazure-databricks

Read More
Spark : need confirmation on approach in capturing first and last date : on dataset...


sqlapache-sparkpysparkapache-spark-sql

Read More
PySpark spark.executor.pyspark.memory introduced errors...


apache-sparkpyspark

Read More
Stop the Application in Driver when Worker is failed...


javaapache-sparkuser-defined-functions

Read More
Retrieving the default configurations...


apache-sparkpyspark

Read More
SPARK SQL Equivalent of Qualify + Row_number statements...


sqlapache-sparkapache-spark-sqlwindow-functionsrow-number

Read More
How To Migrate Spark Scala Azure Mounting Code into Pyspark Code...


apache-sparkpysparkazure-databricks

Read More
Error in publishing data to pubSub from dataProc Spark job: No functional channel service provider f...


scalaapache-sparkgoogle-cloud-pubsubgoogle-cloud-dataproc

Read More
How to get week of month in Spark 3.0+?...


apache-sparkdatetimepysparkapache-spark-sqlapache-spark-3.0

Read More
How to find median and quantiles using Spark...


pythonapache-sparkmedianrddpyspark

Read More
Hive table partition by year month and day query...


apache-sparkhive

Read More
Rowencoder.apply and rowencoder.encoderfor methods in spark catalyst package...


apache-sparkapache-spark-encoders

Read More
Spark Aggregator with Array as Input...


scalaapache-sparkaggregate-functionsuser-defined-functions

Read More
DataBricks: Any way to reset the Generated IDENTITY column?...


apache-sparkdatabricksdelta-lake

Read More
Iceberg table snapshots not expired...


apache-sparkhivegoogle-cloud-dataprocapache-iceberg

Read More
BackNext