Search code examples
Spark Scala: How to Replace a Field in Deeply Nested DataFrame...


scalaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
Should cache and checkpoint be used together on DataSets? If so, how does this work under the hood?...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
why filtering using sql expression is better compared to a function applied in DataSet in Spark...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
How do I efficiently map keys from one dataset based on values from other dataset...


scalaapache-sparkapache-spark-sqlapache-spark-datasetdata-transform

Read More
Process each row to get date...


scalaapache-sparkapache-spark-sqlrddapache-spark-dataset

Read More
Spark Predicate Push Down, Filtering and Partition Pruning for Azure Data Lake...


azureapache-sparkapache-spark-sqlazure-data-lakeapache-spark-dataset

Read More
How to use the spark stats?...


scalaapache-sparkstatisticsapache-spark-datasetquantile

Read More
Mongo Spark Java Connector Group by...


spring-bootapache-sparkapache-spark-sqlapache-spark-dataset

Read More
Convert each value of Java spark Dataset into a row using explode()...


javaapache-sparkapache-spark-dataset

Read More
how to get the number of partitions in a dataset?...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
Why is "Unable to find encoder for type stored in a Dataset" when creating a dataset of cu...


scalaapache-sparkapache-spark-datasetapache-spark-encoders

Read More
Modifying var type of data set in Spark using its own method...


scalaapache-sparkapache-spark-dataset

Read More
Aggregation of multiple columns in spark Java...


javaapache-sparkgroup-byaggregateapache-spark-dataset

Read More
Handling repetetive data in Spark dataframes/datasets...


scalaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
Searching and updating a Spark Dataset column with values from another Dataset...


javaapache-sparkapache-spark-datasetapache-spark-2.0

Read More
How to create a Dataset from a csv which doesn't have a header and has more than 150 columns usi...


scalacsvapache-sparkapache-spark-dataset

Read More
Spark excel: reading excel file with multi line header throw an exception: Method threw 'scala.M...


javaapache-sparkapache-spark-datasetspark-excel

Read More
What is the difference between Spark DataSet and RDD...


apache-sparkrddapache-spark-dataset

Read More
Why Spark Dataset.select replacing the column values...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
Cannot up cast `ordId` from string to int as it may truncate...


scalaapache-sparkapache-spark-dataset

Read More
Spark Datasets - strong typing...


apache-sparkdatasetapache-spark-dataset

Read More
GroupBy + custom aggregation on Dataset with Case class / Trait in the Key...


scalaapache-sparkserializationapache-spark-dataset

Read More
how to handle this in spark...


apache-sparkapache-spark-sqlspark-streamingapache-spark-dataset

Read More
Get certain percentage data on each values of a column using spark java...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
Transforming all new rows into new column in Spark with Scala...


scalaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
Efficiently running a "for" loop in Apache spark so that execution is parallel...


pythonapache-sparkbigdataapache-spark-datasetapache-spark-2.0

Read More
Scala case class with escaped field name throws error during Spark Catalyst code generation...


scalaapache-sparkhiveapache-spark-sqlapache-spark-dataset

Read More
Union spark Datasets in loop...


javaapache-sparkapache-spark-dataset

Read More
Create Spark dataset with parts of other dataset...


scalaapache-sparkapache-spark-sqlflatmapapache-spark-dataset

Read More
Spark DataFrame RangePartitioner...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
BackNext