Search code examples
Spark - Sort DStream by Key and limit to 5 values...


apache-sparkpysparkspark-streamingrdd

Read More
set spark.streaming.kafka.maxRatePerPartition for createDirectStream...


apache-sparkspark-streaming

Read More
How many RDDs does DStream generate for a batch interval?...


apache-sparkspark-streaming

Read More
How to convert spark streaming output into dataframe or storing in table...


scalaapache-sparkapache-spark-sqlspark-streaming

Read More
Number of threads per core in Spark...


apache-sparkspark-streaming

Read More
How to read parquet files using `ssc.fileStream()`? What are the types passed to `ssc.fileStream()`?...


scalahadoopapache-sparkspark-streaminghadoop2

Read More
Pyspark Obtain Substring from Filename and Store as New Column...


apache-sparkpysparkapache-spark-sqlspark-streaming

Read More
How can I read a dataframe using spark streaming with it's schema that I specify...


scalaapache-sparkapache-spark-sqlspark-streaming

Read More
Indexing multiple documents...


solrspark-streamingsolrj

Read More
Spark throws Not Serializable Exception inside a foreachRDD operation...


scalaapache-kafkaspark-streaming

Read More
How to use Spark Streaming to read a stream and find the IP over a time Window?...


pythonpysparkspark-streaming

Read More
How good is the parallelism of Stream Processing Systems?...


parallel-processingspark-streamingapache-stormflink-streaming

Read More
Spark Structured Streaming with Kafka doesn't honor startingOffset="earliest"...


apache-sparkspark-streamingspark-structured-streamingspark-streaming-kafka

Read More
Broadcasting TypeSafe Config throws exception User class threw exception: java.io.UTFDataFormatExcep...


scalaapache-sparkspark-streaming

Read More
SparkContext.setLogLevel("DEBUG") doesn't works in Cluster...


apache-sparklog4jpysparkspark-streamingrdd

Read More
DStreams: Variable created within foreachRDD and then modified inside foreachPartition is reset once...


apache-sparkspark-streamingspark-streaming-kafka

Read More
Why spark streaming is running on previous topics records?...


apache-kafkaspark-streaming

Read More
Unable to read kafka using spark sql...


javamavenapache-sparkapache-kafkaspark-streaming

Read More
How to pass constant value on map function in spark scala...


scalaapache-sparkspark-streaming

Read More
Is Sleep statement allowed in spark Streaming...


spark-streaming

Read More
pyspark textFileStreaming can not detect txt file while textFile works...


apache-sparkpysparkspark-streaming

Read More
How to get data from multiple topics in one place for processing?...


scalaapache-kafkaspark-streaming

Read More
Is there away to share/access the hdfs among developers?...


apache-sparkhadoophiveapache-kafkaspark-streaming

Read More
Can count() be called twice in Spark?...


apache-sparkapache-spark-sqlspark-streaming

Read More
Spark structured streaming exactly once - Not achieved - Duplicated events...


apache-sparkapache-kafkaspark-streamingspark-structured-streamingspark-streaming-kafka

Read More
spark-submit error Caused by: java.lang.ClassNotFoundException: kafka.DefaultSource...


scalaapache-sparkspark-streamingspark-structured-streamingspark-streaming-kafka

Read More
Spark Streaming write to Kafka with delay - after x minutes...


scalaapache-sparkapache-kafkaspark-streamingamazon-kinesis

Read More
Why is the "topics" argument of KafkaUtils.createStream() a Map rather then array?...


javaapache-sparkapache-kafkaspark-streaming

Read More
What does this mean "AssertionError: assertion failed: unsafe symbol Unstable"?...


apache-sparkapache-spark-sqlspark-streaming

Read More
java.util.NoSuchElementException: Columns not found in table abc.company_vals: companyId, companyNam...


apache-sparkapache-spark-sqlspark-streamingdatastax

Read More
BackNext