scala apache-kafka spark-streaming confluent-platform

is spark aware of new partitions that gets added in kafka?

We recently had an issue where some of the Kafka partitions were lost and job continued without failing. In the meantime, new kafka partitions were added. Looks like our spark streaming job did not get restarted and it was not receiving any data from new partitions, until we noticed the discrepancy in the counts. We re-started the jobs and it was all good. So my question is, is spark-kafka streaming api doesn't check from time to time if new partitions were added? Is there any special setting to enable that?

Solution

AFAIK, Spark's Kafka Consumer will not automatically rebalance its consumer group when new topics/partitions are added.

That's one of the benefits that gets listed when comparing Spark Streaming with Kafka Streams, in that Kafka Streams will rebalance

How to correctly (and safely) manage a Seq of objects in an akka actor?
How do I write a scala unit test that ensures compliation fails?
Check a list contains an instance of each possible case class
Append the "_commit_timestamp" Column to the Latest Data Version When Reading from a DeltaTable
Is Scala Native big-endian or little-endian?
How to read the json file in spark using scala?
How to add self-signed generated certificate to trusted certificates from inside a Java keystore?
filter only not empty arrays dataframe spark
How can I get all names of the arrays on Dataframe
create a Spark DataFrame from a nested array of struct element?
spark UI - Understand metrics memory used
How to make an asynchronous python call from within scala
Tapir custom http code in successful response
Remove list elements in a dataframe in scala
Search a dataframe from a list and add column to say found or not
Not able to Explode and select in the same expression in spark scala
Why am I getting "No SLF4J Providers found" in my Scala + AKKA Jar?
How to convert a java Map to immutable Scala map via java code?
How to correctly read a CSV file while escaping delimiter comma placed within square brackets using Apache Spark and Scala?
How to remove an item from a list in Scala having only its index?
Multiple image operations in a single process with gm4java
In Scala, how would I update specific items in a Seq using a filter?
Scala actors as single-threaded queues
What is the difference between Abstract Data Types and Algebraic Data Types
How to cancel a future action if another future did failed?
Manifest and abstract type resolution
How to I convert a HttpEntity into a Protobuf object in Scala using Akka-http?
How to keep reference to a created instance of Actor class with Akka?
Scala: An extension method was tried, but could not be fully constructed (same extension name on multiple classes)
Doobie - lifting arbitrary effect into ConnectionIO