Search code examples
apache-kafkaapache-kafka-connectaws-msk

Is there a way to dump Amazon MSK Topic to S3 directly?


I have planned to used Amazon MSK and i want to dump consumer logs to S3 . But i don't see any options. Do i need to write my own consumer or is there a way to consume Amazon MSK consumer output to s3 directly ?


Solution

  • Kafka Connect is generally the best (easiest/scalable/portable/resilient) way to get data between Kafka and systems down (and up) stream such as S3. Learn more about Kafka Connect here and in this talk here.

    MSK Connect can run Kafka Connect workloads for your MSK on AWS.

    Another option you have is to run your own Kafka Connect worker (which connects to MSK) and use the S3 sink connector (tutorial).