Search code examples
hadoopapache-apexbigdata

Does Apache Apex rely on HDFS or does it have its own file system?


I understand that Apache Apex runs on Hadoop and YARN. Does it utilize HDFS for persistence and replication to protect against data loss? or does it have its own?


Solution

  • Apache Apex does not have it's own file system. Streaming applications written with Apex will, by default, use HDFS for checkpointing, persistence and for saving application specific data.