Search code examples

HDFS IO error (hadoop 2.8) with Flume

I am getting the below error when I try to get a streaming data into hadoop through Flume.

I have created link in flume/lib that point to the .jar files in hadoop/share/hadoop/

I double checked the URL and I think they are all correct. Thought of posting to get some more eyes and some feedback.

      2017-07-20 10:53:18,959 (SinkRunner-PollingRunner-DefaultSinkProcessor) [WARN -org.apache.flume.sink.hdfs.HDFSEventSink.process] HDFS IO error No FileSystem for scheme: hdfs
        at org.apache.hadoop.fs.FileSystem.getFileSystemClass(
        at org.apache.hadoop.fs.FileSystem.createFileSystem(
        at org.apache.hadoop.fs.FileSystem.access$200(
        at org.apache.hadoop.fs.FileSystem$Cache.getInternal(
        at org.apache.hadoop.fs.FileSystem$Cache.get(
        at org.apache.hadoop.fs.FileSystem.get(
        at org.apache.hadoop.fs.Path.getFileSystem(
        at org.apache.flume.sink.hdfs.BucketWriter$
        at org.apache.flume.sink.hdfs.BucketWriter$
        at org.apache.flume.sink.hdfs.BucketWriter$9$
        at org.apache.flume.auth.SimpleAuthenticator.execute(
        at org.apache.flume.sink.hdfs.BucketWriter$
        at java.util.concurrent.ThreadPoolExecutor.runWorker(
        at java.util.concurrent.ThreadPoolExecutor$

Here is the Flume Sink Config

agent1.sinks.PurePathSink.type = hdfs
agent1.sinks.PurePathSink.hdfs.path = hdfs:// 
agent1.sinks.PurePathSink.hdfs.fileType = DataStream
agent1.sinks.PurePathSink.hdfs.filePrefix = export
agent1.sinks.PurePathSink.hdfs.fileSuffix = .txt
agent1.sinks.PurePathSink.hdfs.rollInterval = 120
agent1.sinks.PurePathSink.hdfs.rollSize = 131072

core-site.xml - Hadoop 2.8


        <description>A base for other temporary directories</description>

        <description>The FileSystem for file: uris.</description>

        <description>The FileSystem for hdfs: uris.</description>


  • In my case I found that explicitly declaring the paths solved the issue. it had to do with which Jar it was picking up.

    Thanks @V.Bravo for your reply. I am not using a distribution but standing up a cluster of my own

    • Moses