Search code examples
hadoopmapreducehadoop-partitioning

Files through map function in map reduce


Is it possible to somehow pass a set of files through each map function. The requirement will be to process each file in parallel for different-2 operations. I am completely new to map reduce and i am using JAVA as my programming language.


Solution

  • If you want to get the same file as input to all mappers, for read only access, yes. You can add your files from your main (Driver) class to what is called Distributed Cache. More details can be found here.