Read HDFS file splits

With HDFS's Java API, it's straightforward to read a file sequentially reading each block at a time. Here's a simple example.

I want to be able to read the file one block at a time using something like HDFS's FileSplits. The end goal is to read a file in parallel with multiple machines, each machine reading a zone of blocks. Given a HDFS Path, how can I get the FileSplits or blocks?

Map-Reduce and other processors are not involved. This is strictly a file system level operation.

Solution

This is how you would get the blocks locations of a File in HDFS

  Path dataset = new Path(fs.getHomeDirectory(), <path-to-file>);
  FileStatus datasetFile = fs.getFileStatus(dataset);

  BlockLocation myBlocks [] = fs.getFileBlockLocations(datasetFile,0,datasetFile.getLen());
  for(BlockLocation b : myBlocks){
    System.out.println("Length "+b.getLength());
    for(String host : b.getHosts()){
      System.out.println("host "+host);
    }
  }

Keep numbers which appear in both columns, in J lang
Differentiation in J
How to reshape an array with an arbitrary size in one dimension?
Why is Insert (fold) right associative
Write 4 : 'x&{.&.;: y' tacitly
Alignment issue when printing formatted prime numbers in J language
How can I define a verb in J that applies a different verb alternately to each atom in a list?
How to get user input in the J programming language
How to unbox a list of boxed lists of differing lengths in J?
How can I fix 'noun result was required' error in J?
In j, how can I define a verb locally in one scope and pass it to a defined adverb?
Convert boxed array to normal array?
Read column of CSV file as array
Replace atom in array of strings
What does the dyad `=` do to boxed strings?
Index of minimum element using J
How can I take the outer product of string vectors in J?
Building an array of verbs in J
Reading in multidigit command line parameter
Amend with bond to new data shows unexpected behaviour
How to turn a table or matrix into a (flat) list in J
How to run dissect in J?
How to define selection using index function in J
How to exit the J console?
Find 4-neighbors using J
Writing custom verbs in J
How do I negate a selector in J lang?
How to use arbitrary selector in interchange in J lang?
different result once square root is added inside tacit
Sum of arrays with repeated indices