Search code examples
hbase

can HBase region span node server


  1. a Region is composed of rows, a row is composed of column families
  2. every column family can stay in different machine
  3. a Region do not span multiple machines

point 1 and 2 seems contract to point 3, anything wrong?


Solution

  • All your three statements are actually correct, the difference is just in where column families are stored. Even though a row is composed of one or more column families, those column families are not actually stored together. So one column family will be written across multiple regions.

    This massively speeds up scans - HBase only needs to read the column family you're scanning, and not the data associated with other column families.