Search code examples
hadoopclouderasoftware-distributionmaprbiginsights

What is meaning of "Hadoop distribution"


I am new to hadoop. I recently read about basics of Apache Hadoop, Pig, Hive, HBase. Then I came across term "Hadoop distribution" and examples were Cloudera, MAPR, HortonWorks. So what is relation of Apache Hadoop (& its echo-system ) with "Hadoop Distribution"

Is it like Java Virtual machine specification (a document) and Oracle JVM, IBM JVM (working implementation of the document) ? But we get zips from Apache, which are actually logic implemented.

So I am bit confused.


Solution

  • Based on Distributions and Commercial Support, The following companies provide products that include Apache Hadoop, a derivative work thereof, commercial support, and/or tools and utilities related to Hadoop.

    Some companies release or sell products that include the official Apache Hadoop release files, and/or their own and other useful tools. Other companies or organizations release products that include artifacts build from modified or extended versions of the Apache Hadoop source tree. Such derivative works are not supported by the Apache Team: all support issues must be directed to the suppliers themselves.