Search code examples
hadoopclouderahortonworks-data-platform

How to check the hadoop distribution used in my cluster?


How can I know whether my cluster has been setup using Hortonworks,Cloudera or normal installation of hadoop components?

Also how can I know the port number of various services?


Solution

  • It is difficult to identify hadoop distribution from port number, since Apache, Hortonworks, Cloudera distros uses different port numbers

    Other options are to check for cluster management service agents (Cloudera Manager - agent start up script - /etc/init.d/cloudera-scm-agent , Hortonworks - Ambari agent start up script - /etc/init.d/ambari-agent, Vanilla Apache hadoop will not have any agents in the server

    Another option is to check hadoop classpath, below command can be used to get the classpath.

    `hadoop classpath`
    

    Most of hadoop distributions include distro name in the classpath, If classpath doesn't contains any of below keywords, distribution/setup will be Apache/Normal installation.

    hdp - (Hortonworks) 
    cdh - (Cloudera)