Search code examples
hadoophadoop-yarn

Why am I unable to connect to yarn?


I'm trying to connect to yarn by doing yarn application -list. But I cannot because it says:

<date> <time> INFO client.RMProxy: Connecting to ResourceManager at /0.0.0.0:8032
<date> <time> INFO ipc.Client: Retrying connecting to server: 0.0.0.0/0.0.0.0:8032. Already tried 0 time(s): retyr policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime= 1000 MILLISECONDS)
<date> <time> INFO ipc.Client: Retrying connecting to server: 0.0.0.0/0.0.0.0:8032. Already tried 1 time(s): retry policy is RetryUpToMaximumCount
<date> <time> INFO ipc.Client: Retrying connecting to server: 0.0.0.0/0.0.0.0:8032. Already tried 2 time(s): retry policy is RetryUpToMaximumCount

I have a file under /etc/hadoop/conf.empty/yarn-site.xml, which I assume is related to this in some way. I have a file at /etc/hadoop/conf.empty/ called yarn-env.sh. I tried running this file, but it didn't change anything.

Am I doing something wrong? Or maybe something is not correctly configured? How do I start yarn?


Solution

  • yarn-site.xml is for configuring YARN daemons ResourceManager, NodeManager and ApplicationMaster. The properties relating to these services go in here. And the environment settings for YARN can be modified with yarn-env.sh.

    Start YARN services, (From the path of yarn-site.xml file posted, the installation does not appear to be done using tarballs. So the startup scripts might not be available)

    On ResourceManager host

    sudo service hadoop-yarn-resourcemanager start
    

    And on each NodeManager host

    sudo service hadoop-yarn-nodemanager start
    

    Note: Make sure the preliminary configuration properties are set for both HDFS and YARN and the HDFS daemons Namenode and Datanode are started and running.

    Additionally, Configure the mapreduce to use yarn in mapred-site.xml

    <property>
     <name>mapreduce.framework.name</name>
     <value>yarn</value>
    </property>