Search code examples
hadooppermissionshdfssuperuser

Difference between Superuser and supergroup in Hadoop


What is supergroup and superuser in Hadoop/HDFS?


Solution

  • Superuser

    Based on the Hadoop official documentation:

    The super-user is the user with the same identity as the name node process itself. Loosely, if you started the name node, then you are the super-user. The super-user can do anything in that permissions checks never fail for the super-user.

    Supergroup

    Supergroup is the group of superusers. This group is used to ensure that the Hadoop Client has superuser access. It can be configured using dfs.permissions.superusergroup property in the core-site.xml file.


    References