Authorization in Hadoop

Authorization involves restricting access to resources. Hadoop provides authorization for both HDFS and all Hadoop services. In this section, we will look at how authorization can be enabled in Hadoop to secure shared resources against illegitimate access.

Authorization in HDFS

The HDFS authorization model is very similar to the authorization model in a POSIX system. In POSIX, each resource—files and directories—is associated with an owner user and a group. HDFS is similar to this. Permissions are given to each of these identities separately. There are separate permissions for:

  • The owner of the resource
  • The users of the group that are associated with the resource
  • All other users within the system

There are two permissions levels, ...

Get Hadoop: Data Processing and Modelling now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.