I have a Kerberized CDH cluster, where there are some daily oozie workflows running. All of them use shell
, impala-shell
, hive
and sqoop
to ingest data to Hive tables (lets call these tables SensitiveTables
)
Now, I want to create 2 new BI users to use the cluster and experiment with some other ingested data.
The requirement is that these new BI users:
SensitiveTables
spark-submit
jobs to the clusterApart from setting-up Apache Sentry (which is the recommended way to go), is there any chance to meet those requirements using file-permissions or ACL and Service Level Authorization ?
So far, I managed (via hadoop fs -chmod o-rwx /user/hive/warehouse/sensitive
) to restrict access to SensitiveTables
via Hive (which uses user impersonation), but failed to do so via Impala (which submits all jobs to the cluster as user impala
). Is there anything else I should try?
Thank you,
Gee
After a lot of research and based on the assumptions I described, the answer is NO. Furthermore, the metastore can not be protected this way.