Access Control Layer on Top of Pig Using XACML

1067 Words3 Pages

Apache Hadoop is an open-source software framework for storage and large scale processing of data-sets on clusters of commodity hardware. Hadoop, an Apache top-level project is built and used by a global community of contributors and users. Rather than relying on hardware to deliver high-availability, the library is designed to detect and handle failures at the application layer itself. It delivers a highly-available service on top of a cluster of computers, each of which may be prone to failures.
A small Hadoop cluster has a single master and multiple worker nodes. The master node consists of a JobTracker, TaskTracker, NameNode and DataNode. A slave or worker node acts as both a DataNode and TaskTracker, though it is possible to have data-only worker nodes and compute-only worker nodes. These are normally used only in nonstandard applications. Hadoop requires Java Runtime Environment (JRE) 1.6 or higher. The standard start-up and shutdown scripts require ssh to be set up between nodes in the cluster.
The Apache Hadoop framework is composed of the modules Hadoop Common which contains libraries and utilities for other Hadoop modules, Hadoop MapReduce is a programming model for large scale data processing, Hadoop Distributed File System (HDFS) is a distributed file-system which stores data that provides very high aggregate bandwidth across the cluster and Hadoop YARN, a resource-management platform that manages computer resources in clusters and uses them for scheduling of user applications.
The Hadoop distributed file system is a distributed, scalable, and portable file-system written in Java for the Hadoop framework. Each node in a Hadoop instance has a single namenode; a cluster of datanodes form the HDFS cluster as shown in Figu...

... middle of paper ...

....
[12]
More information on the Apache
Pig
can be found at http://hortonworks.com/hadoop/pig/ on the Web.
[13] More information on the XML and Security: introduction to XACML-
Access
Control
Policies
in
XML
can be found https://community.emc.com/docs/DOC-7314 at and http://dimacs.rutgers.edu/Workshops/Commerce/slides/crampton.pdf on the
Web.
[14]More information on the XACML
Policy
language can be found at http://wso2.com/library/articles/2011/10/understanding-xacml-policy-language-xacml- extended-assertion-markup-langue-part-1/ on the Web.
[15] More information on the Authorization and Authentication In Hadoop can be found at http://blog.cloudera.com/blog/2012/03/authorization-and-authentication-in-hadoop/ on the Web.
[16]
More information on the A
HDFS
Architecture can be found http://hadoop.apache.org/docs/r1.2.1/hdfs_design.html#NameNode+and+DataNodes on the Web.

More about Access Control Layer on Top of Pig Using XACML

Open Document