Google File System

581 Words2 Pages

Google File System (GFS) was developed at Google to meet the high data processing needs. Hadoop’s Distributed File System (HDFS) was originally developed by Yahoo.Inc but it is maintained as an open source by Apache Software Foundation. HDFS was built based on Google’s GFS and Map Reduce. As the internet data was rapidly increasing there was a need to store the large data coming so Google developed a distributed file system called GFS and HDFS was developed to meet the different client needs. These are built on commodity hardware so the systems often fail. To make the systems reliable the data is replicated among multiple nodes. By default minimum number of replicas is 3. Millions of files and large files are common with these types of file systems. Data is more often read than writing. Large streaming needs and small random needs are supported. Working of GFS: GFS consists of Master node and chunk servers which are accessed by multiple clients. The client requests the master for the chunk location. The client sends the file name and the chunk index he needs. Master stores the nam...

More about Google File System

Open Document