Hadoop Overview Hadoop File System was developed using distributed file system design. It is run on commodity hardware. Unlike other distributed systems, HDFS is highly fault tolerant and designed using low-cost hardware. HDFS holds very large amount of data and provides easier access. To store such huge data, the files are stored across multiple machines. These files are stored in redundant fashion to rescue the system from possible data losses in case of failure. HDFS also makes applications available to parallel processing. The important features of HDFS are HDFS is mainly suited for distributed storage and processing of the data. Streaming access to file system data. HDFS follows master-slave architecture Namenode and Datanode help users to easily check the heartbeat of the nodes in the cluster Namenode: Manages the file system namespace. Regulates client’s access to files. It also executes file system operations such as renaming, closing, and opening f