What is HDFS?
Hadoop Distributed File System is a distributed file system and mainly storage system of Hadoop. HDFS is based on write-once and read many-times pattern. Unlike other distributed systems, HDFS is highly fault tolerant and designed using low-cost commodity hardware.
HDFS holds very large amount of data and provides easier access. To store such huge data, the files are stored across multiple machines. These files are stored in redundant fashion to avoid possible data losses in case of failure. HDFS also makes applications available for parallel processing.
Features Of HDFS
- It is suitable for the distributed storage and processing.
- It is good for large files
- It is not suitable for lots of small files, random access and low latency access.
- Streaming access to file system data.
- HDFS provides file permissions and authentication.
- It is highly fault tolerant