Table 2.
Hadoop component | Functions |
---|---|
(1) HDFS | Storage and replication |
(2) MapReduce | Distributed processing and fault tolerance |
(3) HBASE | Fast read/write access |
(4) HCatalog | Metadata |
(5) Pig | Scripting |
(6) Hive | SQL |
(7) Oozie | Workflow and scheduling |
(8) ZooKeeper | Coordination |
(9) Kafka | Messaging and data integration |
(10) Mahout | Machine learning |