What is Hadoop?
Hadoop ,originally developed by yahoo based on a paper published on google file system, is primarily
a distributed file system which provides fast access to the file and the data within. However, at
present it refers to a whole set of software that help in taking the full advantage of the
possibilities offered by Hadoop.
It includes frameworks like Pig and Hive, database implementations like HBase , management tools like
Ambari and Zookeeper as well as several use case specific libraries and tools.