Skip to content

Hadoop #
Find similar titles

You are seeing an old version of the page. Go to latest version

Apache Hadoop is a set of algorithms (an Open source software framework written in Java) for distributed storage and distributed processing of very large data sets (Big Data) on computer clusters built from commodity hardware.

The core of Apache Hadoop consists of a storage part (Hadoop Distributed File System (HDFS)) and a processing part (MapReduce).