Apache Hadoop is an open-source software framework that provides highly reliable distributed processing of large data sets using simple programming models.