A great open source project by Apache. Using MapReduce it divides the resource consuming project to multiple system. It has a tracker called master and slaves. Tracker divides and assigns the job. It uses Hadoop distributed filesystem to distribute the data and then process them in parallel. Yahoo is biggest user and also largest contributor of Hadoop. Other users include face book, Amazon, last.fm, IBM, NY times, Veoh, joost.