Hadoop!

The MapReduce and distributed filesytem parts of Nutch (inspired by projects from Google) have been split into a new project, called Hadoop.

In a previous blog[2]
I wrote about Nutch's MapReduce implementation, for distributed processing of massive data sets. This, and the closely related Nutch Distributed File System (renamed Hadoop Distributed File System), have now been moved[3] into a standalone project called Hadoop[4].

According to Doug Cutting, who created Hadoop (as well as Lucene and Nutch), the name comes from:

The name my kid gave a stuffed yellow elephant. Short, relatively easy to spell and pronounce, meaningless, and not used elsewhere: those are my naming criteria. Kids are good at generating such. Googol is a kid's term.