Hadoop - A Primer
Wednesday, October 15, 2008 at 3:38AM
peterb_2008 in storage cluster web web20 hpc sun microsystems blueprin
Hadoop is a distributed computing platform written in Java. It incorporates features similar to those of the
Google File System and of MapReduce to process vast amounts of data
"Hadoop is a Free Java software framework that supports data intensive distributed applications running on large clusters of commodity computers. It enables applications to easily scale out to thousands of nodes and petabytes of data" (Wikipedia)
* What platform does Hadoop run on?
* Java 1.5.x or higher, preferably from Sun
* Linux
* Windows for development
* Solaris
Article originally appeared on (http://highscalability.com/).
See website for complete article licensing information.