Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.
The following page has been changed by SameerParanjpye:
http://wiki.apache.org/lucene-hadoop/GettingStartedWithHadoop
------------------------------------------------------------------------------
= Downloading and installing Hadoop =
- Hadoop can be downloaded from [http://www.apache.org/dyn/closer.cgi/lucene/hadoop/ here].
You may also download a nightly build from [http://cvs.apache.org/dist/lucene/hadoop/nightly/
here] or check out the code from [http://lucene.apache.org/hadoop/version_control.html subversion]
and build it with [http://ant.apache.org Ant]. To install Hadoop untar the tar file in your
install directory. So the directory structure would like installdir/hadoop-[version]/. All
the scripts to run Hadoop are in hadoop-[version]/bin. I will refer to this directory as hadoop/bin
from now on.
+ Hadoop can be downloaded from [http://www.apache.org/dyn/closer.cgi/lucene/hadoop/ here].
You may also download a nightly build from [http://cvs.apache.org/dist/lucene/hadoop/nightly/
here] or check out the code from [http://lucene.apache.org/hadoop/version_control.html subversion]
and build it with [http://ant.apache.org Ant]. Select a directory to install Hadoop under
(let's call it <installdir>) and untar the tarball in that directory. This will create
a directory called hadoop-<version> under <installdir>. All scripts and tools
needed to run Hadoop are present in the directory hadoop-<version>/bin. This directory
will subsequently be referred to as "hadoop/bin" in this document.
= Starting Hadoop using Hadoop scripts =
This section explains how to set up a Hadoop cluster running Hadoop DFS and Hadoop Mapreduce.
The startup scripts are in hadoop/bin. The file that contains all the slave nodes that would
join the DFS and map reduce cluster is the slaves file in hadoop/conf. Edit the slaves file
to add nodes to your cluster. You need to edit the slaves file only on the machines you plan
to run the Jobtracker and Namenode on. In case you want to run a single node cluster you do
not have to edit the slaves file. Next edit the file hadoop-env.sh in the hadoop/conf directory.
Make sure JAVA_HOME is set correctly. You can change the other environment variables as per
your requirements. HADOOP_HOME is automatically determined depending on where you run your
hadoop scripts from.