Running a MapReduce Job

Running a MapReduce Job (Nov 2015)

Test on:

  • Ubuntu 14.04.3 x64
  • Hadoop 2.7.1 (Pseudo-Distributed Mode)

I will use one of the examples that come with Hadoop package.

1. Preparation
2. Pi
3. WordCount

3.1 Download example input data
3.2 Copy local example data to HDFS
3.3 Run the MapReduce job
3.4 Retrieve the job result from HDFS

1. Preparation

change directory to $HADOOP_INSTALL

Continue reading