An Bootstrap implementation for apache hadoop or apache mahout projects. You can easily start your own hadoop project with this, without configuration.
- JDK (>= 7)
- maven (>= 3.0)
- hadoop (>= 0.23.203)
To run unit tests on MapReduce code, you must install mrunit on your local maven repository first. To install it, execute following commands:
git clone git://github.com/apache/mrunit.git
git checkout [tag-name] # for example, 'release-1.0.0-hadoop1'
mvn install
mvn clean package
hadoop fs -copyFromLocal testtxt.seq /testtxt.seq # copy included file onto your hdfs
hadoop jar target/hadoop-quickstart-1.0-job.jar com.github.dongjinleekr.hadoop.quickstart.WordCountJob -e 10 -i /testtxt.seq -o /counted.out
You can check the output with following command (requires mahout):
mahout seqdumper -i /counted.out/part-r-00000
Copyright (C) 2013 Dongjin Lee. [email protected]
Licensed under the Apache License, Version 2.0