hiped2

Source code that accompanies the book "Hadoop in Practice, Second Edition".

  • Owner: alexholmes/hiped2
  • Platform:
  • License:: Apache License 2.0
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

Source code for Hadoop in Practice, Second Edition

This project contains the source code that accompanies the book
Hadoop in Practice, Second Edition.

License

Apache version 2.0 (for more details look at the license).

Usage

Tarball

The easiest way to start working with the examples is to download a tarball distribution of this project.
Doing so will mean that running your first example is just three steps away:

  1. Go to the releases and download the most recent tarball.

  2. Extract the contents ot the tarball.

     $ tar -xzvf hip-<version>-package.tar.gz
    
  3. The examples in the book all use the hip script located in bin/hip to
    execute the examples. While it's not required, it's recommended that you
    add hip-<version>/bin to your
    path so that you can simply execute hip and execute the examples in the
    book by directly copy-pasting the commands.

  4. Run the "hello world" example, which is

$ cd hip-<version>

# create two input files in HDFS
$ hadoop fs -mkdir -p hip1/input
$ echo "cat sat mat", hadoop fs -put - hip1/input/1.txt
$ echo "dog lay mat", hadoop fs -put - hip1/input/2.txt

# run the inverted index example
$ ./hip hip.ch1.InvertedIndexJob --input hip1/input --output hip1/output

# examine the results in HDFS
$ hadoop fs -cat hip1/output/part*

Done! The tarball also includes the sources and JavaDocs.

Building your own distribution

Here you're going to checkout the trunk and then use Maven to run a build.

  1. Checkout the code.

     $ git clone git@github.com:alexholmes/hiped2.git
    
  2. Build the code and distribution tarball.

$ cd hiped2
$ mvn clean
$ mvn validate
$ mvn package

The JAR's and tarball will be under the target directory. Now you can follow the instructions in the
"Tarball" section above to explode the tarball and run an example.

What's next?

At this point check out the book for more examples and how you can execute them. Or if you find any issues then
please go to the issues and open a new issue.

Main metrics

Overview
Name With Owneralexholmes/hiped2
Primary LanguageJava
Program languageShell (Language Count: 5)
Platform
License:Apache License 2.0
所有者活动
Created At2014-01-26 03:24:17
Pushed At2014-09-10 15:30:24
Last Commit At2014-09-10 08:27:14
Release Count9
Last Release Namev2.0.8 (Posted on )
First Release Namev2.0.0 (Posted on )
用户参与
Stargazers Count80
Watchers Count21
Fork Count73
Commits Count35
Has Issues Enabled
Issues Count5
Issue Open Count4
Pull Requests Count0
Pull Requests Open Count0
Pull Requests Close Count0
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private