hadoop-book

Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White

Github stars Tracking Chart

Hadoop Book Example Code

This repository contains the example code for Hadoop: The Definitive Guide, Fourth Edition
by Tom White (O'Reilly, 2014).

Code for the First, Second, and Third Editions is also available.

Note that the chapter names and numbering has changed between editions, see
Chapter Numbers By Edition.

Building and Running

To build the code, you will first need to have installed Maven and Java. Then type

% mvn package -DskipTests

This will do a full build and create example JAR files in the top-level directory (e.g.
hadoop-examples.jar).

To run the examples from a particular chapter, first install the component
needed for the chapter (e.g. Hadoop, Pig, Hive, etc), then run the command lines shown
in the chapter.

Sample datasets are provided in the input directory, but the full weather dataset
is not contained there due to size restrictions. You can find information about how to obtain
the full weather dataset on the book's website at [http://www.hadoopbook.com/]
(http://www.hadoopbook.com/).

Hadoop Component Versions

This edition of the book works with Hadoop 2. It has not been tested extensively with
Hadoop 1, although most of it should work.

For the precise versions of each component that the code has been tested with, see
book/pom.xml.

Copyright (C) 2014 Tom White

Main metrics

Overview
Name With Ownertomwhite/hadoop-book
Primary LanguageMakefile
Program languageShell (Language Count: 12)
Platform
License:
所有者活动
Created At2009-07-13 10:13:48
Pushed At2020-03-17 05:11:30
Last Commit At2015-02-04 17:30:33
Release Count1
Last Release Name3e-draft (Posted on 2012-02-21 16:43:57)
First Release Name3e-draft (Posted on 2012-02-21 16:43:57)
用户参与
Stargazers Count3.5k
Watchers Count446
Fork Count2.6k
Commits Count314
Has Issues Enabled
Issues Count32
Issue Open Count16
Pull Requests Count2
Pull Requests Open Count3
Pull Requests Close Count5
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private