Apache Pig

Apache Pig 的镜像。Pig 是一种用于处理超大文件的数据流编程环境。「Mirror of Apache Pig. Pig is a dataflow programming environment for processing very large files.」

  • Owner: apache/pig
  • Platform:
  • License:: Apache License 2.0
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

Apache Pig

Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,
reduce.

Pig compiles these dataflow programs into (sequences of) map-reduce or Apache Tez
jobs and executes them using Hadoop. It is also possible to execute Pig Latin
programs in a "local" mode (without Hadoop cluster), in which case all
processing takes place in a single local JVM.

General Info

For the latest information about Pig, please visit our website at:

http://pig.apache.org/

and our wiki, at:

http://wiki.apache.org/pig/

Getting Started

  1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial
  2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and
    http://wiki.apache.org/pig/RunPig
  3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank

Contributing to the Project

We welcome all contributions. For the details, please, visit
https://cwiki.apache.org/confluence/display/PIG/HowToContribute

Overview

Name With Ownerapache/pig
Primary LanguageJava
Program languageShell (Language Count: 14)
Platform
License:Apache License 2.0
Release Count59
Last Release Namerelease-0.17.0 (Posted on 2017-06-14 11:08:58)
First Release NamePIG-1045 (Posted on 2010-10-06 20:21:57)
Created At2009-05-21 01:10:43
Pushed At2023-07-15 20:46:37
Last Commit At
Stargazers Count667
Watchers Count80
Fork Count450
Commits Count3.7k
Has Issues Enabled
Issues Count0
Issue Open Count0
Pull Requests Count0
Pull Requests Open Count10
Pull Requests Close Count30
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private
To the top