Apache Pig

Apache Pig 的镜像。Pig 是一种用于处理超大文件的数据流编程环境。「Mirror of Apache Pig. Pig is a dataflow programming environment for processing very large files.」

Github星跟蹤圖

Apache Pig

Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,
reduce.

Pig compiles these dataflow programs into (sequences of) map-reduce or Apache Tez
jobs and executes them using Hadoop. It is also possible to execute Pig Latin
programs in a "local" mode (without Hadoop cluster), in which case all
processing takes place in a single local JVM.

General Info

For the latest information about Pig, please visit our website at:

http://pig.apache.org/

and our wiki, at:

http://wiki.apache.org/pig/

Getting Started

  1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial
  2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and
    http://wiki.apache.org/pig/RunPig
  3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank

Contributing to the Project

We welcome all contributions. For the details, please, visit
https://cwiki.apache.org/confluence/display/PIG/HowToContribute

主要指標

概覽
名稱與所有者apache/pig
主編程語言Java
編程語言Shell (語言數: 14)
平台
許可證Apache License 2.0
所有者活动
創建於2009-05-21 09:10:43
推送於2025-09-15 19:23:44
最后一次提交
發布數59
最新版本名稱release-0.17.0 (發布於 2017-06-14 19:08:58)
第一版名稱PIG-1045 (發布於 2010-10-07 04:21:57)
用户参与
星數686
關注者數74
派生數447
提交數3.8k
已啟用問題?
問題數0
打開的問題數0
拉請求數0
打開的拉請求數9
關閉的拉請求數31
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?