Apache Pig

Apache Pig 的镜像。Pig 是一种用于处理超大文件的数据流编程环境。「Mirror of Apache Pig. Pig is a dataflow programming environment for processing very large files.」

  • 所有者: apache/pig
  • 平台:
  • 許可證: Apache License 2.0
  • 分類:
  • 主題:
  • 喜歡:
    0
      比較:

Github星跟蹤圖

Apache Pig

Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,
reduce.

Pig compiles these dataflow programs into (sequences of) map-reduce or Apache Tez
jobs and executes them using Hadoop. It is also possible to execute Pig Latin
programs in a "local" mode (without Hadoop cluster), in which case all
processing takes place in a single local JVM.

General Info

For the latest information about Pig, please visit our website at:

http://pig.apache.org/

and our wiki, at:

http://wiki.apache.org/pig/

Getting Started

  1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial
  2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and
    http://wiki.apache.org/pig/RunPig
  3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank

Contributing to the Project

We welcome all contributions. For the details, please, visit
https://cwiki.apache.org/confluence/display/PIG/HowToContribute

概覽

名稱與所有者apache/pig
主編程語言Java
編程語言Shell (語言數: 14)
平台
許可證Apache License 2.0
發布數59
最新版本名稱release-0.17.0 (發布於 2017-06-14 11:08:58)
第一版名稱PIG-1045 (發布於 2010-10-06 20:21:57)
創建於2009-05-21 01:10:43
推送於2023-07-15 20:46:37
最后一次提交
星數667
關注者數80
派生數450
提交數3.7k
已啟用問題?
問題數0
打開的問題數0
拉請求數0
打開的拉請求數10
關閉的拉請求數30
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?
去到頂部