Apache Pig

Apache Pig 的镜像。Pig 是一种用于处理超大文件的数据流编程环境。「Mirror of Apache Pig. Pig is a dataflow programming environment for processing very large files.」

  • 所有者: apache/pig
  • 平台:
  • 许可证: Apache License 2.0
  • 分类:
  • 主题:
  • 喜欢:
    0
      比较:

Github星跟踪图

Apache Pig

Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,
reduce.

Pig compiles these dataflow programs into (sequences of) map-reduce or Apache Tez
jobs and executes them using Hadoop. It is also possible to execute Pig Latin
programs in a "local" mode (without Hadoop cluster), in which case all
processing takes place in a single local JVM.

General Info

For the latest information about Pig, please visit our website at:

http://pig.apache.org/

and our wiki, at:

http://wiki.apache.org/pig/

Getting Started

  1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial
  2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and
    http://wiki.apache.org/pig/RunPig
  3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank

Contributing to the Project

We welcome all contributions. For the details, please, visit
https://cwiki.apache.org/confluence/display/PIG/HowToContribute

概览

名称与所有者apache/pig
主编程语言Java
编程语言Shell (语言数: 14)
平台
许可证Apache License 2.0
发布数59
最新版本名称release-0.17.0 (发布于 2017-06-14 11:08:58)
第一版名称PIG-1045 (发布于 2010-10-06 20:21:57)
创建于2009-05-21 01:10:43
推送于2023-07-15 20:46:37
最后一次提交
星数667
关注者数80
派生数450
提交数3.7k
已启用问题?
问题数0
打开的问题数0
拉请求数0
打开的拉请求数10
关闭的拉请求数30
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?
去到顶部