simplesparkapp

Simple Spark Application

  • 所有者: sryza/simplesparkapp
  • 平台:
  • 許可證: Apache License 2.0
  • 分類:
  • 主題:
  • 喜歡:
    0
      比較:

Github星跟蹤圖

Simple Spark Application

A simple Spark application that counts the occurrence of each word in a corpus and then counts the
occurrence of each character in the most popular words. Includes the same program implemented in
Java and Scala.

To make a jar:

mvn package

To run from a gateway node in a CDH5 cluster:

spark-submit --class com.cloudera.sparkwordcount.SparkWordCount --master local \
  target/sparkwordcount-0.0.1-SNAPSHOT.jar <input file> 2

This will run the application in a single local process. If the cluster is running a Spark standalone
cluster manager, you can replace "--master local" with "--master spark://<master host>:<master port>".

If the cluster is running YARN, you can replace "--master local" with "--master yarn".

主要指標

概覽
名稱與所有者sryza/simplesparkapp
主編程語言Java
編程語言Scala (語言數: 2)
平台
許可證Apache License 2.0
所有者活动
創建於2014-04-03 21:07:09
推送於2023-12-17 16:45:24
最后一次提交2014-08-12 10:15:39
發布數0
用户参与
星數76
關注者數12
派生數94
提交數13
已啟用問題?
問題數3
打開的問題數3
拉請求數2
打開的拉請求數2
關閉的拉請求數0
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?