simplesparkapp

Simple Spark Application

Github星跟踪图

Simple Spark Application

A simple Spark application that counts the occurrence of each word in a corpus and then counts the
occurrence of each character in the most popular words. Includes the same program implemented in
Java and Scala.

To make a jar:

mvn package

To run from a gateway node in a CDH5 cluster:

spark-submit --class com.cloudera.sparkwordcount.SparkWordCount --master local \
  target/sparkwordcount-0.0.1-SNAPSHOT.jar <input file> 2

This will run the application in a single local process. If the cluster is running a Spark standalone
cluster manager, you can replace "--master local" with "--master spark://<master host>:<master port>".

If the cluster is running YARN, you can replace "--master local" with "--master yarn".

主要指标

概览
名称与所有者sryza/simplesparkapp
主编程语言Java
编程语言Scala (语言数: 2)
平台
许可证Apache License 2.0
所有者活动
创建于2014-04-04 05:07:09
推送于2023-12-18 00:45:24
最后一次提交2014-08-13 01:15:39
发布数0
用户参与
星数76
关注者数12
派生数91
提交数13
已启用问题?
问题数3
打开的问题数3
拉请求数2
打开的拉请求数2
关闭的拉请求数0
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?