simplesparkapp

Simple Spark Application

  • Owner: sryza/simplesparkapp
  • Platform:
  • License:: Apache License 2.0
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

Simple Spark Application

A simple Spark application that counts the occurrence of each word in a corpus and then counts the
occurrence of each character in the most popular words. Includes the same program implemented in
Java and Scala.

To make a jar:

mvn package

To run from a gateway node in a CDH5 cluster:

spark-submit --class com.cloudera.sparkwordcount.SparkWordCount --master local \
  target/sparkwordcount-0.0.1-SNAPSHOT.jar <input file> 2

This will run the application in a single local process. If the cluster is running a Spark standalone
cluster manager, you can replace "--master local" with "--master spark://<master host>:<master port>".

If the cluster is running YARN, you can replace "--master local" with "--master yarn".

Main metrics

Overview
Name With Ownersryza/simplesparkapp
Primary LanguageJava
Program languageScala (Language Count: 2)
Platform
License:Apache License 2.0
所有者活动
Created At2014-04-03 21:07:09
Pushed At2023-12-17 16:45:24
Last Commit At2014-08-12 10:15:39
Release Count0
用户参与
Stargazers Count76
Watchers Count12
Fork Count94
Commits Count13
Has Issues Enabled
Issues Count3
Issue Open Count3
Pull Requests Count2
Pull Requests Open Count2
Pull Requests Close Count0
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private