Spark NLP Workshop

在 Apache Spark 上使用 John Snow Labs 的 NLP 的公共可运行示例。「Public runnable examples of using John Snow Labs' NLP for Apache Spark.」

Github stars Tracking Chart

Spark NLP Workshop

Showcasing notebooks and codes of how to use Spark NLP in Python and Scala.

Table of contents

Python Setup

$ java -version
# should be Java 8 (Oracle or OpenJDK)
$ python3 -m venv .sparknlp-env
$ source .sparknlp-env/bin/activate
# spark-nlp by default is based on pyspark 3.x
$ pip install pyspark==3.1.2
$ pip install spark-nlp

Colab setup

# This is only to setup PySpark and Spark NLP on Colab
!wget http://setup.johnsnowlabs.com/colab.sh -O - | bash

Main repository

https://github.com/JohnSnowLabs/spark-nlp

Project's website

Take a look at our official spark-nlp page: http://nlp.johnsnowlabs.com/ for user documentation and examples

Slack community channel

Join Slack

Contributing

If you find any example that is no longer working, please create an issue.

License

Apache Licence 2.0

Main metrics

Overview
Name With OwnerJohnSnowLabs/spark-nlp-workshop
Primary LanguageJupyter Notebook
Program languageJupyter Notebook (Language Count: 7)
PlatformLinux, Mac, Windows
License:Apache License 2.0
所有者活动
Created At2018-08-20 18:06:12
Pushed At2025-06-12 13:22:52
Last Commit At2025-06-12 19:07:52
Release Count7
Last Release Name3.0.3 (Posted on )
First Release Name1.8.3 (Posted on )
用户参与
Stargazers Count1.1k
Watchers Count57
Fork Count611
Commits Count2.9k
Has Issues Enabled
Issues Count78
Issue Open Count1
Pull Requests Count1266
Pull Requests Open Count9
Pull Requests Close Count133
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private