nlp-tutorial

nlp-tutorial 是为使用 Pytorch 学习 NLP（自然语言处理）的人准备的教程。NLP 中的大多数模型都是用不到100行代码实现的。(注释或空行除外)

[08-14-2020] 旧版 TensorFlow v1 代码被归档到 archive 文件夹中。为了方便初学者阅读，只支持pytorch 1.0 或更高版本。

名称与所有者	graykode/nlp-tutorial
主编程语言	Jupyter Notebook
编程语言	Python (语言数: 2)
平台	Linux, Mac, Windows
许可证	MIT License

名称与所有者

graykode/nlp-tutorial

主编程语言

Jupyter Notebook

编程语言

Python (语言数: 2)

平台

Linux, Mac, Windows

许可证

MIT License

创建于	2019-01-09 11:44:20
推送于	2024-02-21 13:49:10
最后一次提交	2021-07-25 14:52:13
发布数	0

创建于

2019-01-09 11:44:20

推送于

2024-02-21 13:49:10

最后一次提交

2021-07-25 14:52:13

发布数

星数	14.7k
关注者数	286
派生数	4k
提交数	78
已启用问题?
问题数	54
打开的问题数	32
拉请求数	16
打开的拉请求数	6
关闭的拉请求数	9

星数

14.7k

关注者数

286

派生数

提交数

已启用问题?

问题数

打开的问题数

拉请求数

打开的拉请求数

关闭的拉请求数

已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?

已启用Wiki?

已存档?

是复刻?

已锁定?

是镜像?

是私有?

nlp-tutorial

nlp-tutorial is a tutorial for who is studying NLP(Natural Language Processing) using Pytorch. Most of the models in NLP were implemented with less than 100 lines of code.(except comments or blank lines)

[08-14-2020] Old TensorFlow v1 code is archived in the archive folder. For beginner readability, only pytorch version 1.0 or higher is supported.

Curriculum - (Example Purpose)

1. Basic Embedding Model

1-1. NNLM(Neural Network Language Model) - Predict Next Word
- Paper - A Neural Probabilistic Language Model(2003)
- Colab - NNLM.ipynb
1-2. Word2Vec(Skip-gram) - Embedding Words and Show Graph
- Paper - Distributed Representations of Words and Phrases
  and their Compositionality(2013)
- Colab - Word2Vec.ipynb
1-3. FastText(Application Level) - Sentence Classification
- Paper - Bag of Tricks for Efficient Text Classification(2016)
- Colab - FastText.ipynb

2. CNN(Convolutional Neural Network)

2-1. TextCNN - Binary Sentiment Classification
- Paper - Convolutional Neural Networks for Sentence Classification(2014)
- TextCNN.ipynb

3. RNN(Recurrent Neural Network)

3-1. TextRNN - Predict Next Step
- Paper - Finding Structure in Time(1990)
- Colab - TextRNN.ipynb
3-2. TextLSTM - Autocomplete
- Paper - LONG SHORT-TERM MEMORY(1997)
- Colab - TextLSTM.ipynb
3-3. Bi-LSTM - Predict Next Word in Long Sentence
- Colab - Bi_LSTM.ipynb

4. Attention Mechanism

4-1. Seq2Seq - Change Word
- Paper - Learning Phrase Representations using RNN Encoder–Decoder
  for Statistical Machine Translation(2014)
- Colab - Seq2Seq.ipynb
4-2. Seq2Seq with Attention - Translate
- Paper - Neural Machine Translation by Jointly Learning to Align and Translate(2014)
- Colab - Seq2Seq(Attention).ipynb
4-3. Bi-LSTM with Attention - Binary Sentiment Classification
- Colab - Bi_LSTM(Attention).ipynb

5. Model based on Transformer

5-1. The Transformer - Translate
- Paper - Attention Is All You Need(2017)
- Colab - Transformer.ipynb, Transformer(Greedy_decoder).ipynb
5-2. BERT - Classification Next Sentence & Predict Masked Tokens
- Paper - BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding(2018)
- Colab - BERT.ipynb

Dependencies

Python 3.5+
Pytorch 1.0.0+

Author

Tae Hwan Jung(Jeff Jung) @graykode
Author Email : nlkey2022@gmail.com
Acknowledgements to mojitok as NLP Research Internship.

nlp-tutorial

Github星跟踪图

nlp-tutorial

课程表 --（示例目的）

依赖

作者

主要指标

nlp-tutorial

Curriculum - (Example Purpose)

1. Basic Embedding Model

2. CNN(Convolutional Neural Network)

3. RNN(Recurrent Neural Network)

4. Attention Mechanism

5. Model based on Transformer

Dependencies

Author