nlp_course

YSDA 自然语言处理课程。(YSDA course in Natural Language Processing.)

Github stars Tracking Chart

YSDA Natural Language Processing course Binder

  • This is the 2019 version. For previous year' course materials, go to this branch
  • Lecture and seminar materials for each week are in ./week* folders
  • YSDA homework deadlines will be listed in Anytask (read more).
  • Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue
  • Installing libraries and troubleshooting: this thread.

Syllabus

  • week01 Embeddings

    • Lecture: Word embeddings. Distributional semantics, LSA, Word2Vec, GloVe. Why and when we need them.
    • Seminar: Playing with word and sentence embeddings.
  • week02 Text classification

    • Lecture: Text classification. Classical approaches for text representation: BOW, TF-IDF. Neural approaches: embeddings, convolutions, RNNs
    • Seminar: Salary prediction with convolutional neural networks; explaining network predictions.
  • week03 Language Models

    • Lecture: Language models: N-gram and neural approaches; visualizing trained models
    • Seminar: Generating ArXiv papers with language models
  • week04 Seq2seq/Attention

    • Lecture: Seq2seq: encoder-decoder framework. Attention: Bahdanau model. Self-attention, Transformer. Analysis of attention heads in Transformer.
    • Seminar: Machine translation of hotel and hostel descriptions
  • week05 Expectation-Maximization

    • Lecture: Expectation-Maximization and Hidden Markov Models
    • Seminar: Implementing expectation maximization
  • week06 Machine Translation

    • Lecture: Word Alignment Models, Noisy Channel, Machine Translation.
    • Seminar: Introduction to word alignment assignment.

Contributors & course staff

Course materials and teaching performed by

Main metrics

Overview
Name With Owneryandexdataschool/nlp_course
Primary LanguageJupyter Notebook
Program languageDockerfile (Language Count: 7)
PlatformWeb browsers
License:MIT License
所有者活动
Created At2018-09-08 09:28:05
Pushed At2024-12-25 17:38:37
Last Commit At2019-10-10 20:08:16
Release Count0
用户参与
Stargazers Count10.1k
Watchers Count360
Fork Count2.6k
Commits Count825
Has Issues Enabled
Issues Count46
Issue Open Count1
Pull Requests Count88
Pull Requests Open Count6
Pull Requests Close Count23
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private