nlp_course

YSDA 自然语言处理课程。(YSDA course in Natural Language Processing.)

  • 所有者: yandexdataschool/nlp_course
  • 平台: Web browsers
  • 許可證: MIT License
  • 分類:
  • 主題:
  • 喜歡:
    0
      比較:

Github星跟蹤圖

YSDA Natural Language Processing course Binder

  • This is the 2019 version. For previous year' course materials, go to this branch
  • Lecture and seminar materials for each week are in ./week* folders
  • YSDA homework deadlines will be listed in Anytask (read more).
  • Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue
  • Installing libraries and troubleshooting: this thread.

Syllabus

  • week01 Embeddings

    • Lecture: Word embeddings. Distributional semantics, LSA, Word2Vec, GloVe. Why and when we need them.
    • Seminar: Playing with word and sentence embeddings.
  • week02 Text classification

    • Lecture: Text classification. Classical approaches for text representation: BOW, TF-IDF. Neural approaches: embeddings, convolutions, RNNs
    • Seminar: Salary prediction with convolutional neural networks; explaining network predictions.
  • week03 Language Models

    • Lecture: Language models: N-gram and neural approaches; visualizing trained models
    • Seminar: Generating ArXiv papers with language models
  • week04 Seq2seq/Attention

    • Lecture: Seq2seq: encoder-decoder framework. Attention: Bahdanau model. Self-attention, Transformer. Analysis of attention heads in Transformer.
    • Seminar: Machine translation of hotel and hostel descriptions
  • week05 Expectation-Maximization

    • Lecture: Expectation-Maximization and Hidden Markov Models
    • Seminar: Implementing expectation maximization
  • week06 Machine Translation

    • Lecture: Word Alignment Models, Noisy Channel, Machine Translation.
    • Seminar: Introduction to word alignment assignment.

Contributors & course staff

Course materials and teaching performed by

主要指標

概覽
名稱與所有者yandexdataschool/nlp_course
主編程語言Jupyter Notebook
編程語言Dockerfile (語言數: 7)
平台Web browsers
許可證MIT License
所有者活动
創建於2018-09-08 09:28:05
推送於2024-12-25 17:38:37
最后一次提交2019-10-10 20:08:16
發布數0
用户参与
星數10.1k
關注者數360
派生數2.6k
提交數825
已啟用問題?
問題數46
打開的問題數1
拉請求數88
打開的拉請求數6
關閉的拉請求數23
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?