Giter Site home page Giter Site logo

tidesq / nlp_course Goto Github PK

View Code? Open in Web Editor NEW

This project forked from yandexdataschool/nlp_course

0.0 2.0 0.0 92.99 MB

YSDA course in Natural Language Processing

License: MIT License

Dockerfile 0.75% Shell 0.02% Python 11.88% Jupyter Notebook 80.06% HTML 7.30%

nlp_course's Introduction

YSDA Natural Language Processing course

  • Lecture and seminar materials for each week are in ./week* folders
  • Create cloud jupyter session from this repo - Binder
  • YSDA homework deadlines are listed in Anytask course page.
  • Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue
  • Installing libraries and troubleshooting: this thread.

Syllabus

  • week01 Embeddings

    • Lecture: Word embeddings. Distributional semantics, LSA, Word2Vec, GloVe. Why and when we need them.
    • Seminar: Playing with word and sentence embeddings.
  • week02 Text classification

    • Lecture: Text classification. Classical approaches for text representation: BOW, TF-IDF. Neural approaches: embeddings, convolutions, RNNs
    • Seminar: Salary prediction with convolutional neural networks; explaining network predictions.
  • week03 Language Models

    • Lecture: Language models: N-gram and neural approaches; visualizing trained models
    • Seminar: Generating ArXiv papers with language models
  • week04 Seq2seq/Attention

    • Lecture: Seq2seq: encoder-decoder framework. Attention: Bahdanau model. Self-attention, Transformer. Pointer networks. Attention for analysis.
    • Seminar: Machine translation of hotel and hostel descriptions
  • week05 Structured Learning

    • Lecture: Structured Learning: structured perceptron, structured prediction, dynamic oracles, RL basics.
    • Seminar: POS tagging
  • week06 Expectation-Maximization

    • Lecture: Expectation-Maximization and Word Alignment Models
    • Seminar: Implementing expectation maximizaiton
  • week07 Machine translation

    • Lecture: Machine Translation: a review of the key ideas from PBMT, the application specific ideas that have developed in NMT over the past 3 years and some of the open problems in this area.
    • Seminar: presentations by students
  • week08 Transfer learning and Multi-task learning

    • Lecture: What and why does a network learn: "model" is never just "model"! Transfer learning in NLP. Multi-task learning in NLP. How to understand, what kind of information the model representations contain.
    • Seminar: Improving named entity recognition by learning jointly with other tasks
  • week09 Domain Adaptation

    • Lecture: General theory. Instance weighting. Proxy-labels methods. Feature matching methods. Distillation-like methods.
    • Seminar: Adapting general machine translation model to a specific domain.
  • week10 Dialogue Systems

    • Lecture: Task-oriented vs general conversation systems. Overview of a framework for task-oriented systems. General conversation: retrieval and generative approaches. Generative models for general conversation. Retrieval-based models for general conversation.
    • Seminar: Simple retrieval-based question answering
  • week11 Generative models

    • Lecture: generative models recap, generative adversarial networks, variational autoencoders and why should you care about them.
    • Seminar: semi-supervised dictionary learning with adversarial networks
  • week12-13 - TBA

Contributors & course staff

Course materials and teaching performed by

nlp_course's People

Contributors

justheuristic avatar drt7 avatar kovarsky avatar vprov avatar sashamn avatar femoiseev avatar yura52 avatar tenich avatar mryab avatar shakhrayv avatar lena-voita avatar

Watchers

tidesq avatar James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.