Giter Site home page Giter Site logo

intro-nlp's Introduction

Introduction to Natural Language Processing (NLP)

Educational resources to get started with Natural Language Processing in Python.

By Sebastian Castro, 2020

For more background, check out the following resources:


Getting Started

Install conda and then create and activate a conda environment

conda create --name intro-nlp --file conda-requirements.txt
conda activate intro-nlp

The version of HuggingFace Transformers available in conda is quite outdated, so you should directly install that one using pip. To do this, first make sure that you are in your conda environment!

conda activate intro-nlp
pip install transformers

Examples

Rule-Based Processing

Basic text processing and sentence parsing using a grammar.

Refer to the Rule-Based Processing README for more information.

Traditional Statistical Methods

The "old school" of NLP, including features such as bag-of-words and machine learning classifiers that do not use neural networks, such as Naive Bayes and Support Vector Machines (SVM).

Refer to the Traditional Machine Learning README for more information.

Modern Statistical Methods using Deep Learning

Here we will see how neural networks have revolutionized NLP, using techniques like word embeddings to reduce vocabulary dimensionality and recurrent neural networks with elements like Gated Recurrent Units (GRU) and Long Short-Term Memory (LSTM) units.

Finally we will look at the most state-of-the-art deep learning based NLP models like Transformers, which do away with recurrent neural networks and their disadvantages by using attention mechanisms.

Refer to the Deep Learning README for more information.


Featured Software Tools

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.