Giter Site home page Giter Site logo

bigdata_ai_nlp_tutorial's Introduction

Austin Big Data AI, NLP Tutorial Part I

Date: Thursday, July 9, 2020, 7:00 PM to 8:00 PM CDT

This tutorial is a gentle introduction to Natural Language Processing (NLP) for those who have not used NLP methods and techniques before or have some minimal experience with them.

Part I

We will start with a brief introduction to NLP concepts and terminology and then, we will walk you through some of the text preprocessing steps to prepare the provided dataset for further NLP modeling. If time permits, we will also show you a complete implementation of a sentiment analysis model.

Setup Instructions: Please Read Before the Tutorial!

Please do the download and install of Jupyter Notebook, as well as the packages listed in the environment.yml file as described below before coming to the tutorial! We will NOT have time to setup or handle technical problems that might come up due to downloads during the tutorial due to the very limited amount of time we will have.

We'll be using Jupyter Notebook and NLTK, as well as some other open source libraries.

Installation Steps

Please note: if you are familiar with creating development environments on your machine, it is generally recommended to install packages in a virtual environment to avoid modifying system state.

There is an environment.yml file included in the github repo that will create a conda environment with all dependecies for the tutorial. This is the easiest way to get all you need to run the jupyter notebooks included with the tutorial.

  • You will need to clone the repository for this tutorial from github using your command line:

    git clone 
    [email protected]:ftarlaci/bigdata_ai_nlp_tutorial.git
    
  • If you do not have anaconda installed, you can download it here: download Anaconda

  • Once you have installed Anaconda, in a shell, navigate to the cloned directory for the tutorial, and run the following line to create an environment with all requirements needed:

    conda env create -f environment.yml
    
  • Activate the environment and open the notebook (a browser window should open):

    conda activate nlp_tutorial
    jupyter notebook
    

This command will launch the Jupyter Notebook in your browser.

Testing correct installation

If you have installed all the required packages, please open and run the test_installations.ipynb. If you don't get any errors, that means you have installed the required libraries correctly and will be able to run the notebooks during the tutorial.

Part II

We will update this repository and this README file for Part II of the tutorial with further instructions. Please revisit this repo before coming to the Part II next week.

bigdata_ai_nlp_tutorial's People

Contributors

ppwadhwa avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.