Giter Site home page Giter Site logo

yunuserez / a-step-by-step-guide-to-bert Goto Github PK

View Code? Open in Web Editor NEW

This project forked from spartanlasergun/a-step-by-step-guide-to-bert

0.0 0.0 0.0 617 KB

A step by step guide to using BERT for sentiment classification

License: GNU General Public License v3.0

Jupyter Notebook 100.00%

a-step-by-step-guide-to-bert's Introduction

A Step by Step Guide To BERT

This repository contains a step by step guide to using the BERT model for text classification. I have choosen to use a dataset of tweets for positive and negative sentiment classification, as this has been one of the areas that newcomers to machine learning are more interested in.

This guide has been created with the intention of making machine learning a bit more accessible to the beginner user - though, the complexity of machine learning inevitably makes it better suited as a resource for users that are already familiar with statistics and python. Still, I have kept the guide simple enough that most beginner programmers will still be able to make a lot of headway with understanding and using the BERT model. My recommendation is that the person using this guide should read through it very slowly. Attempt to understand the code in each cell along with the explanations given before moving forward.

It is very difficult to avoid touching on advanced topics in statistics and taking other concepts for granted. In particular I have avoided going into detail on explanations of logistic regression which would be extremely complex even for the advanced user.

The google collaboratory notebook - 'A_Step_by_Step_Guide_To_BERT.ipynb' - gives a complete step by step walkthrough of:

  • loading data
  • doing the preprocessing
  • intializing the bert model
  • generating the embeddings
  • and, performing text classification using logistic regression

The dataset used for the walktrough is contained in the file 'data.csv'.


Other Resources

If you wish to explore machine learning in a more in depth manner, you should consult the following resources:

  • Getting Started with Goole BERT - by Sudharsan Ravichandiran
  • Transformers for natural language processing - by Denis Rothman
  • What is ChatGPT doing...and why does it work - by Stephen Wolfram

If you are a more advanced user and you want to understand logistic regression itself, you should consult:

  • Logistic Regression: A Self Learning Text - by David Kleinbaum & Mitchel Klein

a-step-by-step-guide-to-bert's People

Contributors

spartanlasergun avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.