Giter Site home page Giter Site logo

imlana21 / twitter-covid19-indonesia-sentiment-analysis---lexicon-based Goto Github PK

View Code? Open in Web Editor NEW

This project forked from evanmartua34/twitter-covid19-indonesia-sentiment-analysis---lexicon-based

0.0 0.0 0.0 2.97 MB

This repository do mainly 3 things: twitter data scrapping , data analysis, sentiment analysis and generation

Jupyter Notebook 98.76% Python 1.24%

twitter-covid19-indonesia-sentiment-analysis---lexicon-based's Introduction

Twitter-COVID19-Indonesia-Sentiment-Analysis---Lexicon-Based

LEXICON BASED Twitter Bahasa Indonesia Sentiment Analysis

This works is an improvement from various sources that is used to explore, generate dataset, and analysis.

Lexicon based sentiment analysis has some flaws such as it only takes the sentiment of each word without really put it on the context and the sentiment score produced is really dependent on the word weighting in the lexicon itself. But for doing analysis from scratch where we dont have the pre labelled data, it really expensive and complicated to do sentiment labelling for not specilized person. Therefore lexicon method come into handy for such scenario in doing sentiment analysis. Put in mind that this methods is minimal usage for learning.

How to use:

  1. Data Scrapping and Data set Generation: run sentiment_Dataset_Generation.py the dataset will be availaible in data/data_extraction the keywords that is used here is related to corona pandemic in indonesia feel free to modify. Dont forget to insert your own twitter credentials

  2. Sources modification: open modify_sources.ipynb to modify lexicon, stop words and slang words

  3. Sentiment and Data Analysis: open Analysis.ipynb to do this

Hope that this is usefull.

Regards,

Evan Martua

References: https://github.com/louisowen6/NLP_bahasa_resources
https://github.com/fajri91/InSet
https://github.com/abhimantramb/elang/blob/master/word2vec/utils/swear-words.txt
https://devtrik.com/python/steeming-bahasa-indonesia-python-sastrawi/
https://towardsdatascience.com/extracting-twitter-data-pre-processing-and-sentiment-analysis-using-python-3-0-7192bd8b47cf

twitter-covid19-indonesia-sentiment-analysis---lexicon-based's People

Contributors

evanmartua34 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.