Giter Site home page Giter Site logo

chetanambi / innoplexus-online-hiring-hackathon-sentiment-analysis Goto Github PK

View Code? Open in Web Editor NEW
11.0 4.0 6.0 153 KB

Analytics Vidhya is India’s largest and one of the world’s leading data science community and knowledge portal. It is a passionate community for Analytics / Data Science Professionals and aims at bringing together influencers and learners to augment knowledge. This platform allows people to know more about analytics from its articles, discussion forum, learning paths, meetups, webinars, training, etc. We get more than 1.5 Million monthly visits and have 100,000+ active registered users. We also help professionals & amateurs to sharpen their skillsets by providing a platform to participate in Hackathons.

Home Page: https://datahack.analyticsvidhya.com/contest/innoplexus-online-hiring-hackathon/

License: Apache License 2.0

Jupyter Notebook 100.00%
analyticsvidhya

innoplexus-online-hiring-hackathon-sentiment-analysis's Introduction

Innoplexus-Online-Hiring-Hackathon-Sentiment-Analysis

image

Problem Statement

Sentiment Analysis for drugs/medicines Nowadays the narrative of a brand is not only built and controlled by the company that owns the brand. For this reason, companies are constantly looking out across Blogs, Forums, and other social media platforms, etc for checking the sentiment for their various products and also competitor products to learn how their brand resonates in the market. This kind of analysis helps them as part of their post-launch market research. This is relevant for a lot of industries including pharma and their drugs.

The challenge is that the language used in this type of content is not strictly grammatically correct. Some use sarcasm. Others cover several topics with different sentiments in one post. Other users post comments and reply and thereby indicating his/her sentiment around the topic.

Sentiment can be clubbed into 3 major buckets - Positive, Negative and Neutral Sentiments.

You are provided with data containing samples of text. This text can contain one or more drug mentions. Each row contains a unique combination of the text and the drug mention. Note that the same text can also have different sentiment for a different drug.

Given the text and drug name, the task is to predict the sentiment for texts contained in the test dataset. Given below is an example of text from the dataset:

Example:

Stelara is still fairly new to Crohn's treatment. This is why you might not get a lot of replies. I've done some research, but most of the "time to work" answers are from Psoriasis boards. For Psoriasis, it seems to be about 4-12 weeks to reach a strong therapeutic level. The good news is, Stelara seems to be getting rave reviews from Crohn's patients. It seems to be the best med to come along since Remicade. I hope you have good success with it. My daughter was diagnosed Feb. 19/07, (13 yrs. old at the time of diagnosis), with Crohn's of the Terminal Illium. Has used Prednisone and Pentasa. Started Imuran (02/09), had an abdominal abscess (12/08). 2cm of Stricture. Started ​Remicade in Feb. 2014, along with 100mgs. of Imuran.

For Stelara the above text is ​positive​ while for Remicade the above text is ​negative​.

Data Description

train.csv

Contains the labelled texts with sentiment values for a given drug

test.csv

test.csv contains texts with drug names for which the participants are expected to predict the correct sentiment

sample_submission.csv

sample_submission.csv contains the submission format for the predictions against the test set. NA single csv needs to be submitted as a solution. The submission file must contain only 2 columns unique_hash, sentiment

Evaluation Metric

The metric used for evaluating the performance of the classification model would be macro F1-Score.

Public and Private Split

The texts in the test data are further randomly divided into Public (40%) and Private (60%) data. Your initial responses will be checked and scored on the Public data. The final rankings would be based on your private score which will be published once the competition is over.

Public Leaderboard: 74 (Score: 0.4850616849)

Private Leaderboard: 29 (Score: 0.5230949840)

innoplexus-online-hiring-hackathon-sentiment-analysis's People

Contributors

chetanambi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.