Giter Site home page Giter Site logo

alfonmedela / to_vaccinate_or_not_to_vaccinate Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 1.0 1.14 MB

#ZindiWeekendz hackathon 5th solution (NLP) πŸ”Š

Python 100.00%
covid-19 covid19 vaccination detection nlp nlp-machine-learning roberta natural-language-processing

to_vaccinate_or_not_to_vaccinate's Introduction

To Vaccinate or Not to Vaccinate: : It’s not a Question challenge!

#ZindiWeekendz hackathon #top5 solution

How to replicate solution

Libraries needed:

  • Numpy
  • Pandas
  • Torch
  • Fastai
  • Transformers

Just in case you are not able to run it, I include a requirements.txt with all the dependencies I have installed in the conda environment. You can check the versions there.

Running the algorithm

Repeat step 1 and 2 for every split (0,1,2,3,4):

  1. Run split_X/main_train.py
  2. Run split_X/submission.py

Once all submission files are generated, run final_submission.py

Method

I used Roberta model with a final layer of a single neuron to predict a value between -1 and 1. The network is trained with MSE loss and evaluated with RMSE metric as it is the objective metric in the competition. I used AdamW optimizer and trained two different stages, first all freezed but the final regression layer and finally, the whole network.

As the dataset is really small, it is 10,000 training samples, each split obtained different results in the public leaderboard. Therefore, I decided to stack 4 models trained on different splits. Additionally, I trained a last 5th split dropping the samples with 0.33 agreement. This way, the network learned from samples with higher confidence labels.

to_vaccinate_or_not_to_vaccinate's People

Contributors

alfonmedela avatar dependabot[bot] avatar

Stargazers

 avatar

Watchers

 avatar

Forkers

crmiguez

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.