Giter Site home page Giter Site logo

text_analysis_covid_vaccine_hesitancy's Introduction

Text Analysis for Covid-19 Vaccine Hesitancy

This notebook explores and analyses the text dataset of people's intent expressions around COVID-19 vaccine hesitancy and brings insight into people's main concerns about COVID-19 vaccines. The recommendations are given to help public health officials to develop targeted communication and education strategies that address concerns and promote vaccine acceptance.

💡 Interesting Findings

Top 10 Questions

Top10Qs

The top 10 common questions has 90-100 intent expressions/sentences and they are related to several concerns or queries such as

  • effectiveness of the vaccine
  • cost of the vaccine
  • mistrust of vaccines which are from China or Russia
  • lack of knowledge in Omicron variant
  • safety meatures after getting the vaccine
  • safety risks of the vaccine (including booster shot)
  • doses of the vaccine

Note: Each question could have multiple intent expressions for example the question 'How effective is the vaccine against the Omicron variant?', the intent expressions include 'Is it worth getting the vaccine because it will not help with the new omciron variant of the virus?', 'What if the vaccine doesn't work with new omricon variant of the virus?', and all the other expressions.   

Key Words

Word_Cloud

As the word cloud shows, some of the most frequent words include side, effect, booster, shoot, variant, safe, effective, people, test, long, cause, concern, children, trust, dose, dangerous, omicron and so on. With further exploration using the technique of bigrams, some keys are discovered to appear together more often such as side effect, omicron variant and booster shot.

By diving deeper into the key words or bigrams identified, it is found that side effect and children/kids have the highest number of expressions/sentences and common questions as displayed in the charts below. According to the chart, around 1000 expressions in total are related to side effect, children/kids or trust

Sen_Qs    

Most Concerned Topics - Side Effects, Kids, Trust

Side Effects

Side_Effects

Among the expressions related to side effects,

  • 38% mentions the worry about the severe side effects (such as death) or adverse reactions
  • 22% questions the side effects of the second shot or booster shot
  • 12% expresses the concern about the side effects in children or women
  • 8% doubts the underreporting of side effects
  • the remaining reflects the lack of knowledge in the vaccine such as its side effect, the commonality of side effects and the relationship between vaccine effectiveness and its side effects.    

Kids

Kids

Based on the pie chart, 11% of expressions shows the unwillingness to get their children vaccinated from the parents' perspective. Some main concerns concentrate on the safety issue of the vaccine, whether the vaccine is compulsory in school, impact of the vaccine on children (such as side effects, missing school), and variances between children and adults in terms of vaccine doses and effectiveness.    

Trust

Trust

The chart above indicates several factors can cause distrust of the vaccine including:

  • the origin of the vaccine such as China or Russia (33%)
  • the companies producing the vaccines (21%)
  • the government (20%)
  • other reasons(26%)    

📝 Recommendations

Based on the findings, a few suggestions are provided to the public health officials or the government as below:

  • Be transparent about the side effects of different vaccines (such as third shot, booster shot) and how they impact different groups (such as children, women etc.);
  • Provide learning resouces online or in the community to help people gain a better understanding of vaccines and things relevant to vaccines, for example how long the vaccine is effective, what are the safety measures after vaccination, what are the differences between different type of shot and so on;
  • Build stronger trust between the public and the government by reporting the side effects of vaccines with integrity, enhancing the regulations and inspection on the quality of vaccines and other effective measures to rebuild the trust and improve the vaccine acceptance.

🛠 Techniques and Tools Used

  • Data visualisation packages including Matplotlib and Seaborn
  • NLP package NLTK
  • Word count
  • Word cloud
  • Bigrams and trigrams
  • Text data cleaning including lowercasing, removing punctuations, removing non-alphabatic and non-number, removing and customizing stop words, lemmatization.

ℹ️ Data Source

The dataset used in the notebook come from the Vaccination Information Resource Assistant (VIRA) Coversation Corpus. Reference for the dataset: Gretz, S., Toledo, A., Friedman, R., Lahav, D., Weeks, R., Sedoc, J., Sangha, P., Katz, Y., & Slonim, N. (2022). Benchmark Data and Evaluation Framework for Intent Discovery Around COVID-19 Vaccine Hesitancy. ArXiv. /abs/2205.11966

text_analysis_covid_vaccine_hesitancy's People

Contributors

amy-panda avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.