Giter Site home page Giter Site logo

code-for-africa-twitter-data-analysis's Introduction

Code-for-Africa-Twitter-data-analysis

Outline of Tasks

Data Collection

Write a script or use a tool from Github to collect public tweets using the hashtag #TigrayGenocide on Twitter for the period 13 April 2021 - 15 April 2021, and share the dataset as a zipped CSV file.

Data Analysis

Data Analysis You can find the data in data folder above The data is a sample of tweets collected during a previous investigation.

Exercises;

For a forensic investigation focusing on social network analysis, defining clusters or groups in a dataset is essential in understanding the formation of communities involved in shaping a particular conversation. An example would be identifying a group of accounts on Twitter that work together to push a particular topic or narrative.

From the data provided above:

  • What subgroups can you identify based on the volume of tweets in each category and the number of unique accounts for each?

  • Who is the most prominent author for each category type?

Establishing who started the conversation on social media, the amplification point and how the conversation evolved over time is key in determining the key perpetrators spearheading a particular disinformation narrative.

Using the data provided, Identify :

  • The account name and account handle of the author who posted the first tweet.

  • Identify the date that had the highest volume of tweets (amplification point).

"I am here for a purpose and that purpose is to grow into a mountain, not to shrink to a grain of sand. - Mandino #quote via @roxanamjones"

This is a tweet found within the dataset. For this exercise :

  • Identify the author of the tweet;

  • Create a subset with all tweets from the same author;

Using a plotting library / tool display the daily number of tweets from ;

  • The top 2 categories within the datasets and rank them by volume of tweets.

  • The 3rd and 4th categories and rank them by volume of tweets.

Account Profiling

  • What are some of the characteristics you would consider when identifying fake or suspicious accounts on twitter?

  • Using the data provided, identify one account that you suspect might be automated and flag some of the bot-like traits you identified.

code-for-africa-twitter-data-analysis's People

Contributors

getuchala20 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.