Giter Site home page Giter Site logo

heleene / trackedtogether Goto Github PK

View Code? Open in Web Editor NEW

This project forked from decorrespondent/trackedtogether

0.0 0.0 0.0 459 KB

Data and code from the Track(ed) Together Project by De Correspondent

License: MIT License

Jupyter Notebook 100.00%

trackedtogether's Introduction

picture alt

Track(ed) Together

This repository contains code and data from the Track(ed) Together Project, a project from the online magazine De Correspondent that ran between May 1st 2020 and May 1st 2021. The purpose was to establish an overview of surveillance measures taken to combat the covid-19 pandemic and to get insights into the scope, use and effectiveness of those measures. The project was meant to last for a year on the assumption that the crisis would be over by then. That wasn't the case.

It would be great if other people and organization built further on our dataset. Therefore we share our data and code with you. Feel free to use the data and code as you see fit, but please attribute the data collection to us. All data have been frequently manualy updated and verified. Data sources are provided as well.

If you want to get a sense of the type of stories or research you can do with this data, please take a look at some of our stories at The Correspondent or, in Dutch, at De Correspondent.

About the data

The data was manualy collected from public sources between May 1st 2020 en May 1st 2021 by several journalists (Lais Martens, Morgan Meaker and Dimitri Tokmetzis). Databasemanagement was done by Heleen Emanuel. All data have been verified. In the end, we collected information on approximately 650 surveillance measures worldwide. Here you can find a list of collected fields.

The data are available in several formats:

  1. CSV dumps
  2. A cleaned up csv file (with some data omitted)
  3. Or the whole MongoDB set

About the MongoDB

In the root you'll find two files:

  • mongo_schema: this file contains the MongoDB shell command that creates the validation JSON schema for the surveillance collection.
  • sample_insert: this file contains an example of a MongoDB shell command that creates one document in the surveillance collection.

There's also a _dumps directory which contains three folders:

  • mongo: this folder contains a dump of the db that can be imported with the mongorestore command.
  • private: this folder contains raw csv dumps of the surveillance collection with all fields..
  • mongo: this folder contains csv dumps of the surveillance collection with some fields and object fields split up over multiple columns.

To create dumps use these commands:

mongo

mongodump --db=surveillance --collection=measures --out=_dumps/mongo

private

mongoexport --db=surveillance --collection=measures --type=csv --fieldFile=_dumps/_private_fields.txt --out=_dumps/private/$(date +%s).csv

public

mongoexport --db=surveillance --collection=measures --type=csv --fieldFile=_dumps/_public_fields.txt --out=_dumps/public/$(date +%s).csv

Notebooks

We've added two notebooks with some code for your convenience. Of course the code could be made more pythonic, suggestions are welcome.

  • Measure data contains code to parse several datasets with covid-19 measures from several reliable sources.
  • CoronaMelder contains code snippets for analyzing several statistics concerning the Dutch contact tracing app CoronaMelder.

Some thoughts on possible research and data collection

There is plenty of interesting research possible, like:

  • Analysis of companies providing surveillance technologies.
  • Comparisson of tracked together data with other measures as described in:
  • Once the crisis has abated, a more in-depth assessment of efficacy of the measures.

If you have any questions, please feel free to reach out to us.

License: MIT

trackedtogether's People

Contributors

dtokmetzis avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.