Giter Site home page Giter Site logo

datatogether / dataset_registries Goto Github PK

View Code? Open in Web Editor NEW
3.0 17.0 0.0 8 KB

Tracking the design and implementation of the metadata registries we will use to track rescued datasets

License: Creative Commons Attribution Share Alike 4.0 International

registries discussion metaverse

dataset_registries's Introduction

Data Together

Data Together empowers people to create a decentralized civic layer for the web, leveraging community, trust, and shared interest to steward data they care about.

Find out about who we are, what we do, and how to get involved at https://datatogether.org/)!

Organizational structure

We maintain pretty light governance but commit to an annual in-person meeting and quarterly calls:

Quarterly Calls

Quarterly calls are held four times annually, for everyone, but especially Data Together partners to sync up on ongoing projects, what is going on in their organizations, and more.

๐Ÿ“… Once per quarter
โ–ถ๏ธ Call Playlist: youtube.com/playlist?list=PLtsP3g9LafVul1gCctMYGm9sz5FUWr5bu

Working Openly

We have developed guidelines for working as an open project, these are all contained in this repo:

License

Data Together Documentation Materials are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

dataset_registries's People

Contributors

titaniumbones avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dataset_registries's Issues

Still current?

Mostly empty and 8 months old. Do we want to invest in this repo?

Compare the Metadata that Different Groups have about the Datasets They have Rescued

In order to design a simple, practical, first-pass metadata format for tracking these datasets.

Information we will want the registries to keep track of:

  • Which datasets have been downloaded
  • Where they were downloaded from
  • Who downloaded them
  • Where they are available -- http links, ipfs links, dat links, etc
  • If the download was vetted for authenticity. If so, then how it was vetted.
    • for example, thus far EDGI has only posted a fraction of the downloaded datasets on datarefuge.org because that CKAN instance only contains stuff that has been vetted using a documented process. We want the registries to show everything, including the stuff that was not vetted, but we want to be able to distinguish between vetted stuff and un-vetted stuff.

Note: a lot of datasets have been downloaded multiple times by different people. We need to represent "these are both versions of the same dataset" without losing info about where they are and who downloaded them.

Add README and Templates

Make sure this repo has the following files:

  • Readme README.md
    • Repo Badges for: Github Project, Slack, License
    • 1-3 sentence description of repository contents
    • Getting Involved section
    • Development section
  • License -- LICENSE
  • Contributing Guidelines (minimal and pointing to org-wide) .github/CONTRIBUTING.md
  • Issue Template -- .github/ISSUE_TEMPLATE.md skipping for now
  • GitHub Description from 1-3 sentence readme blurb

This issue forms part of a project-wide meta-issue

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.