Giter Site home page Giter Site logo

justdata's Introduction

JUST Data Annotation Tool

JUST Logo

This repository contains a Python implementation of the JUST Data Annotation tool which allows the user to load and annotate datasets using a set of predefined tags as well as user-generated tags and attach use case specific files.

JUST Data Annotation focuses on developing a digital workflow fostering gender equality and inclusivity in data collection, handling and management, particularly within the context of industrial data and AI applications. JUST โ€“ Judicious, Unbiased, Safe and Transparent โ€“ is not intended as a metric, but aims to set guidelines for data literacy about bias, sensitive data, the context of data collection and data provenance. Our methodology includes:

  • Creating standards for data annotation that prioritise fairness, safety, and transparency.
  • Engaging a wide range of participants, including underrepresented groups, in the development and testing phases to ensure diverse perspectives are considered.
  • Prototyping new standards and formulating practical guidelines for businesses to implement JUST data principles in their operations.
  • Establishing testbed environments to trial, iterate and refine the JUST data annotation processes, ensuring they are effective and applicable across various industry sectors.

The fronted was built using streamlit to create a scriptable web app. The backend was built using PostgreSQL, JSON, pandas, langchain, openai and other popular Python libraries.

Installation

Clone the repository:

git clone https://github.com/IndustryCommons/justdata.git

Install the necessary requirements, run:

pip install -r requirements.txt

Usage

To run on a local server, execute:

streamlit run justdata.py --server.enableCORS false --server.enableXsrfProtection false

The app has been deployed on the Streamlit Community Cloud here.

Get in touch

If you need more information or would like to contribute to JUST Data Annotation, please contact us at [email protected]

justdata's People

Contributors

iemanuilov avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.