Giter Site home page Giter Site logo

steffenbrinckmann / metadata_extractors_registry Goto Github PK

View Code? Open in Web Editor NEW

This project forked from marda-alliance/metadata_extractors_registry

0.0 0.0 0.0 150 KB

A place to develop and discuss the MaRDA Extractors WG registry.

Home Page: https://marda-registry.fly.dev/

License: MIT License

Python 42.29% CSS 14.71% xBase 0.58% HTML 39.12% Dockerfile 3.02% Procfile 0.29%

metadata_extractors_registry's Introduction

MaRDA Metadata Extractors: Registry

Documentation Github status

A place to develop and discuss the MaRDA Extractors WG registry. The idea is to collect various file formats used in materials science and chemistry, describe them with metadata, and provide links to software projects that can parse them.

By providing this data in a web API, it hoped that users can discover new extractors more easily and metadata standards can be developed for the output of extractors to enable schemas to proliferate throughout the field.

The state of the main branch is deployed to https://marda-registry.fly.dev/, with API docs (and built-in client) accessible at https://marda-registry.fly.dev/redoc.

Contributing

You are welcome to contribute file type and extractor entries to this registry, by opening a pull request. Please see the contributing guidelines for detailed steps. After submitting a pull request, this data will be validated and added to the deployed database once it is merged.

Development

Clone repository with submodules and install deps in a fresh Python virtualenv:

git clone [email protected]:marda-alliance/marda_extractors_registry --recurse-submodules
pip install -r requirements.txt

Use invoke and the tasks in tasks.py to generate pydantic models for all schemas defined in the schema repo:

invoke regenerate-models

From the repository root directory, launch the server with uvicorn:

uvicorn marda_registry.app:app

then navigate to http://localhost:5000 to test.

Deployment

The registry app can be easily deployed via the given Dockerfile. After cloning the repository (with submodules, following the instructions above), the image can be built for a given schema version by running

docker build . -t marda-registry

and then launched with

docker run -p 8080 --env PORT=8080 marda-registry

or equivalent command.

Registry Maintainers

metadata_extractors_registry's People

Contributors

ml-evs avatar pre-commit-ci[bot] avatar peterkraus avatar edan-bainglass avatar jdbocarsly avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.