Giter Site home page Giter Site logo

xorsuyash / autotune Goto Github PK

View Code? Open in Web Editor NEW

This project forked from bharatsahaiyak/autotune

0.0 0.0 0.0 1.17 MB

A comprehensive toolkit for seamless data generation and fine-tuning of NLP models, all conveniently packed into a single block.

License: MIT License

JavaScript 23.73% Python 56.10% CSS 20.07% Dockerfile 0.10%

autotune's Introduction

Coverage Status

AutoTuneNLP

A comprehensive toolkit for seamless data generation and fine-tuning of NLP models, all conveniently packed into a single block.

Setup

Clone the repo and cd to project root.

Environment

This projects works with Python 3.10

Create the virtual environment

python3.10 -m venv venv

Activate the virtual environment

  • For Linux and MacOS
source venv/bin/activate
  • For Windows
.\venv\Scripts\activate

Install all dependancies

pip install poetry
poetry install

Local Development

  1. Start your docker engine.

  2. Copy the sample env and populate the fields

cp sample.env .env
  1. Start the redis and postgres containers
docker compose up -d redis postgres

NOTE: activate the virtual environment before starting the django server and celery worker

  1. Start the django server in a terminal window.
python manage.py runserver port
  1. Start celery worker using gevent pool in another terminal window.
celery -A autotune worker --loglevel=info -P gevent
  • If you are running on windows, the above command won't work since celery is not supported on windows, but you can use the below command for testing (caveat: it's capabilities are lost).
celery -A autotune worker --loglevel=info  --pool=solo

Contributing

Interested in contributing to AutoTune? We'd love your help! Check out our issues section for areas where you can contribute. Please see our contribution guide for more details on how to get involved.

Typical Workflow

  1. User is shown a login page to login using their Google account. (The account has to be of 'samagragovernance.in' domain). The user is then shown a settings page where they are nudged to update the API keys for OpenAI and HuggingFace. The user is also shown a list of all the repos they have access to on HuggingFace. The user can select the repo they want to work on and the settings are saved. The settings tab allows the user to view and update the settings.
  2. User gives a prompt and samples are 5 generated automatically.
  3. User can view the 5 generated samples and select the ones they like and dislike.
  4. The prompt is updated with those examples and the process is repeated until the user is satisfied with the 5 samples.
  5. Once the user is satified, user provides the number of samples they want and the data is generated. The process is async and progress is shown to the user. The progress is tracked every 2 seconds.
  6. The data is generated and shown to the user, they give a go ahead and the dataset is pushed to huggingface. The dataset tab allows the user to view all the datasets that they have deployed until now on huggingface.
  7. A link is shared with the user so that they can view the data on huggingface.
  8. There is a tab called train, which allows user to use the dataset created earlier to train a model by filling a form. The process is async and progress is shown to the user. The progress is tracked every 2 seconds.
  9. Once trained the user is allowed to view the results of the model on the test set. The user can also view the results of the model on the validation set. The models tab allows the user to view all the models that they have deployed until now.
  10. The user is then nudged to deploy the model to huggingface. Once confirmed, the user is asked to provide a name for the model and the model is pushed to huggingface.
  11. A link is shared with the user so that they can view the model on huggingface and a curl is shared so that they can use the model for inference.
  12. The history tab allows the user to view all the tasks that they have performed until now.

License

AutoTune is made available under the MIT License. See the LICENSE file for more info.

Thank you for considering AutoTune for your machine learning and dataset creation needs. We're excited to see the innovative solutions you'll build with our platform!

autotune's People

Contributors

sooraj1002 avatar chakshugautam avatar ksgr5566 avatar singhalkarun avatar kdwevedi avatar takshpanchal avatar gautamr-samagra avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.