Giter Site home page Giter Site logo

umass-rescue / mldatasettemplate Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 48 KB

Template for hosting a dataset and remotely training models on the data

License: MIT License

Shell 4.98% Dockerfile 0.83% Python 94.19%
dataset docker fastapi python redis tensorflow2

mldatasettemplate's People

Contributors

codyrichter avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

mldatasettemplate's Issues

Create logs of worker console output

When a worker trains a model, it creates a lot of logs with important information and potentially important error messages. Log all outputs to a unique file tied to the training job instead of stdout on the worker.

Allow for dynamic loading of loss functions

Right now there is no way for a user to specify a loss function. Unlike optimizers which can be specified with a string loss functions must be loaded directly from a python object. Determine a safe and usable way for researchers to specify their own loss functions on upload

Constant Image Sizing on Load

Currently, datasets can contain images of any size. On load the images should be modified to be the same size as to work with all models.

Enable Worker Scaling

Similar to this Server Issue which also handles Docker worker scaling.

Right now, there is a single Docker worker which handles all training tasks. This is not optimal, as the single worker is hard-coded into the docker-compose.yml file and will not handle large request loads. Figure out the best way to dynamically scale the number of workers available.

Enable GPU For Training

Currently, the Docker container used for the microservice only has access to the CPU and memory. Enabling the container to interact with the system GPU can lead to large speed ups in training tasks.

Handle Automatic Train/Test Split

Right now the server only supports a train/validation split with the entire dataset, and no test set is generated when the dataset is loaded.

Create a new environment variable in .env for train/test split, and then use this new test set to provide additional statistics on model training.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.