Giter Site home page Giter Site logo

slrbl / human-in-the-loop-machine-learning-tool-tornado Goto Github PK

View Code? Open in Web Editor NEW
61.0 61.0 10.0 153.82 MB

Tornado is an open source Human-in-the-loop machine learning tool. It helps you label your dataset on the fly while training your model through a simple web user interface. It supports all data types: structured, text and image.

License: GNU Affero General Public License v3.0

Ruby 40.74% HTML 37.26% Python 13.93% Dockerfile 0.83% JavaScript 0.90% CoffeeScript 0.45% SCSS 5.89%
active-learning active-learning-module activelearning artificial-intelligence automl data-annotation data-annotation-machine-learning data-annotation-tools dataannotations human-in-the-loop-machine-learning interactive-machine-learning keras machine-learning machine-learning-api machinelearning natural-language-processing python ruby-on-rails sklearn

human-in-the-loop-machine-learning-tool-tornado's People

Contributors

slrbl avatar szperajacyzolw avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

human-in-the-loop-machine-learning-tool-tornado's Issues

Elastic Search: [ERROR][o.e.g.GatewayMetaState ] [swoqVS5] failed to read local state, exiting...

I managed to get the page pulled up at 0.0.0.0:3000, but when I try to upload the Kaggle autism screening for toddlers data set, I run into the same issue as Issue #2.

website_issue

It is definitely a problem with elastic search, based on the below error messages. The application then outputs a message that elastic search has exited with code 1. This causes kibana to repeatedly output that there is no live connection at http://elasticsearch:9200.

elastic_search_error

I saw that Issue #2 is similar to mine, but I am unsure how to follow since the repository has a very different structure now. I tried downloading elastic search and putting it into the directory manually, but that didn't seem to work. Thanks in advance.

Gem::LoadError sqlite3

When I execute the rake db:migrate command I get:

Amrapalis-MBP:tornado-active-learning amrapaliz$ rake db:migrate rake aborted! Gem::LoadError: Specified 'sqlite3' for database adapter, but the gem is not loaded. Add gem 'sqlite3' to your Gemfile (and ensure its version is at the minimum required by ActiveRecord).

The Gemfile already has sqlite3. Which version should we add?

can this repo provide docker-compose example?

in the propose to promote this application, the easy to startup the application is important

for the command provided in the README
sh ./launch_tornado.sh

which didn't work in my env ( docker image ubuntu:16.04)
due to ./launch_tornado.sh: 5: ./launch_tornado.sh: bundle: not found

and sure I can google and found the solution for it, like this
https://stackoverflow.com/questions/3914694/bundle-command-not-found
which need to install ruby and gems

instead of keep going to start the application, my first trial for tornado just stop here, and I'm wondering if the application provides a more easy way to reach which might leads more users for the tornado.

thx!

NLP Problems

I would like to label some text strings with a binary label. My data is a CSV file with two columns, one with the string of text, and another with human labels (for ground-truth purposes). The text fields are pretty long, with some reaching 4,000 characters. The file has 6,281 rows. Whenever I try to upload the CSV, I get the following error:

image

I figured it might have been an encoding problem, so I changed all string encoding in the file to UTF-8 and uploaded that version instead. Whenever I upload the UTF-8 version it hangs on "processing" for a long time, and opening the file reveals the image below. This is the first row of the data, truncated at 83 characters, and repeating 12 times. However, this particular string only appears in the data set twice.

image

I tried shortening the data set to only 50 rows and got the same behavior as above. Then I tried to shorten the actual text string to 50 characters because I figured it might be an issue with the string's length. The result of uploading the full file with all fields truncated at 50 characters results in the behavior below:

image

Then I tried shortening the text even more, to 10 characters, and found that it managed to upload the file! However, it is still stuck in "processing". I also discovered that the upload with only 10 characters works for both the original data and the UTF-8 encoded data, but longer text strings will throw the error shown in the first image in this issue.

image

My question is, is there a way to do NLP with the long text strings? Is there a limit on how long the text strings can be? Thanks in advance.

Adding Datasets

After I logged in, I clicked on "Add dataset". I used the CSV file from https://www.kaggle.com/fabdelja/autism-screening-for-toddlers/version/1.

  1. I get this error "Failed to open TCP connection to 127.0.0.1:9200 (Connection refused - connect(2) for "127.0.0.1" port 9200)": https://drive.google.com/open?id=1mX5rUT5iEv3LnzhPvKFqlNG0uaJbGPWM.

  2. When I click back I do see a list of the same dataset and the multiple attempts to upload: https://drive.google.com/open?id=1P6TAKUcXUh9UBG4tuf5dxWQ_HR_OSvZU.

  3. When I click on one of those datasets, I get an error "Something went wrong..", see https://drive.google.com/open?id=1lbyflIit6MA4lHuBh-Zq_FYtRYbTyjT5.

No such file or directory/env: ruby

When running docker-compose up, I get the error "No such file or directory/env: ruby". I have attached a screenshot of the error I am getting. Because of this error, 0.0.0.0:3000 doesn't have anything on it. Is there something I need to do with ruby to get this up and running? Thanks in advance.

image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.