Giter Site home page Giter Site logo

tooltagging's Introduction

Installation

  • Link download

  • Open the containing folder, and run in the terminal: java -jar PreprocessSentence.jar image

image


1. Input file

  • File sentence: containing text data needed to tag
  • File label: containing labels

2. File input format

File data (example file: sentences.csv)

Sentence Status/Comment
I want to buy a new car has intent

File label (example file: label.txt)

Label Abbreviation
Object obj
Action act

3. Flow

  1. Choose label file (example file: label.txt)
  2. Choose file data(example file: sentences.csv)
  3. Click Add/Remove Label button to change the label (the content of original label file will be rewrited)
  4. Start tagging
Button Action
undo back to the previous state text area (only 1 step back)
restore back to the inital state of text area
back back a row in table (previous sentence)
next next a row in table (next sentence)
un/consider mark or unmark "consider" status for the current data row
status - show and edit status/comment to explain more.
- All comments must be written in only 1 row, do not enter in the text box when typing, it will break the structure of csv file
rm label - All label in the selected text will be removed.
- For example: you select <prc>5000 dong </prc>, then click rm label. <prc>5000 dong </prc> -> 5000 dong
remove Remove the current row data (Be careful when deciding to remove a row because It cannot be restored)

4. Result

  • The result file after tagging will be generated and saved in file tagged_*.csv in the same folder with the tool

  • You must open this "tagged_*.csv" file to continue in the next time. If you try to open the original file, the tagged file will be regenerated and rewrited.

  • For example: sentences.csv is the original file, and tagged_sentences.csv is the tagged file. You must open tagged_sentences.csv in the next time to continue to tag without losing all tagged data from the previous time.
    (If you try to open sentences.csv as the input file sentence, the tagged_sentences.csv file will be regenerated and replace the existed tagged_sentences.csv file)

tooltagging's People

Contributors

hongnguyenhuu96 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.