Giter Site home page Giter Site logo

mar_test's Introduction

What is MAR?

MAR stands for machine assisted reading, it is a tool to support primary study selection in systematic literature review.

Latest Versions:

Cite as:

@misc{zhe yu_2016, 
      title={ai-se/MAR: MAR v1.0.2}, 
      DOI={10.5281/zenodo.203136}, 
      publisher={Zenodo}, 
      author={Zhe Yu}, 
      year={2016}, 
      month={Dec}}

Setting up MAR

  1. Setting up Python:
  • We use anaconda by continuum.io (see Why?)
    • We won't need the entire distribution. Download a Python 2.7 version & install a minimal version of anaconda.
  • Make sure you select add to PATH during install.
  • Next, navigate to MAR, and run setup.bat. This will install all the dependencies needed to run the tool.
  • If the above does not work well. Remember you only need a Python 2.7 and three packages listed in requirements.txt installed.
  1. Running script:
  • Navigate to MAR > src and run index.py.
  • If all is well, you'll be greeted by this:
  1. The Interface:

Use MAR

  1. Get data ready:
  • Put your candidate list (a csv file) in MAR > workspace > data.
  • The candidate list can be as the same format as the example file MAR > workspace > data > Hall.csv or a csv file exported from IEEExplore.
  1. Load the data:
  • Click Choose File button to select your csv file in MAR > workspace > data. Wait a few seconds for the first time. Once the data is successfully loaded, you will see the following:
  1. Begin reviewing studies:
  • choose from Relevant, Irrelevant, or Undetermined for each study and hit Submit.
  • hit Next when you want a to review more.
  • statistics are displayed as Documents Coded: x/y (z), where x is the number of relevant studies retrieved, y is the number of studies reviewed, and z is the total number of candidate studies.
  • when x is greater than or equal to 1, an SVM model will be trained after hitting Next.
  • rather than Random sampling, you can now select certain or uncertain for reviewing studies. certain returns the studies that the model thinks are most possible to be relevant while uncertain returns the studies that model is least confident to classify.
  • keep reviewing studies until you think most relevant ones have been retrieved.
  1. Auto review:
  • If your data contains true label, like Hall.csv does, another button called Auto Review will be enabled. By clicking it, it automatically labels all your current studies (depending on the selection Random, certain or uncertain).
  1. Plot the curve:
  • Click Plot button will plot a Relevant studies retrieved vs. Studies reviewed curve.
  • Check Auto Plot so that every time you hit next, a curve will be automatically generated.
  • You can also find the figure in MAR > src > static > image.
  1. Export csv:
  • Click Export button will generate a csv file with your coding in MAR > workspace > coded.
  1. Restart:
  • Click Restart button will give you a fresh start and loose all your previous effort on the current data.
  1. Remember to click Next button:
  • User data will be saved when and only when you hit Next button, so please don't forget to hit it before you want to stop reviewing.

Version Logs

Dec 5, 2016. v1.0.0 The very first, basic version is released.

Dec 6, 2016. v1.0.1 Add one feature under testing, which can predict when the review process should stop.

Dec 14, 2016. v1.0.2:

  • Exported csv file: timestamp added, auto-sorted.
  • "How to Read Less: Better Machine Assisted Reading Methods for Systematic Literature Reviews" is submitted to IST journal and uploaded to arxiv.

mar_test's People

Watchers

Tim Menzies avatar James Cloos avatar George Mathew avatar Wei Fu avatar Di (Jack) Chen avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.