Giter Site home page Giter Site logo

gerbrichferdinands / asreview-thesis-visualization Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 221 KB

A visualiztaion library to create the plots for my thesis project asreview-thesis.

License: Apache License 2.0

Jupyter Notebook 0.29% Python 99.71%
asreview

asreview-thesis-visualization's Introduction

ASReview-visualization

Deploy and releaseBuild status

This is a plotting/visualization supplemental package for the ASReview software. It is a fast way to create a visual impression of the ASReview with different dataset, models and model parameters.

Installation

The easiest way to install the visualization package is to use the command line:

pip install asreview-visualization

After installation of the visualization package, asreview should automatically detect it. Test this by:

asreview --help

It should list the 'plot' modus.

Basic usage

State files that were created with the same ASReview settings can be put together/averaged by putting them in the same directory. State files with different settings/datasets should be put in different directories to compare them.

As an example consider the following directory structure, where we have two datasets, called ace and ptsd, each of which have 8 runs:

├── ace
│   ├── results_0.h5
│   ├── results_1.h5
│   ├── results_2.h5
│   ├── results_3.h5
│   ├── results_4.h5
│   ├── results_5.h5
│   ├── results_6.h5
│   └── results_7.h5
└── ptsd
    ├── results_0.h5
    ├── results_1.h5
    ├── results_2.h5
    ├── results_3.h5
    ├── results_4.h5
    ├── results_5.h5
    ├── results_6.h5
    └── results_7.h5

Then we can plot the results by:

asreview plot ace ptsd

By default, the values shown are expressed as percentages of the total number of papers. Use the -a or --absolute-values flags to have them expressed in absolute numbers:

asreview plot ace ptsd --absolute-values

Plot types

There are currently four plot types implemented: inclusion, discovery, limit, progression. They can be individually selected with the -t or --type switch. Multiple plots can be made by using , as a separator:

asreview plot ace ptsd --type 'inclusions,discovery'

Inclusion

This figure shows the number/percentage of included papers found as a function of the number/percentage of papers reviewed. Initial included/excluded papers are subtracted so that the line always starts at (0,0).

The quicker the line goes to a 100%, the better the performance.

alt text

Discovery

This figure shows the distribution of the number of papers that have to be read before discovering each inclusion. Not every paper is equally hard to find.

The closer to the left, the better.

alt text

Limit

This figure shows how many papers need to be read with a given criterion. A criterion is expressed as "after reading y % of the papers, at most an average of z included papers have been not been seen by the reviewer, if he is using max sampling.". Here, y is shown on the y-axis, while three values of z are plotted as three different lines with the same color. The three values for z are 0.1, 0.5 and 2.0.

The quicker the lines touch the black (y=x) line, the better.

alt text

Progression

This figure shows the average inclusion rate as a function of time, number of papers read. The more concentrated on the left, the better. The thick line is the average of individual runs (thin lines). The visualization package will automatically detect which are directories and which are files. The curve is smoothed out by using a Gaussian smoothing algorithm.

alt text

API

To make use of the more advanced features, you can also use the visualization package as a library. The advantage is that you can make more reproducible plots where text, etc. is in the place you want it. Examples can be found in module asreviewcontrib.visualization.quick. Those are the scripts that are used for the command line interface.

with Plot.from_paths(["PATH_1", "PATH_2"]) as plot:
	inc_plot = plot.new("inclusion")
	inc_plot.set_grid()
	inc_plot.set_xlim(0, 30)
	inc_plot.set_ylim(0, 101)
	inc_plot.set_legend()
	inc_plot.show()
	inc_plot.save("SOME_FILE.png")

Of course fill in PATH_1 and PATH_2 as the files you would like to plot.

If the customization is not sufficient, you can also directly manipulate the self.ax and self.fig attributes of the plotting class.

asreview-thesis-visualization's People

Contributors

gerbrichferdinands avatar j535d165 avatar qubixes avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.