Giter Site home page Giter Site logo

ria's Introduction

Docker image for Review Graph Mining

GPLv3 Dockerhub MicroBadger

Logo

This is a docker image for Whalebrew. This image includes all packages provided by the Review Graph Mining Project so that you don't need installing any python packages to analyze review data, but you need to install Whalebrew instead ๐Ÿณ.

Installation

$ whalebrew install rgmining/rgmining

The above command installs rgmining command, which has two subcommands, dataset and analyze. Those two sub commands are aliases of same name command provided in Scripts for Analyzing Review Graphs package.

dataset

Subcommand dataset provides a set of functions to inspect a dataset. Those functions are divided to two groups, analyzing reviewer information and analyzing product information.

Analyzing reviewer information

To analyze reviewer information of a dataset, dataset command provides the following subcommands:

  • retrieve: output the ID of reviewers who review at least one of the given products,
  • active: output the ID of reviewers who review at least threshold items,
  • reviewer_size: output the number of reviews of each reviewer who reviews target products,
  • filter: output reviews posted by reviewers whose IDs match the given set of IDs.

Analyzing product information

To analyze product information of a dataset, dataset command provides the following subcommands:

  • average: output average rating scores of each product,
  • distinct: output distinct product IDs,
  • popular: output ID of products of which the member of reviews >= threshold.
  • filter: output reviews posted to products of which IDs match the given set of IDs.
  • variance: output variances of reviews for each product.

Basic usage

The basic usage of this command is

$ rgmining dataset <dataset-specifier> <dataset-parameters> reviewer <subcommand>

or

$ rgmining dataset <dataset-specifier> <dataset-parameters> product <subcommand>

where the dataset-specifier is a name of the dataset to be analyzed. It is depended on which libraries you have installed and rgmining dataset -h returns a list of available dataset names.

dataset-parameters are optional arguments specified with --dataset-param flag. The --dataset-param flag takes a string which connecting key and value with a single =. The --dataset-param flag can be given multi-times. You can find what kinds of parameter keys are defined in the dataset you want to use from documents of function load defined in the dataset.

For example, dataset file means loading a dataset from a file, of which each line contains a review in the JSON format. To load such dataset, use file as the dataset-specifier and give the file path as a dataset-parameter with file key, i.e. --dataset-param file="path/to/file".

See document site for more information about each subcommand.

analyze command

analyze command loads a dataset and run a method to find anomalous reviewers and compute a rating summary of each product.

The basic usage of this command is

$ rgmining analyze <dataset-specifier> <dataset-parameters> <method-specifier> <method-parameters>

The dataset-specifier and datasset-parameters are the same parameters described in the dataset command explanation.

The method-specifier is a name of installed method. You can see available method names by rgmining analyze -h.

method-parameters are optional arguments specified with --method-param flag. The --method-param flag takes a string which connecting key and value with a single =, and can be given multi-times.

You can find what kinds of parameter keys are defined in the method you want to run from documents of the constructor of the review graph object defined in the method.

For example, Fraud Eagle takes one parameter epsilon and you can give a value by --method-param epsilon=0.25.

See document site for more information.

License

This software is released under The GNU General Public License Version 3, see COPYING for more detail.

ria's People

Contributors

jkawamoto avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.