Giter Site home page Giter Site logo

cafa_evaluation's Introduction

CAFA Predictions Evaluation

Test data are available in FigShare: https://figshare.com/articles/dataset/cafa_test_data_tgz/16713046

This repository is a collection of code for evaluating CAFA predictions

See also the walkthrough example

To use the code, the steps are:

  1. Obtain or generate propagation Pandas DataFrame files based on the relevant CAFA challenge obo file. These files are pickled DataFrames, one per relevant GO ontology namespace that are used for propagating both benchmark and prediction annotation data. To generate these files yourself, see generate_propagation_map_dataframe.py. This file can be called directly from the shell with a yaml configuration file. See obo_parser_config.yml as an example of the expected configuration.

  2. Obtain or generate weighted DAG files (one per GO ontology namespace). These are pickled Pandas DataFrame files containing the Information Content data for the relevant DAG nodes. To generate your own weighted DAG files, see generate_information_content_matrices.py. This file uses the same configuration yaml as step 1.

  3. Parse raw benchmark data into json-formatted per-species, per-ontology (CCO, MFO, BPO, etc) files. This step uses raw_benchmark_parser.py. This step requires a yaml configuration file as well. See parser_config.yml for an example of the necessary configuration keys and values

  4. Parse raw prediction data into json-fomatted files on a per-species and per-ontology basis. This step uses raw_prediction_parser.py in conjunction with the same configuration file from step #3.

  5. Evaluate the prediction data on a per-species, per-ontology basis against the benchmark data using evaluate_species_prediction.py

  6. Generate cross-species evaluation metrics using evaluate_cross_species.py

cafa_evaluation's People

Contributors

idoerg avatar szarecor avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.