Giter Site home page Giter Site logo

tox21-cluster-translation's Introduction

Tox21 Cluster Interpretation with NCATS Biomedical Data Translator

This repository contains the source code required to query translator for connections between genes and chemical compounds. The specific genes and chemical compounds were generated from an analysis of the Tox21 bioassay data. Tox21 chemical compounds were clustered based on their bioassay profiles. The compounds were annotated to show which gene products they have activity against. An enrichment analysis was applied to identify the key gene targets within each cluster. Then candidate compound-gene pairs were generated by connecting the enriched genes in a cluster to the other compounds within that cluster. To priotitize comound-gene pairs as hypothesese for follow up, we searched the translator for connections.

Learn more about the NCATS Biomedical Data Translator:

Setup

We provide a translator-query.yml file that defines the conda environment needed for this analysis.

conda env create -f translator-query.yml

A local python package, translatorpy, is provided for interfacing with translator. Install this package using pip:

pip install -e . 

Running

Getting the data

THe primary script for running the analysis is compound_target_candidate_search.py

python compound_target_candidate_search.py

This script directly queries translator and pipes the results into the Knowledge Graph Exchange format to enable easier downstream analysis.

Depending on the number of cores on your machines, this could take days. I would not suggest using more than 64 cores at once to avoid hammering the translator ARS system.

Piping to neo4j

The translator querying script will leave a series of knowledge graph exchange formatted tsv files. Those files should be tidied up into one file for nodes and one file for edges.

python clean_kgx_files.py

Then the consolidate knowledge graph files can be loaded into neo4j. You should have an active neo4j database running on your local machine. On my laptop, I use the neo4j desktop version. On my cloud instances I like to use the neo4j docker image.

bash kgx-neo4j.sh

tox21-cluster-translation's People

Contributors

jsanjak avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.