Giter Site home page Giter Site logo

lileiting / sharepathway Goto Github PK

View Code? Open in Web Editor NEW

This project forked from guipengli/sharepathway

0.0 3.0 1.0 1.24 MB

a python package for KEGG pathway enrichment analysis with multiple gene lists.

License: GNU General Public License v3.0

Python 100.00%

sharepathway's Introduction

SharePathway

SharePathway is a python package for KEGG pathway enrichment analysis with multiple gene lists.

There have been dozens of tools or web servers for enrichment analysis using a list of candidate genes from some kinds of high throughput experiments,such as Exome-Seq and RNA-Seq. But the reality is that we usually get multiple gene lists, each from one sample or patient. We can do enrichment analysis for each sample then check which pathway or module is enriched. This strategy is simple and commonly used in cancer study. But we may lose some important driver genes.

SharePathway is motivated at providing users a simple and easy-to-use tool for enrichment analysis on multiple lists of genes simultaneously, which may help gain insight into the underlying biological background of these lists of genes.

Installation

This version is for both python2 and python3. The first step is to install Python. Python is available from the Python project page . The next step is install sharepathway.

Install from PyPi using pip, a package manager for Python:

$ pip install sharepathway

Or, you can download the source code at Github or at PyPi for SharePathway, and then run:

$ python setup.py install

Usage

Assume you have put all the path of your gene list files in one summary file genelists.txt (one path per line) in the directory ~/data/. Go into this directory,open python and run the scripts below. The result will be saved in the result.html file:

import sharepathway as sp
filein="genelists.txt"
fileout="result"
sp.Run(fi=filein,fo=fileout,species='hsa',r=0.1)

The default value of species is 'hsa', represents human species. The ratio r is the min threshold. The default value is 0.01. Entrez Gene ID is supported. The result will be output to a html file.

Output Description

  • Summary

    This part summarizes the input data.

  • Details

    This part list the ranked pathways and related information as shown below.

    Column

    Description

    Pathway

    Pathway name and hyperlink to modified KEGG map

    Genes

    KEGG ID of the genes in the pathway

    pCount

    Total number of genes in the pathway

    Count

    The number of recognized genes from user input

    Ratio

    The percentage of lists that containing genes in the pathway

    Pvalue

    The combined p valude from Fisher's Method

    EASE

    EASE score defined by DAVID, from merged gene list

    FET

    P value of the Fisher Exact Test, from merged gene list

    Samples

    The number of genes in the pathway in each list

Test data

See the gene list files and genelists.txt file in data/. This is just toy data.

Contact

Author: Guipeng Li

Email: guipeng.lee(AT)gmail.com

sharepathway's People

Contributors

guipengli avatar lileiting avatar

Watchers

James Cloos avatar  avatar  avatar

Forkers

gabrile4bick

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.