raphael-group / spruce Goto Github PK

SPRUCE: Somatic Phylogeny Reconstruction using Combinatorial Enumeration

License: Other

CMake 2.78% C++ 93.34% Python 2.47% Shell 1.41%

spruce's Introduction

SPRUCE

SPRUCE (Somatic Phylogeny Reconstruction using Combinatorial Enumeration) is an algorithm for inferring the clonal evolution of single-nucleotide and copy-number variants given multi-sample bulk tumor sequencing data.

License

Support

For support using SPRUCE, please visit the SPRUCE Google Group.

Dependencies

SPRUCE is written C++. In addition to a recent C++ compiler (that supports C++11), it has the following dependencies:

CMake (>= 2.8)
Boost (>= 1.38)
LEMON graph library (>= 1.3)

Graphviz is required to visualize the resulting DOT files, but is not required for compilation.

Compilation instructions

To compile SPRUCE, execute the following commands from the root of the repository:

mkdir build
cd build
cmake ..
make

In case CMake fails to detect LEMON, run the following command with adjusted paths:

cmake -DLIBLEMON_ROOT=~/lemon ..

The compilation results in the following files in the build directory:

EXECUTABLE	DESCRIPTION
`cliques`	Enumerates cliques of the compatibility graph (given a size and a filter)
`enumerate`	Enumerates perfect phylogeny trees
`merge`	Merges multiple solution files into one
`rank`	Sorts solution trees by the fraction of common edges (solution with rank 0 is the most representative tree)
`visualize`	Visualizes one solution or the entire solution space

For example usage see result/run_A22.sh and corresponding instructions. For a description of the input file format see data/README.md.

For instructions on how to visualize the set of enumerated trees see src/visualization/README.md.

spruce's People

Contributors

Stargazers

Watchers

Forkers

samudhane johannesreiter smsinks melkebir dikshantpradhan instantcweed tkoohi

spruce's Issues

Documentation of command line parameters

I'm interested in running SPRUCE but I'm having difficulties understanding the command line params. I got stuck at the first step already, when running cliques.

$ ./build/cliques --help
Usage:
  ./build/cliques [--help|-h|-help]
     [--version] [-f str] [-l int] [-s int] [-v int] input
Where:
  input
     Input file
  --help|-h|-help
     Print a short help message
  --version
     Show version number
  -f str
     Filter (default : "")
  -l int
     Clique limit (default : -1 (unlimited))
  -s int
     Maximal clique size (default: -1 (maximum))
  -v int
     Verbosity level (default: 1)

What does the -f parameter actually do?

Is there any user manual that I may have overlooked?

Any help is highly appreciated.
Thanks,
Harry

executing spruce

Hello,
I am trying to use spruce for one of our heterogeneity projects and am running into difficulty figuring out how to set up the input data sets. I am working with exome data (relatively high depth > 150x in all samples). I am not sure how to go about determining the lower and upper bounds on the vaf and the mu values for my data. Unfortunately, I only have access to the RECOMB paper and this does not seem to mention anything with regard to VAF and copy number states. Any help is much appreciated.

Thanks
Arun

crash running example

When I run the following command line from the example run_A22.sh:

cliques -s 26 -f 70,1 ../../data/real/A22.tsv

There is a crash at compatibilitygraph.cpp:113 because _cliques.size() is 0.

Problem in running SPRUCE

when I am running a command ./run_sims.sh it gives me error
Running sims_r15_m5_n5_c1000...
./run_sims.sh: line 39: ../../build//enumerate: No such file or directory
./run_sims.sh: line 40: ../../build//rank: No such file or directory
Running sims_r15_m5_n5_c10000...
./run_sims.sh: line 39: ../../build//enumerate: No such file or directory
./run_sims.sh: line 40: ../../build//rank: No such file or directory
.
.
Traceback (most recent call last):
File "../process_noisy.py", line 65, in
concordance = float(string.split()[0])
ValueError: could not convert string to float: ./run_sims.sh:

Segmentation fault (cliques)

I run my own dataset as folowing:
cliques -s 26 -v 10 -f 70,1 ESCA.spruce.txt > ESCA.70_1.cliques
but get errors "Segmentation fault".

ESCA.spruce.txt

Improved Documentation About Input Files

Like #1 I also can't understand how to create the input files. Let's assume that I've downloaded a few VCFs of TCGA Cancer Samples from Genomic Data Commons. How can I easily analyse them with Spruce? The user experience is an important factor in how popular a method becomes and how often it will be cited. Spruce could improve.

raphael-group / spruce Goto Github PK

spruce's Introduction

SPRUCE

Support

Dependencies

Compilation instructions

spruce's People

Contributors

Stargazers

Watchers

Forkers

spruce's Issues

Documentation of command line parameters

executing spruce

crash running example

Problem in running SPRUCE

Segmentation fault (cliques)

Improved Documentation About Input Files

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent