holtjma / msbwt Goto Github PK
View Code? Open in Web Editor NEWA python toolkit for building and querying multi-string BWTs
License: MIT License
A python toolkit for building and querying multi-string BWTs
License: MIT License
Hi Matt,
I'm interested in FMLRC and want to use it to correct my PacBio reads. I have ~50 X Illumina data (~125 Gb in total) and I'm building msbwt for Illumina reads. However, it seems that it takes too long to do it. Here is some information in the log file:
[2016-10-31 16:12:16] INFO: Formatting sequences for merging...
[2016-11-04 22:27:21] INFO: Beginning MSBWT construction...
[2016-11-05 00:20:34] INFO: Processing groups of size 256...
[2016-11-06 18:33:50] INFO: Processing groups of size 512...
[2016-11-09 19:07:49] INFO: Processing groups of size 1024...
[2016-11-13 12:56:12] INFO: Processing groups of size 2048...
I wonder what does the "size" mean and when will it finish?
Thank you for your time.
Yuncong Geng
I always get this error when trying to import a MSBWT using the Python API.
I have a small FASTQ file, 400 long sequencing reads. I've tried creating the BWT using both the msbwt package and by converting from the output of ropebwt2. Querying using the command-line also works so I'm wondering what if I'm missing anything. The package has been installed with Python 2.7.
I can provide the data needed to reproduce the above error if necessary.
gunzip -c reads.sorted.txt.gz | tr NT TN | ropebwt2 -LR | tr NT TN | msbwt convert /path/to/output/msbwt
This command does not produce any output file or folder when I try it. Is the tr NT TN command use twice correct?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.