Comments (3)
Hello Fang,
Thanks for your interest in our program! I think the taxonomy parsing issue
is due to the frequently updated NCBI records, which may contain format
errors, or new features that our program cannot recognize. Therefore
deploying a local database is recommended. If you use databaser.py to
generate the local database, as of Oct 2015, the disk space usage is 15.4
GB for BLAST, 12.9 GB for DIAMOND and I don't know how much for RAPsearch2
(but should be larger than DIAMOND, according to my experience). Besides
there is a 200 MB taxdump and a 500 MB GI-to-TaxID dictionary. I hope they
are not too much for your disk quote.
Attached please find a development version of searcher.pl, which should run
DIAMOND and RAPSearch2 way faster than the older version. Have fun.
Let me think about your other question and get back to you soon.
Best,
Qiyun
On Mon, Jan 11, 2016 at 8:11 PM, yoyohashao [email protected]
wrote:
Hello, I was just wondering if it's necessary to perform HGT search on a
Candidate phylum In my opinion, I thought HGT search should be based on the
taxa with clear phylogenetic boundaries and nice representatives The
genomes I am dealing with are a Candidate phylum 'Tectomicrobia' (yet in
NCBI taxonomy, it's still a genus in deltaproteobacteria, taxid 93171) All
available genomes are from metagenomes (lots of hypothetical proteins,
70-90% completeness) Do you think it's viable to find HGT events in these
genomes?Also, HGTector seems to be unable to recognize the taxid:
Enter the TaxID of v42, or press Enter if you don't know:91371
Warning: Invalid TaxID:It's 2016 now and the taxaset was based on 2002 version
My another question is about the databaserpy, I haven't tested this script
due to some disk space issue I want to know if it will generate the
database compatible with RAPsearch/DIAMOND and the peak space usage of
databaserpyThanks
Kind,Fang
—
Reply to this email directly or view it on GitHub
https://github.com/DittmarLab/HGTector/issues/1.
from hgtector.
Thanks for the reply Qiyun. Can you re-upload the new searcher.pl? I didn't see it in the attachment. Also, if I download a master fasta, it can be reformatted by blastall/rapsearch/diamond right? I am fan of Rapsearch. But I wasn't be able to figure out how to build taxdump. So tonight I will try to download the master fasta then try rapsearch.
from hgtector.
Hello Fang,
I just noticed that you are emailing thru the Github, which may not allow
attachment. May I contact you via direct email? Mine is [email protected].
You are right. Once you get the master fasta file, you can reformat it into
Rapsearch format.
Best,
Qiyun
On Thu, Jan 14, 2016 at 2:04 AM, yoyohashao [email protected]
wrote:
Thanks for the reply Qiyun. Can you re-upload the new searcher.pl? I
didn't see it in the attachment. Also, if I download a master fasta, it can
be reformatted by blastall/rapsearch/diamond right? I am fan of Rapsearch.
But I wasn't be able to figure out how to build taxdump. So tonight I will
try to download the master fasta then try rapsearch.—
Reply to this email directly or view it on GitHub
https://github.com/DittmarLab/HGTector/issues/1#issuecomment-171593242.
from hgtector.
Related Issues (20)
- HGT detetction in fish HOT 1
- Error while building database with custom GenBank IDs list HOT 1
- plant genomes making database error HOT 9
- Why not add the parameter for threads used for GridSearchCV? HOT 1
- database --default parameters
- ImportError: dlopen: cannot load any more object with static TLS
- ValueError: Invalid bandwidth:0.3.
- High number of HGTs for one species compared to other species
- Can't download pre-compiled database from dropbox HOT 5
- TaxID 426428 is not found in taxonomy database HOT 1
- about database build! HOT 7
- Database issue HOT 16
- Use precompied blast databases HOT 1
- ValueError: Invalid BLAST database: hgtdb/blast/db. HOT 7
- Problempre-compiled database from dropbox HOT 2
- Database creation died with pandas error HOT 5
- Database error HOT 5
- DIAMOND failed with error code 139 HOT 9
- Cannot download data from NCBI refseq HOT 3
- Build Reference Database Issue: EOFError HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hgtector.