Comments (5)
Yes, all sequences should be in the same file and should have unique IDs: alignments are reported with respect to the IDs in FASTA files. SibeliaZ is oblivious to the fact that sequences may come from different genomes and will try to find alignments within each sequence as well as between sequences. So far a contig is treated like a chromosome.
Thanks for your questions, I will update the README to reflect these features, it is quite important.
from sibeliaz.
Thanks for the explanation. To generate an output similar to cactus (after conversion to hal) would I need to parse the MAF file to remove self alignments? For example, by adding a species ID to fasta headers?
Cheers,
Tom.
from sibeliaz.
Yes, you will have to do it manually.
from sibeliaz.
Hi,
Sorry if this is a daft question, but I am struggling to make the sibeliaz maf file compatible with downstream tools. msa_view from phast and also the hal tools expect the maf file to be organised relative to a reference species (set by the first id in the alignment with the format "species.chromosome"). I think this is a standard convention for maf?
Is it possible to make sibeliaz output alignments relative to a reference species or failing that, do you know any tools that can read in the sibelia output and set a reference species?
I am testing on 3 species with ~400 mb genomes, with one species assembled into chromosomes and the others with thousands of contigs.
Thanks,
Tom.
from sibeliaz.
Can you show an example of a MAF alignment organised relative to a reference so I can adjust output of SibeliaZ?
from sibeliaz.
Related Issues (20)
- bash: line 11, 12 and 13: No such file or directory HOT 9
- free -w erro
- How would it deal with phased genomes?
- SibeliaZ/spoa/vendor/bioparser/include/bioparser/parser.hpp:124:19: error: 'gzFile_s' was not declared in this scope
- Installation issues HOT 2
- please include ##sequence-region pragmas in blocks_coords output HOT 4
- more detailed description for output HOT 1
- Conda installation doesn't include maf_to_gfa1.py HOT 2
- support gzip'd input fasta HOT 1
- is it suitable for the very big genome? HOT 2
- Apply the tool to virus HOT 4
- Q on coverage, multi-genome alignment, and maf2synteny HOT 4
- (support oneTBB) fatal error: tbb/mutex.h: No such file or directory HOT 3
- Question about block_coords.gff output
- Unexpected output HOT 1
- PackagesNotFoundError: The following packages are not available from current channels: - sibeliaz HOT 4
- sibeliaZ outputs HOT 2
- # running issue
- Getting alignments after running with "-n"
- question about LCB determination
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sibeliaz.