Comments (8)
The convention is to call samtools index
with the bam filename as the only argument:
samtools index SRR1055723.hisat2.sorted.bam
This will create a file SRR1055723.hisat2.sorted.bam.bai
. SplAdder will automatically use this index when called with SRR1055723.hisat2.sorted.bam
.
from spladder.
Hi, Thanks for trying out SplAdder. I made annotation parsing a bit more robust and pushed a fix to the development branch. Let me know whether this works for you.
Cheers, Andre
from spladder.
Hi, Andre. Thanks for your response. The annotation parsing problem is solved although I got another error message as below.
$ python /Users/jhhuang/IIS_jhh/GitHub/spladder/python/spladder.py -a Drosophila_melanogaster/ensembl/Drosophila_melanogaster.BDGP6.84.chr.gff3 -b HISAT2_Stringtie_Ballgown/SRR1055723.hisat2.sorted.bam -o spladder_AS/ -T n
Augmenting splice graphs.
Generating splice graph ...
...done.
Loading introns from file ...
Traceback (most recent call last):
File "/Users/jhhuang/IIS_jhh/GitHub/spladder/python/spladder.py", line 323, in
spladder()
File "/Users/jhhuang/IIS_jhh/GitHub/spladder/python/spladder.py", line 224, in spladder
spladder_core(CFG)
File "/Users/jhhuang/IIS_jhh/GitHub/spladder/python/modules/core/spladdercore.py", line 21, in spladder_core
genes = gen_graphs(genes, CFG)
File "/Users/jhhuang/IIS_jhh/GitHub/spladder/python/modules/core/gen_graphs.py", line 83, in gen_graphs
introns = get_intron_list(genes, CFG)
File "/Users/jhhuang/IIS_jhh/GitHub/spladder/python/modules/reads.py", line 431, in get_intron_list
[intron_list_tmp] = add_reads_from_bam(gg, CFG['bam_fnames'], ['intron_list'], CFG['read_filter'], CFG['var_aware'], CFG['primary_only'], CFG['ignore_mismatch_tag'])
File "/Users/jhhuang/IIS_jhh/GitHub/spladder/python/modules/reads.py", line 155, in add_reads_from_bam
(introns, spliced_coverage) = get_all_data(blocks[b], filenames, mapped=False, filter=filter, var_aware=var_aware, primary_only=primary_only, no_mm=no_mm)
File "/Users/jhhuang/IIS_jhh/GitHub/spladder/python/modules/reads.py", line 314, in get_all_data
(coverage_tmp, introns_tmp) = get_reads(fname, contig_name, block.start, block.stop, strand, filter, mapped, spliced, var_aware, collapse, primary_only, no_mm)
File "/Users/jhhuang/IIS_jhh/GitHub/spladder/python/modules/reads.py", line 40, in get_reads
for read in infile.fetch(chr_name, start, stop, until_eof=True):
File "pysam/calignmentfile.pyx", line 878, in pysam.calignmentfile.AlignmentFile.fetch (pysam/calignmentfile.c:11079)
File "pysam/calignmentfile.pyx", line 1660, in pysam.calignmentfile.IteratorRowRegion.init (pysam/calignmentfile.c:18725)
ValueError: no index available for iteration
from spladder.
Are your BAM files indexed?
from spladder.
Cool! I generated the index file with samtools and Spladder works! Thank you very much.
from spladder.
Hi, Thanks for trying out SplAdder. I made annotation parsing a bit more robust and pushed a fix to the development branch. Let me know whether this works for you.
Cheers, Andre
Hello,
I am trying to use SplAdder and I have the same error. I tried replacing the files gene.py, spladder.py, init.py and settings.py as suggested above but it didn't solve it.
Traceback (most recent call last):
File "./bin/spladder/python/spladder.py", line 323, in
spladder()
File "./bin/spladder/python/spladder.py", line 145, in spladder
(genes, CFG) = init.init_genes_gff3(CFG['anno_fname'], CFG, CFG['anno_fname'] + '.pickle')
File "/data1/paola/bin/spladder/python/modules/init.py", line 262, in init_genes_gff3
t_idx = genes[gene_id].transcripts.index(trans_id)
KeyError: 'rna13'
I am using the annotation file GCF_000001735.4_TAIR10.1_genomic.gff from ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/001/735/GCF_000001735.4_TAIR10.1
Any help will be appreciated
from spladder.
Dear CarinaCornejo,
It looks like SplAdder has difficulties with the category primary_transcript
in your file. I will have a look, but will only be able to fix it early next week. A possible workaround for you might be to just remove all the lines containing primary_transcript
, e.g. like so:
grep -v primary_transcript GCF_000001735.4_TAIR10.1_genomic.gff > GCF_000001735.4_TAIR10.1_genomic.filtered.gff
Cheers,
Andre
from spladder.
Dear Andre,
thanks for the suggestion but I am getting the same error with the filtered file.
from spladder.
Related Issues (20)
- Testing mode fails despite using the default merging strategy HOT 1
- Spladder accuracy for samples with high multimappers HOT 2
- is_annotated column interpretation in splAdder versions 3.x HOT 2
- llvmlite.binding' has no attribute 'get_host_cpu_name HOT 3
- at which stage can i select the confidence threshold?
- Index Error during build step HOT 1
- ValueError: NaN, inf or invalid value detected in endog, estimation infeasible HOT 2
- viz mode issue HOT 1
- build mode issue HOT 4
- Test Mode ValueError HOT 1
- Use on large cohorts HOT 6
- TypeError: object of type 'Gene' has no len() HOT 2
- TypeError: expected str, bytes or os.PathLike object, not coo_matrix HOT 4
- RuntimeWarning: invalid value encountered in subtract HOT 2
- qmode collect step for large cohorts HOT 4
- test run mode - TypeError: Indexing elements must be in increasing order HOT 4
- About Image output HOT 2
- gff on gtf HOT 1
- viz issue
- Naming of skipped exons in mutually exclusive events is based on exon length not genomic coordinates
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spladder.