Comments (6)
update: I see others on this forum have experienced similar errors and everything traces back to kerneltree. I have installed this now as a local module (rather than relying on the local HPC install) and the correction step now works correctly on both the test dataset and my own.
I'm going to finish going through the rest of the FLAIR pipeline and if everything looks good then I will close this thread.
Thanks for your help and for making this great tool!
from flair.
Yes, FLAIR would consider two isoforms distinct as long as there is a splice site, transcription start site, or transcription end site that is different. There should not be any duplicate isoforms after the flair-collapse step.
You may want to note that isoforms that contain the exact same splice junctions and differ only by the TSS/TES will have the same isoform base name with the addition of a number at the end to give them unique names (e.g. ENST00000323963.9, ENST00000323963.9-1, ENST00000323963.9-2...). You can control the TSS/TES for a particular splice junction chain that FLAIR finds using --max_ends
, -w
, -i
, and/or -n
arguments for flair-collapse. Hope this helps.
-Alison
from flair.
from flair.
And just as an update. I have the same problem using the test datasets mentioned on the Github page. The problem persists regardless of whether I supply a gtf or junctions file
from flair.
Hi @dandepledge,
Since you are using such a small dataset, the messages for your "missing" steps are probably just taking so little time that you don't actually see them come up in your terminal window.
As for the hanging, I would double check that all of the requirements for correct are installed. This includes kerneltree.
Last, i'm trying to hunt down the annotation file for the organism you are using. Are you using NC_001348 reference sequence? If so, is there a publicly available annotation you are able to link or share? Thanks.
Thanks~
-CMS
from flair.
Hi - you are absolutely correct about the missing messages. I can see them when I run larger datasets or if I pipe the processing information to an outfile.
I have resolved the hanging issue by installing the latest version of flair but now I am getting a different error...
File "/gpfs/home/depled01/flair/bin/ssCorrect.py", line 416, in
main()
File "/gpfs/home/depled01/flair/bin/ssCorrect.py", line 398, in main
with open(os.path.join(tempDir, "%s_inconsistent.bed" % chrom),'rb') as fd:
IOError: [Errno 2] No such file or directory: '/gpfs/scratch/depled01/FLAIR/test/tmp_7d39fe4a-163f-4978-af31-122cb4ca2eb4/chrX_inconsistent.bed'
Correction command did not exit with success status
This error is generated when running the correct command on the test dataset but I also get the same error when I run on my own dataset.
To answer your other question - you have the correct reference sequence but the only public annotation is the one provided on genbank and this is missing lots of important detail (such as splicing) which is why I am trying to use FLAIR (transcript isoform discovery and then quantification)
from flair.
Related Issues (20)
- accept BED9 as well as BED6 for splice junctions
- can't keep intermediate files when calling correct from flair HOT 1
- flair collapse creates invalid isoforms names that cause GTF conversion to fail
- `'run_id' is not defined` error while running `collapse-range` HOT 2
- diffsplice_fishers_exact generates bogus error if input file is missing HOT 1
- diff_iso_usage does not include header in output files
- merging of transcript ids and gene ids is frustrating to use HOT 2
- diff_iso_usage documentation does not clear describe what it does
- The results of my test_output are not consisitent with the oferred test_expected. Why? HOT 1
- flair correct crash with AttributeError: 'bool' object has no attribute 'find_overlap' HOT 1
- ssUtils.py gtfToSSBed crashes if gtf doesn't have transcript_id HOT 1
- flair correct doesn't exit non-zero when an invalid argument is supplied HOT 3
- Improve documentation of algorithm is readthedocs
- Make FLAIR source & packaging structure not weird
- Translate predictProductivity output to protein HOT 4
- Transcript read number
- Question regarding sequencing depth and collapse step HOT 1
- Issues implementing FLAIR2 HOT 1
- Unexpected entries in gtf file after flair collapse HOT 1
- FLAIR could improve alignments by passing minimap2 splice junctions HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from flair.