Comments (3)
Then how TPMCalculator identify gene copies? Here my GTF record only the first row's feature is "gene" others are "transcript"
And I can't see anything special of last 2 transcript from attributes.
$ grep "ENSG00000235538" gencode.v35.annotation.gtf | awk -v FS="\t" '$3!="exon" {print $9}'
gene_id "ENSG00000235538.3"; gene_type "lncRNA"; gene_name "AL078602.1"; level 2; tag "ncRNA_host"; havana_gene "OTTHUMG00000195978.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000665613.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-201"; level 2; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000518669.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000671100.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-202"; level 2; tag "basic"; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000522294.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000657614.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-203"; level 2; tag "basic"; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000507375.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000665405.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-204"; level 2; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000505809.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000669147.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-205"; level 2; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000506093.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000657157.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-206"; level 2; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000512883.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000667749.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-207"; level 2; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000506923.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000664207.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-208"; level 2; tag "basic"; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000509534.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000659903.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-209"; level 2; tag "basic"; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000508554.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000666400.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-210"; level 2; tag "basic"; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000521262.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000452944.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-211"; level 2; transcript_support_level "5"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000043020.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000669856.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-212"; level 2; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000512193.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000657138.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-213"; level 2; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000513259.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000659063.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-214"; level 2; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000527573.1";
gene_id "ENSG00000235538.3"; transcript_id "ENST00000654484.1"; gene_type "lncRNA"; gene_name "AL078602.1"; transcript_type "lncRNA"; transcript_name "AL078602.1-215"; level 2; tag "basic"; tag "TAGENE"; havana_gene "OTTHUMG00000195978.1"; havana_transcript "OTTHUMT00000527575.1";
from tpmcalculator.
Hi,
This is the normal way TPMCalculator quantify RNA-Seq abundance on copies for a same gene. If you look at the output, the third column is the starting coordinate of the gene. In your example, each gene copy starts in a different position. TPMCalculator uses #1, #2, #3 ... to identify the copies.
from tpmcalculator.
We identify the copies using the genomic coordinates. If the transcripts of a same gene are in different genomic region and they don't overlap we mark that as a copy of the same gene
from tpmcalculator.
Related Issues (20)
- Gene number in input GTF differs from the TPMCalculator output HOT 3
- symbol lookup error HOT 9
- Sets the name of the output HOT 2
- Add a new option to use a directory as output destination HOT 1
- For the paired end reads, is it recommended to use option -p or should i go with default without it? HOT 1
- Output file description HOT 2
- /usr/bin/ld: cannot find -lbamtools HOT 2
- Compilation error: collect2: error: ld returned 1 exit status HOT 2
- Is the read counting strand-specific? HOT 2
- Key ID for gene name was not found on GTF line HOT 3
- No TPM values, no reads processed HOT 9
- Chromosome with name: ENST.... does not exist HOT 10
- Output files desctiption HOT 1
- Installation without docker HOT 1
- Meaning of "UniqueReads" in genes.out file? HOT 2
- Possible to use gff3 in TPMCalculator v 0.4? HOT 1
- Build problems on Ubuntu and MacOSX HOT 1
- Help me please HOT 1
- After installing version 0.0.4, -version still prints 0.0.3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tpmcalculator.