Comments (2)
Hi! Thanks for using FeGenie. Sorry for the delayed response.
With regard to the FeGenie-heatmap-data.csv output file, the values in that file indicate the number of genes (from each iron category) identified in each genome, normalized to the number of predicted ORFs in each genome. This number is then multiplied by the inflation factor, which is, by default, 1000. You can change this by setting (for example) -inflation 100, and this will essentially turn the values into percentages. Does that make sense?
We chose 1000 as the default inflation factor because we found that, especially in large metagenome assemblies, the number of genes for each iron gene category, divided by the total number of ORFs in each metagenome, results in very small numbers. So, multiplying by 1000 should make it easier to read.
I just added an additional option to FeGenie that will allow you to forgo normalization (-norm n), and create a FeGenie-heatmap-data.csv with the raw gene counts for each iron gene category.
Let me know if any of this doesn't make sense or if you have any other questions or issues!
Arkadiy
from fegenie.
Thank you so much for taking your time out to give a detailed explanation. I'm very clear now!
from fegenie.
Related Issues (20)
- Error when cluster ORFs HOT 1
- relative abundace? HOT 1
- Issue with Installation HOT 4
- Unable to run FeGenie.py HOT 2
- Tagged release and license file HOT 17
- Regarding --all_results option HOT 2
- Error "hmmsearch: not found ... local variable 'hmmout' referenced before assignment" while running FeGenie HOT 5
- DIAMOND Verification: local variable 'idxDict' referenced before assignment HOT 5
- Why the FeGenie installed by conda and the FeGenie installed manually work very differently? HOT 8
- Error - ValueError: could not convert string to float: 'EMPTY' HOT 38
- Heatmap has same values regardless of what inflation factor I use HOT 2
- Error in hclust(d = dist(x = fegenie.scaled)) HOT 1
- false negatives? HOT 3
- Permission denied when moving files HOT 6
- Rewrite using a workflow manager (snakemake, nextflow)
- geneSummary file HOT 3
- Renamed contigs? HOT 4
- error depth = open("%s/%s.depth" % (outDirectory, cell)) | HELP PLS. HOT 1
- About iron-sulfur proteins HOT 1
- Use of ORFs HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fegenie.