Comments (10)
The issue seems to be that that key simply say which of the sub-populations had the highest frequency. And then
gnomADg_AF_popmax
specify the frequency for that sub group.
I see, but then let's fix it so it grabs the value only if it's a number! I'll open a PR!
from scout.
nfe as non-finnish europeans? And why does it take the place of a number? I'm going to look to their repo 🤔
from scout.
nfe as non-finnish europeans? And why does it take the place of a number? I'm going to look to their repo 🤔
I think "nfe" refers to a subgroup within gnomad (afr is another I have seen - african). You are probably right - non-finnish europeans. The issue seems to be that that key simply say which of the sub-populations had the highest frequency. And then gnomADg_AF_popmax
specify the frequency for that sub group.
from scout.
I was wondering which values gave error for instance in the demo app, so I've added a warning message for debugging and I saw this:
![image](https://private-user-images.githubusercontent.com/28093618/351239793-aaaa46ac-f5b1-4edf-b0c7-0d0a59d6cd21.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjM3NTAwNTcsIm5iZiI6MTcyMzc0OTc1NywicGF0aCI6Ii8yODA5MzYxOC8zNTEyMzk3OTMtYWFhYTQ2YWMtZjViMS00ZWRmLWIwYzctMGQwYTU5ZDZjZDIxLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA4MTUlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwODE1VDE5MjIzN1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTYyNmUwY2EwOTlmY2FjYjc3ZTdmZTQxODE1MDY2MzFmMzc3NGQ2NWEzYzE5MDI2YTlkNDQxOWZiN2JmYTM0YjQmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.M-yEI-cRwh29G9BkBeLDXTrdCEV31kdXCU1kB103t6A)
Perhaps we should try to collect numbers present within parenthesis
from scout.
Perhaps we should try to collect numbers present within parenthesis
Hmm. I guess it depends on where that deleterious(0)
value is produced. Often better to error than ending up with unexpected and unwanted values, if not part of the "standard format". Do you know how your Gnomad values are inserted into the VCF? I don't think we have these values (i.e. deleterious(0)
) in our data.
In our case we (long before I started working at CMD) have prepared the Gnomad data in an indexed file. This is inserted directly on matching variants using VEP's -custom
flag.
1 10108 rs62651026 CAACCCT C 46514.3 RF AF=0.00100908;nhomalt=0;popmax=afr;AF_popmax=0.0035461
1 10109 rs376007522 AACCCT A 89837.3 RF AF=0.0641892;nhomalt=0;popmax=amr;AF_popmax=0.1
from scout.
I think the keys "popmax" and "AF_popmax" are selected by us here. Would be nice to be able to continue using them for a bit longer though.
Soon we will transition to Gnomad 4 where they call things "grpmax" though instead of "popmax". That is probably something you will see / already have seen as well.
from scout.
Do you know how your Gnomad values are inserted into the VCF? I don't think we have these values (i.e.
deleterious(0)
) in our data.
I have no idea, it's very old data and perhaps current cases don't have that format any more. You see those error warnings only with a recent scout version because of this relatively new change in transcript parsing. I'm still not sure it's a bug, but all those error message are annoying for sure..
from scout.
Soon we will transition to Gnomad 4 where they call things "grpmax" though instead of "popmax". That is probably something you will see / already have seen as well.
No problem, we can add the keys!
from scout.
I have no idea, it's very old data and perhaps current cases don't have that format any more. You see those error warnings only with a recent scout version because of this relatively new #4456 in transcript parsing. I'm still not sure it's a bug, but all those error message are annoying for sure..
OK, I'll have to look a bit more that the code to understand what is going on I think.
It seems to me that Scout is picking the wrong value to try getting the max population frequency, which means we end up with no value there. If so, just skipping them would not fix the issue.
I'll check a bit closer. Will be back ...
from scout.
Aha, or I see, it is perhaps just trying to add the group name in addition to the frequencies. Then the change makes sense. I'll test the PR!
from scout.
Related Issues (20)
- HGNC symbols not present in database's genes collection:{'MAP3K20'} HOT 2
- NOt possible to DELETE a comment HOT 7
- Missing OMIM entries on variant page, which are present in hover HOT 21
- dbVar track unreachable? HOT 3
- Implement oauth for igv.js HOT 1
- IGH@ gene symbol is not aliased to the IGH gene when loading variants HOT 1
- Sort selectable IGV tracks alphabetically
- Audit buttons for pages without filters
- Make IGV tracks's items clickable HOT 1
- Nonsensical key/value in demo instance variant HOT 7
- MEI loading is slow in tests
- Height offsets for merged RNA tracks with IGV 3.0.1
- Occasional crashes when testing IGV junction tracks
- IGV RNA junction track variant locus zoom not working
- UpdateChart.js HOT 1
- Add CoLoRSdb and gnomAD keys HOT 9
- Dashboard stats for case tags
- Links to ExAC are no longer valid
- New HPO release 2024-08-13
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scout.