Comments (12)
Hi @matentzn
I checked the latest indexer run and these seem to be the OBO ontologies we still have a problem with:
from ols4.
I manually checked most of these. I personally would suggest to restrict OLS to only active ontologies in OBO:
All of these ones you listed here (or most of them, didn't check all) are obsolete, or inactive. OBO Foundry does not recommend the use of non-active ontologies (i.e. they are hidden on https://obofoundry.org/)
from ols4.
@udp, can you confirm that RO is not having issues anymore, please?
from ols4.
For that 6.5 GB JSON file, json2csv took 3 minutes and generated 765 MB of CSV
This seems a suspiciously large difference, but I tried gzipping them to see how much ACTUAL data there was and not just repetition:
- The 6.5 GB JSON file compressed to 322 MB.
- The 765 MB of CSV compressed to 285 MB
Those numbers are firmly in the same ballpark so I think no data has been lost, so all of obo foundry is actually pretty tiny depending on how you represent it.
I also tried gzipping ALL of the OLS “downloads” folder from noah, so that’s all the OWL files from OBO and OLS’s ontologies, which also includes lots of obsolete stuff I didn’t index above. That compressed to 886 MB. So all of the data in OLS is actually only 886 MB when compressed!
from ols4.
For
doid
cto
cvdo
mfmo
ons
ro
upheno
mamo
vario
can you list the import URLs that are not rdfxml? I may be able to fix these with a bit of a sledge hammer.
ogi OBOFoundry/OBOFoundry.github.io#1942
ero (inactive on obo, URL of ontology redirects to website) OBOFoundry/OBOFoundry.github.io#1942
rnao Resolves: http://purl.obolibrary.org/obo/rnao.owl
from ols4.
@matentzn they were manually checked by me too to make the table. I didn't realise they were obsolete/inactive. However they will be completely absent (= 404) from OLS when we ship OLS4 if we do not load them. Will this an issue?
from ols4.
In general I am happy with not loading inactive ontologies. However, even if an ontology is inactive, it can still be used and we can not drop its availability - particularly when there seems to be no other alternative. I think MAMO is a good example of this and is used in EBI by the BioModels team.
A way around this is to not load inactive OBO ontologies. In a case like MAMO we can add it to the EBI OLS config with the URL pointing to the file system.
from ols4.
@henrietteharmse I think your suggestion is the way to go.
Maybe be a bit more conservative for now and only exclude obsolete ontologies from OBO to start with. If you supply me with a list @udp with the remaining (non obsolete, breaking ones), I can maybe reach out to the groups and use OLS inclusions to up their game a bit and fix their ontology.
from ols4.
@matentzn We currently have an issue with RO. Though the core file is RDF/XML:
https://raw.githubusercontent.com/oborel/obo-relations/master/ro.owl
it imports this file: https://raw.githubusercontent.com/oborel/obo-relations/master/chemical.owl which is in functional syntax.
Issue opened here: oborel/obo-relations#673
from ols4.
This will be solved soon by @anitacaron, the solution is already there, we just need time to review and implement it.
from ols4.
For the Human Disease Ontology (doid
), would loading the doid-merged.owl file (http://purl.obolibrary.org/obo/doid/doid-merged.owl), which has all imports loaded in, fix this issue?
@lschriml, fyi.
from ols4.
The doid file that isn't RDF/XML was our ext.owl file (in OFN). We recently switched it to RDF/XML because other people were experiencing parsing issues (DiseaseOntology/HumanDiseaseOntology#1112).
from ols4.
Related Issues (20)
- SRAO now has a new stable url / purl HOT 1
- Filter out results from Snomed?
- Tree not visible for chebi term HOT 2
- OLS4 terms resource returns 404 for specific term HOT 2
- Search with a dbxref does not work as in OLS3 HOT 4
- OLS4 does not provice baseUris for GO HOT 1
- Update NCIT HOT 1
- Provide documentation on API responses
- Pagination not working for search HOT 3
- Return annotations in search endpoint via `fieldList` HOT 3
- Fillers missing on qualified restrictions
- OLS4 case sensitivity HOT 1
- In some cases SubClass relations are duplicated
- Incorrect forward to OLS4 from https://www.ebi.ac.uk/ols/index HOT 1
- Add support for requests of the form https://www.ebi.ac.uk/ols4/?termId=
- Urgent: 'start' parameter returned incorrect HOT 1
- Links to Installation and API Docs return 404 when using an old OLS link (redirect issue?) HOT 2
- Subset/in_subset is null for VBO terms HOT 2
- EFO out of date HOT 1
- terms?short_form=... not working for obsolete terms HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ols4.