pepkit / geopephub Goto Github PK
View Code? Open in Web Editor NEWAutomatic uploader of GEO metadata to PEPhub.
Home Page: https://pephub.databio.org/geo
License: BSD 2-Clause "Simplified" License
Automatic uploader of GEO metadata to PEPhub.
Home Page: https://pephub.databio.org/geo
License: BSD 2-Clause "Simplified" License
Now pepdbagent is using new sqlalchemy (version >=0.2.0), sqlmodel is using old version of sqlaclhemy (<=1.4.0). It leads to conflicts, and uploader is not working.
Two solutions:
When projects were uploaded to the pephub from geo, the upload and update date was set as date of upload to pephub, change it to geo dates
Add .env file, which metageo_pephub will use to run commands.
We need to add a new feature to do a routine database dump. So, I think this is doable, but it's not currently set up. The good thing about this is that since we're automatically indexing and parsing GEO daily as new records are added, you can always come back later and just get a new one to rerun on a more up-to-date version.
Tasks before first stable version:
Now, there is no possibility to check or upload GSE projects using specified start and end date. Add function that will receive start period, end period and 1)check if we have this period in the queue 2) add to the queue 3) add project to the pephub
While downloading GSE178610 GitHub action failing:
Trying GSE178610 (not a file) as accession...
Skipped 0 accessions. Starting now.
Processing accession 1 of 1: 'GSE178610'
/home/runner/work/_temp/4dfe2662-0b6d-4b4d-84ff-aad23b8dfa34.sh: line 1: 1810 Killed python geo_pipeline_script.py --namespace geo_recent --host *** --db *** --user *** -***
Error: Process completed with exit code 137.
I am not sure why this error is happening. I guess, it's because geofetch is creating to huge annotation dict or list.
Do you have any ideas how to handle this error and if it occurs how to skip this accession? @nleroy917 @nsheff
Now, a lot of files are failing to upload. We have to save this logs and upload that files later manually. We should add this pipeline to our script.
Add functionallity that deals with closed connection
Errors:
Error whiled downloading: GSE155854
Error message: connection already closed
https://github.com/pepkit/geo_pephub/actions/runs/3427197775/jobs/5709905888#step:5:2228
Fetch metadata from GEO with bed files and run this process every day, or week.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.