Giter Site home page Giter Site logo

automated_annotation's People

Contributors

almasaeed2010 avatar

Watchers

 avatar  avatar  avatar  avatar

automated_annotation's Issues

Blast databases need to be limited

We recently got an emailed report from this module stating that organisms that don't have AGL or DB:swissprot:display are a problem. I think this is happening because we didn't clear the cache since we fixes this issue statonlab/hardwoods_site#505

I'll clear the cache and try again.

Send data to Staton server

The generated fasta files can go to the Staton server and we can remotely execute IPS/diamond

  • look into auto ssh commands

Split file by number of organisms

We currently have a problem where the output FASTA file contains way too many features. Annotating these features take a very long time. So far, for our HWG site, it has taken over a month!

The agreed upon move is to allow the admin to limit the number organsims that the fasta files contain. Any method to pick the top N is fine (ie order by organism_id should work).

I am not getting emails every month

We are supposed to get an email report every month detailing if there are organisms with missing annotations. Although the settings form has been configured correctly, I am still not getting an email.

My cron entry:

0 2 1 * * cd /var/www/html & drush annotations-check;

multiple analyses vs recycling analysis over time

in #7 it came up that this module recycles the same analysis over and over for annotations.

It does this for practical reasons: its a challenge to "clean up" after the old annotation set and to ensure we dont end up with multiple annotations.

That said I posit that its more correct to create multiple analyses. Each analysis refers to a specific run of a program: thats why the date run column is a primary column, and why in #7 we can't tell the last time the feature was annotated.

Creating multiple analyses would mean archiving the old analysis and deleting the corresponding annotations it loaded, and doing it in a way that doesnt mean we've lost the annotations for end users, which is a real challenge when we're taking about feature_cvterms. its pretty striaghtforward i think for feature_analysis annotations.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.