Giter Site home page Giter Site logo

bifrost's People

Contributors

azure-pipelines[bot] avatar githubz0r avatar kimleeng avatar kriskiil avatar martinbaste avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

bifrost's Issues

Add the functionality of 'tags' into bifrost

We want to be able to create a new item in samples called 'tags' which can be a multifunctional filtering tool. Tags would be saved as a list of strings. Membership in a tag can be used both for pipelines (components) in it's requirement check as well as filtering on the report side (ie only show samples belonging to the tag with this project name. Tags could also be generated from components, for example the QC stamper could assign a tag onto the sample stating it's QC is good.

This item is expected to generate multiple work items.

Change data plots for QC to show a pregenerated density plot

Data plots are currently pulling information on many more samples than need be. To improve performance we can generate a density plot for each value based off of our existing data and load those in and then we only need to pull samples from the run, I believe we're currently pulling a set number of samples

Fix setup to install indexes for DB and species install.

Make it so the setup folder (being changed to mongoDB_setup) installs the DB Indexes.

Right now the species DB entry is also located there. Thinking that this should be installed via the component into the DB and not by the setup.

Add a interface for managing species and there mappings

Manage species via the gui instead of direct db adjustments. Now the only thing in the species DB should be a internal species name and series of names that can map to refer to it (i.e. S. aureus -> Staphylococcus Aureus)

Update tests with new bifrostlib functionality

So right now Im working on dockerizing and automated testing for min_read_check and I'm trying to figure out how to prepopulate the system for analysis and in my mind I should be using the run_launcher component but can't do docker in a docker easily (perhaps a docker-compose solution is the right way?). I think what I want to do down the road is have bifrost lib updated so that the requests going back and forth for testing are to the api calls or the library that'll process the apis.

Update documentation to reflect schema2.0

Documentation was out of date so update it all, and create a power point while I'm at it to have a master set of slides for presentations. Potentially write up a paper for bioRxiv to push as well.

reorder conda environment channels

I suspect the bug cited in the install script is due to the conda channel order.

#Currently a bug with samtools installation, need to redo it from this source

channels:
- defaults
- bioconda
- conda-forge

I encountered a library issue before and it was because bioconda defers non-bio dependencies to the conda-forge channel. They depend on the following channel order:

channels:
  - bioconda
  - conda-forge
  - defaults

https://bioconda.github.io/user/install.html#set-up-channels

Update bifrostlib for samplecomponents to reflect schema2.0 and beyond, right now its more hardcoded for snakemake purposes

Bifrostlib is part way through updating. Was thinking that each main object needs a class and that it should be updated accordingly. Also when the schema validation goes in ideally the entry can be checked against multiple schemas which may mean 2 versions of the same function. Sample, Category, Run, are mostly done converting (to the current form but needs json validation) while things like Components, SampleComponents need to be redone.

Ability to create a collection of runs through the UI

Through the UI we want to be able to create a list of samples then group them into a collection that can be worked with. This will create a "run" object for them and can be loaded for the user through the GUI interface.

Generate requirements for a validation set

Want to create a validation set which can be tested for new components or lab changes. Ideally samples representative of what we do at SSI and that we can periodically run on sequencing.

Add species interface for managing species and alias's

Right now species are required to have a true term which is stored in the database. A table can be also provided for lookup names to match to these terms. Ensure there's an interface for managing this but remember to keep in mind that components are bound to the true terms.

Duplicate of #45

Establish test data set which can be attached via mongo atlas

Looking to set up easier testing with a small data set including localized development but sharable DB. Figured best bet for this is with mongo atlas so trying that out and getting it working. Also made a dataset available on ENA (PRJEB39131) to run this with randomized S. aureus and E. coli

Look into showing contigs/coverage in a more intuitive way

The contigs in the QC report are represented by numbers currently. I think there's a smart way to show this more as images. With contig lengths sorted by size and heigh to show coverage, coloring could also be done to show species for the contig. This could visually show contamination more clearly as well as plasmids or pcr products (use a log scale for coverage)

Run components on samples to fill out DB when server isn't busy

The idea here is to query our local server and see how big the queue is (might not even need to do this) and submit jobs that can fill out the queue. These jobs should be generated automatically by the system. For example a api request (or query) of samples that have not run the latest components on them. Then this list is submitted to the server when it's not busy to automatically update runs.

Adjust components to have a unique name based on name, version, db_date

Adjust components to have a unique name based on name, version, db_date which would replace in the class section of objects any references to the _id. Part of this is so if the component is installed at two different institutes they're treated as the same and not as unique due to different _id's

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.