Giter Site home page Giter Site logo

raimondilab / precogx Goto Github PK

View Code? Open in Web Editor NEW
6.0 2.0 2.0 2.3 GB

A predictor of GPCR couplings with G-proteins/B-arrs using Transformers

Home Page: https://precogx.bioinfolab.sns.it/

License: GNU General Public License v3.0

Python 2.75% HTML 1.22% JavaScript 86.32% CSS 9.11% R 0.50% SCSS 0.06% GLSL 0.03% Shell 0.02%
bioinformatics transformers gpcr machine-learning logistic-regression esm protein-protein-interaction flask artficial-intelligence datatables

precogx's People

Contributors

actions-user avatar gurdeep330 avatar mr-fcharles avatar nataliarosa9 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

precogx's Issues

Input check

Hi everybody @raimondifranc @nataliarosa9 @maticmarin

I remember that @nataliarosa9 had shown us in the previous call input checks that she created (eg: only alphabets in FASTA etc). So here I am making a list of other checks that we have to make in the input.

We all can edit (add more items) this checkbox-list and request @nataliarosa9 to include them also:

  • Input accepted only in FASTA formatted or precog format (eg: DRD1/F61P)
  • If the input is FASTA, it cannot be more than 1024 residues long (correct @maticmarin?)
  • Use cannot input more than 15 sequences in one go (counting both WT and MUT), right, @raimondifranc?

Features

  • Add PCA panel
  • Dropdown option in PCA panel to select G-protein/labels
  • Update precogx script to include point mutations
  • Update precogx webApp to include point mutations
  • Automate GPCR seq SQL DB
  • First GPCR automate on page load
  • Show Bonds
  • Add result URL

Run PRECOGx on all GPCRs and store info in SQL DB

Hi Gurdeep,

yes, I think that 25 sequences is already pretty generous .... we should
also provide a refreshing page with the URL of the output, and possibly the
option to provide an email, in case the calculations will take a while.
We should probably also run the prediction for all the GPCR sequences at
least of the human GPCRome (and possibly from swissprot) and store this
information in a database (SQLite or MySQL), so to speed up the analysis
in those cases where a certain sequence was already analyzed or for the
mutation analysis case (the WT embeddings predictions should already be
there).

Talk to you on Thursday.
Best,
Francesco

On Tue, Dec 7, 2021 at 11:52 AM Gurdeep Singh @.***>
wrote:

Hi everybody @raimondifranc https://github.com/raimondifranc
@nataliarosa9 https://github.com/nataliarosa9 @maticmarin
https://github.com/maticmarin

I remember that @nataliarosa9 https://github.com/nataliarosa9 had shown
us in the previous call input checks that she created (eg: only alphabets
in FASTA etc). So here I am making a list of other checks that we have to
make in the input.

We all can edit (add more items) this checkbox-list and request
@nataliarosa9 https://github.com/nataliarosa9 to include them also:

โ€”
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#14, or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACHY44PHFU6E6PYBPKZSHWTUPXRPLANCNFSM5JQ5ZCJQ
.
Triage notifications on the go with GitHub Mobile for iOS
https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675
or Android
https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

Originally posted by @raimondifranc in #14 (comment)

Structure panel features 07.12.2021

  • Radius proportional to number of contacts the positions makes
  • Change bond colors to red/blue to highlight enrichment/depletion
    - [ ] Use SIFT's XML files to map from UniProt to PDB residues
  • Include B-arr structures as well

Slider range

Hi @nataliarosa9 , could I ask you to please fix the aesthetics of the slider? I am bad at HTML/CSS stuff

  • Add HTML/CSS to the slider
  • Insert label "Log-Odds score >"
  • Move it to the right-end of the panel

Thanks

GPCR embeddings

/data/Users/marin/transformers/gpcr
This is the 33rd layer

PCA values for all GPCRs

Hi @maticmarin, I think you will have to re-run the train script in my directory to save the PCA matrix of all the GPCRs (at least class A) against all the best models (G-proteins). Please note that I removed > from your extract.sh as I am already saving the NumPy format.

Thanks!

PCA panel 07.12.2021

  • Dropdown to show IUPHAR/Shedding/ebBRET-based coloring
  • Dropdown to show either all 33 layers or the best one (G-protein specific)
  • Always show GPCRome in the background (use grey or reduce opacity if the given GPCR is not in the selected label)
  • Show also WT on PCA when showing for MUT
  • Show GPCR name on hover
  • Disable IUPHAR for some G-proteins like GoA and Barrs (instead show from STRING)
  • Include B-arr too

Results panel 07.12.2021

  • Include IUPHAR/Shedding/ebBRET assay info also
  • Show check-boxes
  • Show download option
  • Include B-arrs too

Contacts file

Hi @maticmarin, when you update the contacts file specific_position_score.txt, please remember to add Barrs too. Also, could you provide Superfamily numbering? (see here)

Thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.