Giter Site home page Giter Site logo

ecifgraph's Introduction

Water Network-Augmented Two-State Model

ECIFGraph::HM-Holo-Apo Scoring Function

Water network rearrangement, transitioning from the ligand-unbound state to the ligand-bound state, has been shown to significantly impact protein-ligandbinding interactions. Despite this, current machine learning-based scoring functions fail to account for these effects. Our study aims to address this challengeby incorporating water network rearrangement information into deep learning-based scoring functions for protein-ligand binding prediction.
Here, we attempt to construct a comprehensive representation that incorporates water network information into both ligand-unbound and bound states. Specifically,we integrated extended connectivity interaction features into the graph representation, while employing a graph transformer operator to extract features fromboth ligand-unbound and bound states. Consequently, we devised a water network-augmented two-state model, designated as ECIFGraph::HM-Holo-Apo, which exhibitssatisfactory performance for scoring, ranking, docking, (reverse) screening, power tests on the CASF-2016 benchmark, and superior performance in larger-scaledocking-based virtual screening tests on the DEKOIS2.0 dataset. Our study emphasizes that the use of a water network-augmented two-state model can effectivelyenhance the robustness of machine learning-based scoring functions.

CONTENTS

Usage

0. Setup dependencies
conda env create -f environment-data.yml
conda env create -f environment-train.yml

1. Generate hydration sites in apo and holo states by HydraMap analysis:
sh gen_hydramap.sh apo
sh gen_hydramap.sh holo

2. Generate dataset for graph representation:
conda activate data
python convert_data.py  --input_list=data/pdb_list/pdb_list_ --output_file=pdbbind_coor2 --use_new_data --groundtruth_dir=data/pdbbind/ --dataset_file=Graph_tmp --apo --thread_num=32

3. Train water network-augmented two-state model:
conda activate train
python train-gpu.py --d_graph_layer 256 --n_graph_layer 4 --dropout_rate 0.3 --data_path Graph_tmp/pdbbind_coor2 --output ./output.txt --pred_output ./prediction.txt --batch_size 64 --processed processed --graph_pooling sum --apo

4. To run trained model, please use the following command line:
python train-gpu.py --d_graph_layer 256 --n_graph_layer 4 --dropout_rate 0.3 --data_path Graph_tmp/pdbbind_coor2 --pred_output ./prediction.txt --processed processed-score --graph_pooling sum --apo --pre_model model/ECIFGraph-Holo-Apo.pt --test_mode 

Folders

data/pdb_list: PDBID list file and experimental binding affinity file
data/pdbbind: Protein & ligand structure files

Notes:

1. Protein PDB files are assumed to contain coordinates for all heavy atoms

ecifgraph's People

Contributors

xiaoyangqu avatar

Stargazers

 avatar Chul Sung avatar  avatar Kun Yao avatar Miraitowa avatar  avatar BIO-RAT avatar  avatar Zhehuan Fan avatar

Watchers

 avatar

Forkers

rxdcpu1

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.