Comments (6)
In this study we used Open Babel to convert .pdbqt to .sdf as well, and the molecules failing in conversion are just skipped. Another strategy is that you can just record the 3D coordinates of the docking poses, and then update the corrdinates of the input molecules (.sdf or .mol2) with the newly-generated ones.
I hope these suggestions will help you!
from rtmscore.
In this study we used Open Babel to convert .pdbqt to .sdf as well, and the molecules failing in conversion are just skipped. Another strategy is that you can just record the 3D coordinates of the docking poses, and then update the corrdinates of the input molecules (.sdf or .mol2) with the newly-generated ones. I hope these suggestions will help you!
That strategy sounds really cool! We have run several examples by replacing coordinates of the reference molecule with newly-generated ones, but it seems that we are still unable to pick the molecule matching the reference best, which should have got the highest score. Our setup is discribed as follows:
We first use rdkit.Chem.RemoveHs() to remove hydrogen atoms in both the generate molecule and the reference molecule (because the former has no implict hydrogen atoms discribed in its .sdf file but the latter has), and then update the corrdinates in the reference file according to those in the generated one, finally use rdkit.Chem.addHs(addCoords=True) to add hydrogen atoms back. But the results are not very satisfying. Is the last step necessary? Or is our approach in line with your suggestion?
from rtmscore.
To my understanding, you have successfully rescored the molecules with RTMScore, but they could not obtain the satisfactory results just using RTMScore for rescoring. It should be noticed that our method just exhibit excellent docking and screening powers rather than scoring and ranking powers. Additionally, the performance is evaluated in terms of overall statistics, and it is just common to see the bad performance of our method for some targets.
from rtmscore.
Sorry I did not introduce the background clearly so probably you misunderstood our purpose...
We feed a protein-ligand pair (e.g. 1t7j_protein.pdb and 1t7j_ligand.mol2) into AutoDock Vina and get different binding poses (ligand_out_1t7j.sdf, converted from .pdbqt) for only this one pair. In this scenario RTMScore is supposed to give the highest score to the binding pose which matches the real one best among all the output ones, but we failed to do that.
I would appreciate it if you would spare some time to help us check what led to our failure. The attachment is the example mentioned above (1t7j), and the problem is that the highest ranked one (the 331st in ligand_out_1t7j.sdf) is obviously worse than the 15th one (we visualize them via PyMol btw).
1t7j.zip
from rtmscore.
The following are the results generated with the command "python rtmscore.py -p 1t7j_protein.pdb -l ligand_out_1t7j.sdf -m ../trained_models/rtmscore_model1.pth -o xxxqq -c 10.0 -rl 1t7j_ligand.mol2 -gen_pocket" just based on your file, and here we can successfully identify the near-native poses. Are you doing anything right?
from rtmscore.
Right, we finally find that using rdkit to read molecule files with sanitizing disabled will lead to this unexpected situation... Thanks to your patient replies, everything is ok now.
I am very grateful for your help!
from rtmscore.
Related Issues (19)
- error of create conda env HOT 3
- Meaning of the distance threshold HOT 2
- casf对接和筛选能力 HOT 10
- Pretrained models - question HOT 2
- rdkit cannot load some mol2 files HOT 7
- 环境问题 HOT 6
- RTMScore的score的打分如何查看?
- 使用新的口袋和配体测试Score为0 HOT 1
- The AUC values presented in Table 6 and Figure 6A seem to be inconsistent. HOT 2
- How to Batch Process RTMScore for Multiple Models in PDB and SDF Files
- Could you provide raw data of pdbbind2020? HOT 1
- Question about csaf2016_docking.py
- integrating w/ docking HOT 1
- sanity checking on new target HOT 40
- data preprocess script for training on new dataset HOT 7
- Some of the ligands in CASF core set cannot be read by RDKit successfully HOT 2
- Additive terms in MDN outputs HOT 2
- Graph files on zenodo HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rtmscore.