Giter Site home page Giter Site logo

parse_pdb_header's Introduction

Descriptions of the parse_PDB_header

A Python 3 script for parsing the header information of PDB files.

The main purpose of this script is to extract some of the header information, such as Experiment Method, Resolution, R value, R_free value, mean B-factor, then calculate the grades of Resolution and R_free value based on the grading of FirstGlance in Jmol. Finally, saving above mentioned information as a .csv file (Pandas DataFrame).

Note: In some PDB files (due to the complex of the protein's structure and the limitations of the experimental detections), the R value have different data, even in the same PDB file, or shown as NULL. Similarly for the R_free value. In addition, the mean B-factor some times reported as NULL

For examples:

In 1BRT.pdb,

REMARK   3  FIT TO DATA USED IN REFINEMENT.                                     
REMARK   3   CROSS-VALIDATION METHOD          : THROUGHOUT                      
REMARK   3   FREE R VALUE TEST SET SELECTION  : RANDOM                          
REMARK   3   R VALUE     (WORKING + TEST SET) : 0.140                           
REMARK   3   R VALUE            (WORKING SET) : 0.147                         
REMARK   3   FREE R VALUE                     : 0.164                           
REMARK   3   FREE R VALUE TEST SET SIZE   (%) : 5.000                           
REMARK   3   FREE R VALUE TEST SET COUNT      : 2283                            

In 1GPD.pbd

REMARK   3  FIT TO DATA USED IN REFINEMENT.                                     
REMARK   3   CROSS-VALIDATION METHOD          : NULL                            
REMARK   3   FREE R VALUE TEST SET SELECTION  : NULL                            
REMARK   3   R VALUE            (WORKING SET) : NULL                            
REMARK   3   FREE R VALUE                     : NULL                           
REMARK   3   FREE R VALUE TEST SET SIZE   (%) : NULL   
REMARK   3   FREE R VALUE TEST SET COUNT      : NULL                            
REMARK   3   ESTIMATED ERROR OF FREE R VALUE  : NULL     
...
REMARK   3  B VALUES.                                                           
REMARK   3   FROM WILSON PLOT           (A**2) : NULL                           
REMARK   3   MEAN B VALUE      (OVERALL, A**2) : NULL                         
REMARK   3   OVERALL ANISOTROPIC B VALUE.                                       
REMARK   3    B11 (A**2) : NULL                                                 
REMARK   3    B22 (A**2) : NULL                                                 
REMARK   3    B33 (A**2) : NULL                                                 
REMARK   3    B12 (A**2) : NULL                                                 
REMARK   3    B13 (A**2) : NULL                                                 
REMARK   3    B23 (A**2) : NULL                                                 

Usage

For example

python parse_PDB_header.py

Then the program will ask you to input the directory that contains the PDB files.

>>> Please type the directory contains PDB files:

If you are already in that directory, you only need to type ./ as input.

Acknowledgements

I thank Wayne for discussion about the calc_R_free_grade() and deal_round() functions. I also would like to thank Zachary Ware for the detailed of the Decimal() function, which published on 2015-08-08 09:36.

parse_pdb_header's People

Contributors

lepingkyxk avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.