Previous Article |
Table of Contents
| Next Article
BIOLOGICAL SCIENCES / BIOPHYSICS
On the origin and highly likely completeness of single-domain protein structures



*Center of Excellence in Bioinformatics, University at Buffalo, State University of New York, 901 Washington Street, Buffalo, NY 14203; and
Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138
Edited by Harold A. Scheraga, Cornell University, Ithaca, NY, and approved December 30, 2005 (received for review October 27, 2005)
The size and origin of the protein fold universe is of fundamental and practical importance. Analyzing randomly generated, compact sticky homopolypeptide conformations constructed in generic simplified and all-atom protein models, all have similar folds in the library of solved structures, the Protein Data Bank, and conversely, all compact, single-domain protein structures in the Protein Data Bank have structural analogues in the compact model set. Thus, both sets are highly likely complete, with the protein fold universe arising from compact conformations of hydrogen-bonded, secondary structures. Because side chains are represented by their C
atoms, these results also suggest that the observed protein folds are insensitive to the details of side-chain packing. Sequence specificity enters both in fine-tuning the structure and thermodynamically stabilizing a given fold with respect to the set of alternatives. Scanning the models against a three-dimensional active-site library, close geometric matches are frequently found. Thus, the presence of active-site-like geometries also seems to be a consequence of the packing of compact, secondary structural elements. These results have significant implications for the evolution of protein structure and function.
evolution | Protein Data Bank | protein folding | protein structure prediction
Conflict of interest statement: No conflicts declared.
This paper was submitted directly (Track II) to the PNAS office.
To whom correspondence should be sent at the present address: Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 250 14th Street NW, Atlanta, GA 30318. E-mail: skolnick{at}gatech.edu
© 2006 by The National Academy of Sciences of the USA
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
L. Wroblewska, A. Jagielska, and J. Skolnick Development of a Physics-Based Force Field for the Scoring and Refinement of Protein Models Biophys. J., April 15, 2008; 94(8): 3227 - 3240. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhou and J. Skolnick Ab Initio Protein Structure Prediction Using Chunk-TASSER Biophys. J., September 1, 2007; 93(5): 1510 - 1518. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-X. Zhou and S. Qin Interaction-site prediction for protein complexes: a critical assessment Bioinformatics, September 1, 2007; 23(17): 2203 - 2209. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Meyerguz, J. Kleinberg, and R. Elber From the Cover: The network of sequence flow between protein structures PNAS, July 10, 2007; 104(28): 11627 - 11632. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Zhong, J. M. Moix, S. Quirk, and R. Hernandez Dihedral-Angle Information Entropy as a Gauge of Secondary Structure Propensity Biophys. J., December 1, 2006; 91(11): 4014 - 4023. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. D. Rose, P. J. Fleming, J. R. Banavar, and A. Maritan A backbone-based theory of protein folding PNAS, November 7, 2006; 103(45): 16623 - 16633. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Fleming, H. Gong, and G. D. Rose Secondary structure determines protein topology Protein Sci., August 1, 2006; 15(8): 1829 - 1834. [Abstract] [Full Text] [PDF] |
||||