A sequence-compatible amount of native burial information is sufficient for determining the structure of small globular proteins

  1. Antonio F. Pereira de Araujoa,b and
  2. José N. Onuchicb,1
  1. aLaboratório de Biologia Teórica, Departamento de Biologia Celular, Universidade de Brasília, DF 70910-900 Brasília, Brazil; and
  2. bCenter for Theoretical Biological Physics, University of California at San Diego, La Jolla, CA 92093
  1. Contributed by José N. Onuchic, September 25, 2009 (received for review August 31, 2009)

Abstract

Protein tertiary structures are known to be encoded in amino acid sequences, but the problem of structure prediction from sequence continues to be a challenge. With this question in mind, recent simulations have shown that atomic burials, as expressed by atom distances to the molecular geometrical center, are sufficiently informative for determining native conformations of small globular proteins. Here we use a simple computational experiment to estimate the amount of this required burial information and find it to be surprisingly small, actually comparable with the stringent limit imposed by sequence statistics. Atomic burials appear to satisfy, therefore, minimal requirements for a putative dominating property in the folding code because they provide an amount of information sufficiently large for structural determination but, at the same time, sufficiently small to be encodable in sequences. In a simple analogy with human communication, atomic burials could correspond to the actual “language” encoded in the amino acid “script” from which the complexity of native conformations is recovered during the folding process.

Footnotes

  • 1To whom correspondence should be addressed. E-mail: jonuchic{at}ucsd.edu
  • Author contributions: A.F.P.d.A. and J.N.O. designed research, performed research, analyzed data, and wrote the paper.

  • The authors declare no conflict of interest.

« Previous | Next Article »Table of Contents