Recombinatoric exploration of novel folded structures: A heteropolymer-based model of protein evolutionary landscapes

  1. Yan Cui*,,
  2. Wing Hung Wong*,,,
  3. Erich Bornberg-Bauer§, and
  4. Hue Sun Chan,
  1. *Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115; Dana-Farber Cancer Institute, Boston, MA 02115; Department of Statistics, Faculty of Arts and Sciences, Harvard University, Cambridge, MA 02138; §Bioinformatics Group, School of Biological Sciences, University of Manchester, Manchester M13 9PT, United Kingdom; and Departments of Biochemistry and Medical Genetics and Microbiology, Faculty of Medicine, University of Toronto, Toronto, ON, Canada M5S 1A8
  1. Edited by Peter G. Wolynes, University of California at San Diego, La Jolla, CA, and approved November 21, 2001 (received for review May 15, 2001)

Abstract

The role of recombination in evolution is compared with that of point mutations (substitutions) in the context of a simple, polymer physics-based model mapping between sequence (genotype) and conformational (phenotype) spaces. Crossovers and point mutations of lattice chains with a hydrophobic polar code are investigated. Sequences encoding for a single ground-state conformation are considered viable and used as model proteins. Point mutations lead to diffusive walks on the evolutionary landscape, whereas crossovers can “tunnel” through barriers of diminished fitness. The degree to which crossovers allow for more efficient sequence and structural exploration depends on the relative rates of point mutations versus that of crossovers and the dispersion in fitness that characterizes the ruggedness of the evolutionary landscape. The probability that a crossover between a pair of viable sequences results in viable sequences is an order of magnitude higher than random, implying that a sequence's overall propensity to encode uniquely is embodied partially in local signals. Consistent with this observation, certain hydrophobicity patterns are significantly more favored than others among fragments (i.e., subsequences) of sequences that encode uniquely, and examples reminiscent of autonomous folding units in real proteins are found. The number of structures explored by both crossovers and point mutations is always substantially larger than that via point mutations alone, but the corresponding numbers of sequences explored can be comparable when the evolutionary landscape is rugged. Efficient structural exploration requires intermediate nonextreme ratios between point-mutation and crossover rates.

Footnotes

  • To whom reprint requests should be addressed at: Department of Biochemistry, University of Toronto, Medical Sciences Building, 5th Floor, 1 King's College Circle, Toronto, ON, Canada M5S 1A8. E-mail: chan{at}arrhenius.med.toronto.edu.

  • This paper was submitted directly (Track II) to the PNAS office.

  • Abbreviations:
    2D,
    two-dimensional;
    H,
    hydrophobic;
    P,
    polar
« Previous | Next Article »Table of Contents