Previous Article |
Table of Contents
| Next Article
PHYSICS / EVOLUTION
Evolvability is a selectable trait
Department of Bioengineering and Department of Physics and Astronomy, Rice University, Houston, TX 77005-1892
Communicated by David Chandler, University of California, Berkeley, CA, June 30, 2004 (received for review April 19, 2004)
| Abstract |
|---|
|
|
|---|
Darwin was obsessed with variation. His books, considered as an ensemble, devote much more attention to variation than to natural selection, because he knew that no satisfactory theory of evolutionary change could be constructed until the causes of variation and the empirical rule of its form and amount had been elucidated (1).
Whether the propensity to evolve, or evolvability (24), can be an object of Darwinian natural selection is a topic of interest. Causality would suggest not because of the apparently anticipatory nature of evolvability (5, 6). Many within the field of evolutionary biology are uncomfortable with the concept that evolvability is a selectable trait. A growing body of experimental data, however, would be explained if evolvability were a selectable trait (715).
Higher organisms cannot evolve, or adapt, by germ-line mutation to an environmental change within their own lifetime. Does this mean that lineages and individuals cannot be under selection for evolvability? Although viability is the selection criterion, the genotype that determines the viability arises in a mutated, evolved way from that of the previous generation as a result of base substitution, recombination, transposition, and horizontal gene transfer. These mutational processes are the driving forces of evolution, and their rates fundamentally determine evolvability. The perspective we offer here is that the evolvability of an organism is defined by the rates of genetic change, that genetic change is not always deleterious, and that these rates of genetic change are not fixed and are under selective pressure. That is, the mechanisms that define the rates of change are encoded in the genotype, and so they are selectable. An analogy with thermodynamics illuminates the issue: How is free energy minimized in a physical system of particles despite the difficulty in defining the entropy of a given configuration of the particles? An ensemble of particle configurations allows the definition of free energy and the approach to thermodynamic equilibrium just as a population of evolving organisms allows the definition of and selection for evolvability.
Within the framework of point mutation, base substitution, and recombination, correlations of adaptation with function have been observed. It is known that immunoglobins have evolved such that the mutation rates in complementary determining regions, in which mutation is most likely to generate useful variants, are much higher than those in framework regions (14, 16). Recent data point to a role for DNA polymerases in regulating the somatic hypermutation rate of immunoglobin genes (13, 17). Similarly, codon usage within the influenza hemagglutinin protein seems to be biased to favor more rapid antigenic drift (14). Furthermore, in HIV-1 protease, the probability of mutation is not randomly distributed within the structure but rather concentrated at sites that alter the geometry of the protein-binding domain, conferring significant propensity for antigenic drift (18). Such behavior is not mere curiosity and has widespread implications for drug design and the evolution of drug resistance (19). Stressful conditions may generally provoke activation of error-prone polymerases, triggering a large increase in adaptive rates (20). Not only point mutation but also recombination are widely appreciated to confer increased evolvability (9, 21, 22). Recombination among the hemagglutinin and neuraminidase proteins, for example, is believed to be a significant mechanism leading to the emergence of new virulent strains of influenza (23). Computational and theoretical studies have also shown the persistence under selection of evolvability-enhancing moves in the context of point mutation and recombination evolutionary dynamics (2429).
The selective forces that lead to the evolution and maintenance of mechanisms for rearrangement, deletion, transfer, and transposition of genetic material, inasmuch as they lead to even greater evolution than point mutation and recombination alone, are of great interest. For example, the development of antibiotic resistance in bacteria has evolved mainly through the swapping of DNA pieces between the evolving bacteria (8, 30). Similarly, the evolution of Escherichia coli from Salmonella is thought to have occurred exclusively from DNA swapping (31). It has been proposed that the success of bacteria as a group stems from a capacity to acquire genes from a large and diverse range of species (32). It would seem, then, that large genetic moves are pervasive and crucial to evolutionary dynamics (6, 8, 1012, 15, 30, 31, 3339). Concomitantly, evolvability is enhanced by these larger moves, as shown experimentally for the case of DNA shuffling (32, 4044). A key question is whether selection for evolvability fosters the husbandry of these moves.
We address here, from a theoretical point of view, selection of evolvability in the presence of large-scale genetic moves. Although the use of the term evolvability has only recently come into vogue in the scientific community, investigations into the evolution of adaptation go back several decades (4547). Prominent from a theoretical perspective are works in population genetics (48, 49) and game theory (5052). Despite the insights that these studies give as to the origin and maintenance of evolvability, evolution of and selection for evolvability remains a contested issue primarily because of the causality principle (5, 6). We show here that evolvability is selected for, notwithstanding the constraints imposed by causality, when a system is subject to a constant, random environmental change. This selection for evolvability occurs even when viability as a function of genotype is an extremely complex function, with exponentially many optima, and when the evolving system is unable to reach the global optimum of viability in any one instance of the environment. We demonstrate our results by using computer simulations of protein molecular evolution that incorporate selection in a varying environment. The genotype of a protein molecule is mapped to a complex phenotype by using a generalized NK model in which all assumptions and relevant parameters are known. The selective pressure for evolvability is shown to be greater for larger rates of environmental change. Interestingly, a generalized susceptibility of the system correlates with the fluctuations in the environment, albeit not as a result of generalized linear response theory (53). The addition of selection for evolvability as a phenomenological law to the toolbox of evolutionary theory allows for the explanation of a large body of experimental results.
| The Generalized Block NK Model |
|---|
|
|
|---|
We model the molecular evolution of protein systems by using a generalization of the NK (5557) and block NK (58) models that has been used previously to study protein molecular evolution strategies (54) and the immune-system response to vaccination and disease (59). The model includes a population of sequences, upon which selection acts and in which occur genetic mutations. The mutational hierarchy includes both point mutations and large-scale swapping moves, akin to transposition or translocation events. Although the model does not include recombination, such inclusion is not expected to change the results because swapping can be viewed as a powerful form of recombination (54). For example, linkage effects are mitigated even more rapidly by swapping in our model than they would be by recombination. The selection for greater swapping rates in more rapidly changing environments observed in our model parallels results found in studies of the evolution of sex, for which adaptation and variation in a heterogeneous environment is well researched (60).
In the generalized block NK model, each individual evolving protein sequence has an energy that is determined by secondary structural subdomain energies, Usd, subdomainsubdomain interaction energies, Usdsd, and chemical binding energies, Uc. This energy is used as the selection criteria in our studies and is given by
![]() | [1] |
Within our generalized block NK model, each protein molecule is composed of M = 10 secondary structural subdomains of N = 10 aa in length. We consider five chemically distinct amino acid classes (negative, positive, polar, hydrophobic, and other) and L = 5 different types of subdomains (helices, strands, loops, turns, and others). We therefore have L different subdomain energy functions of the NK form
![]() | [2] |
where aj is the amino acid type of the jth amino acid in the subdomain, and
i is the type of the ith subdomain. As in previous studies, we consider the case in which the range of the interactions within a subdomain is specified by K = 4 (54, 59). Here 
i is a quenched Gaussian random number with zero mean and a variance of unity, and it is different for each value of its argument for each of the L subdomain types,
i. The interaction energy between secondary subdomain structures is given by
![]() | [3] |
where we consider D = 6 interactions between secondary structures (54, 59). The zero-mean, unit-variance Gaussian
and the interacting amino acids, j1,..., jK, are selected at random for each interaction (i, j, k). In our model, P = 5 aa contribute directly to a binding event, as in a typical pharmacophore, where the chemical binding energy of each amino acid is given by
![]() | [4] |
where the zero-mean, unit-variance Gaussian
i and the contributing amino acid, i, are chosen at random.
| System Evolution and Environmental Change |
|---|
|
|
|---|
After pool swapping and point mutations, selection occurs, and the 20% lowest-energy protein sequences are kept and amplified to form the population of 1,000 proteins for the next round of selection. The parameters pswap(i) and nmut(i) are allowed to take a log-Gaussian random walk for each protein sequence. This process is repeated for Ngen rounds of selection, after which an environmental change is imposed on the system with a severity that is characterized by the parameter p (59). The parameter p is the probability of (i) changing the type of each of the 10 subdomains in the protein sequences,
i in Eq. 2, (ii) changing the amino acids and energies that are involved in subdomainsubdomain interactions, jk and
(k)ij in Eq. 3, and (iii) changing the amino acids and energies that are involved in the chemical binding, i and
i in Eq. 4. We repeat the process for a total of 100 environmental changes and average our results over 1,000 instances of the ensemble. For each system studied, a steady state in nmut, pswap, and the average energies at the beginning,
U
start, and end,
U
end, of the dynamics in a single instance of the environment is reached after <80 environmental changes in all cases. We average the data over the last 20 environmental changes. We study how the frequency of environmental change, 1/Ngen, and the severity of environmental change, p, affect the evolvability of the protein sequences. A schematic diagram showing the molecular evolution of our protein system can be seen in Fig. 1.
|
| Selection for Evolvability |
|---|
|
|
|---|
0, do large-scale changes tend to be disfavored. The role of base substitution in our evolving system is more complex. For more severe environmental changes and for higher frequencies of environmental change, the system depends more on DNA swapping than on point mutation to produce low-energy proteins. In these cases, because the protein must make large changes to its sequence to adapt to the environmental change, selection results in high values of pswap, with base substitution having only a small effect on the energy of the protein. For less severe environmental changes and lower frequencies of environmental change, base substitution is sufficient to achieve the small modifications in protein sequence that are required for adaptation to the environmental change. Thus, we observe the higher dependence on nmut and lower dependence on pswap for small p. In addition, as 1/Ngen
0, nmut
0, because mutations tend to be deleterious in stable systems with no environmental fluctuations.
|
![]() | [5] |
The overwhelming contribution to D0 comes from the swapping step, because the swapping move far more dramatically changes the sequence. The short-time diffusion rate selected for reflects, as a function of environmental change, a balance between staying within a favorable basin of attraction, or niche, and adaptation to a newly created, superior niche. As Fig. 2 shows, greater environmental change favors greater local diffusion, as indicated by the monotonic increase of pswap with p.
It is useful to regard base substitution as a means of fine tuning the protein sequences, whereas DNA swapping can be considered a source of more substantial evolutionary change. This hierarchy within the space of evolutionary moves becomes more apparent when studying the difference between starting and ending protein sequences within one environment as a function of p, pswap, and nmut. The distance between protein sequences is characterized by the Hamming distance between the respective amino acid sequences. For a given p, the Hamming distance decreases only slightly as the frequency of environmental change, 1/Ngen, increases, but it has a very strong dependence on the severity of the environmental change, p, as shown in Fig. 3a. The sensitivity of the Hamming distance also shows markedly different behavior to pswap and nmut, as shown in Fig. 3b. For state points with fixed nmut, 1/Ngen, and p, the Hamming distance strongly depends on the value of pswap. However, for state points with fixed pswap, 1/Ngen, and p, the Hamming distance displays little or no variation with nmut. The Hamming distance is a long-time measure of the evolvability of the system. The long-time diffusion coefficient can be defined as the square of the Hamming distance multiplied by the frequency of environmental change. As Fig. 3a implies, the long-time evolvability, as measured by the long-time diffusion coefficient, increases with both the severity and frequency of environmental change.
|
As we have seen, evolvability is quantifiable at any point in time through measurement of diversity and the local mutation rates. For this reason, causality does not prevent selection for evolvability. Because evolvability is an observable property, it can be selected for.
| Susceptibility |
|---|
|
|
|---|
U
start, and end,
U
end, of the dynamics within a single instance of the environment. This quantity is shown as a function of the severity, p, and frequency, 1/Ngen, of environmental change. It is apparent that at low frequencies of environmental change, populations with greater diversity and variation, which are more evolvable, have slightly lower values of
U
end. There is also a clear increasing trend in
U
start as a function of p, which is a feature of the generalized NK model. Considering the ending energy of a protein molecule within one instance of the environment to be roughly the sum of n Gaussian terms from the generalized NK model,
![]() | [6] |
The starting energy of this protein molecule after an environmental change is given by
![]() | [7] |
where
![]() | [8] |
and where
are random Gaussian variables with zero mean (
= 0), whereas xi are evolved variables that are better than random and typically negative. Thus, the average starting energy of this protein molecule is
![]() | [9] |
Thus, averaging over the values in the new environment
![]() | [10] |
or, averaging over many environmental changes
![]() | [11] |
|

U
/Ngen = (
U
end
U
start)/Ngen. In Fig. 4b we plot the susceptibility of our system as a function of the severity of environmental change, p. For a fixed frequency of environmental change, the susceptibility is a linear function of the severity of environmental change, as in Eq. 11. This simple analysis captures the essence of the dynamics that occurs in the correlated, generalized NK model. Fig. 4c shows that the probability distribution of the susceptibility is Gaussian in shape. Note also that the variance of the susceptibility increases with p in Fig. 4c, and thus the linearity of the susceptibility in Fig. 4b is not simply the result of a generalized fluctuationdissipation theorem. | Implications for Evolution |
|---|
|
|
|---|
Given our results, we propose that it is not mere chance that highly evolvable species tend to be found in rapidly changing environments or that an environmental crisis can trigger an increase in the rate of the evolution of a species. Indeed, selection for evolvability allows for the explanation of many data: the existence of somatic hypermutation in the immune system (13, 14, 16, 17), the evolution of drug resistance in species of bacteria (8, 30), and the occurrence and success of transpositional events in bacterial evolution (10, 31, 36). A recently studied example from mammals is the San Nicolas Island fox, which is a highly endangered species and the most monomorphic sexually reproducing animal known. This species, however, is found to have high levels of genetic variation within the major histocompatibility complex loci (62) that allows for increased pathogen resistance.
We believe that our results are of relevance to the field of vaccine and drug design. Currently, the design of new vaccines and drugs is largely based on the assumption that pathogens evolve by local space searching in response to therapeutic and immune selection. However, it is clear that we must anticipate the evolutionary potential of large DNA swapping events in the development of viruses, parasites, bacteria, and cancers if we are to engineer effective methods of treating them. How evolvability correlates with treatment strategy, and how to drive pathogens into regions of low evolvability where they are eradicated most easily, is of importance to efforts for vaccine and drug engineering.
Specific pathogenic examples of evolvability include the emergence of new influenza strains by a novel hemagglutinin neuraminidase recombination, followed by antigenic drift to a highly infectious strain (23); emergence of many new HIV strains with the spread of the disease from its site of origin in Africa (63, 64); and the increased emergence of new infectious diseases associated with modern, post-World War II travel (65). Additionally, a recent study of the dynamics of HIV-1 recombination suggests that HIV-1 may have evolved high recombination rates to foster rapid diversification and further its survival (66).
Note that evolvability is not simply the observation that new strains occur; rather, it is the underlying probability with which new strains are created by genetic modification. These new strains may proliferate and be observed, or they may fail and not be observed to an appreciable extent. Fundamental study of evolvability, then, requires an appreciation of these underlying rates of genetic change. These underlying rates, such as polymerase error rates, recombination rates, and transposition rates, are what selection for increased evolvability may modulate (67). These underlying rates of change are inheritable and can be altered by mutation. Study of these rates of genetic change, deconvoluted from observed rates of evolution, which are these rates multiplied by a probability of survival, is of fundamental interest.
It is intriguing that we find that at low frequencies of environmental change, populations that are subject to more severe environmental changes can produce lower-energy individuals than populations that are not subject to environmental changes (Fig. 4a). Thus, under some conditions, adaptability can provide global benefits. This finding can be contrasted to the more customary expectation that specialists are better than generalists (68). In experimental studies of Chlamydomonas, generalists that were evolved in alternating light and dark conditions were found to be better than their ancestors in both light and dark conditions but less good than specialists that had evolved exclusively in one of the environmental conditions (69). Studies of the evolution of E. coli at constant and alternating temperatures produced similar results (70, 71). The nature of the environmental change in these studies is not completely random as in our model. In addition, the number of rounds of selected evolution under each environmental condition is perhaps better defined within our model. These experiments do point to possible tests of our theory. For a species that is capable of DNA swapping evolutionary moves, a systematic study of competency as a function of the frequency of a random environmental change would be of interest. We predict that under some conditions, certain frequencies of environmental change will produce better individuals, after a given number of rounds of evolution and selection, than would be produced by evolution in a constant environment. Different severities of environmental change could also be imposed by altering the change in environmental variables between samples, such as temperature, food concentrations, light conditions, and exposure to disease. With regard to susceptibility, we would expect the rate of change of viability within an environment to be higher in systems with more frequent and harsher environmental changes because of greater evolvability.
| Summary |
|---|
|
|
|---|
Many observations within evolutionary biology, heretofore considered evolutionary happenstance or accidents, are explained by selection for evolvability. For example, the vertebrate immune system shows that the variable environment of antigens has provided selective pressure for the use of adaptable codons and low-fidelity polymerases during somatic hypermutation. A similar driving force for biased codon usage as a result of productively high mutation rates is observed in the hemagglutinin protein of influenza A. Selection for evolvability explains the prevalence of transposons among bacteria and recombination among higher organisms. We suggest that therapeutics also confer selective pressure on the evolvability of pathogens, and that this driving force for antigenic drift should be considered in drug- and vaccine-design efforts.
| Acknowledgements |
|---|
| Footnotes |
|---|
* To whom correspondence should be addressed at: Department of Bioengineering and Department of Physics and Astronomy, MS 142, Rice University, 6100 Main Street, Houston, TX 77005-1892. E-mail: mwdeem{at}chinook.rice.edu.
© 2004 by The National Academy of Sciences of the USA
| References |
|---|
|
|
|---|
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
E. Munoz and M. W. Deem Amino acid alphabet size in protein evolution experiments: better to search a small library thoroughly or a large library sparsely? Protein Eng. Des. Sel., May 1, 2008; 21(5): 311 - 317. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kashtan, E. Noor, and U. Alon Varying environments can speed up evolution PNAS, August 21, 2007; 104(34): 13711 - 13716. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Lynch Colloquium Papers: The frailty of adaptive hypotheses for the origins of organismal complexity PNAS, May 15, 2007; 104(suppl_1): 8597 - 8604. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. T. Cirz, M. B. Jones, N. A. Gingles, T. D. Minogue, B. Jarrahi, S. N. Peterson, and F. E. Romesberg Complete and SOS-Mediated Response of Staphylococcus aureus to the Antibiotic Ciprofloxacin J. Bacteriol., January 15, 2007; 189(2): 531 - 539. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. L. O'Loughlin, W. M. Patrick, and I. Matsumura Natural history as a predictor of protein evolvability Protein Eng. Des. Sel., October 1, 2006; 19(10): 439 - 442. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Bloom, S. T. Labthavikul, C. R. Otey, and F. H. Arnold Protein stability promotes evolvability PNAS, April 11, 2006; 103(15): 5869 - 5874. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Grande-Perez, E. Lazaro, P. Lowenstein, E. Domingo, and S. C. Manrubia Suppression of viral infectivity through lethal defection PNAS, March 22, 2005; 102(12): 4448 - 4452. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||