Previous Article |
Table of Contents
| Next Article
SOCIAL SCIENCES / BIOLOGICAL SCIENCES / ANTHROPOLOGY / EVOLUTION
Distinct genomic signatures of adaptation in pre- and postnatal environments during human evolution

,
,





,
,¶,

Center for Molecular Medicine and Genetics and
Department of Anatomy and Cell Biology, Wayne State University School of Medicine, Detroit, MI 48201; ¶Perinatology Research Branch, National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892; ||Department of Computer Science, Wayne State University, Detroit, MI 48201; 
Department of Anthropology, The George Washington University, Washington, DC 20052; and 
Department of Obstetrics and Gynecology, Wayne State University School of Medicine and Hutzel Women's Hospital, Detroit, MI 48201
Contributed by Morris Goodman, December 31, 2007 (received for review December 3, 2007)
| Abstract |
|---|
|
|
|---|
fetal | human disease | mitochondria | placenta | thyroid
An additional important dimension to consider is the environment in which selection occurs. Placental mammals exist in two primary environments during their lifetimes that may select for different adaptive changes: the prenatal and postnatal. Thus the prenatal environment, with its maternal fostering of fetal development, could favor a different course of adaptive evolution than postnatal environments, when individuals face a much broader range of external challenges. Furthermore, to shape complex morphologies or physiologies, the targets of natural selection may have been not just individual genes but, rather, networks of genes comprising specific biological pathways.
In light of these considerations, the present study was directed at identifying the positively selected genetic changes that may have helped to shape key, distinctive (and in some cases unique) phenotypic features of humans. We first identified genes that have been the targets of positive selection during human evolutionary history, broadly defined to encompass the lineage that descended from the last common ancestor (LCA) of primates and rodents to present day humans (Fig. 1A). After exclusion of the genes that were also targets of positive selection in rodent descent, the remaining genes were referred to as human ancestry-specific (HAS). We next identified highly expressed HAS genes from human tissues representing pre- and postnatal stages of development (11): fetal brain, adult brain, fetal liver, adult liver, fetal lung, adult lung, fetal thyroid, adult thyroid, placenta, adult uterus, and adult heart (Fig. 1B). For the purpose of this study, postnatal is defined as the age of the individual tissue samples, which were exclusively adult.
|
| Results and Discussion |
|---|
|
|
|---|
|
The top FACs comprise distinctive sets of annotations and indicate that different biological pathways became the targets of positive selection during successive periods of humankind's evolutionary history. For example, aerobic energy production pathways were particularly strong targets of selection on the ape stem, as evidenced by that period's annotations of mitochondrion and oxidative phosphorylation for HAS genes. Similarly, immune system functions appear to have been strong targets of selection on the primate stem, as evidenced by annotations of cytokine activity and immune response for HAS genes. Although some annotations do appear more than once during different periods of human evolutionary history (e.g., transmembrane, glycoprotein) only a small number of genes underlying these annotations are shared between the different periods (SI Table 7). This suggests that the adaptive changes that occurred during earlier periods set the stage for selection of different sets of adaptive changes during the later evolutionary periods, even within the same biological pathway. Additional analyses in which all HAS genes are clustered according to only disease-related annotations support the finding that earlier periods of human evolutionary history also play a role in "maladaptive" human phenotypes, particularly with respect to autoimmune and longevity-related disease (SI Results).
Highly Expressed HAS Genes. The top FACs for all highly expressed genes from each of the 11 tissues, and the subset of those that are also adaptively evolving, are presented in Table 2. Among the 1,240 HAS genes, 287 appeared at least once on the lists of genes highly expressed in the 11 studied tissues. In all four adult-stage tissues with a fetal counterpart, and in the uterus/placenta comparison, the annotations comprising the top clusters of highly expressed HAS genes are overrepresented to a greater extent (as indicated by enrichment scores) than in the prenatal-stage tissue. Furthermore, when considered in aggregate, enrichment scores from the top three clusters of highly expressed HAS genes from all five adult tissues are significantly different from the enrichment scores determined for the top three clusters in the five predominantly fetal tissues (Mann–Whitney U test = 45; P = 0.006, two-tailed test). When heart is added to the adult-stage tissues, the significant difference from fetal tissues remains (Mann–Whitney U test = 52; P = 0.003). These greater adult enrichment scores are in contrast to the more variable pattern of enrichment scores obtained for all highly expressed genes for which certain tissues (e.g., thyroid, lung) show higher enrichment in the fetal stage and others (e.g., whole brain, liver) show higher enrichment in the adult stage. Notably, enrichment scores determined for the top three clusters of all highly expressed genes in the adult vs. fetal tissues are not significantly different (Mann–Whitney U test = 100; P = 0.619, two-tailed test).
|
Pre- vs. Postnatal Environments.
It has been suggested that genes involved in determining species-specific morphologies are more likely to be expressed during developmental rather than adult stages (18) and that the species-specific morphologies are more evident later rather than earlier during development (19, 20). Although results presented here do not include a developmental time series of expression data, the pattern of fetal vs. adult-expressed HAS genes may be interpreted to conform to this general pattern in which the positively selected genes that shape species-specific morphologies are more likely to be highly expressed in fetal rather than in adult tissues: Only the prenatal and gestation-related tissues show 50% or more of their adaptive signatures on the human terminal lineage (Fig. 2). Indeed, among the 127 genes that are highly expressed in fetal tissues, only two show a signature of selection on both the human and chimpanzee terminals (and on no other, earlier period of human ancestry): smoothelin and v-crk sarcoma virus ct10 oncogene homolog (avian). Furthermore, only one of these—smoothelin—appears in one of the top three clusters identified among fetally expressed HAS genes (fetal lung). This observation is particularly noteworthy in that, in terms of mean dN calculated from the three human ancestry periods for each tissue with fetal and adult stages, the fetal tissues show lower values in all cases but the lung, which had similar mean dN values in its adult and fetal stages (0.058 vs. 0.059; other data not shown). Despite this overall lower level of nonsynonymous changes, HAS genes highly expressed in fetal tissues show a more pronounced pattern of human terminal-specific change than do their adult counterparts. When considering the total number of adaptive signatures that occur among the top three FACs by HAS branch, this difference in fetal vs. adult tissues is statistically significant (
2 = 7.52, df = 2, P = 0.02). As apparent from examples discussed below, the top FACs for adult tissues are usually implicated in physiological pathways, whereas the top FACs for fetal tissues can be implicated in developmental pathways. This interpretation of our results is in line with the proposal that investigations into the evolution of physiological and morphological characters are best treated separately (18).
|
In contrast, the most enriched cluster determined from adult whole-brain tissue includes annotations of mitochondrion, oxidative phosphorylation, and ion transporter activity and comprises 16 genes (SI Table 13), 10 of which show the signature of positive selection on the ape stem lineage. Given the documented accelerated evolution in electron transport chain genes in anthropoid primates (27) and the relatively higher expression of ETC-related genes in both chimpanzee and human brains (28), these results suggest that energy metabolism pathways supporting adult neuronal function have been modified during portions of the ancestral lineage shared with chimpanzees. In support of this, the proportion of metabolic-supporting glial cells relative to neurons in the neocortex increases in anthropoid primates as overall brain size enlarges, with humans and other apes displaying the highest glia–neuron ratios (29).
Because the heart exhibits a mass-specific metabolic rate that is more than twice that of the brain (30), we also examined HAS gene expression in adult heart tissue. Although aerobic energy metabolism-related clusters also appear in these analyses, the FACs so obtained appear to be largely subsets of the most enriched FACs obtained when considering all highly expressed genes in that tissue (Table 2). Furthermore, whereas brain and body sizes vary widely among primates (31), the heart is an extremely conserved organ whose four-chambered anatomical structure is present in all mammals and whose function is essential to life. Nevertheless, the heart could have benefited from adaptive changes in this pathway if, for example, the adaptive evolution favored more efficient energy production accompanied by less free radical production, which would, in turn, favor longer life spans.
Adaptive Evolution in Thyroid-Expressed HAS Genes. The thyroid produces hormones that are known to play a major role in regulating metabolic rate and the growth rate and function of many bodily systems. In particular, thyroid hormones play a significant role in early brain development (32–35). Despite low levels of triiodothyronine (T3) in the fetus, from weeks 13 to 20—when the mother is the only source of thyroxine (from which T3 is converted)—levels of this hormone increase in fetal cerebral cortical tissue to levels higher than in adults (34). The provision of this important hormone by the mother to the fetus may thus help account for the much greater number of HAS genes identified in adult vs. fetal thyroid, providing a particularly striking example of the differing signatures of selection depending on the developmental stage of the tissue. Consistent with this finding, the topmost cluster from adult thyroid predominates in membrane- and transmembrane-related annotations, suggesting that the genes so annotated could play a role in trafficking thyroid hormones and their related products.
Although genes directly related to thyroid hormone generation did not meet the criterion of our HAS list, oxioreductase and other aerobic energy metabolism annotations comprised the second-most-enriched FAC of HAS genes in the adult thyroid tissue. This finding is consistent with the view (36–38) that thyroid hormones are regulators of mitochondrial function, because they are known to increase oxygen consumption (39, 40) and stimulate cytochrome c oxidase activity (37). In addition, a direct mitochondrial T3 pathway has been implicated in cellular differentiation (41). This dual involvement in energy production and development has led to the proposal that thyroid hormone's regulation of mitochondrial activity may represent a major link between metabolism and development (38), a proposal consistent with the findings presented here.
Adaptive Evolution in Placentally Expressed HAS Genes. Although the placenta shows >50% of its adaptive signatures on the human terminal lineage in two of its top three FACs, results from the topmost FAC indicate that hormonal pathways that function in pregnancy have been particularly strong targets of adaptive evolution during the 19 or so million years before the LCA of humans and chimpanzees. Five of the six genes in this topmost cluster have annotations of hormone and/or hormone activity. Four of these genes show adaptive evolution on the ape stem lineage: MUC1, CCK, GH2, and CGA. MUC1 is expressed in both villous and extravillous trophoblasts (42, 43) and has been implicated in implantation (44). This gene is also highly expressed in the amnion (45). Humans have interstitial implantation compared with the superficial form with fewer invasive trophoblasts seen in Old World monkeys (46). CCK has been shown to regulate leptin levels (47), and leptin expression levels have been shown to change in the obstetrical syndromes of preeclampsia and intrauterine growth retardation (48). GH2 also increases its expression in patients with preeclampsia (49). Interestingly, preeclamptic-like symptoms have not been described in species other than apes.
CGA encodes the
polypeptide in heterodimeric glycoprotein hormones including chorionic gonadotropin (CG), luteinizing hormone, follicle stimulating hormone, and thyroid stimulating hormone. Although these hormones share an identical
polypeptide, their β polypeptides are unique and confer biological specificity. CG is produced by placental trophoblast cells and stimulates production of ovarian steroids essential for pregnancy maintenance. Additional members of the CGβ gene family, predicted to function in implantation and placental development, have recently been identified as originating in the common ancestor of humans and African great apes (50). Taken together, the identification of HAS genes involved in hormonal pathways active during gestation supports the proposal that the evolution of the endocrine system in recent and more distant periods of human evolutionary history is related to adaptations in pregnancy and development.
Concluding Remarks. This study presents a framework for understanding the genetic underpinnings of humankind. FACs of HAS genes by period of descent confirm the deep genetic roots of human distinctiveness and emphasize the importance of considering how the adaptive changes in earlier evolutionary periods set the stage for adaptive changes in later evolutionary periods. In addition, analyses of highly expressed HAS genes from different developmental stages demonstrate that the prenatal environment selects for different HAS genes than the postnatal environment. Indeed, the prenatal environment appears to have favored the adaptive evolution of genes involved in human-specific developmental and morphological changes during the last 6 million years, whereas the postnatal environment has favored the adaptive evolution of genes involved in key physiological processes, the adaptations having been established during earlier periods of human ancestry. A potential caveat when considering our results is that, particularly among fetal tissues, expression data were obtained from samples that included heterogeneous mixtures of ages, gestational stages, and sex, and may thus represent gene expression patterns from a variety of developmental stages. However, that we were able to detect strong signals of tissue-specific adaptations despite the heterogeneous nature of the sampled tissues' origins speaks to the advantage of focusing on functional analyses of only highly expressed HAS genes.
Many clues regarding additional important components in the development of human phenotypic characters are hinted at by our results. For example, the identification of glycoprotein and glycosylation sites among the annotations in the topmost FACs in both the total HAS and human terminal lists suggests a genetic basis for changes in carbohydrate biochemistry and is consistent with the well documented human- and other species-specific differences in glycans and glycan-binding proteins (51, 52). Furthermore, in our tissue-based lists of positively selected genes, the annotations glycoprotein, glycosaminoglycan binding, carbohydrate binding, and glycosylation site appear in the FACs of 10 of the 11 studied tissues.
An additional example comes from annotations comprising genes of the protocadherin (PCDH) gene family. The identification of two PCDH genes in the most enriched cluster obtained from highly expressed HAS genes in fetal brain tissue is noteworthy in light of recent reports that there are primate-specific cis-antisense transcripts for PCDH genes that are expressed in fetal and adult human brain (53). Together, these findings point to a key biological pathway that has undergone both protein coding and regulatory evolution in human ancestry. Our study thus offers insight into the links between genotype and phenotype and underscores the importance of protein-encoding loci in the process of phenotypic change.
| Materials and Methods |
|---|
|
|
|---|
Identification of Highly Expressed Genes. Genes expressed at three times or greater than the median of the 79 analyzed tissues were obtained from Human GNF1H, gcRMA gene expression data indexed by SymAtlas (11) for: placenta, adult uterus (uterus or uterus corpus), whole adult brain, fetal brain, adult liver, fetal liver, adult thyroid, fetal thyroid, adult lung, fetal lung, and adult heart. Each tissue-based list was intersected with the HAS list to identify the set of RefSeq IDs that was both highly expressed and positively selected in each of the 11 tissues.
Identification of Functional Annotation Clusters.
To identify major biological themes underlying this large assortment of HAS genes, functional analyses were conducted by using DAVID (Database for Annotation, Visualization and Integrated Discovery) (12). DAVID is a publicly available, National Institute of Allergy and Infectious Diseases-administered bioinformatics resource that provides a comprehensive set of functional annotation tools designed to assist investigators in understanding the biological meaning behind large sets of genes. Several DAVID tools identify enriched or overrepresented biological annotations within a given set of genes, usually determined in relation to the proportion of similar annotations for an entire genome. For the purpose of this study, results were obtained with the FAC tool, which clusters similar annotations based on the cooccurrence of particular gene sets. The tool also calculates an associated enrichment score for each cluster based on the geometric mean of the P values determined for each of its component annotations. The
similarity threshold was set to 0.7 to identify annotation sets that shared strong agreement among their underlying genes. Other options were set to their default values.
FAC results of the HAS list, and the subset of HAS genes that were highly expressed in each tissue, were obtained in April and May 2007. All text, tables, and figures report results in terms of DAVID genes. P values from DAVID analyses referred to in the text are reported in terms of the modified Fisher-exact P value implemented by the DAVID tool (12). Cluster titles indicated in Tables 1 and 2 were obtained by selecting the overrepresented annotation that conveyed the broadest biological meaning from among the top three overrepresented annotations found in the cluster.
| ACKNOWLEDGMENTS. |
|---|
|
|
|---|
| Footnotes |
|---|
To whom correspondence may be addressed. E-mail: mgoodwayne{at}aol.com or dwildman{at}med.wayne.eduFreely available online through the PNAS open access option.
Author contributions: M.U., M.G., and D.E.W. designed research; M.U., M.I., J.C.O., and D.E.W. performed research; G.L. contributed new reagents/analytic tools; M.U., M.G., O.E., R.R., J.C.O., C.C.S., L.I.G., and D.E.W. analyzed data; and M.U., M.G., O.E., C.C.S., L.I.G., and D.E.W. wrote the paper.
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/cgi/content/full/0712400105/DC1.
© 2008 by The National Academy of Sciences of the USA
| References |
|---|
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||