New Research In
Physical Sciences
Social Sciences
Featured Portals
Articles by Topic
Biological Sciences
Featured Portals
Articles by Topic
- Agricultural Sciences
- Anthropology
- Applied Biological Sciences
- Biochemistry
- Biophysics and Computational Biology
- Cell Biology
- Developmental Biology
- Ecology
- Environmental Sciences
- Evolution
- Genetics
- Immunology and Inflammation
- Medical Sciences
- Microbiology
- Neuroscience
- Pharmacology
- Physiology
- Plant Biology
- Population Biology
- Psychological and Cognitive Sciences
- Sustainability Science
- Systems Biology
Ancient X chromosomes reveal contrasting sex bias in Neolithic and Bronze Age Eurasian migrations
Edited by Wolfgang Haak, Max Planck Institute for the Science of Human History, Jena, Germany, and accepted by Editorial Board Member Richard G. Klein January 12, 2017 (received for review October 2, 2016)
This article has a reply. Please see:

Significance
Studies of differing female and male demographic histories on the basis of ancient genomes can provide insight into the social structures and cultural interactions during major events in human prehistory. We consider the sex-specific demography of two of the largest migrations in recent European prehistory. Using genome-wide ancient genetic data from multiple Eurasian populations spanning the last 10,000 years, we find no evidence of sex-biased migrations from Anatolia, despite the shift to patrilocality associated with the spread of farming. In contrast, we infer a massive male-biased migration from the steppe during the late Neolithic and Bronze Age. The contrasting patterns of sex-specific migration during these two migrations suggest that different sociocultural processes drove the two events.
Abstract
Dramatic events in human prehistory, such as the spread of agriculture to Europe from Anatolia and the late Neolithic/Bronze Age migration from the Pontic-Caspian Steppe, can be investigated using patterns of genetic variation among the people who lived in those times. In particular, studies of differing female and male demographic histories on the basis of ancient genomes can provide information about complexities of social structures and cultural interactions in prehistoric populations. We use a mechanistic admixture model to compare the sex-specifically–inherited X chromosome with the autosomes in 20 early Neolithic and 16 late Neolithic/Bronze Age human remains. Contrary to previous hypotheses suggested by the patrilocality of many agricultural populations, we find no evidence of sex-biased admixture during the migration that spread farming across Europe during the early Neolithic. For later migrations from the Pontic Steppe during the late Neolithic/Bronze Age, however, we estimate a dramatic male bias, with approximately five to 14 migrating males for every migrating female. We find evidence of ongoing, primarily male, migration from the steppe to central Europe over a period of multiple generations, with a level of sex bias that excludes a pulse migration during a single generation. The contrasting patterns of sex-specific migration during these two migrations suggest a view of differing cultural histories in which the Neolithic transition was driven by mass migration of both males and females in roughly equal numbers, perhaps whole families, whereas the later Bronze Age migration and cultural shift were instead driven by male migration, potentially connected to new technology and conquest.
Genetic data suggest that modern European ancestry represents a mosaic of ancestral contributions from multiple waves of prehistoric migration events. Recent studies of genomic variation in prehistoric human remains have demonstrated that two mass migration events are particularly important to understanding European prehistory: the Neolithic spread of agriculture from Anatolia starting ∼9,000 y ago and migration from the Pontic-Caspian Steppe ∼5,000 y ago (1⇓⇓⇓⇓⇓–7). These migrations are coincident with large social, cultural, and linguistic changes, and each has been inferred to have replaced more than half of the contemporaneous gene pool of resident central Europeans.
During such events, males and females often experience different demographic histories owing to cultural factors, such as norms regarding inheritance and the residence locations of families in relation to parental residence, social hierarchy, sex-biased admixture, and inbreeding avoidance (8⇓⇓⇓–12). Empirical evidence suggests that sex-specific differences in migration and admixture have shaped patterns of human genomic variation worldwide, with notable examples occurring in Africa, Austronesia, Central Asia, and the Americas (13⇓⇓–16). These sex-specific behaviors leave signatures in the patterns of variation in genetic material that is differentially inherited between males and females in a population. Therefore, contrasting patterns of genetic variation for differentially inherited genetic material can be informative about past sociocultural and demographic events (8⇓⇓⇓–12, 17).
Analyses of the maternally inherited mitochondrial DNA (mtDNA) and the paternally inherited Y chromosome have lent differential support to the hypothesis that the Neolithic spread of agriculture from Anatolia occurred through a large population migration rather than a spread of technology (18⇓⇓⇓–22). In general, studies of Y-chromosomal data more than mtDNA have supported Anatolian migration. This pattern of results has been interpreted as evidence for male-biased migration of the population that introduced farming (18, 20, 21). The hypothesis of male-biased migration of farming populations is consistent with ethnographic studies showing a higher frequency of patrilocality in farming than in hunter-gatherer (HG) populations, because an inheritance model through the paternal lineage would favor the persistence of farming-associated Y chromosomes as the source population would have greater flexibility in female mates. Isotopic studies from Neolithic European archeological sites suggest more female than male migration on a local scale, supporting the shift to patrilocality in the region (10, 23). However, genetic evidence has been mixed; both Near Eastern-related mitochondrial and Y-chromosome haplotypes have been observed in European populations, which could indicate comparable male and female migration during the Neolithic spread of agriculture. For example, Haak et al. (22) find that mitochondrial haplotype N1a, associated with Near Eastern farmers, occurs at about ∼25% frequency in Neolithic central Europeans. Later migrations from the steppe, which were previously not accounted for, may have obscured signal and interpretation (22).
Based on archeological data, as well as ancient and modern Y chromosome data, the later migration from the Pontic-Caspian Steppe has also been hypothesized to be male-biased (5, 24⇓⇓⇓⇓–29). In particular, multiple large-scale studies of modern Y-chromosome data infer a rapid growth of R1a and R1b haplotypes ∼5,000 y ago (27⇓–29). Similarly, Haak et al. (5) provide evidence that R1a and R1b were rare in central Europe before ∼4,500 y ago, but common soon thereafter. They also observe multiple R1b haplotypes in ancient Yamnaya individuals from the steppe. Populations in the Pontic-Caspian Steppe region, such as the Yamnaya or Pit Grave culture, are thought to have strong male-biased hierarchy, as inferred by overrepresentation of male burials, male deities, and kinship terms (26, 30). The region is a putative origin for the domesticated horse in Europe, and the culture is known for its use of horse-driven wagons, a potential male-biased mechanism of dispersal into central Europe (30).
Recent analytical advances in the understanding of admixture on the autosomes and the sex-specifically–inherited X chromosome and technological advances that have generated genome-wide data from many ancient samples now make it possible to consider the contrasting male and female genetic histories of prehistoric Europe. We test the hypotheses that migrations from Anatolia during the Neolithic transition and from the Pontic Steppe during the late Neolithic/Bronze Age period were male biased.
Results
Fig. 1 provides a schematic of the population admixture events that have previously been inferred (1⇓⇓⇓⇓⇓–7). Previous studies have inferred the relationship between the various ancient populations shown in the figure, but they did not consider a population history model. We compare genetic differentiation of the autosomes and the X chromosome between the migrating and admixed populations for each migration event: Anatolian farmers (AF) to early Neolithic central Europeans (CE) and Pontic Steppe pastoralists (SP) to late Neolithic and Bronze Age central Europeans (BA). We compute the statistic Q (31, 32), which is an estimator of the ratio of effective population size of the X chromosome with the ratio of effective population size of the autosomes based on the
Schematic of the admixture history of central European farmers during the Neolithic and Bronze Age. First, a migration from Anatolia occurred during the Neolithic transition, and, second, a late Neolithic/Bronze Age migration occurred from the Pontic-Caspian Steppe to central Europe. In both cases, the migrating population mixed with the contemporaneous local population upon entering central Europe.
Comparisons of
To infer sex-specific admixture rates and compare potential migration models, we estimated ancestry proportions on the X chromosome and autosomes separately, with a model-based clustering algorithm (33), using the ancient genomes as proxies for the ancient source groups in our population model and using supervised clustering (Materials and Methods, Fig. 2, and SI Appendix, Figs. S1 and S2 and Tables S1–S7). For an admixture process with equally many males and females contributing, the ratio of mean X-chromosomal admixture to mean autosomal admixture is expected to be 1. An admixture process with more contributing males leads to a reduction of the migrating population’s ancestry on the X chromosome compared with the autosomes.
Comparisons of estimated X and autosomal ancestry on the basis of model-based supervised clustering. (A) Early/middle Neolithic Europeans (CE). (B) Late Neolithic/Bronze Age Europeans (BA). Individuals are ordered by X-chromosomal ancestry, with corresponding autosomal ancestry for the same individual shown below. Clustering results by individual are presented in SI Appendix, Table S1. (C) Histograms of the ratio of the mean across individuals of X-chromosomal ancestry to the mean across individuals of autosomal ancestry for 100 autosomal resampled estimates using random sets of SNPs equal in size to the set of X-chromosomal SNPs for the corresponding population (Materials and Methods). Colors for all panels correspond to ancestry groups given in Fig. 1.
Neolithic Migration.
For the Neolithic transition, we estimated the ratio of mean AF ancestry on the X chromosome (across individuals) to the mean on the autosomes as
We additionally considered the fraction of individuals in the admixed population with higher X-chromosomal than autosomal ancestry. This measure is indicative of sex bias, with less emphasis on the exact values of the ancestry proportions. Excluding three individuals with 100% ancestry estimated to be from Anatolian-related populations on both the X and autosomes, nine of 17 individuals have higher X than autosomal ancestry (P = 0.500, binomial test).
Notably, the four middle Neolithic individuals (SI Appendix, Table S1) have higher HG ancestry than earlier CE individuals, consistent with a previously described resurgence of HG ancestry during the middle Neolithic (5, 34). The ratio of the mean X to the mean autosomal ancestry for this group of four samples is
We find no statistical support for differences in X and autosomal ancestry; however, we cannot exclude low levels of sex-specific mating between early farmers and hunter-gatherers. Therefore, we evaluated the magnitude of differences in male and female contributions that would be consistent with observed X-to-autosomal ancestry ratios. We determined this range of sex bias values by simulating ancestry under a mechanistic admixture model, including genetic drift and sampling at specified sample sizes (17, 35, 36) (Materials and Methods and Fig. 3A). Even for a small admixed population, the largest bias consistent with the observed X and autosomal ancestries is less than 1.2 males for every female, with a median over 1,000 simulations of 1.07.
Estimated levels of sex bias during the Neolithic transition and Pontic Steppe migration. (A) Neolithic transition. The range of sex bias, measured as the ratio of males to females from a source population, that is consistent with the observed ratio of X and autosomal ancestries (Materials and Methods). Total contributions from the source population, the fraction of admixed individuals with a parent from that source population, are specified based on autosomal ancestry as 0.913 from AF and 0.087 from HG. Lines indicate that the observed ratios of X to autosomal ancestry in our dataset were present in the middle 50% (black) or middle 80% (gray) of 1,000 simulated admixed populations for specified CE population sizes. (B) Pontic Steppe migration. Under a model of constant admixture over time, the fraction of the total contribution of genetic material originating from males for each source population: CE and SP. Contributions are estimated from the migration parameter sets that have the smallest 0.1% Euclidean distance between observed and model-calculated ancestries. (C) Schematic of sex-specific migrations during the early Neolithic and later Neolithic/Bronze Age. Female contributions in are shown in red, and male contributions are shown in blue. Parameters are estimated under a single pulse migration model from Anatolia and under a constant migration model for the Pontic Steppe migration. The total contribution of each population is the average of female and male contributions from that source.
Based on the slightly larger X than autosomal ancestry observed for HG ancestry, under the simulation framework, we estimate a median of 1.91 females for every male from the HG population to the early CE population. The signal of female bias in contributions from HG to CE populations might be caused by a male-biased inheritance structure in the new farming population; that is, it is possible that the migration from Anatolia involved substantial contributions from both men and women, but once in central Europe, a shift to patrilocality might have made absorption of local HG females easier than absorption of HG males. However, the absolute difference between estimated male and female contributions is small (∼0.06). Correspondingly, differences in the numbers of female and male migrants would be small or are potentially a result of sampling.
Considering these analyses together, we find no statistical support for a male-biased migration from Anatolia. Only a small range of possible sex bias is consistent with the data; however, owing to the small total contribution from the HG population, we see female-biased contributions from the HG to CE populations (Fig. 3A).
Pontic-Caspian Steppe Migration.
We next considered female and male migration histories during the late Neolithic/Bronze Age migration from the Pontic-Caspian Steppe (Fig. 1). In contrast to the CE population during the early Neolithic expansion from Anatolia, we find a strikingly lower distribution of SP ancestry on the X chromosome than the autosomes (in accordance with
To interpret the values of sex-specific admixture that can produce the observed ratio of X to autosomal SP ancestry of about 0.6, we considered four models for the admixture process. The first model is a single admixture event, in which an SP population quickly mixes with central European farmers, with no further migration from either population to the admixed BA population. Under this model, however, the level of sex bias is too high to have been produced by a single admixture event; no solution for the female and male migration rates exists within the possible admixture contribution range from 0 to 1 (Materials and Methods). In other words, in a pulse migration and admixture scenario in a single generation, even a male-only migration event is not extreme enough to generate the observed X-to-autosome bias in the data. Ongoing male migration from the steppe over multiple generations is therefore required to explain observed patterns of X and autosomal ancestry.
We therefore considered a model of constant contributions over time from the SP population and early Neolithic farmers (CE). We follow the method of Goldberg and Rosenberg (17), comparing expected X and autosomal ancestry (equations 5, 17, and 18 of ref. 17 and equation 30 of ref. 35) with observed ancestry in our data over a grid of possible parameter values. We present results from the 0.1% of parameter sets closest to observed data using a Euclidean distance between model-based and observed population mean ancestries on the X and autosomes (Materials and Methods). Other cutoffs (0.5%, 1%, and 5%) produced similar trends.
SI Appendix, Fig. S4 plots the range of sex-specific contributions from the SP and CE populations that produce estimates close to the estimates observed in the BA population. Males from the steppe and central European females show substantial ongoing migration, with continuing admixture rates of almost 1/2; that is, almost half of the male parents in each generation of BA individuals are new migrants from the SP population. Females from the steppe and early Neolithic European males, however, are estimated to have contributed negligibly to the BA population. Fig. 3B plots the proportional contribution of males from each source population, with a median of about 94% of SP ancestry in the BA population coming from male SP migrants and all local CE ancestry originating in CE females. This result corresponds to ∼14 male migrants for every female migrant from the steppe contributing to the ancestry of the BA population. Considering the smallest 0.5%, 1% , and 5% of Euclidean distances instead, this ratio is about 8.5, 7.5, and 5.1, respectively, males per female migrating from the steppe. These estimates are similar to estimates from modern Y-chromosome data, suggesting a reduction in the male effective population size by more than fivefold about 5,000 y ago (28).
The signature of X-chromosomal to autosomal ancestry is driven by the last few generations of admixture. Testing other models of time-dependent admixture, with the contributions from one or both of the source populations increasing or decreasing over time, we find that the data fit model-based estimates approximately equally well when the admixture contributions at the last few generations are similar to the admixture contributions estimated from a constant admixture model (Materials and Methods).
The signal of a large male bias holds when analyzing late Neolithic Corded Ware individuals and later Bronze Age Unetice individuals separately, with mean X-to-autosomal ancestry ratios in the two groups of 0.716 and 0.474, respectively. Ancestry and sex bias do differ between the groups, with a larger male bias and lower SP ancestry for the later Unetice, although the trend is not statistically significant (SI Appendix, Fig. S1B). Individuals from Bell Beaker archeological sites, a culture that overlapped with Corded Ware and Unetice but occurred over a wider geographic scale, show levels of X and autosomal ancestry suggestive of overall ancestry contributions and levels of sex bias that are similar to Corded Ware and Unetice, with mean X and autosomal ancestry of 0.28 and 0.56, respectively (SI Appendix, Table S7).
The signal of male-biased contributions from SP to BA over time is consistent with an admixture scenario in which a massive male-biased migration from the steppe initially looks to local European farmer females for wives, and with a paternal mode of inheritance, the BA population disproportionately absorbs females from local “unadmixed” farmers. Admixture from the steppe population continues over time, although mainly men migrate, perhaps expanding using the male-dominated modes of horses and wagons (24, 30).
Discussion
Overall, the model-based ancestry results show remarkable similarity to our original comparisons of relative genetic drift on the X versus autosomes using a measure of genetic differentiation,
Owing to the large ancestry contribution and lack of sex-biased admixture, the massive cultural change that accompanied the shift to agriculture is consistent with a large-scale migration of an entire subset of a population, perhaps families, and a slower rate of spread. Minimal differences in sex-specific migration and the high overall AF ancestry in CE individuals support this scenario. Given the probable patrilocality of the migrating AF population (10, 23), this result suggests that the residence and descent rules were not determining factors in long-distance, sex-specific migration. The lack of sex bias is not, in fact, inconsistent with previous indications of sex bias during the Neolithic based on mtDNA diversity. Earlier work focused on measures of diversity rather than ancestry, which estimate the effective population size rather than admixture. Therefore, earlier single-locus studies are likely seeing the signal of patrilocality rather than the migration process from Anatolia (20).
In contrast, our results, combined with the archeological evidence, suggest that the rapid migration from the Pontic Steppe was strongly male-biased, potentially via newly domesticated horses in multiple waves (24, 25, 30). Such differences in sex-specific migration patterns are suggestive of fundamentally different types of interactions between invading and local populations during the two migration events. Our results demonstrate the power of joint X chromosome and autosome analyses for inferring important processes in human prehistory.
Materials and Methods
Genetic Samples and Populations.
We analyzed published (6) ancient samples that have been genotyped for a set of 1,240,000 SNPs, including 49,711 on the X chromosome. Under notation from a study by Mathieson et al. (6), for the early Neolithic migration from Anatolia, we considered individuals from the CEM population label for “selection label 2”; for the late Neolithic/Bronze Age migrations from the Pontic Steppe, we considered individuals with the “archeological culture” label “central_LNBA.” These subsets of the data geographically restrict analyses to central Europeans, decreasing potential variation from spatial variation within Europe. We further classify individuals within each group using cultural, temporal, and geographic information; archeological labels follow the labels used by Mathieson et al. (6) and Lazaridis et al. (37) (SI Appendix, Table S1).
Additional genomic filtering and analyses were done in PLINK v1.90 (38). We removed the pseudoautosomal region of the X chromosome, and removed one SNP from each pair with a correlation greater than 0.4 using the command “indep-pairwise 200 25 0.4” following previous ancient DNA studies (4, 6). We considered admixed individuals with at least 1,000 SNPs on the X chromosome. SI Appendix, Tables S1 and S2 show the admixed individuals and their population classifications. More information on the samples is available in the study by Mathieson et al. (6).
Sex-Biased Genetic Differentiation.
As a first line of evidence for the sex-specific relationships between the two sets of migrating and admixed populations, AF-CE and SP-BA, we compared genetic differentiation on the X versus autosomes,
Values of Q are suggestive, because deviations from 3/4 can also be produced by population histories with population size or migration changes even in the absence of sex bias (40, 41). Additionally, Q lacks a clear framework for quantitative interpretation. Therefore, we used a mechanistic admixture model comparing ancestry on the X chromosome and autosomes to infer sex-specific admixture rates and compare potential migration models.
Estimating Ancestry Components.
Evidence of admixture and migration events in the population history of central Europeans, as well as current best proxy populations for their sources, has been extensively presented in other studies (1⇓⇓⇓⇓–6). Therefore, we assumed these migration events occurred, and used the best samples/populations currently available as representatives of relatives of the admixed populations. A schematic of the migration events is shown in Fig. 1, with estimated ancestry components shown in Fig. 2. Results by individual are presented in SI Appendix, Tables S2, S4, and S5.
We estimated ancestry components of the two admixed populations, CE and BA. For the early Neolithic transition to agriculture, we assumed the number of ancestry components, K, was 2 with AF and HG source populations. For the later migration from the steppe, we assumed three ancestry components (K = 3) with contributions from the SP represented by the Yamnaya Samara population, as well as contributions from AF and HG populations.
Multiple methods exist to infer individual ancestry proportions. We considered two of the most common clustering methods: Admixture (33) v1.3, which is a maximum likelihood method, and Structure (42) v2.3, which is a Bayesian algorithm. Both methods rely on a similar underlying model, with different estimation techniques. In each program, we tested supervised and unsupervised clustering and compared individual ancestry estimates and population-level summary statistic estimates between methods. Results are summarized in SI Appendix, Tables S4 and S5, for the autosomes and X chromosome, respectively. We describe ancestry estimation methods and compare results from both supervised and unsupervised clustering in Admixture and Structure in SI Appendix. Based on these analyses, we use ancestry estimated in Admixture from supervised clustering for main text analyses.
Assessing Ancestry Estimation.
To compare the X chromosome with the autosomes, we estimated autosomal ancestry on 100 sets of SNPs resampled from the autosomes. For each of the two migration events, we resampled autosomal ancestry to match the number of SNPs used in X-chromosomal analyses. For the Neolithic transition, the number of SNPs was 3,763. For the steppe migration, the number of SNPs was 4,605. We also down-sampled X-chromosomal SNPs for BA individuals 10 times to 3,763 to compare estimates of ancestry between the two migration events. Ancestry estimates based on the down-sampled data were within 5% of the original full data.
To test if the ancestry estimates are stable over the choice of individuals in the source populations of Mathieson et al. (6), we tested multiple subsets of source population individuals (SI Appendix, Table S3): (i) all individuals from the study of Mathieson et al. (6) for the respective categories, using the original population descriptions of Anatolians, Western + Scandinavian HG populations, and Yamnaya Samara, and (ii) the subset of individuals whose genetic population assignment matches their known cultural association in an average of 10 independent unsupervised admixture runs for both the X and the autosomes.
For the first event, the Neolithic transition, the estimated ancestry components are roughly constant with varying choice of individuals. For the migration from the steppe, however, we see a range of values for estimated ancestries over different seeds, suggesting variation in the likelihood surface. The qualitative results are consistent through all analyses. The population means for the X chromosome and autosomes range from 0.27 to 0.44 and from 0.54 to 0.73, respectively. The ratio of X to autosomal ancestry for a given seed varies between 0.38 and 0.61. Although the magnitude of ancestry estimates varies, the signal of substantial sex bias based on the ratio of X to autosomal ancestry is seen for all scenarios.
For all analyses, we used ancestry estimated from the mean per individual of X-chromosomal estimates over the 10 seeds. Autosomal ancestry is estimated as the median of 100 estimates from resampled SNP sets, which is in the lower range of autosomal estimates. For the steppe migration, this estimate leads to a ratio of mean X-chromosomal to mean autosomal ancestry of 0.59, which is on the conservative (closer to 1) end of the range of estimates.
We also considered the impact of source population on inference of ancestry by estimating ancestry proportions of alternate Near Eastern farming reference individuals as reported by Lazaridis et al. (37) (SI Appendix, Table S6). We include Natufian and Neolithic Levantine individuals, although these groups are unlikely the source for the Neolithic migration into Europe (7, 43). For admixed CE populations, the mean X ancestry is 0.97 and the mean autosomal ancestry is 0.91, leading to an X-to-autosomal ancestry ratio of 1.06. This ratio of X to autosomal ancestry is slightly elevated compared with the original 0.989, but it leads to the same conclusion of minimal to no sex bias from the Near East into central Europe. Notably, the elevated ratio is driven by higher X-chromosomal ancestry (0.97 vs. original 0.90), which may be an effect of the reduction in the number of X-chromosome SNPs to 1,988.
Ancestry Over Time.
Within each admixed population, samples span thousands of years; therefore, we considered temporal heterogeneity in ancestry by fitting linear models of ancestry over time in MATLAB (fitlm) (SI Appendix, Fig. S1). We fit X and autosomal ancestry separately for each population. Sampling age is defined as the midpoint of the two-sigma calibrated date range given by Mathieson et al. (6). For individual ancestry y and sampling age x, for the Neolithic migration, we have
Statistical Significance of X and Autosomal Differences.
We tested for statistical significance of the difference between the population means of X and autosomal ancestry within the admixed Neolithic population using the Wilcoxon signed-rank test. We did 100 comparisons of the distribution of ancestry on the X chromosome within the population with the distribution of autosomal ancestry estimated using each resample of M SNPs, where M is the number of X-chromosomal SNPs for the associated population.
The Wilcoxon signed-rank test is a nonparametric paired difference test. For a statistically significant difference in the within-population distribution of X and autosomal ancestry, one would expect an excess of small P values. Rather, for the Neolithic transition, comparing X and autosomal AF-related ancestry, the P value distribution over the 100 calculations is approximately uniform (SI Appendix, Fig. S3). Similarly, comparing X and autosomal ancestry when autosomal ancestry is estimated from all SNPs together (M = 331,515), the Wilcoxon signed-rank test is not significant (P = 0.493). In contrast, for the later migration from the Pontic Steppe, P = 0.002 comparing the distribution of ancestry on the X chromosome with the distribution of ancestry estimated for the autosomes with all SNPs together (M = 375,243), and we see an excess of small P values for the comparisons with 100 resampled autosomal estimates (SI Appendix, Fig. S3).
Simulations to Estimate Range of Sex Bias During Neolithic Transition.
For a constant admixed population of size
At
Admixture Models for Migration from the Steppe.
We used recursive expressions for X and autosomal ancestry as a function of sex-specific admixture rates to interpret observed ancestry (17, 36). We considered four general models of admixture over time: (i) single admixture event with no further migration, (ii) constant migration over time, (iii) increasing migration from SP over time, and (iv) decreasing migration from SP over time.
First, we considered a single pulse admixture event, analogous to the admixture event used for the early Neolithic migration from Anatolia. For mean autosomal ancestry within the admixed population of 0.618 and mean X-chromosomal ancestry of 0.366, under the model of a single admixture event with no further migration, we used equations 22 and 23 of ref. 17 to write X and autosomal ancestries as a function of sex-specific contribution parameters. We have
We next considered constant admixture over time. Assuming
Because the number of males in both admixed populations is small, mean sample ancestry estimates may not be representative of the population mean. Therefore, we followed equation 5 of ref. 17, calculating a pooled female and male Euclidean distance between model-based ancestry calculations and observed ancestry estimates. Fig. 2 presents results based on the smallest 0.1% of Euclidean distances on the grid, with estimated sex bias values for other cutoffs in the text.
These scenarios, however, are few of many that are possible, and further work is needed to describe the spatiotemporal variation in admixture during these migrations. Although spatial and temporal resolution will refine admixture models, the signals of sex-specific admixture during the prehistory of central Europe will persist. Similarly, other processes may also differentially affect the X chromosome and autosomes, including recombination, mutation, and selection, but these forces are unlikely to have a large impact on the chromosome-wide, ancestry-based summary statistics we base analyses on over the short time scales considered.
Variance in Ancestry.
Our analyses focus on comparisons of mean X-chromosomal and autosomal ancestry. However, the variance can also be informative about the admixture history (35, 36). The variance in ancestry with the admixed Neolithic individuals is quite low (0.013 for the X chromosome and 0.005 for the autosomes), with a higher variance in the admixed BA population (0.102 for the X chromosome and 0.039 for the autosomes). Larger X than autosomal variance is expected owing to the difference in the number of chromosomes inherited per generation. The higher variance in ancestry across individuals associated with the Pontic Steppe migration is consistent with recent or ongoing migration within the past few generations, particularly because sex bias would decrease the variance (36). Additionally, with recent or ongoing male-biased migration, one would expect lower steppe ancestry on X chromosomes in admixed males than in admixed females, because females receive an X chromosome from their fathers. The mean X-chromosomal ancestry of BA males is roughly half the mean X-chromosomal ancestry of BA females, although the difference is not statistically significant with only four individuals. Although consistent with inferences from mean ancestry components, strong conclusions cannot be drawn from the variance or differences in male and female ancestry, given the current sample sizes.
Acknowledgments
We thank Jonathan Kang for bioinformatics support with the HapMap data. We acknowledge support from National Science Foundation (NSF) Graduate Research and Achievement Rewards for College Scientists fellowships (to A.G.), a Wenner–Gren Foundation fellowship (to T.G.), NSF Grant BCS 1515127 (to N.A.R.), European Research Council Grant 311413 and Göran Gustafsson Foundation Prize (to M.J.), as well as an NSF-Swedish Research Council Graduate Research Opportunities Worldwide award (to A.G. to visit Uppsala University).
Footnotes
- ↵1To whom correspondence may be addressed. Email: mattias.jakobsson{at}ebc.uu.se or agoldb{at}stanford.edu.
Author contributions: A.G. and M.J. conceived the project; M.J. headed the project; A.G., T.G., N.A.R., and M.J. designed research; A.G., T.G., and M.J. performed research; A.G., T.G., and M.J. analyzed data; and A.G., T.G., N.A.R., and M.J. wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS Direct Submission. W.H. is a Guest Editor invited by the Editorial Board.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1616392114/-/DCSupplemental.
References
- ↵.
- Ammerman AJ,
- Cavalli-Sforza LL
- ↵.
- Skoglund P, et al.
- ↵
- ↵
- ↵
- ↵
- ↵.
- Günther T,
- Jakobsson M
- ↵
- ↵
- ↵.
- Haak W, et al.
- ↵
- ↵
- ↵.
- Tishkoff SA, et al.
- ↵
- ↵.
- Cox MP,
- Karafet TM,
- Lansing JS,
- Sudoyo H,
- Hammer MF
- ↵
- ↵.
- Goldberg A,
- Rosenberg NA
- ↵.
- Bentley RA,
- Layton RH,
- Tehrani J
- ↵
- ↵.
- Rasteiro R,
- Chikhi L
- ↵.
- Szécsényi-Nagy A, et al.
- ↵.
- Haak W, et al.
- ↵.
- Bentley RA, et al.
- ↵.
- Gimbutas M
- ↵
- ↵.
- Kristiansen K,
- Larsson TB
- ↵.
- Batini C, et al.
- ↵.
- Karmin M, et al.
- ↵
- ↵.
- Anthony DW
- ↵
- ↵.
- Waldman YY, et al.
- ↵.
- Alexander DH,
- Novembre J,
- Lange K
- ↵.
- Bollongino R, et al.
- ↵.
- Verdu P,
- Rosenberg NA
- ↵.
- Goldberg A,
- Verdu P,
- Rosenberg NA
- ↵.
- Lazaridis I, et al.
- ↵
- ↵
- ↵
- ↵
- ↵.
- Pritchard JK,
- Stephens M,
- Donnelly P
- ↵.
- Kılınç GM, et al.
Citation Manager Formats
More Articles of This Classification
Biological Sciences
Genetics
Social Sciences
Related Content
Cited by...
- Integrative studies of cultural evolution: crossing disciplinary boundaries to produce new insights
- Four millennia of Iberian biomolecular prehistory illustrate the impact of prehistoric migrations at the far end of Eurasia
- Sackler Colloquium on Extension of Biology Through Culture: Cultural evolutionary theory: How culture evolves and why it matters
- A composite window into human history
- Reply to Lazaridis and Reich: Robust model-based inference of male-biased admixture during Bronze Age migration from the Pontic-Caspian Steppe
- Failure to replicate a genetic signal for sex bias in the steppe migration into central Europe

















