Skip to main content
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Rights and Permissions
    • Site Map
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian
  • Log in
  • My Cart

Main menu

  • Home
  • Articles
    • Current
    • Latest Articles
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • Archive
  • Front Matter
  • News
    • For the Press
    • Highlights from Latest Articles
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Purpose and Scope
    • Editorial and Journal Policies
    • Submission Procedures
    • For Reviewers
    • Author FAQ
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Rights and Permissions
    • Site Map
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian

User menu

  • Log in
  • My Cart

Search

  • Advanced search
Home
Home

Advanced Search

  • Home
  • Articles
    • Current
    • Latest Articles
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • Archive
  • Front Matter
  • News
    • For the Press
    • Highlights from Latest Articles
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Purpose and Scope
    • Editorial and Journal Policies
    • Submission Procedures
    • For Reviewers
    • Author FAQ

New Research In

Physical Sciences

Featured Portals

  • Physics
  • Chemistry
  • Sustainability Science

Articles by Topic

  • Applied Mathematics
  • Applied Physical Sciences
  • Astronomy
  • Computer Sciences
  • Earth, Atmospheric, and Planetary Sciences
  • Engineering
  • Environmental Sciences
  • Mathematics
  • Statistics

Social Sciences

Featured Portals

  • Anthropology
  • Sustainability Science

Articles by Topic

  • Economic Sciences
  • Environmental Sciences
  • Political Sciences
  • Psychological and Cognitive Sciences
  • Social Sciences

Biological Sciences

Featured Portals

  • Sustainability Science

Articles by Topic

  • Agricultural Sciences
  • Anthropology
  • Applied Biological Sciences
  • Biochemistry
  • Biophysics and Computational Biology
  • Cell Biology
  • Developmental Biology
  • Ecology
  • Environmental Sciences
  • Evolution
  • Genetics
  • Immunology and Inflammation
  • Medical Sciences
  • Microbiology
  • Neuroscience
  • Pharmacology
  • Physiology
  • Plant Biology
  • Population Biology
  • Psychological and Cognitive Sciences
  • Sustainability Science
  • Systems Biology

Y-chromosomal evidence of a pastoralist migration through Tanzania to southern Africa

Brenna M. Henn, Christopher Gignoux, Alice A. Lin, Peter J. Oefner, Peidong Shen, Rosaria Scozzari, Fulvio Cruciani, Sarah A. Tishkoff, Joanna L. Mountain, and Peter A. Underhill
PNAS August 5, 2008 105 (31) 10693-10698; https://doi.org/10.1073/pnas.0801184105
Brenna M. Henn
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christopher Gignoux
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alice A. Lin
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter J. Oefner
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peidong Shen
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rosaria Scozzari
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fulvio Cruciani
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sarah A. Tishkoff
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joanna L. Mountain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter A. Underhill
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  1. Edited by Dolores R. Piperno, Smithsonian Institution, Washington, DC and Balboa, Panama, and approved May 23, 2008 (received for review February 5, 2008)

View Abstract
  • Article
  • Figures & SI
  • Info & Metrics
  • PDF
Loading

Abstract

Although geneticists have extensively debated the mode by which agriculture diffused from the Near East to Europe, they have not directly examined similar agropastoral diffusions in Africa. It is unclear, for example, whether early instances of sheep, cows, pottery, and other traits of the pastoralist package were transmitted to southern Africa by demic or cultural diffusion. Here, we report a newly discovered Y-chromosome-specific polymorphism that defines haplogroup E3b1f-M293. This polymorphism reveals the monophyletic relationship of the majority of haplotypes of a previously paraphyletic clade, E3b1-M35*, that is widespread in Africa and southern Europe. To elucidate the history of the E3b1f haplogroup, we analyzed this haplogroup in 13 populations from southern and eastern Africa. The geographic distribution of the E3b1f haplogroup, in association with the microsatellite diversity estimates for populations, is consistent with an expansion through Tanzania to southern-central Africa. The data suggest this dispersal was independent of the migration of Bantu-speaking peoples along a similar route. Instead, the phylogeography and microsatellite diversity of the E3b1f lineage correlate with the arrival of the pastoralist economy in southern Africa. Our Y-chromosomal evidence supports a demic diffusion model of pastoralism from eastern to southern Africa ≈2,000 years ago.

  • pastoralism
  • phylogeography
  • population genetics
  • Y-chromosome
  • Khoisan

A major focus of early research on the autosomal and mitochondrial population genetics of Europeans was to assess the genetic impact of the Neolithic agricultural expansion throughout Europe, ≈8,000 years ago (1–4). Contrasting models of demic and cultural diffusion for the spread of agriculture from the Near East to Europe predict different genetic signatures. Namely, demic diffusion predicts the replacement of old indigenous European lineages with Near Eastern ones or substantial admixture with Near Eastern lineages across the continent. The cultural diffusion model predicts low levels of gene flow from the Near East to Europe, allowing the majority of European lineages to reflect earlier Upper Paleolithic migration patterns. Although detailed genetic investigations of the European Neolithic are ongoing (5, 6), similar investigations of agropastoral expansions within Africa have been limited by patchy genetic sampling and a sparser archaeological record. The migration of agricultural Bantu-speaking cultures from western Africa appears to have had a major impact on patterns of genetic variation in Africa, but specific models have rarely been tested. In this article, we have investigated Y-chromosomal data from pastoral, agricultural, and hunter-gatherer populations in both eastern and southern Africa, to explore the possibility of demic diffusion of agropastoral practices during the Holocene.

Previously, Arredi et al. (7) suggested that a subset of E3b1-related Y chromosomes were associated with a demic diffusion of the Neolithic culture within northern and eastern Africa. Although representative subclades of E3b1-M35 are broadly distributed across Africa and Europe, reaching informative frequencies in many instances, a paraphyletic subset of chromosomes, E3b1-M35*, has a more restricted distribution. Frequencies of E3b1-M35* >25% are primarily found in samples from eastern African populations, including: the Hema in northeastern Democratic Republic of Congo; the Maasai in Kenya; and the Wairak, Datog, Sandawe, and Burunge of Tanzania (8–11). Paragroup E3b1-M35* is also found at moderate frequencies in Ethiopia and Somalia (8). Within southern Africa, the Khwe (Kxoe) and !Kung of Namibia and Angola, respectively, show high to moderate frequencies of M35*. Additionally, E3b1-M35* Y-chromosomes are scattered throughout Europe and northern Africa at very low frequencies (8, 12).

The relatively high frequency of M35* in both eastern (e.g., Sandawe, Datog, Maasai) and southern (e.g., Kxoe) African populations is intriguing given previous studies showing substantial isolation between the two regions (10, 11, 13, 14). Jointly, these studies suggest two temporally distinct migration events between the regions. The earlier event has been dated at >30,000 years ago and may have involved people with Khoisan linguistic affiliations moving into southern Africa, as evidenced by the Y-chromosome A and B clades [supporting information (SI) Fig. S1] (10, 13, 14) and mtDNA L0d clade (10). The second event involves the migration of Bantu-speaking agropastoralists from eastern Africa into southern Africa ≈1,500 years ago (15).

Here, we report a Y-chromosome-specific polymorphism that defines Y-chromosome haplogroup E3b1f-M293, accounting for many of the previous E3b1-M35*s. We generated Y-chromosome M293 SNP and microsatellite data for 13 populations of eastern and southern Africa to test the possibility of a pastoralist migration between these regions. Given its estimated age (below); the M293 locus is informative regarding migrations and demographic events occurring during the Holocene in Africa.

Results

To date the time of origin of E3b1f-M293, we generated and analyzed microsatellite data for 10 associated Y-chromosome loci. Because the DYS389I locus did not appear to follow the single-step mutation model, we initially excluded this locus in our temporal estimates of haplogroup E3b1f based upon network analysis (data not shown) and the ρ estimator (16, 17). Analysis of the nine locus Y-chromosome short tandem repeat (STR) dataset yielded an estimate of 9,300–11,100 years for the age of the M293 mutation, assuming the mutation rate (μ) calculated in Zhivotovsky et al. (18). With DYS389I included, we also dated the two components of the network separately [i.e., just DYS389I (repeat size 10) representatives vs. DYS389I (repeat size 13 or higher) individuals] (Fig. 1). The inclusion of the DYS389I locus but exclusion of DYS389I-10+ haplotypes yielded a date of 12,200–10,100 years ago (ya) (ρ = 3.024, min ρ = 2.512). The M293 (DYS389I-10) associated Y-STR network yielded a date of 9,400 ya (ρ = 2.081, SE = 1,070 y, μ adjusted for nine loci given the invariant status of DYS389I). The difference in the ages of the two network hemispheres indicates that the DYS389I-10 allele is most likely derived.

Fig. 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 1.

Median-joining network of 10-locus Y-STR haplotypes for E3b1f-M293 positive individuals. Network was generated using Network 4.1.1.1 (Fluxus Engineering). Black arrows indicate roots used for ρ estimates; gray arrow indicates the node giving a minimum ρ estimate for M293*. Networks were processed first by the reduced-median method, and then by the median-joining method (16, 42). DYSA7.2 was removed from the analyses due to an apparently high level of homoplasy. Networks were generated initially without weighting any of the STR loci; then the three most homoplastic mutations were down-weighted (by half) for the final network. Such weighting did not significantly change the structure of the network (data not shown). ε was set to zero. ρ estimates were converted to years using the average Y-STR mutation rate estimated in ref. 18.

While the spatial frequency distribution of paragroup E3b1-M35*(former, uniformed by M293) is pan-African, these haplotypes are rarely identified in western and central African populations (8, 11, 12, 19) or in North Africa (7) (Fig. 2 a). Rather, its regionalization is greatest in both eastern and southern Africa. The geographic distribution of the E3b1f-M293 clade, while more confined, has corresponding frequency maxima in Tanzania and southern Africa (Fig. 2 b).

Fig. 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 2.

Contour maps of the frequencies of E3b1-M35*(former) (A) and E3b1f-M293 (B) in Africa. Populations without M293 in B are based on unpublished data. The geographic distributions of M35*(former) and M293 frequencies across Africa were created using the Kringing method in Surfer 8 (Golden Software). Locations of populations in Table 1 are indicated by cross hatches. Because M35* is a paraphyletic haplogroup, the sharing of M35* does not indicate a close genetic relationship. Areas of high frequency are similar in the two maps as 90% of the M35*(former) samples are M293+. M293 is only found in sub-Saharan Africa, indicating a separate phylogenetic history for M35*(former) samples further north.

The haplogroup E3b1f distribution spans different language phyla and subsistence economies. When the Wafiome are excluded because of low sample size (n = 2), the Tanzanian Datog population has both the highest haplogroup E3b1f-M293 frequency (43%) and Y-STR diversity (Table 1) of any group surveyed. The Datog are pastoralists who speak a Southern Nilotic language. The Burunge are Afro-Asiatic agropastoralists, who display the M293 clade at a frequency of 28%. M293 associated Y-STR diversity is also quite high in the Burunge, second only to the Datog (Table 1). Network analysis indicates that the Burunge have several haplotypes that fall on fairly long branches distinct from other populations, but they also share Y-STR haplotypes with individuals representing the Hadza, Datog, and Sandawe (Fig. 1). The frequency of M293 chromosomes in the Sandawe, a Tanzanian click-speaking (Khoisan) population, is 24%. Sandawe haplotypes are broadly scattered across the M293 network with the majority concentrated within the M293(DYS389I-10) clade (Fig. 1). The sample from the Hadza, a small click-speaking population that until recently subsisted via hunting and gathering, has considerably less M293-specific Y-STR diversity than do the other Tanzanian samples (Table 1). In two cases, Y-chromosomes of Hadza and Datog individuals share identical haplotypes. This pattern reflects the current geographic proximity of these two populations in northern Tanzania. The Sandawe, Hadza, and Datog populations all have frequencies of the residual E3b1-M35* at ≈5%.

View this table:
  • View inline
  • View popup
Table 1.

Population descriptions and corresponding M35* and M293 frequencies

The presence of M293 at low to moderate frequencies in Bantu-speaking populations of eastern and southern Africa (Table 1) likely reflects recent admixture with local populations after the Bantu-speakers migrated out of western Africa (8). Paragroup E3b1-M35*(former) is also rare in northern African populations, with a maximum frequency of 9% for a sample from an Egyptian population (7, 11, 12, 20). No E3b1-M35*(former) individual has yet been reported in central Africa (8, 11, 19). We focus therefore on populations of eastern and southern Africa as potential sources of M293 diversification.

The M293 mutation is common in the Datog (43%), Kxoe (31%), and Burunge (28%) samples (in descending order of frequency). The high level of Y-STR diversity on the M293 background in the Datog population, coupled with highest frequency, suggests that the Datog have carried M293 longer than any other population. Datog haplotypes occur within both hemispheres of the E3b1f-M293 network (Fig. 1). The Burunge also carry a significant percentage of M293 haplotypes, although the frequency and genetic diversity is somewhat reduced compared with that of the Datog. The Kxoe (known in earlier studies as the Khwe), from southern Africa, have only one Y-STR haplotype on the M293* ancestral background which is shared by four individuals. They are thus unlikely to be the source population for M293. Two of the 11 Bantu-speaking Kenyan males in the Human Genome Diversity Panel (HGDP) panel reported as E-M35*(former) (8) are derived at M293. Without information about M293 in the Maasai, Hema, and other populations in Kenya, Sudan, and Ethiopia, we cannot pinpoint the precise geographic source of M293 with greater confidence. However, the available evidence points to present-day Tanzania as an early and important geographic locus of M293 evolution.

Population genetic studies of migration from eastern Africa to southern Africa have confirmed the impact of Bantu-speaking populations arriving in southern Africa ≈1,500 ya (21–24). Before this contact, separation of eastern and southern African populations is likely to have occurred at least 15 ka, and may have occurred as early as 40–50 ka, with little subsequent gene flow (10, 13, 14, 25). The apparent molecular age and distribution of the M293 haplogroup underscores that this locus is exceptionally informative with regards to the context of this important intermediate time period in the history of eastern and southern Africa.

E3b1-M35*(former) was found at relatively high frequencies in southern African click-speaking populations (31% and 11% in the Kxoe and !Kung, respectively) (8). All of these individuals now exhibit the M293 mutation (present study). With the exception of one haplotype found in the Kxoe sample, the southern African M293+ Y-chromosomes all carry the derived M293(DYS389I-10) allele (Fig. 1). Three Sandawe and two Kxoe share an identical M293(DYS389I-10) Y-STR haplotype. Most !Kung individuals have Y-chromosomes that appear to derive from this central haplotype (see Fig. 1). One !Kung individual shares a Y-STR haplotype with Hadza and Datog individuals. These Tanzanian and Khoe-San individuals with identical Y-STR haplotypes across 10 loci are likely to have very recent common ancestry. Using analytical methods developed in Walsh (26), we can estimate the time to the most recent common ancestor (TMRCA) of two individuals who share identical haplotypes across 10 loci. Assuming an infinite alleles model and an average Y-STR mutation rate from Zhivotovsky et al. (18), the mean TMRCA is 1,800y (95% credible region 40–6,680 y). Assuming a binary stepwise STR model, the mean TMRCA of two identical 10 locus haplotypes is 2,065 y. The median estimate under a binary stepwise model is 1,200 y (95% credible region 40–5,070 y). The mean and median estimates are somewhat different due to the exponential shape of the TMRCA curve.

The recent common ancestry between southern African Khoe-San- and northern Tanzanian M293-derived individuals seems to be independent of the Bantu-speaking populations. Out of our sample of 94 individuals from eastern African Bantu-speaking populations, only one individual carried the M293(DYS389I-10) allele. The only other Bantu-speaking individual with this allele was sampled from South Africa. Furthermore, only the M293* haplotype from one set of Kxoe individuals falls within three steps of the haplotype of an eastern African Bantu-speaking individual (Fig. 1). The distance between M293 Y-STR haplotypes of Bantu speakers and southern African Khoe-San speakers strongly suggests a migration of non-Bantu-speakers to southern Africa distinct from the Bantu migration 1,500 ya. The direct haplotype sharing between Sandawe/Kxoe and !Kung/Hadza/Datog leads us to argue for a migration between Tanzania and southern-central Africa (specifically, northern Namibia and southern Angola).

The polarity of this migration is inferred from the restricted M293 Y-STR haplotype diversity in southern Africa relative to that of eastern Africa (Table 1, Fig. 1). Southern African individuals are also almost exclusively M293(DYS389I-10), whereas the Datog and Sandawe have a range of haplotypes both with and without M293(DYS389I-10). Several scenarios are plausible, including the hypothesis that a group carrying the M293(DYS389I-10) allele may have moved from northern Tanzania, southward through Tanzania into eastern Namibia/southern Angola. Alternatively, this group may have originated outside of Tanzania and split into two populations with one population moving into northern Tanzania and the other circumventing Tanzania via the Democratic Republic of Congo and rapidly moving southward into central-southern Africa. The almost complete absence of E3b1-M35*(former) in Rwanda and Mozambique (Paula Sánchez, personal communication) lends some support to a migration through Tanzania, but study of populations of the southern Democratic Republic of Congo would prove more conclusive.

To infer a date for this migration event, we estimated ρ by taking the M293(DYS389I-10) haplotype from which most southern African haplotypes are derived as the root (17) (see Fig. 1). Southern African individuals that were clearly derived from another root haplotype (e.g., shared haplotype with an eastern African individual) were excluded from this estimate. Using the same mutation rate described in Fig. 1, we estimated a maximum age of 2,700 ya (ρ = 0.6, n = 10) with a standard error (SE) of 1,100 y. Although a rough estimate, subject to several caveats, the restricted M293 diversity among the !Kung and Kxoe suggests a relatively recent migration of individuals with M293 into southern Africa.

Discussion

The inferred date and geographic route of this migration is particularly relevant in the context of archaeological evidence for the spread of pastoralism to southern Africa, presumably along a tsetse fly-free corridor (27). Archaeologists continue to debate the mode of transmission of pastoralism to this region (28, 29). A major unresolved question is: were early instances of sheep, cows, pottery, and other pastoralist markers transmitted to southern-central Africa by demic or cultural diffusion? According to one model, pastoralism arrived as the result of population movement from eastern Africa. This pastoralist population migrated into southern-central Africa along with livestock. Subsequently, these pastoralists either mixed with local populations, or expanded in number without substantial genetic exchange with local groups. According to a second model, pastoralism was transmitted with little to no population movement as people passed stock and pastoralist practices from eastern Africa to southern-central Africa.

Our Y-chromosome data support the former demic diffusion model. Direct sharing of two haplotypes (Sandawe/Kxoe and Hadza/Datog/!Kung) combined with the similarity of other southern African M293 haplotypes to those of Tanzanian individuals is evidence of human migration between the two regions. Given the directionality and dating discussed previously, we conclude that eastern African individuals contributed M293 to southern African populations within the last few thousand years. The scale of this migration may have been small, minimally four male individuals (based on the two shared eastern/southern haplotypes; the Kxoe haplotype, which is one step from an eastern haplotype; and the shared Kxoe/!Kung type, which is several steps removed from any neighbors). It is possible that other male individuals who did not carry M293 were also involved in this migration, but the A clade and B2b-M112 haplotypes in eastern and southern Africa are geographically structured with no evidence of recent contact (10, 14). E3a-M2 may have been involved in this earlier pastoralist migration, but we cannot differentiate between early E3a-M2 chromosomes and E3a-M2 chromosomes that are known to have been introduced later during the expansion of Bantu-speaking peoples. Furthermore, without samples from the other regions of Namibia, Botswana and South Africa we cannot address the question of how pastoralism spread after it reached south-central Africa (28, 29).

A second set of questions in the pastoralism literature focus on the linguistic/ethnic affiliation of the population that introduced pastoralism to southern Africa. Two different linguistic groups have been proposed as likely vectors. Bantu-speaking agropastoralists could have introduced sheep, and later cattle, either by direct admixture with local Khoe-San populations or in a cultural contact zone preceding the actual arrival of Bantu-speakers into southern Africa (30). Archaeological remains of Iron Age ceramics (associated with early Bantu-speaking populations) occur in southern Zambia at 2,300 ya, although the ceramics are not clearly associated with caprine or cattle remains (31, 32). A second model is based primarily on linguistic evidence, with some archaeological correlates. Ehret (33) proposed that elements of the Khwe language, specifically words associated with pastoralism, had been borrowed from an East Sahelian language. Intriguingly, the Bambata-ware pottery found at early pastoralist sites in northern Namibia, northern Botswana and Zambia has stylistic similarities to spouted pottery found at Ngamuriak on the border of Kenya/Tanzania (29, 34). Ngamuriak is a pastoralist site considered part of the Elmenteitan culture. Southern Nilotic languages (a subset of East Sahelian) correlate with the Elmenteitan archaeological culture from 2,500 ya (33, 35).

As argued above, the M293 clade provides evidence of a migration independent of the initial Bantu expansion. If indeed M293 is indicative of the spread of pastoralism, then the Y-chromosome does not support the model of Bantu-speaking agropastoralists initially introducing sheep to southern Africa. Instead, haplogroup E3b1f-M293 points toward a different source population for the immigrating pastoralists. The shared Sandawe/Kxoe and Hadza/Datog/!Kung haplotype supports a connection between radically different branches of Khoisan. However, it is also possible that a third population contributed the same haplotypes to both the Sandawe, Hadza, !Kung, and Kxoe within a relatively short period. The Hadza/Datog/!Kung haplotype sharing supports this hypothesis. More than any other East African population in our dataset, the Datog dominate the M293(DYS389I-10) diversity (Fig. 1) and overall M293 diversity (Table 1). Newman (36), in his study of the Sandawe subsistence strategies, describes one Sandawe clan, the Alagwa, which is derived from people with Barabaig heritage. Barabaig is a dialect of Datog, a Southern Nilotic language, and Barabaig individuals self-report their ethnicity as Datog. This Barabaig clan became incorporated into the Sandawe because of their purported rainmaking abilities and eventually came to occupy a dominant position within the Sandawe society (36). Ethnographic evidence and shared Y-STR haplotypes support exchange between Tanzanian click-speaking groups and Southern Nilotic-speaking groups in Tanzania (10). Given the high frequency and diversity of E3b1f-M293 in the Datog, our data provide tentative support for a Southern Nilotic linguistic affiliation of the population responsible for introducing pastoralism to southern Africa.

Although genetic data can be a powerful tool for inferring past human migrations, the additional inference of cultural processes associated with migration will always be more tenuous because culture is necessarily decoupled from the genes. Exceptions include aDNA from skeletal remains that are directly associated in situ with archaeological artifacts (6). In rare cases, natural selection for genes that have coevolved with cultural practices, such as lactose persistence, can provide more direct evidence for the adoption of agropastoralism (37). These inferences rely on the synthesis of genetic and a variety of types of nongenetic information.

Although the nongenetic evidence surrounding this migration is consistent with a pastoralist component, a nonpastoralist interpretation is also possible. As demonstrated above, the distribution of the E3b1f-M293 haplogroup is consistent with a migration from eastern to southern Africa within the last 5,000 y. Agriculture is generally linked to the appearance of iron-producing Bantu-speaking peoples in southern Africa ≈1,500 years ago (15, 32). However, the southern African M293 haplotypes are unlikely to have originated among Bantu speakers (see above). Thus, if this migration did not involve agricultural or pastoral populations, then it would have involved, minimally, male hunter-gatherer individuals moving between the regions. Historically, hunter-gatherer populations in Tanzania are best represented by the click-speaking Hadza and Sandawe, although there were almost certainly other hunter-gatherer populations in the region that did not persist as independent groups up to the present day. A connection between eastern and southern African hunter-gatherer groups, although possible, is somewhat unlikely given genetic support for isolation between the populations for >30,000 years (10).

In summary, the E3b1f-M293 lineage accounts for a large fraction of the paraphyletic set of E3b1-M35*(former) Y-chromosome lineages found in eastern and southern Africa. The discovery of even a single SNP, when well characterized in the context of a phylogeny, can illuminate previously obscure connections between southern and eastern African populations. Associated STR diversity has informed questions about both the mode of transmission of pastoralism and the population affiliation of the southern African pastoralists. These data provide evidence compatible with a model of demic diffusion accompanying the spread of pastoralism to southern Africa, and suggest an eastern African Southern Nilotic-speaking population as the most likely source for pastoralism in south-central Africa. Genetic estimates indicate that gene flow between eastern and southern populations most likely occurred between 1,200 (credible region bounded by 40–5,000 ya) and 2,700 years ago (SE bounded by 1,600–3,800 ya). The combination of the two dating methods indicates that gene flow between eastern and southern populations most likely occurred ≈2,000 years ago (bounded by 40–5,000 ya). With more extensive population sampling, E3b1f-M293 lineages may also yield insights into the migration of the Khoekhoe pastoralists within southern Africa, where they eventually reached as far south as the Cape of Good Hope (28, 29).

Materials and Methods

Here, we report a Y-chromosome SNP, M293, originally ascertained by P.S. and P.J.O. in a South African Bantu individual, that defines a haplogroup, E3b1f, independent of the others (Fig. S1). The E3b1-M35* designation used here refers to Y chromosomes that display the derived allele for M35, but retain ancestral alleles for M78, M81, M123, M281, V6, and M293 binary polymorphisms. The previous definition of E3b1-M35* (uninformed by M293) will be referred to as “E3b1-M35*(former)” throughout the present article.

We considered Y-chromosome data for 454 individuals from 13 populations of eastern and southern Africa. DNA samples were collected from the following Tanzanian populations by S.A.T.: Hadza, Sandawe, Burunge, Datog, Turu, Wafiome, Mbugwe, and Sukuma in the Arusha and Dodoma provinces of Tanzania. Individuals were grouped according to self-identified ethnicity, and only samples from unrelated individuals who could trace ancestry to the same ethnic group as far back as the grandparents were included in the study. Written informed consent was obtained from all donors and Institutional Review Board approval and permits from Commission for Science and Technology (COSTECH) and National Institute for Medical Research (NIMR) in Tanzania were obtained before sample collection. The Kenyan Bantu and South African Bantu samples were obtained from the HDGP–Centre d'Etude du Polymorphisme Humain collection. The Khwe (Kxoe) and !Kung samples were collected, with appropriate consent, in Kimberley, Northern Cape Province, South Africa (38). However, these individuals are recent immigrants to the area. The Kxoe were relocated from the Caprivi strip in Namibia, and the !Kung are from southern Angola. Ethiopian samples are described in Seielstad et al. (39) and Underhill et al. (19) and were collected with informed consent.

Of these sampled populations, 87 individuals had previously been reported as E3b1-M35*(former) (Table 1, Dataset S1). All such E3b1-M35*(former) individuals were subsequently genotyped at the M293 locus, using denaturing high performance liquid chromatography (DHPLC) methodology. The specifications for marker M293 (rs9341316 in dbSNP) are a T to G transversion at nucleotide position 130 within a 338-bp fragment amplified using PCR primer sequences: forward 5′-3′ = gatattagtattgaagaaaccag and reverse 5′-3′ = gctggctaatacttccacagag. The results showed that haplogroup E3b1f-M293 accounts for the majority of the former E3b1-M35* samples in eastern and southern Africa. Specifically, a total of 76 individuals carried the derived G allele at M293. Using the Promega PowerPlex Y System, we then determined Y-STR haplotypes for each M293 derived individual (Dataset S1). Y-STR loci include: DYS19, 389I and II, 439, 390, 391, 392, and 393. In addition, two more Y-STR loci, DYS388 and A7.2, were typed as described in ref. 40. The DYS389II repeat allele number was determined by subtracting the DYS389I repeat number (41). The DYS389I locus was of particular interest.

Approximately half of the M293+ individuals carry the DYS389I-10 repeat allele, which is unusually rare in the Y-STR Haplotype Reference Database (www.yhrd.org). Only 0.004% of African individuals (12 of n = 3,087) in the database carry the DYS389I-10 repeat allele; all of them were from South Africa or Mozambique or identified as African American. The other half of our M293+ individuals carry an allele with 13 or more repeats, suggestive of a three-step mutational event. To determine whether the DYS389I-10 mutation was a unique event, we examined 407 samples from a diverse set of African populations for which Y-SNP and DYS389I information was available. All individuals carrying the DYS389I-10 allele (38 individuals, all from this study) were uniquely positive for the M293 derived allele and no intermediate 11 or 12 repeat-size alleles where observed for the 76 haplogroup E3b1f chromosomes. We concluded that generation of the DYS389I-10 allele was a rare, and possibly unique, multistep event, and that individuals positive for the M35 mutation that also carry the DYS389I-10 allele are also very likely to be derived for the M293 the mutation. In one case only, derived M293 status was inferred for a M35 derived Datog individual with the DYS389I-10 allele because DNA was exhausted: an assumption bolstered by the overall 10 locus Y-STR profile identical to a known M293 derived Datog individual.

Acknowledgments

We thank Antonel Olkers (North-West University, Potchefstroom, South Africa) for collecting the !Kung and Khwe samples from South Africa. We thank R. Klein and K. A. Horsburgh for helpful discussion. P.J.O. was supported by BayGene (the Bavarian Genome Network). S.A.T.'s funding sources for field work/sample collection included: National Science Foundation (NSF) grants BCS-0196183 and BCS-0552486 and Wenner Gren Foundation grants and a L.S.B. Leakey Foundation grant. J.L.M.'s funding sources included: NSF grant BCS-9905574 and National Institutes of Health Grant GM28428 and a L.S.B. Leakey Foundation grant.

Footnotes

  • ‡To whom correspondence should be addressed. E-mail: bmhenn{at}stanford.edu
  • Author contributions: B.M.H. and P.A.U. designed research; C.G., A.A.L., P.J.O., P.S., R.S., and F.C. performed research; R.S., F.C., and S.A.T. contributed new reagents/analytic tools; B.M.H., C.G., and P.A.U. analyzed data; and B.M.H., C.G., J.L.M., and P.A.U. wrote the paper

  • ↵ §Present address: 23andMe, Inc., Mountain View, CA 94043.

  • The authors declare no conflict of interest.

  • This article is a PNAS Direct Submission.

  • Data deposition: The sequence reported in this paper has been deposited in the Single Nucleotide Polymorphism (dbSNP) database, www.ncbi.nlm.nih.gov/projects/SNP (accession no. rs9341316).

  • This article contains supporting information online at www.pnas.org/cgi/content/full/0801184105/DCSupplemental.

  • © 2008 by The National Academy of Sciences of the USA

References

  1. ↵
    1. Barbujani G ,
    2. Bertorelle G
    (2001) Genetics and the population history of Europe. Proc Natl Acad Sci USA 98:22–25.
    OpenUrlAbstract/FREE Full Text
  2. ↵
    1. Cavalli-Sforza LL ,
    2. Menozzi P ,
    3. Piazza A
    (1993) Demic expansions and human evolution. Science 259:639–646.
    OpenUrlAbstract/FREE Full Text
  3. ↵
    1. Chikhi L ,
    2. et al.
    (1998) Clines of nuclear DNA markers suggest a largely neolithic ancestry of the European gene pool. Proc Natl Acad Sci USA 95:9053–9058.
    OpenUrlAbstract/FREE Full Text
  4. ↵
    1. Richards M ,
    2. et al.
    (2000) Tracing European founder lineages in the near eastern mtDNA pool. Am J Hum Genet 67:1251–1276.
    OpenUrlCrossRefPubMed
  5. ↵
    1. Dupanloup I ,
    2. Bertorelle G ,
    3. Chikhi L ,
    4. Barbujani G
    (2004) Estimating the impact of prehistoric admixture on the genome of Europeans. Mol Biol Evol 21:1361–1372.
    OpenUrlAbstract/FREE Full Text
  6. ↵
    1. Haak W ,
    2. et al.
    (2005) Ancient DNA from the first European farmers in 7500-year-old Neolithic sites. Science 310:1016–1018.
    OpenUrlAbstract/FREE Full Text
  7. ↵
    1. Arredi B ,
    2. et al.
    (2004) A predominantly neolithic origin for Y-chromosomal DNA variation in North Africa. Am J Hum Genet 75:338–345.
    OpenUrlCrossRefPubMed
  8. ↵
    1. Cruciani F ,
    2. et al.
    (2004) Phylogeographic analysis of haplogroup E3b (E-M215) y chromosomes reveals multiple migratory events within and out of Africa. Am J Hum Genet 74:1014–1022.
    OpenUrlCrossRefPubMed
  9. ↵
    1. Luis J ,
    2. et al.
    (2004) The Levant versus the Horn of Africa: evidence for bidirectional corridors of human migrations. Am J Hum Genet 74:532–544.
    OpenUrlCrossRefPubMed
  10. ↵
    1. Tishkoff S ,
    2. et al.
    (2007) History of Click-Speaking Populations of Africa Inferred from mtDNA and Y Chromosome Genetic Variation. Mol Biol Evol 24:2180–2195.
    OpenUrlAbstract/FREE Full Text
  11. ↵
    1. Wood E ,
    2. et al.
    (2005) Contrasting patterns of Y chromosome and mtDNA variation in Africa: evidence for sex-biased demographic processes. Eur J Hum Genet 13:867–876.
    OpenUrlCrossRefPubMed
  12. ↵
    1. Semino O ,
    2. et al.
    (2004) Origin, diffusion, and differentiation of Y-chromosome haplogroups E and J: inferences on the neolithization of Europe and later migratory events in the Mediterranean area. Am J Hum Genet 74:1023–1034.
    OpenUrlCrossRefPubMed
  13. ↵
    1. Knight A ,
    2. et al.
    (2003) African Y chromosome and mtDNA divergence provides insight into the history of click languages. Curr Biol 13:464–473.
    OpenUrlCrossRefPubMed
  14. ↵
    1. Semino O ,
    2. et al.
    (2002) Ethiopians and Khoisan share the deepest clades of the human Y-chromosome phylogeny. Am J Hum Genet 70:265–268.
    OpenUrlCrossRefPubMed
  15. ↵
    1. Phillipson DW
    (2005) African Archaeology (Cambridge Univ Press, Cambridge, UK).
  16. ↵
    1. Bandelt H ,
    2. Forster P ,
    3. Röhl A
    (1999) Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16:37–48.
    OpenUrlAbstract
  17. ↵
    1. Forster P ,
    2. Harding R ,
    3. Torroni A ,
    4. Bandelt H
    (1996) Origin and evolution of Native American mtDNA variation: A reappraisal. Am J Hum Genet 59:935–945.
    OpenUrlPubMed
  18. ↵
    1. Zhivotovsky L ,
    2. et al.
    (2004) The effective mutation rate at Y chromosome short tandem repeats, with application to human population-divergence time. Am J Hum Genet 74:50–61.
    OpenUrlCrossRefPubMed
  19. ↵
    1. Underhill P ,
    2. et al.
    (2000) Y chromosome sequence variation and the history of human populations. Nat Genet 26:358–361.
    OpenUrlCrossRefPubMed
  20. ↵
    1. Bosch E ,
    2. et al.
    (2001) High-resolution analysis of human Y-chromosome variation shows a sharp discontinuity and limited gene flow between northwestern Africa and the Iberian Peninsula. Am J Hum Genet 68:1019–1029.
    OpenUrlCrossRefPubMed
  21. ↵
    1. Beleza S ,
    2. et al.
    (2005) The genetic legacy of western Bantu migrations. Hum Genet 117:366–375.
    OpenUrlCrossRefPubMed
  22. ↵
    1. Cruciani F ,
    2. et al.
    (2002) A back migration from Asia to sub-Saharan Africa is supported by high-resolution analysis of human Y-chromosome haplotypes. Am J Hum Genet 70:1197–1214.
    OpenUrlCrossRefPubMed
  23. ↵
    1. Pereira L ,
    2. et al.
    (2001) Prehistoric and historic traces in the mtDNA of Mozambique: Insights into the Bantu expansions and the slave trade. Ann Hum Genet 65:439–458.
    OpenUrlCrossRefPubMed
  24. ↵
    1. Salas A ,
    2. et al.
    (2002) The making of the African mtDNA landscape. Am J Hum Genet 71:1082–1111.
    OpenUrlCrossRefPubMed
  25. ↵
    1. Scozzari R ,
    2. et al.
    (1999) Combined use of biallelic and microsatellite Y-chromosome polymorphisms to infer affinities among African populations. Am J Hum Genet 65:829–846.
    OpenUrlCrossRefPubMed
  26. ↵
    1. Walsh B
    (2001) Estimating the time to the most recent common ancestor for the Y chromosome or mitochondrial DNA for a pair of individuals. Genetics 158:897–912.
    OpenUrlAbstract/FREE Full Text
  27. ↵
    1. Gifford-Gonzalez D
    (2000) Animal disease challenges to the emergence of pastoralism in sub-Saharan Africa. Afr Archaeol Rev 17:95–139.
    OpenUrlCrossRef
  28. ↵
    1. Sadr K
    (1998) The First Herders at the Cape of Good Hope. Afr Archaeol Rev 15:101–132.
    OpenUrlCrossRef
  29. ↵
    1. Smith A
    (2005) The concepts of ‘Neolithic’ and ‘Neolithisation’ for Africa? Before Farming 1:1–6.
    OpenUrl
  30. ↵
    1. Sampson CG ,
    2. Hart TJG ,
    3. Wallsmith DL ,
    4. Blagg JD
    (1989) The ceramic sequence in the upper Seacow Valley: Problems and implications. S Afr Archaeol Bull 44:3–16.
    OpenUrlCrossRef
  31. ↵
    1. Mitchell P
    (2002) The Archaeology of Southern Africa (Cambridge Univ Press, Cambridge, UK).
  32. ↵
    1. Phillipson DW
    (1989) The earliest South African pastoralists and the early Iron Age. Nsi 6:127–134.
    OpenUrl
  33. ↵
    1. Ehret C
    (1998) An African Classical Age: Eastern and Southern Africa in World History, 1000 BC to AD 400 (University of Virginia Press, Charlottesville).
  34. ↵
    1. Marshall F
    (1990) Origins of specialized pastoral production in East Africa. Am Anthropol 92:873–894.
    OpenUrlCrossRef
  35. ↵
    1. Ambrose SH
    1. Ehret C ,
    2. Posnansky M
    (1982) in The Archaeological and Linguistic Reconstruction of African History, eds Ehret C , Posnansky M (University of California Press, Berkeley), pp 104–157.
  36. ↵
    1. Newman JL
    (1970) The Ecological Basis for Subsistence Change Among the Sandawe of Tanzania (National Academy of Sciences, National Research Council, Washington, DC).
  37. ↵
    1. Tishkoff SA ,
    2. et al.
    (2007) Convergent adaptation of human lactase persistence in Africa and Europe. Nat Genet 39:31–40.
    OpenUrlCrossRefPubMed
  38. ↵
    1. Scozzari R ,
    2. et al.
    (1997) Differential structuring of human populations for homologous X and Y microsatellite loci. Am J Hum Genet 61:719–733.
    OpenUrlPubMed
  39. ↵
    1. Seielstad M ,
    2. et al.
    (1999) A view of modern human origins from Y chromosome microsatellite variation. Genome Res 9:558–567.
    OpenUrlAbstract/FREE Full Text
  40. ↵
    1. Cinnioğlu C ,
    2. et al.
    (2004) Excavating Y-chromosome haplotype strata in Anatolia. Hum Genet 114:127–148.
    OpenUrlCrossRefPubMed
  41. ↵
    1. Cooper G ,
    2. Amos W ,
    3. Hoffman D ,
    4. Rubinsztein D
    (1996) Network analysis of human Y microsatellite haplotypes. Hum Mol Genet 5:1759–1766.
    OpenUrlAbstract/FREE Full Text
  42. ↵
    1. Bandelt H ,
    2. Forster P ,
    3. Sykes B ,
    4. Richards M
    (1995) Mitochondrial portraits of human populations using median networks. Genetics 141:743–753.
    OpenUrlAbstract/FREE Full Text
View Abstract
PreviousNext
Back to top
Article Alerts
Email Article

Thank you for your interest in spreading the word on PNAS.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Y-chromosomal evidence of a pastoralist migration through Tanzania to southern Africa
(Your Name) has sent you a message from PNAS
(Your Name) thought you would like to see the PNAS web site.
Citation Tools
Y-chromosomal evidence of a pastoralist migration through Tanzania to southern Africa
Brenna M. Henn, Christopher Gignoux, Alice A. Lin, Peter J. Oefner, Peidong Shen, Rosaria Scozzari, Fulvio Cruciani, Sarah A. Tishkoff, Joanna L. Mountain, Peter A. Underhill
Proceedings of the National Academy of Sciences Aug 2008, 105 (31) 10693-10698; DOI: 10.1073/pnas.0801184105

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
Y-chromosomal evidence of a pastoralist migration through Tanzania to southern Africa
Brenna M. Henn, Christopher Gignoux, Alice A. Lin, Peter J. Oefner, Peidong Shen, Rosaria Scozzari, Fulvio Cruciani, Sarah A. Tishkoff, Joanna L. Mountain, Peter A. Underhill
Proceedings of the National Academy of Sciences Aug 2008, 105 (31) 10693-10698; DOI: 10.1073/pnas.0801184105
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Mendeley logo Mendeley

More Articles of This Classification

Biological Sciences

  • Highly disordered histone H1−DNA model complexes and their condensates
  • Thermodynamic favorability and pathway yield as evolutionary tradeoffs in biosynthetic pathway choice
  • SUMO protease SENP1 deSUMOylates and stabilizes c-Myc
Show more

Anthropology

  • Salt and marine products in the Classic Maya economy from use-wear study of stone tools
  • Synchronization of energy consumption by human societies throughout the Holocene
  • Impact of climate change on the transition of Neanderthals to modern humans in Europe
Show more

Related Content

  • No related articles found.
  • Scopus
  • PubMed
  • Google Scholar

Cited by...

  • Fine-Scale Human Population Structure in Southern Africa Reflects Ecogeographic Boundaries
  • Environments and trypanosomiasis risks for early herders in the later Holocene of the Lake Victoria basin, Kenya
  • Genome-wide evidence of Austronesian-Bantu admixture and cultural reversion in a hunter-gatherer group of Madagascar
  • Genomic Variation in Seven Khoe-San Groups Reveals Adaptation and Complex African History
  • Rapid, global demographic expansions after the origins of agriculture
  • Working toward a synthesis of archaeological, linguistic, and genetic data for inferring African population history
  • Scopus (68)
  • Google Scholar

Similar Articles

You May Also be Interested in

Better understanding how the truffles reproduce has major implications for farmers, chefs, and foodies enamored with the expensive, pungent fungus. Image courtesy of Shutterstock/Vitalina Rybakova.
Inner Workings: The mysterious parentage of the coveted black truffle
Better understanding how the truffles reproduce has major implications for farmers, chefs, and foodies enamored with the expensive, pungent fungus.
Image courtesy of Shutterstock/Vitalina Rybakova.
PNAS QnAs with NAS foreign associate and metabolic engineer Sang Yup Lee
PNAS QnAs
PNAS QnAs with NAS foreign associate and metabolic engineer Sang Yup Lee
Researchers report a species of early bird with a combination of bird-like and dinosaur-like bone morphologies, and the structure of the bird’s shoulder girdle highlights the role of developmental plasticity in the early evolution of birds, according to the authors.
Dinosaur-like forms in early bird shoulders
Researchers report a species of early bird with a combination of bird-like and dinosaur-like bone morphologies, and the structure of the bird’s shoulder girdle highlights the role of developmental plasticity in the early evolution of birds, according to the authors.
Honey bee. Image courtesy of Vivian Abagiu (The University of Texas at Austin, Austin, TX).
Effect of glyphosate on honey bee gut
A study suggests that the herbicide glyphosate disrupts bee gut microbiota, increasing bees’ susceptibility to pathogens, and that glyphosate’s effects may contribute to the largely unexplained increase in honey bee colony mortality.
Image courtesy of Vivian Abagiu (The University of Texas at Austin, Austin, TX).
HIV. Image courtesty of Pixabay/typographyimages.
Ancient retrovirus and intravenous drug use
A study finds that a fragment of an ancient retrovirus, integrated in human ancestors before the emergence of Neanderthals, is found more frequently in people who contracted HIV and hepatitis C through intravenous drug use, compared with control populations.
Image courtesty of Pixabay/typographyimages.
Proceedings of the National Academy of Sciences: 115 (41)
Current Issue

Submit

Sign up for Article Alerts

Jump to section

  • Article
    • Abstract
    • Results
    • Discussion
    • Materials and Methods
    • Acknowledgments
    • Footnotes
    • References
  • Figures & SI
  • Info & Metrics
  • PDF
Site Logo
Powered by HighWire
  • Submit Manuscript
  • Twitter
  • Facebook
  • RSS Feeds
  • Email Alerts

Articles

  • Current Issue
  • Latest Articles
  • Archive

PNAS Portals

  • Classics
  • Front Matter
  • Teaching Resources
  • Anthropology
  • Chemistry
  • Physics
  • Sustainability Science

Information

  • Authors
  • Reviewers
  • Press
  • Site Map

Feedback    Privacy/Legal

Copyright © 2018 National Academy of Sciences.