New Research In
Physical Sciences
Social Sciences
Featured Portals
Articles by Topic
Biological Sciences
Featured Portals
Articles by Topic
- Agricultural Sciences
- Anthropology
- Applied Biological Sciences
- Biochemistry
- Biophysics and Computational Biology
- Cell Biology
- Developmental Biology
- Ecology
- Environmental Sciences
- Evolution
- Genetics
- Immunology and Inflammation
- Medical Sciences
- Microbiology
- Neuroscience
- Pharmacology
- Physiology
- Plant Biology
- Population Biology
- Psychological and Cognitive Sciences
- Sustainability Science
- Systems Biology
Large numbers of vertebrates began rapid population decline in the late 19th century
Contributed by Wen-Hsiung Li, October 12, 2016 (sent for review July 22, 2016; reviewed by William J. Murphy and Jianzhi Zhang)

Significance
The current rate of species extinction is ∼1,000 times the background rate of extinction and is attributable to human impact, ecological and demographic fluctuations, and inbreeding due to small population sizes. The rate and the initiation date of rapid population decline (RPD) can provide important clues about the driving forces of population decline in threatened species, but they are generally unknown. We analyzed the genetic diversity data in 2,764 vertebrate species. Our population genetics modeling suggests that in many threatened vertebrate species the RPD on average began in the late 19th century, and the mean current size of threatened vertebrates is only 5% of their ancestral size. We estimated a ∼25% population decline every 10 y in threatened vertebrate species.
Abstract
Accelerated losses of biodiversity are a hallmark of the current era. Large declines of population size have been widely observed and currently 22,176 species are threatened by extinction. The time at which a threatened species began rapid population decline (RPD) and the rate of RPD provide important clues about the driving forces of population decline and anticipated extinction time. However, these parameters remain unknown for the vast majority of threatened species. Here we analyzed the genetic diversity data of nuclear and mitochondrial loci of 2,764 vertebrate species and found that the mean genetic diversity is lower in threatened species than in related nonthreatened species. Our coalescence-based modeling suggests that in many threatened species the RPD began ∼123 y ago (a 95% confidence interval of 20–260 y). This estimated date coincides with widespread industrialization and a profound change in global living ecosystems over the past two centuries. On average the population size declined by ∼25% every 10 y in a threatened species, and the population size was reduced to ∼5% of its ancestral size. Moreover, the ancestral size of threatened species was, on average, ∼22% smaller than that of nonthreatened species. Because the time period of RPD is short, the cumulative effect of RPD on genetic diversity is still not strong, so that the smaller ancestral size of threatened species may be the major cause of their reduced genetic diversity; RPD explains 24.1–37.5% of the difference in genetic diversity between threatened and nonthreatened species.
Although preservation of biodiversity is vital to a sustainable human society, rapid population decline (RPD) continues to be widespread across taxa (1⇓–3). When RPD occurs, it is accompanied by a loss of genetic diversity. Genetic diversity is reflected in the genetic differences among individuals and is essential for populations to adapt to changing environments (4). The start date and the rate of RPD provide useful information for effective conservation of threatened species and are important for promotion of public awareness of the threat. However, these two key parameters are difficult to estimate because there are virtually no time-series data on population size over hundreds of years. For most species, the population size may only be traced back to 40 y (2). Therefore, an alternative approach is to estimate the start date and the rate of RPD, using mathematical modeling.
Changes in population size over thousands of years could be inferred for a species from genome-wide DNA polymorphism data (5⇓–7). However, it remains a formidable technical challenge to infer the event of RPD because the signal of such an event is weak in the typical time scale of observable polymorphisms (8). To overcome the limited resolution power of the genetic data from a single species, we propose an approach that draws conclusions based on the collective support from many species. The central premise of our approach is that the threat of extinction of thousands of species was primarily due to a common cause in the past that led to a significant depletion of available habitats and resources. Consequently, we were able to draw conclusions based on present-day polymorphism data from a large number of threatened species and their nonthreatened relatives. Our method is depicted in Fig. 1. Here we studied RPD in vertebrates, because vertebrates have been more extensively investigated in the past. However, our conclusions should have some generality because vertebrate species live in a wide range of ecosystems. Moreover, the proposed method is also suitable for studying nonvertebrate species.
Schematic inference on the start date and the rate of RPD under one particular demographic model. The coalescence simulations were conducted conditional on the sample sizes, the numbers of loci, the pattern of missing data, the generation times, the census sizes, the species distributions, and the years of sampling. The data were summarized as the relative difference in four genetic diversity measurements between two species groups. The species categorized as near threatened (NT) and least concern (LC) are treated as the nonthreatened species. The threatened species include those listed as critically endangered (CR), endangered (EN), and vulnerable (VU) (6). The uncategorized species include those that are listed as data deficient and have not been evaluated by the IUCN.
Results and Discussion
Data Collected.
We reviewed more than 10,000 peer-reviewed papers published in the last two and half decades, among which ∼2,500 papers in 164 scientific journals were found to have surveyed the genetic diversity of at least one vertebrate species. The level of genetic diversity was measured with one of the following summary statistics (9): the expected and observed heterozygosity (
Categories of the 2,764 vertebrate species used in this study. (A) The number and relative proportion of the species in each taxon category. (B) IUCN Red List categories of the examined species and their relative proportions. CR, critically endangered; DD, data deficient; EN, endangered; LC, least concern; NE, not evaluated; NT, near threatened; VU, vulnerable. The threatened species include the species of critically endangered, endangered, and vulnerable, and the nonthreatened species include the near-threatened and least-concern species.
Comparison of Genetic Diversity Between Nonthreatened and Threatened Species.
Following a previous study (11), we compared the genetic diversity between nonthreatened and threatened vertebrate species using the permutation test (12). The establishment of those IUCN categories does not rely on the information of genetic diversity. Although the distributions of genetic diversity of nonthreatened and threatened species overlap (Fig. 3 A and B and SI Appendix, Fig. S1), the mean genetic diversity of nonthreatened species is significantly higher than that of related threatened species in all 16 comparisons (Fig. 3 C–E and SI Appendix, Table S1), generally agreeing with the previous finding (11). The results remain the same when we recompiled the data with different numbers of microsatellite loci (
Comparisons of genetic diversity between nonthreatened and threatened vertebrate species. (A) Empirical distributions of
To examine whether differences in population structure can explain the reduction in genetic diversity of threatened species, we first compared the Fst values (an indicator of recent population structure estimated from microsatellite loci) between two species groups and found no significant difference (P = 0.25) (SI Appendix, Table S2). Next, we calculated the one-tailed P values of Tajima’s D (13) for the mitochondrial DNA polymorphism data, which is sensitive to ancient but not recent population structure (14). There was also no significant difference between the two species groups (P = 0.67) (SI Appendix, Table S2). Thus, population structure differences are unlikely the principle cause of the difference in genetic diversity between nonthreatened and threatened species.
To assess the impact of recent demographic change on genetic diversity of threatened species, we considered pairs of nonthreatened and threatened species from the same family. For each pair we calculated the ratio of the long-term effective population size (
Ratios of long-term effective population size (circles, measured as
We suggest that the recent impacts on population size could be measured by
Demographic Models.
We used a model-based approach to quantify the RPD. One model is illustrated in Fig. 5A. The essential premise is that many threatened species began the RPD at similar times due to the increased impact of human activities and habitat losses. Specifically, the model assumes that each threatened species began an exponential decrease in size
Coalescence-based modeling and analysis. (A) The two-phase model of exponential population decline for threatened species with a constant effective population size of nonthreatened species during both phases. (B) The likelihood surface obtained from the analysis. (C) The estimates based on the data from all studied species and subgroups of species. The estimates (
To estimate the two parameters (
The surface of the likelihood
The effect of RPD on the reduction in genetic diversity is relatively weak because the time period of RPD is short and because the observed differences in genetic diversity between the two species groups were small. For example, the initial simulated heterozygosity of threatened species was 0.572, and the reduced heterozygosity after RPD was 0.558. That is, only 2.4% of the heterozygosity was lost due to RPD because the time period of RPD was only 123 y.
Comparison with Historical Data.
It has been documented that accelerated land use by humans might have caused the decrease of biodiversity since the middle of the 19th century (21) and extinction rates increased sharply over the past 200 y (22). Therefore, our estimated
Unbiasedness and Robustness of the Method.
We examined the bias of our inference method. First, the sampling effect among species was studied, and it was found that smaller subsets of data gave unbiased estimates but with large variances, as expected (Fig. 5C). Second, RPDs were simulated (10,000 replicates) with the parameter values
The robustness of our estimates was investigated in a number of ways. Several scenarios of the ancient demography before RPD were simulated, and we found that the ancient demography has almost no impact on the estimates (SI Appendix, Fig. S4). We also found that even if the population size of species categorized as near-threatened was assumed to decrease by half since RPD the estimates were virtually unchanged (SI Appendix, Fig. S5A). We also considered the uncertainty of assessment on the genetic diversity due to small sample size, the uncertainty of assessment on the threatened status of species, the uncertainty of estimating
Our large collection of species allowed us to conduct refined inferences. First, we reanalyzed each taxon class (Fig. 5D). We found that threatened birds and fish species seem to have experienced a more recent RPD than other threatened vertebrates. Thus, the estimated
In summary, this study demonstrates a utility of genetic polymorphism data in conservation biology. The model with the two parameters
Methods
Collecting Genetic Diversity Data.
For over a decade we carefully examined 2,475 peer-reviewed papers published in 164 scientific journals (Dataset S1) and collected DNA polymorphism data for vertebrate species living in various ecosystems. We focused on vertebrate species because available data were mostly from vertebrates. The genetic diversity in each of these species was typically assayed by one or two molecular techniques: allelic variation at microsatellite loci and sequence variation in the control (D-loop) and coding regions of the mitochondrial DNA. Therefore, the genetic diversity data presented here represent the polymorphism level in both nuclear and mitochondrial genomes.
To ensure data quality, if an inconsistency (in the sample size, the number of haplotypes, the number of alleles, or any related key information) was found in a paper we contacted the authors for their confirmation or discarded the data if we received no response. We also excluded studies using museum samples because the number of such studies is very limited. To increase nomenclatural consistency, the standard world checklists (version 2014.3) on the IUCN Red List were used (www.iucnredlist.org/technical-documents/information-sources-and-quality).
The within-species polymorphism data of 2,764 vertebrate species are presented in Dataset S1. In this dataset, there were 400 vertebrate species surveyed by more than one method, so we have 3,219 nonredundant entries in total, where each entry is composed of one to three summary statistics. If the microsatellite loci of a vertebrate species were surveyed, the sample size (n), the number of microsatellite loci, the expected heterozygosity (
According to the IUCN, the threatened species (TS) include those listed as critically endangered (CR), endangered (EN), and vulnerable (VU) (3). The species categorized as near threatened (NT) and least concern (LC) are treated as the nonthreatened species (NS). The uncategorized species include those that are listed as data deficient (DD) and have not been evaluated by the IUCN. A taxon is listed as data deficient when there is inadequate information to make an assessment of its risk of extinction (3). Generally, the uncategorized species were excluded from our analyses (Fig. 1), unless stated otherwise.
Collecting Species Distribution and Generation Time Data.
The geographic distributions of 2,552 vertebrate species were retrieved from the IUCN Red List (version 2014.3) and a reptile database (www.reptile-database.org). The data are given in Dataset S1. The generation times of 3,146 vertebrate species were obtained from different published resources (Dataset S1), which formed the basis for our generation time estimates. Assume that the generation time of a species is
Ratio of Effective Population Sizes Between Nonthreatened and Threatened Species.
It is difficult to estimate the effective population size at a specific time of a species with fluctuating population size (5⇓–7). However, the ratio of
where
Based on the current census of 1,868 vertebrate species obtained from IUCN, a professional book (30), and peer-reviewed literature (Dataset S1), we estimated
Demographic Model and Likelihood Inference of RPD.
Demographic model.
We assumed that the effective population size of a nonthreatened species remains constant. Denote the ancestral effective population size of a threatened species at time
Under the assumption of constant population size during the first phase and
Assumption about sampling times.
The sampling time is likely different in different studies, but the time duration between the time of sampling and the time of publication is generally much shorter than
Coalescence-based simulations.
The coalescence-based simulations followed the standard procedure (31, 32). To simulate the microsatellite polymorphism data for the i-th species, we (randomly) chose
We first calculated
where
where
To model the heterogeneity of mutation rates among microsatellite loci, we assumed that the mutation rate for a randomly selected locus follows a lognormal distribution, with the coefficient of variation equal to 1 (33). We also assumed that the microsatellite loci are independent and are autosomal.
If the evolution of the i-th species followed a nonconstant size model, the time in the unit of years was transformed to the unit of
To simulate the single-nucleotide polymorphism data on the D-loop region, we followed the procedures described above with two modifications. First,
Likelihood inference of RPD.
The observed genetic diversity of a species can be represented by a vector
For the summary statistic He on the microsatellite loci, we denote
The above procedure was applied to multiple summary statistics with only minor modifications. For the microsatellite dataset, we jointly considered He and
Then,
All of the collected species, including those with small sample size (
Likelihood ratio test.
To conduct the likelihood ratio test and obtain likelihood-based confidence intervals, we obtained the empirical distribution of the likelihood ratio
Effect of RPD on the Difference in Genetic Diversity Between Nonthreatened and Threatened Species.
We first conducted the coalescence-based modeling described above, conditional on
Robustness Analysis Under Various Demographic Models.
To examine the robustness of the estimates, we conducted reanalyses under various demographic models. The likelihood method described above is very flexible, and little modification is needed to analyze other demographic models. First, we considered a slow population decline in a nonthreatened vertebrate species categorized as near-threatened (NT) based on the rationale that human activities may also have an impact on those nonthreatened vertebrate species. We assumed that their population size declined by half
Second, we considered an ancestral population with varying size (SI Appendix, Fig. S4). Under the ancestral instantaneous expansion model (SI Appendix, Fig. S4 A and B), we assumed that
To ensure that the simulated genetic diversity level is equal to the observed one in a nonthreatened species with varying population size,
To determine
Acknowledgments
We thank Feng Gao for technical assistance; Chun Ye, Dongsheng Lu, and Xixian Ma for help during data collection; Jing Luo, Xuemei Lü, Peng Shi, and John Parsch for their comments; and Sara Barton for editorial assistance. This work was supported by the Strategic Priority Research Program of the Chinese Academy of Sciences Grant XDB13040800, National Natural Science Foundation of China Grant 91531306, and 973 Project Grant 2012CB316505 (all to H.L., J.X.-Y., G.D., Z.G., C.M., and Z.Y.). This work was also supported in part by Grant nos. 91231120 and 91631304 from the National Natural Science Foundation of China and an endowment from The University of Texas Health Science Center at Houston (to Y.-X.F.). Y.-P.Z. is supported in part by the Yunnan Provincial Science and Technology Department and the National Natural Science Foundation of China.
Footnotes
↵1H.L. and J.X.-Y. contributed equally to this work.
- ↵2To whom correspondence may be addressed. Email: wli{at}uchicago.edu, Yunxin.Fu{at}uth.tmc.edu, or zhangyp{at}mail.kiz.ac.cn.
Author contributions: H.L., J.X.-Y., W.-H.L., Y.-X.F., and Y.-P.Z. designed research; H.L., J.X.-Y., G.D., Z.G., C.M., Z.Y., and Y.-X.F. performed research; H.L. and Y.-X.F. contributed new reagents/analytic tools; H.L., J.X.-Y., and Y.-X.F. analyzed data; and H.L., J.X.-Y., G.D., Z.G., C.M., Z.Y., O.A.R., W.-H.L., Y.-X.F., and Y.-P.Z. wrote the paper.
Reviewers: W.J.M., Texas A&M University; and J.Z., University of Michigan.
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1616804113/-/DCSupplemental.
Freely available online through the PNAS open access option.
References
- ↵.
- Stuart SN, et al.
- ↵.
- WWF
- ↵.
- IUCN
- ↵.
- Frankham R,
- Ballou JD,
- Briscoe DA
- ↵
- ↵
- ↵
- ↵.
- Fu Y-X
- ↵.
- Hartl DL,
- Clark AG
- ↵
- ↵.
- Spielman D,
- Brook BW,
- Frankham R
- ↵.
- Good PI
- ↵.
- Tajima F
- ↵.
- Fu Y-X
- ↵.
- Xu H,
- Fu Y-X
- ↵
- ↵
- ↵.
- Tavaré S,
- Balding DJ,
- Griffiths RC,
- Donnelly P
- ↵
- ↵.
- Tajima F
- ↵
- ↵.
- Ceballos G, et al.
- ↵
- ↵.
- Collen B,
- Nicholson E
- ↵.
- Bazin E,
- Glémin S,
- Galtier N
- ↵
- ↵.
- Martin AP,
- Palumbi SR
- ↵Koepfli KP, Paten B, Genome 10K Community Scientists, O’Brien SJ (2015) The Genome 10K Project: A way forward. Annu Rev Anim Biosci 3:57-111..
- ↵
- ↵.
- Nowak RM
- ↵.
- Futuyma D,
- Antonovics J
- Hudson RR
- ↵.
- Griffiths RC,
- Tavaré S
- ↵.
- Renwick A,
- Davison L,
- Spratt H,
- King JP,
- Kimmel M
Citation Manager Formats
Sign up for Article Alerts
Article Classifications
- Biological Sciences
- Evolution

















