Twenty-million-year relationship between mammalian diversity and primary productivity
- aSenckenberg Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft für Naturforschung, 60325 Frankfurt, Germany;
- bInstitute of Ecology, Evolution and Diversity, Goethe University, 60438 Frankfurt, Germany;
- cDepartment of Geosciences and Geography, University of Helsinki, 00014 Helsinki, Finland;
- dInstitute of Biology, Leipzig University, 04103 Leipzig, Germany;
- eDepartment of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912;
- fSchool of Earth Sciences, University of Bristol, Clifton, Bristol BS8 1RJ, United Kingdom;
- gInstitute of Geosciences, Goethe University, 60438 Frankfurt, Germany;
- hDepartment of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794
See allHide authors and affiliations
Edited by David Jablonski, The University of Chicago, Chicago, IL, and approved August 2, 2016 (received for review February 9, 2016)

Significance
Our study links diversity dynamics of fossil large mammals through time to primary productivity, i.e. net production of plant biomass. Spatial diversity patterns of extant terrestrial animals are often correlated with present-day primary productivity, but it is unclear whether the relationship holds throughout the geological past. Here we show that higher primary productivity was consistently associated with higher mammalian diversity throughout the geological period of the Neogene, supporting the hypothesis that energy flow from plants to consumers is a key factor determining the level of biodiversity. Our comparison of the fossil diversity–productivity relationship with present-day data suggests that human activity and Pleistocene climate change have conspired to dissolve the relationship that has characterized our planet over 20 My.
Abstract
At global and regional scales, primary productivity strongly correlates with richness patterns of extant animals across space, suggesting that resource availability and climatic conditions drive patterns of diversity. However, the existence and consistency of such diversity–productivity relationships through geological history is unclear. Here we provide a comprehensive quantitative test of the diversity–productivity relationship for terrestrial large mammals through time across broad temporal and spatial scales. We combine >14,000 occurrences for 690 fossil genera through the Neogene (23–1.8 Mya) with regional estimates of primary productivity from fossil plant communities in North America and Europe. We show a significant positive diversity–productivity relationship through the 20-million-year record, providing evidence on unprecedented spatial and temporal scales that this relationship is a general pattern in the ecology and paleo-ecology of our planet. Further, we discover that genus richness today does not match the fossil relationship, suggesting that a combination of human impacts and Pleistocene climate variability has modified the 20-million-year ecological relationship by strongly reducing primary productivity and driving many mammalian species into decline or to extinction.
One ubiquitous pattern in ecology is the positive relationship between the diversity of terrestrial organisms and primary productivity (1⇓⇓–4). For consumers, this relationship is thought to arise because primary productivity limits energy flow to and total biomass at higher trophic levels (5). Because primary productivity depends largely on climatic conditions (1), and spatial richness patterns of extant species are often strongly correlated with climate at global and continental scales, the productivity hypothesis has been successful in explaining spatial patterns of diversity (1⇓⇓⇓–5). However, the present-day diversity–productivity relationship may not be representative for Earth’s history, because present-day conditions have been strongly shaped by human activity (6, 7). Exponential increases in human population size and in biomass of a few domesticated species, such as cattle, pigs, and poultry, have resulted in increasing appropriation of the net primary production of biomass (NPP) since the beginning of the Holocene (8, 9). Today, human activity removes up to 30% of the global NPP from natural ecosystems, mostly through harvesting, deforestation, and grazing (10). Increasing human impact and strong glacial–interglacial climate oscillations superimposed on Pleistocene environmental changes have dramatically reduced the number of extant large mammal species (7, 11). Here, we test the diversity–productivity relationship in large mammals by analyzing the Neogene fossil record, which precedes Pleistocene climate change and human dominance of natural ecosystems.
To date, the generality of the terrestrial diversity–productivity relationship over long geological timescales remains elusive. Although temporal changes in terrestrial fossil diversity have been linked to changing productivity and temperature (12⇓–14), the few quantitative analyses to date have been performed at highly disparate spatial scales, either global to continental or for single fossil locations (15⇓–17). The evidence for terrestrial diversity–climate relationships from these studies is equivocal, calling into question the universality of the diversity–productivity relationship. Some of the discrepancies may arise because quantitative studies on large spatial scales have used global paleo-climate reconstructions based on marine records (13, 15, 17), which are unlikely to represent terrestrial climatic conditions adequately. Temperature also could be an indirect or secondary driver of terrestrial diversity, because present-day spatial diversity patterns are often better explained by combinations of proxy variables for energy and water availability than by temperature alone (2⇓–4).
To evaluate the mammalian diversity–productivity relationship through the Neogene, we combine Northern hemisphere mammalian fossil data for stratigraphic stages covering the Miocene and Pliocene epochs ∼23–1.8 Mya (Table S1) with regional terrestrial NPP estimates derived from fossil plant communities (18), covering 23–2.6 Mya in Europe and 17–2.6 Mya in North America (Fig. 1). Our mammalian dataset contains 14,083 fossil occurrence records for 690 genera (orders Artiodactyla, Carnivora, Perissodactyla, Primates, and Proboscidea) in 1,567 locations, divided into three North American and three European regions (Fig. 1) based on biogeographic history (12, 19). We focus on large terrestrial mammals because of their comparatively well-resolved taxonomy, their high preservation rates, and their well-sampled and comprehensive Neogene fossil record. To account for preservation and spatial sampling biases still present in the record, we estimate regional and continental mammalian γ diversity on the genus level with a first-order Jackknife approach (20) separately within each global stratigraphic stage and each continent-specific land mammal age or unit (Table S1). Terrestrial NPP is estimated from paleobotanical data accounting for temporal uncertainty, uncertainty of climatic reconstruction, and spatial structure. We evaluate the fossil relationship between NPP and γ diversity through time (i) across the two continents and (ii) across focal regions. Finally, we compare predictions from this fossil diversity–productivity relationship with observed present-day diversity and NPP to test whether the Neogene relationship has persisted into the present despite Pleistocene climate change and increasing human impact.
Spatial coverage of the Neogene paleobotanical and mammalian fossil records in focal regions (black outlines) in (A) Western, Central, and Eastern North America and (B) Western, Eastern Europe, and the Caucasus. Based on 145 paleobotanical locations (green diamonds), we estimate the NPP across each continent and within each of the three best-covered regions (Western North America and Western and Eastern Europe). The coverage of 1,567 fossil locations for large terrestrial mammals is shown as the number of localities (unique combinations of spatial location and stratigraphic stage, gray shading) in 1° latitudinal–longitudinal grid cells.
Definitions of time intervals across North America and Europe and summary statistics of mammalian and paleobotanical Neogene fossil records
Results and Discussion
The temporal dynamics of mammalian γ diversity, i.e., of the estimated regional diversity of genera, differ strongly between the two continents and across our focal regions (Fig. 2 A–H). Miocene γ diversity peaked earlier in North America (stratigraphic stage Burdigalian) than in Europe (Tortonian), a difference that has been linked to earlier drying and cooling in North America (14, 21, 22). In our terrestrial plant datasets, North America shows consistently lower NPP than Europe in the Miocene but not in the Early Pliocene (Fig. 2 I and J). Because terrestrial NPP data are available only at the resolution of stratigraphic stages, we use the stage-level mammalian diversity estimates in the following analyses. These stage-level diversity estimates generally track estimates in the more finely resolved land mammal ages (Pearson’s correlation coefficients between diversity estimates for stages and diversity estimates for the contemporary land mammal ages: r = 0.701, t = 5.29, df = 29, P < 0.001 for continents, r = 0.688, t = 8.31, df = 77, P < 0.001 for regions) (Fig. 2), although diversity is elevated in long stratigraphic stages compared with the corresponding land mammal ages (e.g., Tortonian in Europe; Fig. 2B). Because diversity estimates may partly reflect temporal turnover of genera within a stratigraphic stage, we assess the effects of temporal resolution by repeating analyses with the diversity estimates in land mammal ages, averaged within each stage.
Temporal dynamics of Neogene mammalian diversity (A–H) and net primary production (J–K) in North America and Europe. (A and I) North America continent-wide. (C) Western North America. (E) Central North America. (G) Eastern North America. (B and J) Europe continent-wide. (D) Western Europe. (F) Eastern Europe. (H) Caucasus. Patterns of γ diversity for large terrestrial mammals (genus-level, first-order Jackknife estimation) are largely consistent in global stratigraphic stages (black trend line shows natural cubic spline interpolation; vertical bars indicate SEs) and continent-specific land mammal ages (red stepped line and error bars). Only time intervals with more than five mammalian locations are shown. Present-day observed genus richness (blue squares) is markedly lower than fossil diversity. The fossil NPP estimates in the two continents within stratigraphic stages (orange and green symbols; symbol size indicates the number of grid cell values underlying the estimate; error bars indicate the entire range between average minimum and maximum values across the grid cells) were very similar with two approaches to allocating paleoclimatic estimates to stratigraphic stages, i.e., whether paleobotanical records were assigned automatically following absolute dates given in source datasets (orange) or were assigned manually according to stratigraphic information in source datasets (green). Neogene estimates were generally much higher than the present-day estimates (potential NPP, blue squares with SEs too small to see). Stratigraphic stages (see Table S1): Aquitanian (Aq); Burdigalian (Bu); Langhian (La); Serravallian (Se); Tortonian (To); Messinian (Me); Early Pliocene (EP); Late Pliocene (LP); Pleistocene (Pl).
Our analyses show a significant positive relationship of fossil mammalian γ diversity with NPP across the continents and stratigraphic stages (Fig. 3A and Table 1). We fit generalized linear mixed-effects models (GLMMs) with Poisson-distributed errors and account for the temporal and spatial data structure through random effects. Further, we account for covariates describing known effects on richness by fitting the area of the region or continent and the duration of the stratigraphic stage as fixed effects. Because of the relatively low availability of paleobotanical locations where the taxonomic composition has been analyzed and NPP could be inferred, the regional analyses are restricted to three focal regions with highest data coverage and best spatial and temporal match of mammalian and paleobotanical locations: Western North America and Western and Eastern Europe (Figs. 1 and 2). These regional analyses confirm the continental-scale results (Fig. 3B and Table 1). All patterns reported here are robust to the well-known limitations associated with the analysis of fossil data (6), because we find no or little effect (SI Materials and Methods) of range-through genera (Figs. S1 and S2), diversity estimator algorithm (Fig. S2), location definition (Fig. S3), and temporal resolution (for analyses using diversity in land mammal ages, see Table S2; for different methods of allocating paleobotanical data to stratigraphic stages, see Fig. 2 I and J). Supplemental simulations based on present-day data indicate that first-order Jackknife estimation performs well in the parameter space likely to be important for our high-quality mammalian fossil record (Fig. S4). We also estimate fossil NPP taking climatic uncertainty into account (Fig. S5) and validate the NPP model through comparisons with present-day data (Fig. S6).
Models of the fossil mammalian diversity–productivity relationship in (A) continents and (B) focal regions across stratigraphic stages in the Neogene (black symbols) and visual comparison with present-day data (gray and colored symbols). GLMMs (black continuous lines) account for temporal and spatial data structure with random effects (dotted lines) and show consistent effects of NPP on fossil γ diversity. Black symbols represent mean conditional response values for stratigraphic stages (as in Fig. 2) predicted for median values of the fixed-effect covariates (Table 1). Present-day observed data (gray symbols), data adjusted for human appropriation of NPP (blue symbols), and data adjusted for end-Pleistocene and Holocene extinctions (red symbols) fall below the fossil model predictions (dashed lines).
Model results for the fossil mammalian diversity–productivity relationship
Observed genus richness, number of range-through genera, and number of locations in the Neogene mammalian fossil record in North America (A–D) and Europe (E–H). Temporal dynamics of the number of observed mammalian genera (top of each graph, left axis) and the number of locations with mammalian fossils (bottom of each graph, right axis) were very similar. Numbers of genera are shown in global stratigraphic stages (black symbols, trend line fitted by natural cubic spline interpolation) and continent-specific land mammal ages (red stepped line), with error bars denoting the number of range-through genera (those genera with an occurrence gap for that single time interval). Numbers of locations are shown on a logarithmic scale for visual clarity, in stratigraphic stages (dark blue bars, blue outline) and in land mammal ages (light blue bars, black outline); the horizontal dotted line shows the cutoff of five locations (diversity estimation was performed only with at least six locations). Stratigraphic stages for land mammal ages (Table S1) are Aquitanian (Aq) (23.0–19.5 Mya), Burdigalian (Bu) (19.5–16.0 Mya), Langhian (La) (16.0–13.6 Mya), Serravallian (Se) (13.6–11.6 Mya), Tortonian (To) (11.6–7.2 Mya), Messinian (Me) (7.2–5.3 Mya), Early Pliocene (EP) (5.3–2.6 Mya), and Late Pliocene (LP) (2.6–1.8 Mya). (A) Continent-wide North America. (B) Western North America. (C) Central North America. (D) Eastern North America. (E) Continent-wide Europe. (F) Western Europe. (G) Eastern Europe. (H) Caucasus.
Estimation of mammalian diversity with different richness estimators. (A–J) Matrix of scatter plots comparing γ diversity estimates for large terrestrial mammals through the Neogene, generated with five different richness estimators: the two richness estimators based on occupancy within continents or regions (A–D, ACE; D–G, Chao1) and the three estimators based on presence–absence location matrices for each continent or region (C and G–I, Bootstrap; B, F, I, and J, first-order Jackknife; A, E, H, and J, Chao). (K and L) Scatter plots of the relationship of γ diversity values, which were estimated with first-order Jackknife (K) and Chao1 (L), with the sum of observed genera and range-through genera (i.e., genera with an occurrence gap for that single time interval). Each plot shows all continental (black) and regional (red) diversity estimates within each time interval with more than five mammalian locations (stratigraphic stages and land mammal ages), Pearson’s correlation coefficients across continental and regional estimates, and the 1:1 line.
Effects of location definition on the estimation of mammalian diversity. (A) Scatter plot showing the number of fossil locations that were spatially grouped into 1° grid cells (compare with Fig. 1) in Western Europe (black) and Eastern Europe (red). (B) Scatter plot showing that mammalian γ diversity estimates (first-order Jackknife) based on fossil locations and on 1° grid cells were strongly correlated in Western Europe (black) and Eastern Europe (red) across time intervals with more than five mammalian locations or grid cells (the 1:1 line is shown for comparison). (C and D) The temporal dynamics of Neogene mammalian diversity in Western Europe (C) and Eastern Europe (D) as estimated based on either the 1° grid cells (black and red lines; vertical bars indicate SEs) or on the original fossil locations (gray and orange lines with SE bars moved to the right to facilitate comparison). Patterns of γ diversity were very similar with the two location definitions and were largely consistent in stratigraphic stages (black and gray symbols; trend lines fitted by natural cubic spline interpolation) and MN units (red and orange stepped lines and error bars). For stratigraphic stages and MN units see Fig. S1 and Table S1.
Full model results for the Neogene diversity–productivity relationship in mammals
Performance assessment of the estimation of mammalian diversity with simulations based on present-day data. We randomly sampled differing numbers of locations from the set of fossil Neogene locations and used the presence–absence matrices of extant species at these selected locations to estimate mammalian γ diversity in North America (black) and Europe (red) (see SI Materials and Methods for details). The procedure was repeated 100 times (bold short horizontal lines indicate the median of simulated values; bold vertical lines indicate the interquartile range of simulated values; thin vertical lines indicate the total range between minimum and maximum simulated values). (A and B) Performance of first-order Jackknife diversity estimation, measured as the proportion of the true genera known to occur from present-day range maps (compare with Fig. S7A) that was estimated in simulations. Performance was robust (dashed line shows perfect performance) when the number of locations (A) and the proportion of genera preserved within the set of 100 locations (B) were varied. (C and D) The use of the richness estimator consistently improved the estimation of the number of living genera within each continent from different sets of locations: The number of genera observed within sampled locations (C) was usually much lower than the number observed across all locations within a continent (dotted line in C), whereas the number of genera from sampled locations estimated with Jackknife (D) was substantially closer to the number known to exist in the continent (dotted line in D).
Calculation of NPP estimates from paleobotanical data. (A) Original locations of paleobotanical records at fossil plant community locations (green dots) were summarized into 1° grid cells, and regional or continental average values were calculated across these cells (red outline). (B) Average NPP values within each grid cell were derived by generating a distribution of values for each variable, MAT and total annual precipitation (green bars symbolize ranges of paleo-climatic estimates; the gray curve shows fitted frequency distribution). From these distributions, the highest-density points for temperature [HDR(T)] and precipitation [HDR(P)] (blue dashed vertical lines) and the upper and lower limits of the intervals containing 50% of the original values (the central 50% credibility intervals, blue horizontal lines) were extracted. These values were used to calculate the NPP with the Miami model formula, generating an estimate of NPPi with a 50% credibility interval for grid cell i. (C) Grid-cell values of NPP that fell within the same stratigraphic stage were averaged on the regional or continental level by taking a weighted mean across grid cells, with the weight wi being the inverse of the credibility interval within each cell.
Global variation of present-day NPP and its human appropriation and comparison of the Miami model with a DGVM. (A–D) Global maps showing different measures of present-day NPP in 1° latitudinal–longitudinal grid cells. (A) Potential NPP calculated from temperature and precipitation with the Miami model. (B) Potential NPP derived from a DGVM. (C) Correction factor for HANPP, calculated as the proportion of potential NPP (estimated with the DGVM, B) that remains after human modification and harvest. (D) Remaining NPP adjusted for human appropriation, derived by multiplying the Miami-model NPP values by the HANPP correction factor. (E) Scatter plot showing a strong global relationship across 1° grid cells (linear regression model: adjusted R2 = 0.77, slope estimate = 1.96, P < 0.001) between present-day potential NPP values derived from the Miami model (A) and the DGVM (B). Symbols are semitransparent to show overlapping data points.
Our results provide strong support for the hypothesis that the terrestrial diversity–productivity relationship is a general pattern in ecology and paleo-ecology that persists in time and in space, at least in the Neogene across the two continents analyzed here. Our study might reconcile previous large-scale paleontological studies, e.g., those reporting that North American diversity of mammals is not consistently related to global temperature through the Cenozoic (15), even though major transitions between evolutionary faunas match periods of climate change in the same region (17, 22). This inconsistency, combined with the significant diversity–productivity relationship found here, could suggest that primary productivity is a more important or more direct driver of terrestrial mammalian diversity than temperature (3), although we do not directly compare temperature and productivity effects. In addition, the plant records in our study recovered regional variation in terrestrial NPP that could not be captured by the single global temperature curve from marine isotope data used in previous work, even though the terrestrial records have patchy spatial coverage and lower temporal resolution (18).
Next, we visually compare whether present-day diversity and NPP estimates were in agreement with the Neogene diversity–productivity relationship. We observe that present-day genus richness of large mammals in North America and Europe falls far below the predictions from the fossil relationship that has prevailed over 15–20 My (Fig. 3, gray symbols). Additionally, adjusting present-day values for both human appropriation of NPP (HANPP) (Fig. 3, blue symbols) and end-Pleistocene and Holocene extinctions (red symbols) would seem to reconcile present-day values with the fossil relationship, suggesting that increasing HANPP (8⇓–10) and the end-Pleistocene and Holocene extinctions (11, 23) have impacted the temporal diversity–productivity relationship in large mammals since the end of the Neogene. However, conclusions from these comparisons must be taken cautiously. We could not fit a combined model across fossil and present-day data points because of substantial differences, particularly in the underlying timescale: The average stratigraphic stage in the Neogene lasted 2.6 My, whereas the present-day data are a snapshot of the last 10,000 y at most. The large differences in diversity and NPP between fossil and present-day data could result from this differing timescale and mean that present-day data must be compared with fossil model predictions that are made outside the range of diversity and NPP values ever recorded in the Neogene (dashed line in Fig. 3). Nevertheless, we observe that the differences between the fossil diversity–productivity relationship and the observed present-day data points are striking (Fig. 3) and might reflect a fundamental change in the diversity–productivity relationship that occurred between the Neogene and today.
If the diversity–productivity relationship has been changed since the Neogene, we would expect the present-day relationship in space to be weakest in those regions most impacted by climatic oscillations and mammalian extinctions, such as North America and Europe. Across the globe, we find a significant present-day spatial relationship of mammalian diversity with terrestrial NPP (adjusted for human appropriation; see SI Materials and Methods and Fig. S7), in agreement with previous studies (2⇓–4). In contrast, we show that the present-day spatial relationship within the focal regions Western North America, Western Europe, and Eastern Europe is much weaker (Fig. S7), as could be expected because of climatic and anthropogenic impacts since the end of the Pliocene. Presumably, increasing HANPP in these regions has prevented a recovery from the numerous mammalian declines and extinctions that occurred in the Pleistocene and Holocene and are ongoing (8, 9) and has changed the diversity–productivity relationship through time. These results could be specific to large mammals, because they have been most strongly affected by past extinctions and experience high extinction risk today (7, 24). The applicability of our fossil and present-day diversity–productivity relationships to small mammals is unclear, because small mammals may be less susceptible to climate oscillations and have experienced fewer end-Pleistocene and Holocene extinctions (7, 13). Future studies could test the prediction that the diversity–productivity relationship through time is consistent with present-day patterns in other taxa, including those less affected by climate oscillations and human impact.
The present-day diversity–productivity relationship across space in extant large mammals. (A) Spatial variation of present-day genus richness for 861 extant species (267 genera) of large mammals (orders Artiodactyla, Carnivora, Perissodactyla, Primates, and Proboscidea) in 1° grid cells across the globe. Insets show genus richness within our focal regions (44 genera in total). (B and C) Results of generalized linear models with Poisson error distributions on a scatter plot (B) and corresponding map (C) investigating the spatial relationship between present-day genus richness (A) and NPP (adjusted for human appropriation; Fig. S6D) across different sets of 1° grid cells: within our focal regions (blue) and across the globe (gray), where we used the full NPP gradient or the fossil gradient only (darker shading, i.e., cells where the present-day NPP values lie within the limits of the NPP gradient estimated from the Neogene paleobotanical data, shown by dashed vertical lines in B). Symbols in B are semitransparent to show overlapping data points. Generalized linear models with Poisson errors showed significant and strong positive spatial relationships across all grid cells globally (continuous gray line in B: n = 12,730, residual deviance = 46,883, slope estimate = 0.0006, z = 169.5, P < 0.001) and across all cells falling within the fossil NPP gradient (black line in B: n = 4,531, residual deviance = 26,289, slope estimate = 0.0005, z = 75.1, P < 0.001). In contrast, an equivalent model across only the grid cells within our focal regions (red line in B) showed a significant but much weaker relationship (n = 1,460, residual deviance = 1,118, slope estimate < 0.0001, z = 3.2, P < 0.01). All these models were refitted accounting for spatial autocorrelation in the residuals; the results of these models were virtually identical to the models presented here (SI Materials and Methods).
Because of the large spatial and temporal scales of our diversity–productivity analysis, we cannot fully disentangle the ultimate underlying ecological and evolutionary mechanisms: Because resources drive consumer abundances and biomass, productivity could directly limit the diversity that can exist in a given region or it could influence extinction and speciation processes (5, 25). It is clear from our fossil results that productivity is not the only factor influencing diversity and that mammalian diversity does not perfectly track productivity through time. In our Neogene models, the effects of area are stronger than the effects of productivity, and the duration of the stratigraphic stage is also a significant covariate in most models. Additionally, there is a surprisingly large amount of scatter in the global present-day diversity–productivity relationship (Fig. S7). Presumably, our Neogene relationship captures the large-scale temporal transition from tropical and subtropical wet environments to much drier and colder temperate systems today (14) rather than a fine-scale temporal correlation between diversity and productivity. Also, the variability in primary productivity might have a cumulative effect, so that regions with a stable paleo-climatic history accumulate high diversity over long timespans (26). Although we did not test this possibility explicitly, the weak spatial diversity–productivity relationship in our focal regions today in comparison with the stronger global spatial relationship could support this idea, because the focal regions were influenced by glaciations until relatively recently.
SI Materials and Methods
Estimation of Mammalian γ Diversity.
We calculated mammalian γ diversity with five different estimation algorithms (Chao, first-order Jackknife, Bootstrap, Chao1, and ACE) as described in Materials and Methods in the main text and found that the resulting diversity estimates were strongly correlated across the different methods (Fig. S2 A–J). Nonetheless, we used two algorithms (first-order Jackknife and Chao) in all analyses because they represent two different treatments of occurrence data, are considered reliable estimators (40), and are comparatively weakly correlated in our analyses (Fig. S2 A–J). The first-order Jackknife estimates diversity based on the presence or absence of location matrices (Materials and Methods), and the unbiased variant of the Chao estimator (Chao1, presented below) estimates diversity based on genus occupancy within continents or regions.
Despite the differences in underlying occurrence or occupancy data, the diversity patterns of the Chao1 estimator did not differ substantially from those for the Jackknife estimator (median absolute difference in regional and continental γ diversity estimates: five genera, maximum: 65 in continents, 78 in regions; Fig. S2F). Absolute diversity had a slight tendency to be higher with Chao1 than with Jackknife (53% of time intervals). In addition, some estimated errors were extremely large with Chao1; however, differences between diversity estimates were nearly always smaller than the associated errors (in 88% of time intervals for Chao1 errors). We analyzed the diversity–productivity relationship at the continental scale and in the three regions with highest data coverage using the stratigraphic stages (because paleo-climatic estimates were available only at that temporal resolution). In the continental datasets and the selected regional datasets, differences in diversity estimates across stratigraphic stages were very small between the two methods (median difference in γ diversity was four genera). Therefore, we conclude that the γ diversity estimates included in our analyses were robust to the diversity estimator algorithm.
We evaluated how data characteristics influenced both richness estimators and quantified the performance of Jackknife diversity estimation if true genus richness is known (36, 38, 43). To this end, we present three sets of analyses to (i) assess the effect of range-through genera with the Jackknife and Chao1 richness estimators, (ii) determine any effects of the different location definition used in North American and European fossil datasets on diversity estimation with Jackknife, and (iii) assess the performance of Jackknife estimation with simulations based on present-day data.
Effects of range-through genera.
Our γ diversity estimation within each stratigraphic stage and land mammal age implicitly assumes that time intervals are independent of each other. This assumption increases the likelihood of underestimating diversity especially in short time intervals with few locations, because these often miss genera that were recorded before and after the interval but not within it. Here, we define range-through genera as those that have an occurrence gap during one or more time intervals but are present in intervals either side of the gap (note that this definition differs from that of ref. 59 and others, in which such genera are termed “Lazarus taxa,” denoted “Nr−”). Given that occurrence gaps may reflect either true fluctuations in diversity or data biases, we counted the number of range-through genera for gaps of either one or two intervals in length (note that one-interval range-through genera as defined here are equivalent to “part-timers” in ref. 59, denoted “Npt”). Then, as a simple test for underestimation of genus richness by the diversity estimation methods, we investigated whether the total genus number (the sum of observed and range-through genera) was above our γ diversity estimates in each of the time intervals.
We found that the number of range-through genera for gaps of either one or two intervals in length were usually low relative to the number of observed genera (see Fig. S1 for range-through genera with gaps of a single interval). For gaps of a single stratigraphic stage, the median number of range-through genera was 1.5 (maximum = 13) in continents and 0.5 (maximum = 11) in regions. The median number of range-through genera with single-interval gaps in continent-specific land mammal ages was 3 (maximum = 20) for continents and was 1 (maximum = 23) for regions. For gaps lasting two time intervals, the numbers decreased markedly: the median number of range-through genera was 0 (maximum = 2) in stratigraphic stages for the continental data, and 0 (maximum = 3) for the regional data. The median number of range-through genera with gaps lasting two land mammal ages was 3 (maximum = 7) for continents and was 1 (maximum = 5) for regions. Virtually all γ diversity estimates with both Jackknife and Chao1 estimators within a given time interval were above its total genus number for both continental and regional datasets (Fig. S2 K and L). The only exceptions were the stratigraphic stage Langhian and the MN05 zone in Eastern Europe (Fig. S2 K and L), where the number of locations was low and the proportion of range-through was genera high (10 locations with 19 genera observed, and 10 range-through genera in the Langhian and 8 in MN05) (Fig. S1G). However, because the numbers of range-through genera were low, especially in the stratigraphic stages, and because our diversity estimates were greater than the total genus number for almost all time intervals and regions, we argue that our diversity estimation effectively took the relatively weak effects of range-through genera into account.
Effects of different spatial location definitions in North American and European datasets.
Spatial locations in the Eurasian source dataset were the unique spatial fossil sites, but locations in North America were based on fossil sites that were grouped if they were close in space and belonged to the same stratigraphic formation (see ref. 30 for details). To assess the impact of this difference in location definition between our source datasets, we ran a sensitivity analysis across the best-sampled part of Europe (following refs. 36 and 43) and determined the extent to which diversity estimates with first-order Jackknife varied when locations are spatially grouped. We sampled all locations in Western and Eastern Europe into the 1° grid (Fig. 1). This grouping of locations is probably at a much larger spatial scale than the grouping inherent in the North American dataset, so it should represent a robust test case for the effects of spatial grouping on diversity estimation. Up to 31 locations (within a given time interval) were grouped into one grid cell (Fig. S3A), and 8,875 occurrences of 411 genera in 1,222 locations across Western and Eastern Europe in the original location dataset were transformed into 6,208 occurrences in 257 grid cells (of the same genera).
There was a strong and significant correlation between Jackknife γ diversity estimates calculated using the original and grouped locations (Pearson’s r = 0.996, t = 71.9, df = 40, P < 0.001) (Fig. S3B). Temporal diversity patterns for the two European regions (Fig. S3 C and D) were highly robust to spatial grouping (median difference between estimates across stratigraphic stages and MN units: 2 genera; maximum: 12), although the confidence intervals were slightly larger with spatial grouping (mean error: 8 genera with locations, 10 genera with grid cells). Given these results, we conclude that our comparisons of North American and European diversity estimates are not affected by differences in spatial grouping of locations between the source datasets.
Performance assessment with simulations based on present-day data.
We conducted a sensitivity analysis of Jackknife estimator performance with the present-day mammalian occurrence data (see Materials and Methods in the main text and discussion below for data description). To maintain the nonrandom spatial sampling structure of our fossil data in the present-day simulation dataset, we extracted present-day species lists by overlaying the species’ range maps with the actual locations from the Neogene fossil dataset (298 locations in North America and 1,225 in Europe which could be assigned to a single continent-specific land mammal age; compare with Fig. 1). With this dataset, we assessed the effects of varying (i) the number of locations and (ii) the preservation of genera within locations on the estimation of γ diversity. To simulate the variation in number of locations observed in our fossil record, we randomly picked 100 sets of locations within each continent consisting of 10, 50, 100, or 200 locations each. These location numbers reflect typical values from the Neogene fossil data: Stratigraphic stages contained 166 locations on average (minimum: 38 locations; maximum: 485 locations) in Europe, and 47 locations on average in North America (minimum: 21 locations; maximum: 93 locations), whereas the more finely resolved continent-specific time intervals contained fewer locations (European MN units contained 1–190 locations and an average of 86, and North American NALMAs contained 11–48 locations and 25 on average). We also explored preservation of genera, because fossil locations preserve only a subset of the genera that actually lived there. For this exploration, we randomly deleted 20, 40, or 60% of the genus occurrences in each location from our randomized subsets of 100 locations. We then calculated γ diversity with the first-order Jackknife algorithm across the 100 randomizations in each location and preservation set and compared the results with the true number of genera from the present-day range maps.
Our simulations showed a robust performance of γ diversity estimation (Fig. S4). With only 50 locations, a median of 93% of the genus richness in North America and 78% in Europe was estimated (Fig. S4A). Diversity in North America was estimated well with as few as 10 locations (median 81%); in Europe performance with 10 locations was lower (median 58%) but increased to median values of 85% and 89% with 100 and 200 locations, respectively. Random deletion of genus occurrences within each location had comparatively little effect (Fig. S4B). In North America, reducing the genus occurrences by as much as 60% led to virtually the same median estimate as that of the full set (25.98 vs. 25.99 genera). In Europe, a decrease in performance was observed with reduction of genus occurrences, but it was weak (median estimate 74% of genera estimated with 60% of occurrences deleted, vs. 85% performance with all occurrences) (Fig. S4B). Our simulations also showed that even though not all genera known to be present in the focal regions were sampled in the complete location set, we were able to approximate the correct number of genera with a reasonable number of locations (compare the ranges of estimates and the horizontal dotted lines in Fig. S4 C and D).
These results indicate that γ diversity can be well estimated, even when data are much more incomplete than our mammalian fossil data are likely to be. Our analyses of the fossil diversity–productivity relationship focused on the stratigraphic stages, which on average contained more locations than necessary to achieve a performance of above 85% in our simulations. Further, provided enough locations were selected, the effect of undetected or nonpreserved genera on diversity estimation was low, assuming random loss of genera and unbiased preservation rates. Recent evidence shows that these assumptions are reasonable for large datasets such as our mammalian fossil record (60). Although for practical and philosophical reasons we can never be sure that we are estimating the true diversity (39, 40), our evaluation of the diversity estimation clearly indicated that estimating mammalian γ diversity from the fossil occurrence data with a first-order Jackknife approach constituted a robust and appropriate way to quantify regional and continental genus richness.
Estimation of Present-Day Diversity and End-Pleistocene and Holocene Extinctions.
To calculate present-day mammalian diversity, we edited the range maps from the IUCN Red List Global Mammal Assessment 2008 (www.iucnredlist.org/initiatives/mammals) to match our taxonomy (35), as described previously (44). We excluded marine species (the pinniped families Phocidae, Otariidae, and Odobenidae, the sea otter Enhydra lutris, and the polar bear Ursus maritimus), ranges classified as “presence uncertain,” “historical range,” and “introduced,” and humans and domesticated species (Equus caballus, Bos taurus, Capra hircus, Ovis aries, Felis catus), but not wild boar (Sus scrofa), wildcat (Felis silvestris), or wolf (Canis lupus).
We compiled lists of extinct genera for our focal regions and mammalian orders from literature (11, 23). The following 14 genera went extinct within Europe (listed by order): Artiodactyla: Myotragus, Ovibos, Spirocerus, Bos, Megaloceros, Praemegaceros, Hippopotamus, and Phanourios; Carnivora: Crocuta; Perissodactyla: Coelodonta, and Stephanorhinus; Proboscidea: Elephas, Mammuthus, and Paleoloxodon. In North America, we counted 25 extinct genera: Artiodactyla: Stockoceros, Tetrameryx, Bootherium, Bos, Euceratherium, Saiga, Camelops, Hemiauchenia, Paleolama, Bretzia, Cervalces, Navahoceros, Torontoceros, Mylohyus, and Platygonus; Carnivora, Homotherium, Miracinonyx, Smilodon, Arctodus, Tremarctos, and Platygonus; Perissodactyla: Equus and Tapirus; Proboscidea: Cuvieronius, Mammut, and Mammuthus. Because our focal taxa also contained smaller-bodied carnivores that are generally not counted in estimates of megafaunal extinctions (11), these numbers are likely to underestimate the true number of extinctions for our focal taxa.
Paleobotanical Data.
Accounting for temporal uncertainty in allocation of stratigraphic stages.
We obtained paleo-climatic data for temperature and precipitation from several public datasets, all of which were compilations across many original source studies that derived paleo-climatic estimates from fossil plant communities (18, 45⇓–47). We identified and removed duplicate climatic locations (i.e., those included in several of these compilation datasets) based on the original source literature citations and on spatial and temporal proximity and overlap. All climatic records for the Pliocene were allocated to the stratigraphic stage of the Early Pliocene following previous work (46, 47). Age estimates for the other climatic records were usually given as absolute start and end times of stratigraphic stages (18, 45). Because absolute age definitions for any given stratigraphic time interval can change substantially depending on the stratigraphic reference scheme, we had to assign records from these datasets to our stratigraphic stages in a consistent fashion rather than following the original, disparate stratigraphic schemes in the different compilations, which were often not consistent with each other. We therefore allocated the non-Pliocene climatic records to our stratigraphic stage scheme (Table S1) following two different approaches to account for the uncertainty of date assignment. In both approaches, all records allocated to more than two stratigraphic stages were removed to reduce the temporal uncertainty of paleo-climatic estimates.
Our first approach (hereafter “automatic allocation”) was to assign stages automatically based on the minimum and maximum absolute age estimates given in the source data using a temporal accuracy of 0.3 Mya. To do so, we identified the uppermost and lowermost stage assigned to a climatic record as the most recent stages with start dates smaller or equal to the maximum and minimum estimated age ±0.3 Mya, respectively. The upper- and lowermost stage and all stratigraphic stages between these then were assigned to the climatic record. This first approach assigned many stages to most climatic records, so when we excluded records that were allocated to more than two stages, the automatic allocation dataset was severely reduced. As an alternative second approach (hereafter “stratigraphic allocation”), we assigned the nearest equivalent stage manually based on the stratigraphic reference scheme (or more detailed stratigraphic information if given) in the compilation datasets (18, 45). A table showing the allocated stages for each age range of paleo-climatic records is available online; a link to the files is given in Materials and Methods in the main text. In both approaches we excluded climatic records that had no age information or that were allocated to stages older than Aquitanian or younger than Early Pliocene.
After deriving the NPP from the paleo-climatic estimates (see Materials and Methods in the main text and details below), we evaluated the influence of temporal uncertainty in the assignment of stratigraphic stages on the final NPP estimates for continents and regions. We found that the temporal allocation of stratigraphic stages had little effect on the final NPP estimates, which were very similar when the automatic and stratigraphic allocation methods were compared (correlation across the stratigraphic stages: Pearson’s r = 0.96, t = 10.5, df = 10, P < 0.001) (see Fig. 2 I and J for continental estimates). The mean absolute difference in continental NPP between the two datasets was only 48.1 g dry matter⋅m−2⋅y−1 (maximum 191.8 g dry matter⋅m−2⋅y−1 in the North American Messinian, where estimates were based on two grid cells) (Fig. 2I). We therefore used the dataset based on manual stratigraphic allocation in the diversity–productivity analyses, because it was based on more original paleo-climatic data points.
Accounting for climatic uncertainty in deriving paleo-climatic estimates for each grid cell.
Averaging paleo-climatic records across whole continents or regions directly does not account for spatial variation in sampling or uncertainty of climatic values. Instead, we averaged paleo-climatic estimates across locations within 1° grid cells taking climatic uncertainty into account, calculated the NPP and its uncertainty within these grid cells, and then averaged NPP estimates across grid cells weighted by the uncertainty (the latter two steps are described in the next section). Almost all paleo-climatic records provided ranges between a minimum and a maximum climatic estimate for a given fossil plant community, and the distribution of possible climatic values usually is assumed to be uniform between these endpoints (18). We used this distribution of climatic estimates, or uncertainty range, to calculate mean climatic estimates within each grid cell (Fig. S5). We generated a distribution of climatic values for a given grid cell by allocating each climatic range between a given minimum and maximum value into bins (bin size for temperature: 0.01 °C; bin size for precipitation: 1 mm/y). The mean climatic value assigned to the cell was the highest-density point of the resulting frequency distribution of binned values (Fig. S5B). To incorporate the variation within the grid cell and the uncertainty associated with the climatic estimates, we also extracted the upper and lower limits of the credibility interval containing 50% of the binned values (Fig. S5B).
In the two datasets, i.e., automatic and stratigraphic stage allocation, between 103 and 129 grid cell-by-stratigraphic stage combinations (in 64–82 grid cells) contained only one record for either temperature or precipitation. In these cases, the mean for the grid cell was calculated as the average of the given minimum and maximum climatic value. To derive a credibility interval around this mean, the limits of the 50%-interval of climatic values were approximated as the mean ± 1 SD (this interval should contain 68.2% of values if data are normally distributed), with the SD calculated as the total range (maximum minus minimum) divided by 4 (because in a normal distribution 95% of values fall into the interval of 4 SD units around the mean). Further, a few climatic records did not supply a range of values but instead supplied point estimates: Between 13 and 16 grid cell-by-stratigraphic stage combinations (in 12–14 grid cells) contained only a single record with a climatic point estimate. In these cases, the SD was set to 1 °C or 120 mm/y to derive credibility intervals as above, based on the approximate median values across all climatic estimates given as ranges in the two datasets.
Calculation of NPP from Paleo-Climatic Estimates.
Within each grid cell that contained paleo-climatic data, we calculated the NPP with the Miami model formula (Fig. S5B). The Miami model assumes positive relationships of MAT and MAP with the NPP and that the NPP is limited by either temperature or precipitation (Fig. S5B) (51). In effect, NPP values are calculated for each of MAT and MAP separately, and the smaller value then is chosen to reflect the final NPP. In our case, final mean NPP values were based on precipitation in the majority of grid cells. The credibility interval values for NPP within a grid cell were always derived from the same variable as the mean values. Finally, regional and continental values were obtained using the mean across grid cells weighted by the climatic uncertainty (inverse of the size of the 50% credibility interval; see Materials and Methods in the main text and Fig. S5C).
Analyses of the Mammalian Diversity–Productivity Relationship.
Main models of the fossil relationship.
We ran separate analyses for continents and regions (see also Materials and Methods in the main text). These spatial units were not simple replicates at different spatial aggregation levels but instead represented the best-available current knowledge at a given spatial scale. Because of the lower data coverage in some regions, such as Caucasus and Eastern North America, we excluded these regions in regional analyses. We used GLMMs with Poisson-distributed errors because the response was a count variable (γ diversity estimates were rounded to integers to meet the model assumptions). We tested different model specifications (following ref. 56) to account for the temporal and spatial data structure (Table S2) and determined the best model based on comparison of the AIC as a random slope and intercept GLMM. This kind of model assumes that random effects act on the variance of fixed-effect estimates but not on their mean values (see Table S2 for model comparisons and formula).
We fitted random slope effects of the midpoint of the stratigraphic stage (age in Mya) within each continent or region and random intercept effects for continent or region identity. Preliminary analyses with generalized least-square models containing autoregressive terms indicated that temporal autocorrelation had no significant effect of on variance heterogeneity and that the fitted random effects in the GLMM should account for the temporal trend and the spatial structure in the data (56). The variable of interest (NPP, log-transformed), as well as the two covariates area (in square kilometers, log-transformed) and duration of the stratigraphic stage (in My, log-transformed), were fitted as fixed effects. To approximate the area for which γ diversity was estimated in each stratigraphic stage, area was measured with a convex hull around all the mammalian locations occurring in each stage (using a Mollweide projection). We accounted for area and stage duration because these parameters are known to influence diversity in fossil samples significantly even after controlling for spatial and temporal sampling biases. Correlation coefficients between the variables fitted as fixed effects were low (all absolute r < 0.34 for continents and r < 0.24 for regions), indicating that collinearity was unlikely to affect the models.
Supplemental models of the fossil relationship.
We fitted the same GLMMs to γ diversity estimates averaged across land mammal ages within each stratigraphic bin to test the effect of estimating fossil mammalian diversity in the relatively long stratigraphic stages as opposed to the finer temporal resolution of the land mammal ages (supplemental models in Table S2). Because of the strong effects of area in our models, we also ran additional models to check that the tectonic extension of Western North America during the Miocene did not drive our results. Between 15 and 5 Mya, Western North America experienced a major extension in area that is estimated to have doubled its east–west extent over that timeframe (61). The area estimates for Western North America were rescaled to approximate the extension process with the following multiplication factors: Aquitanian and Burdigalian 0.5×, Langhian 0.6×, Serravallian 0.7×, Tortonian 0.8×, Messinian 0.9×. We then fitted the GLMMs across stratigraphic stages as before and found that results of the continental analysis did not change qualitatively when area values were adjusted for the tectonic extension of North America (supplemental models in Table S2). The NPP remained as a significant predictor of mammalian γ diversity in the continental model but was nonsignificant in the regional model that accounted for tectonic extension (Table S2). We conclude that the diversity–productivity relationship found in the continental models was robust to the influence of North American tectonic changes on the covariate area, but the effect of area extension on the regional level warrants further study.
Models of the present-day spatial relationship.
The substantial differences between fossil and present-day data, especially in their sampling and underlying timescale, precluded a joint statistical analysis of the diversity–productivity relationship (see Results and Discussion in the main text). Therefore, we compared the present-day estimates with the model predictions from the fossil record only visually (Fig. 3). To evaluate the present-day diversity–productivity relationship across the globe, we followed a purely spatial approach, as commonly applied in macroecology, fitting generalized linear models with a Poisson error distribution across the 1° grid cells (Fig. S7). To check the influence of spatial autocorrelation, we refitted these models including a spatial autocovariate as additional predictor variable, which was calculated from the distance-weighted spatial correlation matrix of the residuals from the initial models that included only the NPP as predictor variable (62). The models controlling for spatial autocorrelation were virtually identical to the models presented in Fig. S7: The present-day diversity–productivity relationship across all grid cells globally was significant and strongly positive (n = 12,730, residual deviance = 6,455, slope estimate = 0.0005, z = 149.6, P < 0.001), as was the relationship across all cells falling within the fossil NPP gradient (n = 4,531, residual deviance = 3,553.9, slope estimate = 0.0006, z = 75.2, P < 0.001). The relationship across only the grid cells within our focal regions was significant but much weaker (n = 1,460, residual deviance = 112, slope estimate < 0.0001, z = 3.2, P < 0.01).
Conclusions
There has been increasing interest in reconciling paleontological and neontological perspectives on diversity (27), but this integration has been challenging because of the inherent differences in sampling, timescale, and taxonomy (6). Here we successfully use the fossil record to test an ecological pattern over geological timescales and pioneer large-scale quantitative analyses that directly link fossil occurrence datasets to terrestrial paleo-environmental proxy data. Our results suggest that general ecological rules cannot be inferred exclusively either from the geological past or from present-day data alone. Mammalian diversity and terrestrial primary production are currently much lower than over the last 23 My and seem to be inconsistent with the universal diversity–productivity relationship we find through the Neogene. This difference renders predictions of future diversity dynamics based on knowledge of past and present relationships more challenging than previously thought. In fact, accelerating human impacts strongly decrease the probability of a rebound of diversity (8, 9, 28), supporting the hypothesis that an irreversible anthropogenic state shift of the biosphere has already taken place (29).
Materials and Methods
Mammalian Fossil Data.
We extracted geo-referenced and dated fossil species and genus occurrences of nonmarine members of the mammalian orders Artiodactyla, Carnivora, Perissodactyla, Primates, and Proboscidea throughout the Miocene and Pliocene for North America (30, 31) and for Eurasia (NOW, the New and Old Worlds Database of Fossil Mammals, www.helsinki.fi/science/now/). Original data will be publicly available through the NOW database during 2016, and our cleaned datasets, processed data for analyses, and R scripts are available online (dataportal-senckenberg.de/database/metacat/bikf.10018.1/bikf). Fossil locations were only included if they could be unambiguously assigned to one time interval. The sources used two different chronologies (Table S1): the North American Land Mammal Ages (NALMA) (32) and the Mammal Neogene (MN) units (33). We evaluated mammalian diversity within these land mammal ages or units to gain a detailed view of temporal diversity dynamics, but we also combined occurrence data into a set of broader, global stratigraphic stages (34) (Table S1). Although these global stages were less well resolved in time, they were comparable across continents and matched the temporal resolution of the paleobotanical data.
We followed the taxonomy of our sources for fossil (NOW database and refs. 30 and 31) and extant species (35). The raw data were corrected on the species level for taxonomic errors, and we unified the taxonomy across the data sources (using a taxonomic look-up table available at dataportal-senckenberg.de/database/metacat/bikf.10018.1/bikf) to avoid biases in genus counts arising from synonyms. We performed all analyses at the genus level because the sampling bias inherent in the fossil record should be less influential on diversity estimates calculated at higher taxonomic levels (6). Additionally, morphological disparity at the genus level in fossil mammals has been shown to approximate disparity at the species level in extant mammals (36, 37). The final dataset contained a total of 1,688 unique species in 663 genera, plus 27 genera for which we had only genus-level occurrences (the full dataset is available at dataportal-senckenberg.de/database/metacat/bikf.10018.1/bikf). We performed analyses at two spatial extents. Continental datasets of North America and Europe included all their respective locations. For regional analyses, focal regions defined based on existing knowledge of biogeographic history (12, 19) were small enough to capture biogeographically meaningful units but were large enough to contain a sufficient number of mammalian fossil locations within the stratigraphic stages. Fig. 1 shows the final regions delimited on a 1° latitudinal–longitudinal grid.
Estimation of Mammalian γ Diversity.
The number of genera varied considerably across time intervals (Table S1) and was significantly correlated with the number of locations (r = 0.83, t = 9.8, df = 45, P < 0.001 across all time intervals for continents; r = 0.76, t = 13.3, df = 130, P < 0.001 for regions) (Fig. S1). We corrected for this sampling bias with algorithms to estimate γ diversity (i.e., the region- or continent-wide genus richness) based on the occurrence of genera (20, 38⇓–40). We applied the richness estimators Chao, Jackknife, and Bootstrap (20, 40) to a genus-by-location matrix of presences and absences for each time interval within each focal region and each continent. From these matrices, we also calculated genus-level occupancy for each subset, i.e., the number of locations where a genus was present (36). We applied the site-specific, abundance-based richness estimators of Chao1 (the unbiased variant of the Chao estimator) and Abundance-Coverage Estimator (ACE) to these occupancy data, treating a region or continent as one site (20, 39). Analyses were performed in R with the vegan package (41, 42), and estimates based on fewer than six locations were excluded.
Values of γ diversity from different estimators were strongly correlated (Fig. S2 A–J), so we present results with first-order Jackknife here (see SI Materials and Methods for details of estimator selection and results with different estimators). One central issue is that diversity in the relatively long stratigraphic stages is likely to represent signals of both standing diversity and temporal turnover of genera within a stage. We were restricted to global stratigraphic stages for comparisons between the two continents and because terrestrial NPP data were available only at that temporal resolution. To assess the effect of temporal resolution directly, we repeated analyses with the diversity estimates in the more finely resolved land mammal ages, which were then averaged within stratigraphic stages. Further, we assessed key assumptions and the performance of diversity estimation in supplemental analyses and simulations (following refs. 36, 38, and 43); see SI Materials and Methods and Figs. S2–S4).
Estimation of Present-Day Diversity and End-Pleistocene and Holocene Extinctions.
To estimate present-day γ diversity for the same five orders of large mammals, we extracted occurrences by overlaying species’ range maps with our 1° grid (Fig. 1). We edited the range maps from the International Union for the Conservation of Nature (IUCN) Red List Global Mammal Assessment 2008 (www.iucnredlist.org/initiatives/mammals) to match our taxonomy (35) as described previously (44), excluding humans, domesticated and marine species, and uncertain, historical, and introduced ranges (see SI Materials and Methods for details). The dataset included a total of 861 extant species in 267 genera across the globe and 86 species in 44 genera in our regions (Fig. S7A). To adjust for the effects of end-Pleistocene and Holocene extinctions, we compiled available lists of extinct species (11, 23), selected the species recorded for our focal regions, and cross-checked them with our extant dataset (see SI Materials and Methods for the final list). We then adjusted present-day mammalian diversity in each continent and focal region by adding the number of extinct genera to the present-day observed genus richness.
Paleobotanical Data.
Paleo-climatic data were obtained from several public sources that covered the Neogene as a whole (18), exclusively the Miocene (45), or exclusively the Pliocene (46, 47). We used terrestrial estimates of mean annual temperature (MAT) and precipitation (MAP) inferred from fossil plant communities which allowed us to calculate spatially explicit values of terrestrial NPP for each region or continent. We consider these datasets appropriate for the large temporal and spatial scales addressed here (48) and accounted for the temporal and climatic uncertainties associated with paleobotanical climate reconstructions as follows (see SI Materials and Methods and Fig. S5 for details). We allocated paleo-climatic records to our stratigraphic scheme following two different approaches to account for temporal assignment uncertainty (SI Materials and Methods) but found no substantial differences between the resulting NPP datasets (Fig. 2 I and J). To account for the spatially clumped data structure (Fig. 1), we summarized the paleo-climatic records that fell into our set of focal regions (344 records in 182 locations, or 439 location-by-stratigraphic-stage combinations, available at dataportal-senckenberg.de/database/metacat/bikf.10018.1/bikf) into the 1° grid (Fig. S5). Because paleo-climatic records often provided ranges between minimum and maximum estimates that reflect climatic uncertainty for each fossil plant community (18), we took the entire distribution of climatic estimates into account when calculating mean estimates (and 50% credibility intervals) for each grid cell (SI Materials and Methods and Fig. S5).The great majority of paleo-climatic records were from Europe (Fig. 1). The sparseness of records in North America results from the known rarity of settings suitable for the preservation of paleobotanical material in the arid Neogene there (49), and hardly any alternative terrestrial paleo-climatic records exist for our spatial and temporal scales (50). We excluded paleo-climatic data derived from Neogene paleosols in North America (49) because these showed very low spatial and temporal congruence with our data (most paleosol data were for Central North America) and because similar paleosol compilations are lacking in Europe.
Calculation of NPP from Paleo-Climatic Estimates.
We calculated NPP (in grams of dry matter per square meter per year) with the Miami model equation (Fig. S5B) (51) within each of the 1° grid cells that contained an estimate of MAT (in degrees centigrade) and an estimate of MAP (in millimeters per year). The Miami model is commonly applied to fossil data when no other NPP estimates or environmental drivers for more complex modeling are available and is considered robust at large spatial scales (52). We further demonstrated the robustness of NPP estimates from the Miami model with present-day data (see below). Our methods assume no effects of temporal changes in atmospheric CO2 levels on paleo-climatic estimation from plant fossils or on conversion of paleo-climatic values to NPP estimates, because past CO2 levels are still under debate, and recent vegetation models suggest that they are likely comparable to preindustrial levels since at least the late Miocene (50). Additionally, the influence of CO2 fertilization on paleo-climatic reconstruction is considered negligible, particularly in areas where water is not the main limiting factor (18). For each stratigraphic stage and each region and continent, we calculated weighted mean NPP based on all grid cells with both a MAT and a MAP estimate (excluding stages with only one cell). To account for uncertainty in underlying climatic estimates, we used our measure of the paleo-climatic variance within grid cells as weights, i.e., we calculated a mean that was weighted with the inverse values of the width of the 50% credibility interval from the binned distribution of original paleo-climatic estimates (SI Materials and Methods and Fig. S5C).
Present-Day NPP Data and Human Appropriation.
To obtain comparable NPP estimates for the present day, we calculated NPP with the Miami model (51) from contemporary climate records. Data on MAT and total annual precipitation from the Climate Research Unit (CRU) time-series (TS) dataset (version 3.21) for the years 1960–2010 (53) were resampled to our 1° grid. We calculated average present-day NPP within grid cells based on the arithmetic means across the 50 y (Fig. S6A) and regional and continental estimates as the average across all respective grid cells. We did not use remote-sensing data because these show actual NPP (including human impact), whereas the NPP estimated from potential vegetation is more appropriate for comparison with the fossil record. To investigate the robustness of NPP estimates, we showed that the potential NPP values derived with the Miami model correlated strongly with potential NPP estimated from a dynamic global vegetation model (DGVM) (Fig. S6 B–E). DGVMs are sophisticated models of plant population dynamics in response to abiotic parameters and perform well in the biomes covered by our focal regions (54); the DGVM used here was based on plant physiology, atmospheric CO2, climate, hydrology, and soil (10). Our comparison (Fig. S6E) showed that NPP estimates derived with the Miami model provided a realistic picture of present-day potential NPP in the absence of human impact at the global scale. Finally, we estimated the actual primary productivity available in natural ecosystems today by adjusting NPP values for HANPP with a correction factor (Fig. S6C), which was the proportion of potential NPP (modeled by the DGVM, Fig. S6B) that remains after human modification and harvest (10). Remaining NPP adjusted for human appropriation (Fig. S6D) was calculated for each grid cell by multiplying potential NPP from the Miami model with the HANPP factor.
Analyses of the Mammalian Diversity–Productivity Relationship.
We analyzed the temporal relationship of fossil γ diversity with NPP separately on the continental and regional scales and across stratigraphic stages for which we had sufficient data (more than five mammalian locations and more than one grid cell with NPP estimate), from the Aquitanian (starting 23 Mya, Europe only) or Langhian (starting 17 Mya, both continents) to the Early Pliocene (ending 2.6 Mya) (datasets and R scripts are available at dataportal-senckenberg.de/database/metacat/bikf.10018.1/bikf). We fitted GLMMs with Poisson-distributed errors using Maximum Likelihood with the lme4 package for R (55). We chose a particular model structure because it best represented the hypothesis we wanted to test, i.e., whether γ diversity was related to NPP when accounting for effects of area and duration of the time interval (38) as well as for the temporal and spatial structure in the data (see Table 1, SI Materials and Methods, and Table S2 for details). These models were the best GLMMs from a selection of possible model specifications we tested (Table S2) following a standardized protocol (56). Marginal and conditional R2 values for GLMMs were calculated with the MuMIn package (57, 58).
Acknowledgments
We thank A. Barnosky, D. Currie, T. Hickler, T. Müller, M. Lawing, S. Pauls, R. O’Hara, E.-M. Gerstner, U. Salzmann, and B. Schmid for providing data, advice, and discussion. Our work was supported by German Research Foundation Emmy Noether Fellowship DFG FR 3246/2-1 (to S.A.F.), by the sFossil workshop at the Synthesis Centre for Biodiversity Sciences (iDiv) DFG FZT 118 (to J.S. and A.M.), by European Commission Marie Skłodowska-Curie Grant FP7-PEOPLE-2012-IEF-329645 (to J.T.E. and A.M.), by the Landesoffensive zur Entwicklung wissenschaftlich-ökonomischer Exzellenz (LOEWE) funding program of Hesse’s Ministry of Higher Education, Research, and the Arts, and by the Kone Foundation (to J.T.E.). C.H.G. holds German Research Foundation Mercator Guest Professorship DFG INST 161/723-1 at Goethe University Frankfurt. This article is a contribution to the integrative Climate Change Biology (iCCB) programme.
Footnotes
- ↵1To whom correspondence should be addressed. Email: sfritz{at}senckenberg.de.
↵2Present address: BIOS Research Unit, 00510 Helsinki, Finland.
Author contributions: S.A.F., J.T.E., J.S., C.H., C.M.J., A.M., K.B.-G., and C.H.G. designed research; S.A.F., J.T.E., J.S., C.H., and C.H.G. performed research; J.T.E. and C.M.J. contributed new reagents/analytic tools; S.A.F., J.T.E., J.S., and C.H. analyzed data; and S.A.F., J.T.E., J.S., C.H., C.M.J., A.M., K.B.-G., and C.H.G. wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
Data deposition: Data and R codes for analysis are available at dataportal-senckenberg.de/database/metacat/bikf.10018.1/bikf.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1602145113/-/DCSupplemental.
References
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Dirzo R, et al.
- ↵.
- Barnosky AD
- ↵
- ↵.
- Haberl H, et al.
- ↵
- ↵
- ↵
- ↵
- ↵.
- Alroy J,
- Koch PL,
- Zachos JC
- ↵.
- Badgley C, et al.
- ↵.
- Figueirido B,
- Janis CM,
- Pérez-Claros JA,
- De Renzi M,
- Palmqvist P
- ↵
- ↵.
- Bernor RL,
- Fahlbusch V,
- Mittmann H-W
- Fortelius M, et al.
- ↵.
- Colwell RK,
- Coddington JA
- ↵.
- Eronen JT, et al.
- ↵.
- Eronen JT,
- Janis CM,
- Chamberlain CP,
- Mulch A
- ↵.
- Turvey ST
- ↵.
- Turvey ST,
- Fritz SA
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Janis CM,
- Scott KM,
- Jacobs LL
- ↵.
- Janis CM,
- Gunnell GF,
- Uhen MD
- ↵.
- Woodburne MO
- ↵.
- Rössner GE,
- Heissig K
- Steininger FF
- ↵.
- Gradstein FM,
- Ogg JG,
- Schmitz MD,
- Ogg GM
- ↵.
- Wilson DE,
- Reeder DM
- ↵
- ↵
- ↵.
- Barnosky AD,
- Carrasco MA
- ↵
- ↵.
- Magurran AE,
- McGill BJ
- Gotelli NJ,
- Colwell RK
- ↵.
- R Development Core Team
- ↵.
- Oksanen J, et al.
- ↵.
- Saarinen J,
- Oikarinen E,
- Fortelius M,
- Mannila H
- ↵.
- Fritz SA,
- Purvis A
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Forrest M, et al.
- ↵.
- Lieth H,
- Whittaker RH
- Lieth H
- ↵
- ↵
- ↵
- ↵.
- Bates D, et al.
- ↵.
- Zuur AF,
- Ieno EN,
- Walker NJ,
- Saveliev AA,
- Smith GM
- ↵
- ↵.
- Bartoń K
- ↵.
- Alroy J
- ↵
- ↵.
- Mix HT,
- Mulch A,
- Kent-Corson ML,
- Chamberlain CP
- ↵