Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2014 May 27;111(23):8524–8529. doi: 10.1073/pnas.1406664111

Commonness and rarity in the marine biosphere

Sean R Connolly a,1, M Aaron MacNeil b, M Julian Caley b, Nancy Knowlton c,1, Ed Cripps d, Mizue Hisano a, Loïc M Thibaut a, Bhaskar D Bhattacharya e, Lisandro Benedetti-Cecchi f, Russell E Brainard g, Angelika Brandt h, Fabio Bulleri f, Kari E Ellingsen i, Stefanie Kaiser j, Ingrid Kröncke k, Katrin Linse l, Elena Maggi f, Timothy D O’Hara m, Laetitia Plaisance c, Gary C B Poore m, Santosh K Sarkar e, Kamala K Satpathy n, Ulrike Schückel k, Alan Williams o, Robin S Wilson m
PMCID: PMC4060690  PMID: 24912168

Significance

Tests of biodiversity theory have been controversial partly because alternative formulations of the same theory seemingly yield different conclusions. This has been a particular challenge for neutral theory, which has dominated tests of biodiversity theory over the last decade. Neutral theory attributes differences in species abundances to chance variation in individuals’ fates, rather than differences in species traits. By identifying common features of different neutral models, we conduct a uniquely robust test of neutral theory across a global dataset of marine assemblages. Consistently, abundances vary more among species than neutral theory predicts, challenging the hypothesis that community dynamics are approximately neutral, and implicating species differences as a key driver of community structure in nature.

Keywords: metacommunities, marine macroecology, species coexistence, Poisson-lognormal distribution

Abstract

Explaining patterns of commonness and rarity is fundamental for understanding and managing biodiversity. Consequently, a key test of biodiversity theory has been how well ecological models reproduce empirical distributions of species abundances. However, ecological models with very different assumptions can predict similar species abundance distributions, whereas models with similar assumptions may generate very different predictions. This complicates inferring processes driving community structure from model fits to data. Here, we use an approximation that captures common features of “neutral” biodiversity models—which assume ecological equivalence of species—to test whether neutrality is consistent with patterns of commonness and rarity in the marine biosphere. We do this by analyzing 1,185 species abundance distributions from 14 marine ecosystems ranging from intertidal habitats to abyssal depths, and from the tropics to polar regions. Neutrality performs substantially worse than a classical nonneutral alternative: empirical data consistently show greater heterogeneity of species abundances than expected under neutrality. Poor performance of neutral theory is driven by its consistent inability to capture the dominance of the communities’ most-abundant species. Previous tests showing poor performance of a neutral model for a particular system often have been followed by controversy about whether an alternative formulation of neutral theory could explain the data after all. However, our approach focuses on common features of neutral models, revealing discrepancies with a broad range of empirical abundance distributions. These findings highlight the need for biodiversity theory in which ecological differences among species, such as niche differences and demographic trade-offs, play a central role.


Determining how biodiversity is maintained in ecological communities is a long-standing ecological problem. In species-poor communities, niche and demographic differences between species can often be estimated directly and used to infer the importance of alternative mechanisms of species coexistence (13). However, the “curse of dimensionality” prevents the application of such species-by-species approaches to high-diversity assemblages: the number of parameters in community dynamics models increases more rapidly than the amount of data, as species richness increases. Moreover, most species in high-diversity assemblages are very rare, further complicating the estimation of strengths of ecological interactions among species, or covariation in different species’ responses to environmental fluctuations. Consequently, ecologists have focused instead on making assumptions about the overall distribution of demographic rates, niche sizes, or other characteristics of an assemblage, and then deriving the aggregate assemblage properties implied by those assumptions (48). One of the most commonly investigated of these assemblage-level properties is the species abundance distribution (SAD)—the pattern of commonness and rarity among species (911). Ecologists have long sought to identify mechanisms that can explain common features of, and systematic differences among, the shapes of such distributions, and have used the ability to reproduce empirical SADs as a key test of biodiversity theory in species-rich systems (4, 6, 1114).

Over the last decade, one of the most prevalent and influential approaches to explaining the structure of high-diversity assemblages has been neutral theory of biodiversity (12, 15, 16). Neutral models assume that individuals are demographically and ecologically equivalent, regardless of species. Thus, variation in relative abundance among species arises purely from demographic stochasticity: chance variation in the fates of individuals (i.e., birth, death, immigration, and speciation events). Most studies investigating neutral theory aim to determine whether community structure in nature is consistent with the theory’s core species equivalence assumption. This is typically done by assessing the fit of a neutral model to empirical data, sometimes relative to a putatively nonneutral alternative (1720). However, although all neutral models share the species equivalence assumption, they differ with respect to auxiliary assumptions, such as the mode of speciation assumed, leading to different predictions for SADs and other ecological patterns. Indeed, attempts to draw conclusions from tests of neutral theory are almost invariably disputed, largely due to arguments about the extent to which alternative auxiliary assumptions can materially alter neutral theory’s ability to explain the data (11, 12, 18, 21).

An alternative, potentially more robust approach to evaluating neutral theory was proposed by Pueyo (22), based on approximating neutral and nonneutral dynamics as successively higher-order perturbations of a model for the idealized case of pure random drift in abundances. This approach predicts that a gamma distribution should approximate the distribution of species abundances for small departures for random drift, whereas assemblages exhibiting greater departures from neutrality should be better approximated by a lognormal distribution. This raises the possibility that a comparison of gamma and lognormal SADs could offer a robust test for the signature of nonneutrality in species abundance data, provided that the gamma distribution provides a sufficiently close approximation to SADs produced by neutral models.

Here, we evaluate Pueyo’s framework and apply it to patterns of commonness and rarity in 1,185 SADs from 14 marine ecosystems ranging from shallow reef platforms to abyssal depths, and from the tropics to polar regions (Fig. 1 and Tables S1 and S2). First, we test the gamma neutral approximation against several models of community dynamics that share the core neutrality assumption of species equivalence, but make different assumptions about the speciation process, spatial structure of the metacommunity, and the nature of competition between individuals. Then, we analyze the marine species abundance data, to evaluate whether they are consistent with the hypothesis that marine assemblages are neutrally structured. Finally, we ask whether patterns of commonness and rarity deviate from neutral expectation in idiosyncratic ways, or whether there are particular features of real SADs that cannot be captured by neutral models.

Fig. 1.

Fig. 1.

Sampling locations of SADs. Color and symbol combinations correspond to particular ecosystems. These symbols are reproduced in the surrounding panels, which show observed and fitted SADs for the site-level data (averaged across sites) of the corresponding ecosystem. The bars represent the mean proportion of species at each site in different octave classes of abundance, across all sites in the corresponding dataset [the first bar represents species with abundance 1, then abundances 2–3, abundances 4–7, etc. (10)]. The blue and red lines show the mean of fitted values from site-by-site fits of the Poisson-gamma and Poisson-lognormal distributions to the data, respectively.

Results

A gamma distribution of species abundances closely approximates several alternative neutral models across a broad range of neutral model parameter values (Fig. S1; see SI Results for further discussion). Moreover, the gamma consistently outperforms the lognormal when fitted to data simulated from neutral models. Specifically, as the number of distinct species abundance values in the simulated data increases, the relative support for the gamma distribution becomes consistently stronger for all of the neutral models we considered (Fig. 2A). This reflects the fact that datasets with only a small number of abundance values (e.g., a site containing 11 species, 10 of which are only represented by one individual) provide very little information about the shape of the SAD, whereas those with more abundance values provide more information (e.g., a site with 100 species whose abundances are spread over 10–20 different values).

Fig. 2.

Fig. 2.

Species abundances are better approximated by (A) a gamma distribution for simulated neutral communities, but (B) a lognormal distribution for the empirical data. Percentage support for the lognormal versus the gamma is plotted as a function of the number of observed distinct species abundances. In A, different neutral models are plotted with different colors, and each point represents a particular neutral model parameter combination from Fig. S1. In B, each combination of symbol and color represents a different marine ecosystem, whereas increasing symbol size indicates the increasing scale at which abundances were pooled (site, mesoscale, and regional).

In contrast to their relative fit to simulated neutral SADs, the lognormal consistently outperforms the gamma distribution when fitted to real marine species abundance data. When considered in terms of average support per SAD, relative support for the lognormal becomes consistently stronger as the number of observed species abundance values increases, in direct contrast with the simulated neutral data (Fig. 2B). Moreover, when the strength of evidence is considered cumulatively across all sites for each dataset, the lognormal has well over 99% support as the better model in each case (Table 1). This substantially better fit of the lognormal is retained in every case when data are pooled to the mesoscale, and, in all cases save one, when data are pooled at the regional scale (Table 1, Fig. 2B, and Figs. S2 and S3). The lognormal also remained strongly favored when we tested the robustness of our results by classifying species into taxonomic and ecological guilds, and restricting our analysis to the most species-rich guild within each dataset (see SI Results and Table S3).

Table 1.

Model selection for analysis at whole-dataset level

Site Mesoscale Regional
Dataset LN Gamma LN Gamma LN Gamma
Antarctic Isopods (ANI) >0.9999 <0.0001 >0.9999 <0.0001 >0.9999 <0.0001
Antarctic Molluscs (ANM) 0.9981 0.0019 0.9995 0.0005 0.9992 0.0008
Tuscany Archipelago Fish (TAP) >0.9999 <0.0001 0.9999 0.0001 0.9983 0.0017
Indo-Pacific Coral Crustaceans (IPC) >0.9999 <0.0001 NA NA >0.9999 <0.0001
SE Australia: Shelf Fish (SEF) >0.9999 <0.0001 >0.9999 <0.0001 >0.9999 <0.0001
W Australia: Deep Fish (WAF) >0.9999 <0.0001 >0.9999 <0.0001 >0.9999 <0.0001
Scotian Shelf Fish (SSF) >0.9999 <0.0001 >0.9999 <0.0001 0.9984 0.0016
Eastern Bass Strait Invertebrates (EBS) >0.9999 <0.0001 >0.9999 <0.0001 0.0087 0.9913
Sunderban Zooplankton (SUZ) >0.9999 <0.0001 NA NA 0.9892 0.0108
Great Barrier Reef Fish (GBR) >0.9999 <0.0001 >0.9999 <0.0001 >0.9999 <0.0001
Central Pacific Reef Fish (CPF) >0.9999 <0.0001 >0.9999 <0.0001 >0.9999 <0.0001
Norwegian Shelf Macrobenthos (NSM) >0.9999 <0.0001 >0.9999 <0.0001 >0.9999 <0.0001
North Sea Invertebrates (NSI) >0.9999 <0.0001 >0.9999 <0.0001 >0.9999 <0.0001
Bass Strait Intertidal (BSI) >0.9999 <0.0001 >0.9999 <0.0001 0.9998 0.0002
Overall >0.9999 <0.0001 >0.9999 <0.0001 >0.9999 <0.0001

Values indicate percentage support for the gamma and lognormal (LN) models fitted to the species abundance data at three scales: site level, mesoscale, and regional. Each row represents a different dataset. For IPC and SUZ, there were too few SADs to create mesoscale groupings. The last row is an overall test, based on summing the log-likelihoods across all datasets. Where one of the models has at least 95% support, the model’s weight is shown in bold.

Inspection of the lack of fit of the gamma neutral approximation indicates that it deviates from the data in highly consistent ways: real SADs exhibit substantially more heterogeneity than the gamma distribution can generate (Fig. 3). Specifically, the gamma is unable to simultaneously capture the large number of rare species and the very high abundances of the most common species. For abundance distributions lacking an internal mode (i.e., where the leftmost bar in the SAD is the largest one), this is manifested as an excess of rare species and paucity of species with intermediate abundance, relative to the best-fit neutral approximation (Fig. 3A, blue lines). Conversely, when an internal mode is present in the data, the abundances of the most highly abundant species are consistently higher than the gamma distribution can produce (Fig. 3B, blue lines). In contrast, discrepancies between the data and the lognormal are much smaller in magnitude, and more symmetrically distributed around zero, compared with the gamma (Fig. 3, red lines).

Fig. 3.

Fig. 3.

Residuals (observed minus predicted number of species) of gamma (blue lines) versus lognormal (red lines) fits, illustrating why the gamma performs worse than the lognormal. Each line represents one ecosystem at one spatial scale, with log2 abundance class normalized so that each dataset’s uppermost octave is 1.0. (A) Residuals for abundance distributions lacking an internal mode, plotted on an arithmetic scale to highlight the lack of fit for the smaller (leftmost) abundance classes. (B) Residuals for abundance distributions with an internal mode, plotted on a logarithmic scale to illustrate the lack of fit for the larger (rightmost) abundance classes.

Detailed analysis of variation in the strength of evidence against neutrality, within and among datasets, indicates that the relative performance of the lognormal over the gamma is substantially driven by the fact that the most abundant species is, on average, too dominant to be captured by the gamma neutral approximation. After controlling for the effects of the number of abundance values in the sample on statistical power, the relative abundance of the most-abundant species explained over one-half of the variation in the strength of support for the lognormal over the gamma, for site-level, mesoscale, and regional-scale abundance distributions (Table S4, Fig. S4, and SI Results). Conversely, the prevalence of rarity was a poor predictor of the strength of evidence against the gamma neutral approximation (Table S4, Fig. S5, and SI Results).

In addition to outperforming the gamma neutral approximation, tests of the absolute goodness of fit of the lognormal suggest that it approximates the observed species abundance data well. Statistically significant lack of fit (at α = 0.05) to the lognormal was detected in 4.8% of sites, approximately equal to what would be expected by chance, under the null hypothesis that the SADs are in fact lognormal. Moreover, lognormal-based estimates of the number of unobserved species in the regional species pool are realistic, and very similar to those produced by an alternative, nonparametric jackknife method that relies on presence–absence rather than abundance data (Fig. 4).

Fig. 4.

Fig. 4.

Agreement between lognormal-based and nonparametric estimates of the total number of species in the community. Points on the horizontal axis are richness estimates produced by the nonparametric jackknife, based on presence–absence data across sites. The points on the vertical axis are estimates produced by the lognormal model, fitted to the pooled regional abundance distributions. Error bars are 95% confidence intervals. The solid line is the unity line, where the lognormal and the nonparametric jackknife produce the same estimate of the number of unobserved species.

Discussion

Recently, the use of SADs to test biodiversity theory has been criticized because different species abundance models often generate very similar predictions, which can be difficult to distinguish when fitted to species abundance data (9). Consequently, some researchers have focused on other properties of assemblages, such as community similarity (12), species–area and species–time relationships (23, 24), and relationships between species traits or phylogeny and species abundance (25, 26). Such approaches are powerful when evaluating the performance of particular species abundance models. However, because models combine multiple assumptions, attributing a model’s failure to one assumption in particular, such as species equivalence, is problematic. Indeed, in the debate over neutral theory of biodiversity, studies that show failure of a neutral model (12, 25, 27) are almost invariably followed by responses showing that packaging neutrality with a different set of alternative assumptions can explain the data after all (11, 28, 29). Although the identification of alternative auxiliary assumptions that preserve a theory’s core prevents premature abandonment of a promising theory, it also can hinder progress by inhibiting the reallocation of scientific effort to more promising research programs (30). Given the proliferation of alternative theories of biodiversity (8, 14, 31, 32), identifying and testing predictions that are robust to auxiliary assumptions, and therefore better target a theory’s core assumptions, should be a high priority.

Here, we showed that, as previously hypothesized (22), a gamma distribution successfully captures features common to several models that share the core neutrality assumptions of species equivalence, but make very different auxiliary assumptions. We then found that this approximation cannot simultaneously account for the large number of very rare species, and the very high abundances of the most abundant species, in marine species abundance data. These discrepancies are highly consistent across diverse taxa, habitat types, and geographical locations, suggesting that this shortcoming of neutral theory is likely to be widespread in nature.

In contrast to the simulated neutral assemblages, analysis of the real data reveals that the lognormal distribution captures much better the observed heterogeneity in species abundances, and produces realistic estimates of the number of unobserved species at the regional scale. Lognormal SADs are generated by models that make a variety of different assumptions about how variation in abundances is determined, but all those proposed to date explicitly or implicitly incorporate niche or demographic differences among species. For instance, Engen and Lande (33) derived the lognormal from stochastic community dynamics models that allow for differences in species’ intrinsic growth rates and carrying capacities, and for arbitrary patterns of covariation in species’ responses to environmental fluctuations (4). More generally, Pueyo (22) derived the lognormal as a generic approximation for deviations from neutral community dynamics. Alternatively, niche apportionment models explain variation in species abundances based on variation in niche size, rather than stochastic population fluctuations (8). The lognormal can arise from the hierarchical apportionment of niche space due to a central limit theorem-like phenomenon (34). It can also be expected to arise more generally when there are multiple niche dimensions, and niche overlap may occur. As niche dimensions increase, the niche hypervolume becomes the product of idiosyncratically varying niche widths along an increasingly large number of niche axes. Provided abundance is proportional to the niche hypervolume (potentially along with other factors, such as body size and the proportion of resources within the niche that have not been preempted by other species), abundance will tend toward a lognormal distribution, again by the central limit theorem (13, 18).

The fairly general conditions under which lognormal distributions can arise suggests that a broad range of nonneutral characteristics, such as high-dimensional niche differences, low-dimensional life history trade-offs, or differential responses to environmental fluctuations, may give rise to approximately lognormal abundance distributions, such as those found in this study (13, 18, 22). Thus, although the comparatively poor performance of the gamma relative to the lognormal provides strong evidence against neutral dynamics as the principal driver of variation in abundances among species, the seemingly excellent fit of the lognormal distributions cannot, by itself, unambiguously identify which nonneutral processes are most responsible for the heterogeneity in species abundances in a given assemblage. Consequently, methods for apportioning the heterogeneity in species abundances among its potential causes, and which can yield inferences about species-rich assemblages, warrant further development. Some such methods, such as those based on relationships between abundance and species’ functional traits (34), or on spatiotemporal trends in relative abundances (35), explicitly predict lognormal SADs, and therefore are likely to be particularly promising.

Conclusions

Neutral theory explains variation in the abundances and distribution of species entirely as a consequence of demographic stochasticity—chance variation in the fates of individuals (15, 36). Although proponents of neutral theory have always acknowledged the existence of ecological differences between species, neutral theory assumes that those differences are overwhelmed by the phenomena that are explicitly included in neutral models (14, 36). The formulation and testing of neutral theory has drawn attention to the potential importance of demographic stochasticity as a process that contributes to differences in species abundances that are unrelated to species’ ecological traits, such as niche size or competitive ability. Such effects should be particularly important among rare species (4). Indeed, our finding that there are common features of different neutral models suggests that it can play a role as a robust null expectation, at least for some aspects of community structure (16). However, the most abundant few species often numerically dominate communities and play a disproportionately large role in community and ecosystem processes (37). We have shown that neutral theory consistently underestimates among-species heterogeneity in abundances across a broad range of marine systems. The fact that its performance is closely linked to abundances of the most common species indicates that it is the ecological dominance of these very highly abundant species that cannot be explained by neutral processes alone. Commonness itself is poorly understood, but the identities of the most common species in ecosystems tend to remain quite consistent over ecological timescales (38). Thus, the key to understanding the distribution of abundances in communities, even species-rich ones, may lie as much in understanding how the characteristics of common species allow them to remain so abundant, as in understanding the dynamics and persistence of rare species.

Materials and Methods

Approximating Neutrality.

Pueyo’s framework starts with a stochastic differential equation for random drift in population size (i.e., birth rate equals death rate, no density dependence, immigration, emigration, or speciation) and considers approximating departures from this model in terms of successively higher-order perturbations to it. Here, we take as our candidate neutral approximation the gamma distribution and, as our alternative model, the lognormal distribution. More specifically, because species abundance data are discrete, whereas the gamma and lognormal are continuous distributions, we use the Poisson-gamma (i.e., negative binomial) and Poisson-lognormal mixture distributions, as these distributions are commonly used to approximate discrete, random samples from underlying gamma or lognormal community abundance distributions (see SI Materials and Methods for further details).

To assess whether the Poisson-gamma distribution provides a good approximation to the SADs produced under neutrality, we tested it against five different neutral models: Hubbell’s original neutral model (39), a protracted speciation neutral model (21), a fission speciation model (40), an independent species model (11, 41), and a spatially explicit neutral model (42). We chose these five models because they encompass models that relax key assumptions of neutral theory as originally formulated; moreover, each of them meets a strict definition of neutrality: every individual has the same demographic rates, and the same per-capita effects on other individuals, regardless of species. We tested the approximation in two ways. First, we assess how closely (in absolute terms) the Poisson-gamma can approximate neutral abundance distributions. Second, we assess whether the Poisson-gamma outperforms the Poisson-lognormal when fitted to data generated according to neutral model assumptions (see SI Materials and Methods for details).

Empirical Data.

Data were contributed to the Census of Marine Life (CoML) project and represent a diverse range of taxa, ocean realms, depths, and geographic locations (Table S1). To be included in our analysis, contributed data needed to meet several criteria (see SI Materials and Methods for details). Where datasets included samples over multiple years from the same sites, only the most recent year of data was used. Finally, we only fitted SADs if they contained more than five distinct species abundance values, to minimize convergence problems associated with fitting species abundance models to very sparse data. However, the data from such sites were still used in the analyses that pooled abundance distributions at larger scales.

Fitting Models to Species Abundance Data.

For both the simulated neutral data, and the real species abundance data, we fitted our models using maximum-likelihood methods (see SI Materials and Methods for details). For the empirical data, in addition to fitting our species abundance models at the site level, we also fitted pooled species abundances at a mesoscale level, and at the regional (whole-dataset) level. For datasets that were spatially hierarchically organized, we used this hierarchy to determine how to pool sites at the mesoscale [e.g., for the Central Pacific Reef Fish (CPF) data, sites were nested within islands, so pooling was done to the island level]. For data that were not explicitly hierarchically organized [Antarctic Isopods (ANI), Antarctic Molluscs (ANM), Scotian Shelf Fish (SSF), Bass Strait Intertidal (BSI)], cluster analysis was used to identify mesoscale-level groupings. In two cases [Sunderban Zooplankton (SUZ), Indo-Pacific Coral Crustaceans (IPC)], there were only a few sites sampled, and no natural hierarchical structure, so these data were omitted from the mesoscale analysis.

For both the analysis of the marine species abundance data, and the analysis of the simulated neutral communities, model selection was based on Akaike weights, which are calculated from Akaike’s information criterion values and estimate the probability (expressed on a scale of 0–1) that a model is actually the best approximating model in the set being considered. Because the Poisson-gamma and the Poisson-lognormal have the same number of estimated parameters, this is equivalent to calculating model weights based on the Bayesian information criterion. For the empirical data, model selection was done at the whole-dataset level by summing the log-likelihoods for all individual sites (for the site-level analysis) or mesoscale (for the mesoscale analysis) abundance distributions for a dataset, and calculating Akaike weights based on these values (Table 1). However, this approach does not make sense for the analysis of the simulated neutral SADs, because an arbitrary degree of confidence can be obtained by simulating a large number of sites. Therefore, we instead calculated an expected level of model support on a per-SAD basis, for each neutral model and parameter combination, by calculating the mean difference in log-likelihoods across the 100 simulated datasets, and converting this mean into an Akaike weight. We examined these Akaike weights as functions of the number of distinct observed species abundance values, because we would expect our ability to distinguish between alternative models to increase as the number of distinct observed species abundance values increases. For comparison, we also calculated per-SAD Akaike weights for the marine species abundance data. This approach is less powerful than the aggregate whole-dataset comparisons shown in Table 1, but it facilitates visualization of the differences between the simulated neutral SADs (Fig. 2A) and the real marine SADs (Fig. 2B).

Analysis of Variation in Performance of Neutral Approximation.

The discrepancies between the data and the gamma neutral approximation suggest that real data exhibit too much heterogeneity in species’ abundances to be captured by the neutral approximation. To better understand this, we examined whether the relative model support varied systematically within or among datasets as a function of the prevalence of rare species, and the abundances of the most abundant species. As a measure of relative model support, we used a per-observation difference in log-likelihoods (see SI Materials and Methods for details). We first confirmed that this standardization controlled for the effect of sample size on statistical power (i.e., the trend illustrated in Fig. 2B). Then, we asked whether the variation in standardized model support was better explained by the numerical dominance of the most common species, or by the prevalence of very rare species, using mixed-effects linear models.

Testing the Absolute Fit of the Lognormal Distribution.

Goodness of fit of the lognormal distribution to the empirical data was assessed with parametric bootstrapping (see SI Materials and Methods for details). Also, for each dataset’s regional-scale SAD, we compared lognormal-based estimates of total number of species in the species pool with estimates using the nonparametric jackknife (10). See SI Materials and Methods for further details.

Supplementary Material

Supporting Information

Acknowledgments

U.S. acknowledges S. Ehrich and A. Sell for providing ship time. The authors thank all participants in the Census of Marine Life project, particularly S. Campana, M. Sogin, K. Stocks, and L. A. Zettler. They also thank R. Etienne for providing advice for obtaining numerical solutions of the fission speciation neutral model, and J. Rosindell and S. Cornell for sharing simulated neutral community data from their spatially explicit neutral model. The authors thank T. Hughes for comments on an early version of the manuscript. K.E.E. acknowledges The Norwegian Oil and Gas Association for permitting use of data. A.B. acknowledges the support of the Ministry for Science and Technology and the German Research Foundation (Deutsche Forschungsgemeinschaft) for support of the Antarctic benthic deep-sea biodiversity (ANDEEP) and ANDEEP-System Coupling (SYSTCO) expeditions, as well as five PhD positions. A.B. also thanks the Alfred-Wegener-Institute for Polar and Marine Research for logistic help, as well as the crew of the vessel and all pickers, sorters and identifiers of the extensive deep-sea material. The Census of Marine Life funded the assembly of the metadataset. Analysis of the data was made possible by funding from the Australian Research Council (to S.R.C.).

Footnotes

The authors declare no conflict of interest.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1406664111/-/DCSupplemental.

References

  • 1.Angert AL, Huxman TE, Chesson P, Venable DL. Functional tradeoffs determine species coexistence via the storage effect. Proc Natl Acad Sci USA. 2009;106(28):11641–11645. doi: 10.1073/pnas.0904512106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Levine JM, HilleRisLambers J. The importance of niches for the maintenance of species diversity. Nature. 2009;461(7261):254–257. doi: 10.1038/nature08251. [DOI] [PubMed] [Google Scholar]
  • 3.Adler PB, Ellner SP, Levine JM. Coexistence of perennial plants: An embarrassment of niches. Ecol Lett. 2010;13(8):1019–1029. doi: 10.1111/j.1461-0248.2010.01496.x. [DOI] [PubMed] [Google Scholar]
  • 4.Sæther BE, Engen S, Grøtan V. Species diversity and community similarity in fluctuating environments: Parametric approaches using species abundance distributions. J Anim Ecol. 2013;82(4):721–738. doi: 10.1111/1365-2656.12068. [DOI] [PubMed] [Google Scholar]
  • 5.Scheffer M, van Nes EH. Self-organized similarity, the evolutionary emergence of groups of similar species. Proc Natl Acad Sci USA. 2006;103(16):6230–6235. doi: 10.1073/pnas.0508024103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.MacArthur JW. On the relative abundance of species. Am Nat. 1960;94(874):25–34. [Google Scholar]
  • 7.Clark JS. Individuals and the variation needed for high species diversity in forest trees. Science. 2010;327(5969):1129–1132. doi: 10.1126/science.1183506. [DOI] [PubMed] [Google Scholar]
  • 8.Tokeshi M. Species Coexistence: Ecological and Evolutionary Perspectives. Oxford: Blackwell Science; 1999. [Google Scholar]
  • 9.McGill BJ, et al. Species abundance distributions: Moving beyond single prediction theories to integration within an ecological framework. Ecol Lett. 2007;10(10):995–1015. doi: 10.1111/j.1461-0248.2007.01094.x. [DOI] [PubMed] [Google Scholar]
  • 10.Connolly SR, Hughes TP, Bellwood DR, Karlson RH. Community structure of corals and reef fishes at multiple scales. Science. 2005;309(5739):1363–1365. doi: 10.1126/science.1113281. [DOI] [PubMed] [Google Scholar]
  • 11.Volkov I, Banavar JR, Hubbell SP, Maritan A. Patterns of relative species abundance in rainforests and coral reefs. Nature. 2007;450(7166):45–49. doi: 10.1038/nature06197. [DOI] [PubMed] [Google Scholar]
  • 12.Dornelas M, Connolly SR, Hughes TP. Coral reef diversity refutes the neutral theory of biodiversity. Nature. 2006;440(7080):80–82. doi: 10.1038/nature04534. [DOI] [PubMed] [Google Scholar]
  • 13.May RM. In: Ecology and Evolution of Communities. Cody ML, Diamond JM, editors. Cambridge, MA: Belknap Press of Harvard Univ Press; 1975. pp. 81–120. [Google Scholar]
  • 14.Hubbell SP. The Unified Neutral Theory of Biodiversity and Biogeography. Princeton: Princeton Univ Press; 2001. [DOI] [PubMed] [Google Scholar]
  • 15.Clark JS. Beyond neutral science. Trends Ecol Evol. 2009;24(1):8–15. doi: 10.1016/j.tree.2008.09.004. [DOI] [PubMed] [Google Scholar]
  • 16.Rosindell J, Hubbell SP, He FL, Harmon LJ, Etienne RS. The case for ecological neutral theory. Trends Ecol Evol. 2012;27(4):203–208. doi: 10.1016/j.tree.2012.01.004. [DOI] [PubMed] [Google Scholar]
  • 17.McGill BJ. A test of the unified neutral theory of biodiversity. Nature. 2003;422(6934):881–885. doi: 10.1038/nature01583. [DOI] [PubMed] [Google Scholar]
  • 18.Connolly SR, Dornelas M, Bellwood DR, Hughes TP. Testing species abundance models: A new bootstrap approach applied to Indo-Pacific coral reefs. Ecology. 2009;90(11):3138–3149. doi: 10.1890/08-1832.1. [DOI] [PubMed] [Google Scholar]
  • 19.Etienne RS, Olff H. Confronting different models of community structure to species-abundance data: A Bayesian model comparison. Ecol Lett. 2005;8(5):493–504. doi: 10.1111/j.1461-0248.2005.00745.x. [DOI] [PubMed] [Google Scholar]
  • 20.Muneepeerakul R, et al. Neutral metacommunity models predict fish diversity patterns in Mississippi-Missouri basin. Nature. 2008;453(7192):220–222. doi: 10.1038/nature06813. [DOI] [PubMed] [Google Scholar]
  • 21.Rosindell J, Cornell SJ, Hubbell SP, Etienne RS. Protracted speciation revitalizes the neutral theory of biodiversity. Ecol Lett. 2010;13(6):716–727. doi: 10.1111/j.1461-0248.2010.01463.x. [DOI] [PubMed] [Google Scholar]
  • 22.Pueyo S. Diversity: Between neutrality and structure. Oikos. 2006;112(2):392–405. [Google Scholar]
  • 23.Adler PB. Neutral models fail to reproduce observed species-area and species-time relationships in Kansas grasslands. Ecology. 2004;85(5):1265–1272. [Google Scholar]
  • 24.McGill BJ, Hadly EA, Maurer BA. Community inertia of Quaternary small mammal assemblages in North America. Proc Natl Acad Sci USA. 2005;102(46):16701–16706. doi: 10.1073/pnas.0504225102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ricklefs RE, Renner SS. Global correlations in tropical tree species richness and abundance reject neutrality. Science. 2012;335(6067):464–467. doi: 10.1126/science.1215182. [DOI] [PubMed] [Google Scholar]
  • 26.Bode M, Connolly SR, Pandolfi JM. Species differences drive nonneutral structure in Pleistocene coral communities. Am Nat. 2012;180(5):577–588. doi: 10.1086/667892. [DOI] [PubMed] [Google Scholar]
  • 27.Wills C, et al. Nonrandom processes maintain diversity in tropical forests. Science. 2006;311(5760):527–531. doi: 10.1126/science.1117715. [DOI] [PubMed] [Google Scholar]
  • 28.Lin K, Zhang DY, He FL. Demographic trade-offs in a neutral model explain death-rate—abundance-rank relationship. Ecology. 2009;90(1):31–38. doi: 10.1890/07-2079.1. [DOI] [PubMed] [Google Scholar]
  • 29.Chen AP, Wang SP, Pacala SW. Comment on “Global correlations in tropical tree species richness and abundance reject neutrality”. Science. 2012;336(6089):1639. doi: 10.1126/science.1222534. author reply 1639. [DOI] [PubMed] [Google Scholar]
  • 30.Lakatos I. The Methodology of Scientific Research Programmes. Cambridge, UK: Cambridge Univ Press; 1978. [Google Scholar]
  • 31.McGill BJ. Towards a unification of unified theories of biodiversity. Ecol Lett. 2010;13(5):627–642. doi: 10.1111/j.1461-0248.2010.01449.x. [DOI] [PubMed] [Google Scholar]
  • 32.Harte J. Maximum Entropy and Ecology: A Theory of Abundance, Distribution, and Energetics. Oxford: Oxford Univ Press; 2011. [Google Scholar]
  • 33.Engen S, Lande R. Population dynamic models generating the lognormal species abundance distribution. Math Biosci. 1996;132(2):169–183. doi: 10.1016/0025-5564(95)00054-2. [DOI] [PubMed] [Google Scholar]
  • 34.Sugihara G, Bersier LF, Southwood TRE, Pimm SL, May RM. Predicted correspondence between species abundances and dendrograms of niche similarities. Proc Natl Acad Sci USA. 2003;100(9):5246–5251. doi: 10.1073/pnas.0831096100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Engen S, Lande R, Walla T, DeVries PJ. Analyzing spatial structure of communities using the two-dimensional poisson lognormal species abundance model. Am Nat. 2002;160(1):60–73. doi: 10.1086/340612. [DOI] [PubMed] [Google Scholar]
  • 36.Alonso D, Etienne RS, McKane AJ. The merits of neutral theory. Trends Ecol Evol. 2006;21(8):451–457. doi: 10.1016/j.tree.2006.03.019. [DOI] [PubMed] [Google Scholar]
  • 37.Gaston KJ. Ecology. Valuing common species. Science. 2010;327(5962):154–155. doi: 10.1126/science.1182818. [DOI] [PubMed] [Google Scholar]
  • 38.Gaston KJ. Common ecology. Bioscience. 2011;61(5):354–362. [Google Scholar]
  • 39.Etienne RS, Alonso D. A dispersal-limited sampling theory for species and alleles. Ecol Lett. 2005;8(11):1147–1156. doi: 10.1111/j.1461-0248.2005.00817.x. [DOI] [PubMed] [Google Scholar]
  • 40.Etienne R, Haegeman B. The neutral theory of biodiversity with random fission speciation. Theor Ecol. 2011;4(1):87–109. [Google Scholar]
  • 41.He F. Deriving a neutral model of species abundance from fundamental mechanisms of population dynamics. Funct Ecol. 2005;19(1):187–193. [Google Scholar]
  • 42.Rosindell J, Cornell SJ. Universal scaling of species-abundance distributions across multiple scales. Oikos. 2013;122(7):1101–1111. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES