Abstract
Introductions of nonindigenous species can have significant effects. It is commonly claimed that the rate of species introductions to the United States has increased over time. This claim is based in part on the increasing rate of discoveries of introduced species. This discovery rate is influenced by factors other than the introduction rate. These include the sampling rate and population growth in the introduced species. In this article, we show that the discovery rate can increase even when there is no increase in either the introduction rate or the sampling rate. This suggests that the basis for some claims regarding an increasing rate of introductions may be invalid.
The introduction of nonindigenous species to both the terrestrial and marine environments can have significant ecological and economic effects. These introductions pose a major threat to biological diversity (1). Fifty thousand nonindigneous species are believed to exist in the United States alone, where their predation on and competition with native species has been implicated in the decline of at least 400 of 958 species on the endangered species list (2, 3). The annual damage caused by introduced species in the United States has been estimated to be more than $100 billion (3).
It is often claimed that the rate of species introductions to the United States has increased over time. This claim is based, in part, on the observation that the rate of discovery of introduced species has increased over time. For example, Cohen and Carlton (4) showed that, even after correcting for extraordinary collecting effort, the time series of the cumulative number of introduced species discovered in the San Francisco Estuary is convex, that is, increasing at an increasing rate, over the period 1850–1995; Fig. 1 is based on Cohen and Carlton's adjusted time series. On the basis of this finding, they concluded that the rate of increase of actual introductions had also increased over time.
Figure 1.
Cumulative counts of nonindigenous species in San Francisco Estuary, 1850–1995, correcting for extraordinary collection effort. This figure is based on figure 1B in ref. 4.
The discovery curve for introduced species reflects, in part, the process by which specimens are collected. In interpreting this curve, it is necessary to account for this collection process. For example, the rate at which introduced species are discovered will depend, to some extent, on collection effort. Cohen and Carlton (4) recognized this and excluded from their analysis species that were discovered through “an extraordinary collection effort.” In this article, we go a step further and use a simple model of the discovery process to show that an increasing rate of discoveries need not imply an increasing rate of introductions even when collection effort is constant. This somewhat surprising result suggests that, by itself, a convex discovery curve cannot be taken as evidence of an increasing introduction rate.
A Model of Discovery
Let the random variable Y(t) be the cumulative number of introduced species that have been discovered by time t. This random variable is given by
![]() |
1 |
where Ij(t) is the binary random variable taking value 1 if the jth introduced species is discovered by time t and 0 otherwise and n(t) is the cumulative number of species introduced by time t. It follows that the expected value of Y(t) is
![]() |
2 |
where qj(t) = 1 − prob(Ij(t) = 1) is the probability that the jth species has not been discovered through time t. We will refer to E[Y(t)] as the expected discovery curve.
To proceed, it is necessary to specify a model for qj(t). We will adopt the following simple model. The basic unit of sampling is the individual organism. Individuals are sampled at random from an effectively infinite collection of individuals representing both indigenous and introduced species. Sampling is random in the sense that, if the kth individual is sampled at time tk, then the probability that it belongs to the jth introduced species is the relative abundance πj(tk) of this species at this time. It follows that
![]() |
3 |
where r(t) is the cumulative number of individuals sampled by time t and, upon combining Eqs. 2 and 3 that
![]() |
4 |
The expression in Eq. 4 shows that, in qualitative terms, the expected discovery curve depends on the temporal patterns of (i) introductions, (ii) sampling, and (iii) relative abundances of the introduced species. As noted, interpretations of empirical discovery curves have focused almost exclusively on the pattern of introductions. Specifically, convexity of empirical discovery curves is taken as evidence that n(t) is also convex, that is, that the rate of introductions has increased over time. This conclusion is occasionally tempered by noting the potential importance of variations over time in discovery effort. However, in the next section, we show that the expected discovery curve is typically convex even when both the introduction rate and the sampling rate are not increasing.
Some Results
In this section, we present some results for the case where the rate at which individuals are sampled is constant. Specifically, we will assume that r(t) = rt and, for convenience, we will rescale time so that r = 1. This amounts to measuring time in units of the constant sampling interval. To begin with, suppose that n species are introduced at t = 1, and that no subsequent introductions take place, so that n(t) = n for all values of t. Also, suppose that these species are identical, in the sense that πj(t) = π(t) for all values of j. In this case, the expected discovery curve is
![]() |
5 |
A necessary and sufficient condition for convexity of Eq. 5 is
![]() |
6 |
for all t. Provided that the relative abundance of each introduced species increases, but remains small (so that the denominator of Eq. 6 is close to 1), over the period of observation, the expected discovery curve will be convex even with no new species introductions.
This result implies that the expected discovery curve will also be convex if species with identical growth patterns are introduced at a constant rate, so that n(t) = nt, provided that
![]() |
7 |
for all j and t > j, where πj(t) is the relative abundance at time t of a species introduced at time j for all j and t > j. This follows from the previous case because the overall expected discovery curve is the sum of expected discovery curve is the sum of expected discovery curves for the species introduced at each time. Provided that Eq. 7 holds, each of these curves will be convex and, as the sum of convex curves is also convex, so will be the overall expected discovery curve. Note that, even if all species have the same pattern of postintroduction population growth, their relative abundances will, in general, depend on the time of introduction, if only because of the growth in the number of introductions. The condition in Eq. 7 will be met if, over the period of observation, the postintroduction abundance of each introduced species remains a small fraction of, but grows faster than, the total abundance of both indigenous and introduced species. This condition is sufficient, but not necessary for convexity under a constant introduction rate. For example, suppose that there are N indigenous species with average abundance scaled to 1. At each time, n species are introduced at average abundance 1 and there is no population growth thereafter. Under these assumptions, the expected discovery curve is
![]() |
8 |
It can be shown by induction that Eq. 8 is convex provided N > 1.
We have shown that, even if the rate of species introductions is constant, the expected discovery curve can be convex both when postintroduction population growth is rapid and when there is no postintroduction population growth. It is not surprising that the same result can hold for the intermediate case in which the population of each introduced species first grows to, and then remains at, a constant level. For example, suppose that there are N = 15,000 indigenous species with average abundance of 2. Each time period (measured in months in this example), n = 2 species are introduced at abundance 0.002 and thereafter each population grows according to the logistic model with carrying capacity K = 2 and intrinsic growth rate r. In Fig. 2, expected discovery curves are shown for r = 0.1, 1, and 10. The corresponding times until the population reaches 0.99 K are 115, 11.5, and 1.15, respectively. Parameter values have been chosen to demonstrate that our model can yield discovery rates similar to those depicted by Cohen and Carlton (4). Other combinations of parameters could yield similar results. In all cases, the expected discovery curve is convex over the period of observation. As the total relative abundance of introduced species grows, there is a tendency for the expected discovery curve to become linear. For the obvious reason, this occurs faster the greater r.
Figure 2.
Expected cumulative discovery curves through time under the logistic growth model. Each curve represents a different value of the intrinsic growth rate, r.
Discussion
The goal of this article has been to show that the pattern of discoveries of introduced species may not reflect the true pattern of introductions. In broad terms, the basic result is that the expected discovery curve for introduced species will be convex even if the introduction rate is constant, provided the relative abundance of each introduced species remains small in absolute terms. This result is quite robust and, in particular, does not depend on the details of the distribution of relative abundances. Thus, after correcting for sampling effort, an increasing rate of discoveries of introduced species does not necessarily imply that the rate of introductions has also increased. This is a theoretical result and does not imply that the actual rate of introduction of species to the United States (or elsewhere) has not increased over time. Instead, it points out that this conclusion cannot be based on the convexity of the empirical discovery curve alone. To interpret the pattern of discoveries and obtain rigorous empirical estimates of the true rates of introduction would require careful analysis of both the sampling process and the population processes of introduced species beyond the scope of this article. We have shown, though, that this model can yield discovery rates that closely correspond to those empirically observed. As an alternative to detailed modeling, conclusions about the rate of introductions can be based on other types of information.
The sampling model on which our results are based is widely used, either explicitly or implicitly, in the analysis of sampled species data. Although this model is clearly extremely simple, there is nothing in this simplicity that unfairly favors these results.
Acknowledgments
We thank Tin Klanjscek for useful suggestions about the convexity results and Carol McAusland and an anonymous referee for helpful comments.
Footnotes
This paper was submitted directly (Track II) to the PNAS office.
References
- 1.Williamson M. Ecography. 1999;22:5–12. [Google Scholar]
- 2.Wilcove D, Rothstein D, Dubow J, Phillips A, Losos E. Bioscience. 1998;48:607–615. [Google Scholar]
- 3.Pimentel D, Lack L, Zuniga R, Morrison D. Bioscience. 2000;50:53–67. [Google Scholar]
- 4.Cohen A, Carlton J. Science. 1998;279:555–558. doi: 10.1126/science.279.5350.555. [DOI] [PubMed] [Google Scholar]










