ABSTRACT
Scatterplots of biological datasets often have no‐data zones, which suggest constraint or promotion of dependent variables. Although methods exist to estimate boundary lines—that is, to fit lines to the edges of scatters of data points—there are, to our knowledge, none available to assess the significance of the areal extents of no‐data zones. Accordingly, we propose a flexible boundary line definition paired with a permutation test of the magnitude of no‐data zones—rather than testing the shape or slope of the line as current methods do. Our proposed permutation test can be used with any method of defining a boundary line. We demonstrate our approach with empirical datasets, find no‐data zones that methods such as quantile regressions fail to detect, and discuss how our approach can quantify constraint and promotion relationships that are not always apparent with other statistics.
Keywords: bivariate data, boundary line, non‐parametric statistics, quantile regression
Scatterplots of biological datasets often have no‐data zones, which suggest constraint or promotion of dependent variables by independent variables. We propose a flexible boundary line definition paired with a permutation test of the magnitude of areas without data—rather than testing the shape or slope of that line as previously developed methods do. Using empirical datasets, we demonstrate instances where—for example—quantile regressions fail to identify significant patterns.
1. Introduction
In biological datasets, one or more corners of a scatterplot are often free of data (Figure 1). Although lines can be drawn around the scatter of data points using methods such as mathematical models (Guo, Brown, and Enquist 1998; Hao et al. 2016; Li et al. 2022), partitioned regressions (Thomson et al. 1996), isolation of data points (Blackburn, Lawton, and Perry 2009), and quantile regression (Cade and Noon 2003; Cade, Terrell, and Schroeder 1999), there are to our knowledge no statistical methods available to calculate the probability that the area extents of these no‐data zones are greater than that expected by chance. A boundary line—also referred to as a constraint line, envelope (e.g., Hao et al. 2016) or limit line (Carling, Jonathan, and Su 2022)—delineates the edge of a scatter of data points as well as a zone where no data occurs. Values of the dependent variable are unlikely to occur beyond such an edge into the no‐data zone across a given range of the independent variable (Grubb 2016; Webb 1972). Since first mentioned (Webb 1972), boundary lines have been used throughout the biological sciences—particularly in ecology (e.g., Puglielli, Hutchings, and Laanisto 2021), agronomy (e.g., Evanylo and Sumner 1987; Schnug, Heym, and Achwan 1996; Walworth, Letzsch, and Sumner 1986), and forestry (e.g., Zhang et al. 2005). In all these disciplines, the probabilities of the sizes of no‐data zones are potentially of major biological significance (Guo, Brown, and Enquist 1998). This is because calculating them could assist in assessing whether the dependent variable is being constrained or promoted by the independent variable over a certain range of the independent variable (Milne, Ferguson, and Lark 2006; Walworth, Letzsch, and Sumner 1986; Webb 1972). Moreover, the presence of underlying boundary lines or triangular relationships (sensu Maller et al. 1983; Maller 1990) can be indicative of unmeasured factors—closely correlated with the independent variable—contributing to variation in the dependent variable (Grubb 2016; Hao et al. 2016; Mills et al. 2009).
In this paper, we describe: (i) a flexible and assumption‐free method to define a boundary line, delineating a no‐data zone; and (ii) a non‐parametric method to assess the significance of the position and magnitude of the no‐data zone—as opposed to assessing the shape or slope of the boundary line. Our method to assess the significance of no‐data zones is independent of the method used to define a boundary line.
2. Rationale and Related Methods
To illustrate the utility of an areal‐extent boundary line method and the features of a dataset it describes, we compare it with two other statistical methods often applied to bivariate scatterplot data: simple linear and quantile regression (Table 1). Our approach is not to present an exhaustive comparison of related methods but rather to illustrate the distinction between methods that assess the significance of the curves' parameters and methods that assess the significance of the area extent of no‐data zones. Quantile regressions—and other methods—simultaneously fit and test a line, whereas we propose a method to test the areas bound by a line. Although simple linear regression is inappropriate for defining a boundary line—as is evident below—we include its comparison as an introduction for readers unfamiliar with boundary line concepts and because it is so frequently used in ecology and broader biological sciences.
TABLE 1.
Linear regression | Quantile regression | Proposed permutation test of no‐data zones | |
---|---|---|---|
Assumptions | |||
Independent observations | Yes | Yes | Yes |
Residuals ~ N | Yes | No | Not applicable |
Linear response | Yes | No | No |
Homoscedasticity | Yes | No | No |
Sensitivity to outliers | Some | Limited | Potentially extreme |
Core estimate/statistic | Slope of mean | Slope of quantile | Area (Q) of no‐data zone |
Null hypothesis (H 0) | H 0: slope = 0 | H 0: quantile slope = 0 | H 0: |
p‐value meaning | P (slope|H 0) | P (quantile slope|H 0) | P (Q|H 0) |
Appropriate as a boundary line method | No | Yes | Yes |
Simple linear regression estimates the ‘line of best fit’—the mean value of the dependent variable conditional across the range of the independent variable, assuming the relationship follows a straight line. Although simple linear regression can be generalised to describe non‐linear relationships (e.g., with link functions), we only discuss linear forms of regression here to simplify comparison across methods. Quantile regression—an extension of linear regression (Koenker and Bassett 1978)—predicts a given quantile (𝜏) of the dependent variable conditional across the range of the independent variable. For example, if the model is specified to regress for the median (i.e., the 50% quantile; 𝜏 = 0.5), a quantile regression of the form Y = β 0 + β 1 X 1 uses the median to estimate the central tendency of the dependent variable across the range of the independent variable. When specified with high or low quantiles (e.g., 𝜏 = 0.05 or 0.95), quantile regressions describe patterns in the extremes of the response variable, not the central tendency. Indeed, in many studies, this has been used to describe the ‘edges’ of the scatter of points to test whether the independent variable constrains or promotes the dependent variable (e.g., Anderson and Jetz 2005; Horning 2012; Kelt and Van Vuren 2001; Lessin, Dyer, and Goldberg 2001; Medinski et al. 2010; Mills et al. 2006, 2009, 2013; Strong 2011; Scharf, Juanes, and Sutherland 1998). However, constraint or promotion may be evident in features other than the shape of the extremes of a scatter of data points; it may also be evident in the ‘tightness’ of such relationships. For regression lines, the spread of data can be described with confidence intervals about the line or with metrics like R 2. Additionally, the shape of such spread can be described by regressing both an upper and lower quantile separately. For example, in datasets where the 95% quantile has a steep positive slope and the 5% quantile has a relatively shallow slope, the spread of the dependent variable increases as the independent variable increases—that is, a triangular relationship.
An underlying boundary line delineates the values of the dependent variable—across the range of the independent variable—beyond which observations are unlikely to occur. For example, the observed boundary lines for the upper‐left and ‐right no‐data zones (Figure 1) describe the maximum observed Y‐values for given observed X‐values. In our proposed method, the estimate of interest is the areal extent of regions of the scatterplot with no data—as opposed to, for example, the slope of the relationship between X and Y (Table 1). These regions are defined using the corners of the scatterplot, such that the areal extent of each no‐data zone is expressed relative to that of the full dataset. With the method we propose, the probability of a given no‐data zone of the observed extent is described relative to permutations of the raw data—the p‐value is associated with the area (Q; Table 1) of the no‐data zone. This contrasts with the P‐values associated with the slope terms of linear and quantile regressions—that is, the probability that the observed trend in the dependent variable is a product of chance. In those cases, the p‐values are associated with the locations and slopes of relationships between X and Y (Table 1). Although confidence intervals about such regression lines can also be used to describe the likely range of values for the dependent variable, these intervals are based on the statistic that a regression estimates (i.e., the slope) and not an estimate of where data is likely—or unlikely—to occur. We postulate that investigating constraint or promotion effects is better done by assessing the magnitude by which the dependent variable is restricted rather than assessing the shape (i.e., slope) of that restriction.
In Figure 1, for example, it would be useful to know whether no‐data Zone A is greater in area extent than would be expected by chance. If it is greater, then it would be reasonable to conclude that the dependent variable is constrained by the independent variable—or a closely correlated variable—over the first third of the range of the independent variable. Similarly, if Zone D is greater than expected by chance, then the dependent variable is likely to be promoted by the independent variable—or a closely correlated variable—over the last third of the range of the independent variable.
3. Defining a Boundary Line
To test the significance of the area extent of a no‐data zone, a first step is delineating the edge of that no‐data zone—that is, identifying which points in the scatterplot form the ‘boundary’. Such lines can be defined with multiple approaches—for example, quantile regressions (Mills et al. 2006; Grubb 2016), Pareto fronts (Shoval et al. 2012; Sheftel et al. 2013), convex hulls, or kernel density estimation (Carmona, Pavanetto, and Puglielli 2024). The most flexible and assumption‐free definition, we propose, uses the four extreme points of the scatter—the data points with the minimum and maximum X‐ and Y‐values. To delineate such a boundary, straight lines are used to join points—progressively nearer the centre of the scatter in one direction but further from the centre in the direction perpendicular to that—starting from one of the four extreme points. The resulting boundary line is equivalent to a Pareto front. Using the boundary line for Zone A as an example, the procedure is described below.
Start with the point with the minimum X‐value.
Only considering points with X‐ and Y‐values greater than those of the starting point, select the point closest in the X‐direction to connect to the boundary.
Repeat steps 1 and 2, with the selected point as the new starting point, until there are no points with greater Y‐values remaining.
Connect the resulting sequence of points with the corner of the plot—that is, an imaginary point at the horizontal minimum and vertical maximum coordinates of the scatterplot.
The area of this polygon can then be computed. As boundary lines can be defined in other ways, we suggest that in such cases the curve described be used to delineate the no‐data zone along with the corners of the scatterplot. The significances of the areal extents of no‐data zones can then be tested with the permutation method we describe here. Using regression methods to delineate boundary lines—irrespective of the significance of the slope of that regression—has its limitations, in that the form of the relationship must be defined a priori and is forced to be a smooth line. This can result in seemingly inappropriate or uncertain fits (e.g., Figures 2b versus 3c). Kernel density estimation methods, while more flexible than regression lines, are dependent on a priori or parametric bandwidth selection (e.g., Steury et al. 2010). It is, however, noteworthy that both regression and kernel density estimation methods are less statistically sensitive to outliers than the boundary line procedure described above (Table 1).
4. Permutation Testing
As is common practice in ecology and evolutionary biology, we apply a non‐parametric test (Gotelli and Ellison 2004; Ives 2022; Pielou 1966) to our measure of interest—that is, the areal extent of a no‐data zone. In instances where parametric null distributions and observed no‐data zones need to be compared for a priori reasons, permutations can be generated from Monte Carlo simulations. For example, Díaz et al. (2016) and Puglielli, Hutchings, and Laanisto (2021) tested the significances of the observed magnitudes (in those cases, volumes) of data in trait space using Monte Carlo approaches. While those studies did not test the significances of no‐data zones, Monte Carlo simulations could be used to parametrically generate a null distribution of such zones.
Here, we propose a permutation‐based method for assessing the significance of an observed boundary line's corresponding no‐data zone (Figure 4)—irrespective of how the boundary was defined. The proposed method uses permutations of the observed data to generate a random null distribution against which the observed data is compared. Using Zone A as an example again (see Figure 5, illustrated with empirical data), the procedure is described below.
Measure the area of Zone A in the observed data.
Randomly reorder the X and Y coordinates separately, creating a new dataset of X–Y data point pairs (but comprising the same X‐ and Y‐values).
Measure the area of Zone A in the new permuted dataset.
Repeat steps 2 and 3 a total of 10,000 times, for example.
Calculate the average extent of the 10,000 Zone A areas generated in step 4.
Use a one‐tailed Z‐test (see Figure 5c) to determine the probability of the area calculated in step 1 being greater than the average area calculated in step 5.
Described mathematically, the vectors X and Y represent the coordinates for a series of observations (x i and y i ), such that the area Q obs. of a given no‐data zone (A, B, C or D) is based on some function of X obs. and Y obs., as described above (see also Figure 4a–c). Sampling without replacement, one can permute X obs. and Y obs., giving X perm. and Y perm. (Figure 4d), and determine a given no‐data zone's area (Q perm.) accordingly (Figure 4f). Doing this, for example, 10,000 times generates a set of 10,000 areas based on random permutations of the data, such that Q perm. follows a normal distribution (Figure 4g). The probability of observing an area at least as great as a given Q obs. can then be calculated using a one‐tailed Z‐test—that is, the difference between Q obs. and the mean of Q perm. (), in terms of the standard deviation of Q perm. (s):
5. Examples With Empirical Data
In addition to the empirical data used to illustrate the logic of our proposed permutation test (Figure 5) (Sankaran et al. 2005), we also demonstrate here further example applications of this test to other empirical datasets (Figure 3) (Mills et al. 2006, 2013).
When comparing soil infiltrability and water‐dispersible clay content (Figure 3a), permutation tests identify significant no‐data Zones B and C—that is, a significant lack of observations where infiltrability and water‐dispersible clay content are great (Zone B) as well as small (Zone C). While similar conclusions can be drawn using quantile regression (e.g., for Zone C; Figure 2a), the latter method estimates a smaller no‐data zone than our proposed method, with relatively wide confidence intervals.
The extents to which woody cover is constrained at different levels of rainfall, soil silicon and soil zinc (Figure 3b–d) all exemplify the case of ‘humped’ constraint–promotion datasets. Our method distinguishes between patterns of constraint in these datasets that are significant (minor amounts of rainfall and soil silicon, and large amounts of soil zinc) and illusory (large amounts of rainfall and soil silicon, and minor amounts of soil zinc) (Figure 3b–d). To test for a humped relationship with another method, we performed a quantile regression with a parabolic form between woody cover and soil silicon. This quantile regression (Figure 2b) failed to identify and highlight the significance of no‐data Zone B (as in Figure 3c). Furthermore, the non‐parametric observed boundary line in Figure 3c shows the easily undetected—yet ecologically important—pattern of decreasing woody cover as soil silicon increases beyond 45%. This contrasts with the easily detected pattern shown with the parabolic quantile regression in Figure 2b of increasing woody cover as soil silicon increases above 30%. A shortcoming of regression methods is apparent here: the form of the relationship to be fit must be explicit and chosen a priori. The inappropriate fit (Figure 2b) is specific to parabolic curves: parabolas are necessarily symmetrical, while the humped pattern of woody cover versus silicon is potentially asymmetrical. As permutation tests of observed boundary lines can be performed regardless of how they are defined, our method offers the flexibility to accommodate such cases. For example, if kernel density estimates are used to delineate the boundary line, the significance of the no‐data zone bounded by that line can also be assessed using our method.
In addition to the above demonstrations of the significance of observed no‐data zones, we note that this significance is sensitive to the spread of the rest of the dataset used in permutation tests. When applying our method to the full woody cover versus soil silicon dataset, Zone B is not significant (Figure 3c). However, Zone B is significant when only considering the woodland‐savanna subset of that data (inset in Figure 3c). Notwithstanding the value of determining the significances of no‐data zones in subsets of data, we encourage the use of our method with prudence, applying it only to subsets of data that have been predetermined in the specific context of an investigation.
6. Concluding Remarks
Our proposed permutation test of observed boundary lines could be applied to a variety of datasets, not just ecological or agronomic—for example, soil temperature dynamics (Chmura et al. 2023), physical properties of cancer cells (Naghavian et al. 2023), neurons in the visual cortex (Fişek et al. 2023), cloud cover dynamics (Vo et al. 2023), and planetary temperatures (Peterson et al. 2023). Across datasets as varied as these, this method is likely to assist in describing relationships that are not immediately apparent with regression lines.
Author Contributions
Anthony J. Mills: conceptualization (lead), formal analysis (equal), methodology (lead), visualization (supporting), writing – original draft (lead), writing – review and editing (equal). Ruan van Mazijk: data curation (lead), formal analysis (equal), methodology (supporting), software (lead), visualization (lead), writing – original draft (supporting), writing – review and editing (equal).
Conflicts of Interest
The authors declare no conflicts of interest.
Acknowledgements
We thank Martin Fey, Res Altwegg, Martin kidd, Daan Nel, Ruan de Wet, Ruan Parrot, Shantelle Parrot, Dirk Snyman and Nicholas Salonen for discussions that assisted in developing the conceptual framework presented. Our thanks also go to Gugu Mabuza, Oscar Maeyer and Rion Cuthill for their comments on a draft version of the manuscript. Lastly, we thank Giacomo Puglielli and an anonymous reviewer for their constructive feedback during the review process.
Data Availability Statement
The authors have nothing to report.
References
- Anderson, K. J. , and Jetz W.. 2005. “The Broad‐Scale Ecology of Energy Expenditure of Endotherms.” Ecology Letters 8, no. 3: 310–318. 10.1111/j.1461-0248.2005.00723.x. [DOI] [Google Scholar]
- Beitman, B. 2009. “Brains Seek Patterns in Coincidences.” Psychiatric Annals 39, no. 5: 255–264. 10.3928/00485713-20090421-02. [DOI] [Google Scholar]
- Blackburn, T. , Lawton J., and Perry J.. 2009. “A Method of Estimating the Slope of Upper Bounds of Plots of Body Size and Abundance in Natural Animal Assemblages.” Oikos 65, no. 1: 107–112. 10.2307/3544892. [DOI] [Google Scholar]
- Cade, B. , and Noon B.. 2003. “A Gentle Introduction to Quantile Regression for Ecologists.” Frontiers in Ecology and the Environment 1, no. 8: 412–420. [Google Scholar]
- Cade, B. S. , Terrell J. W., and Schroeder R. L.. 1999. “Estimating Effects of Limiting Factors With Regression Quantiles.” Ecology 80, no. 1: 311–323. 10.2307/176999. [DOI] [Google Scholar]
- Carling, P. A. , Jonathan P., and Su T.. 2022. “Fitting Limit Lines (Envelope Curves) to Spreads of Geoenvironmental Data.” Progress in Physical Geography: Earth and Environment 46, no. 2: 272–290. 10.1177/03091333211059995. [DOI] [Google Scholar]
- Carmona, C. P. , Pavanetto N., and Puglielli G.. 2024. “Funspace: An R Package to Build, Analyse and Plot Functional Trait Spaces.” Diversity and Distributions 30, no. 4: e13820. 10.1111/ddi.13820. [DOI] [Google Scholar]
- Chmura, H. E. , Duncan C., Burrell G., Barnes B. M., Buck C. L., and Williams C. T.. 2023. “Climate Change Is Altering the Physiology and Phenology of an Arctic Hibernator.” Science 380, no. 6446: 846–849. 10.1126/science.adf5341. [DOI] [PubMed] [Google Scholar]
- Díaz, S. , Kattge J., Cornelissen J. H., et al. 2016. “The Global Spectrum of Plant Form and Function.” Nature 529, no. 7585: 167–171. 10.1038/nature16489. [DOI] [PubMed] [Google Scholar]
- Evanylo, G. K. , and Sumner M. E.. 1987. “Utilization of the Boundary Line Approach in the Development of Soil Nutrient Norms for Soybean Production.” Communications in Soil Science and Plant Analysis 18, no. 12: 1379–1401. 10.1080/00103628709367906. [DOI] [Google Scholar]
- Fişek, M. , Herrmann D., Cloves M., et al. 2023. “Cortico‐Cortical Feedback Engages Active Dendrites in Visual Cortex.” Nature 617, no. 7962: 769–776. 10.1038/s41586-023-06007-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gotelli, N. J. , and Ellison A. M.. 2004. A Primer of Ecological Statistics. Sunderland, UK: Sinauer Associates. [Google Scholar]
- Grubb, P. J. 2016. “Trade‐Offs in Interspecific Comparisons in Plant Ecology and How Plants Overcome Proposed Constraints.” Plant Ecology and Diversity 9, no. 1: 3–33. 10.1080/17550874.2015.1048761. [DOI] [Google Scholar]
- Guo, Q. , Brown J., and Enquist B.. 1998. “Using Constraint Lines to Characterize Plant Performance.” Oikos 83, no. 2: 237–245. 10.2307/3546835. [DOI] [Google Scholar]
- Hao, R.‐F. , Yu D.‐Y., Wu J.‐G., Guo Q.‐F., and Liu Y.‐P.. 2016. “Constraint Line Methods and the Applications in Ecology.” Chinese Journal of Plant Ecology 40, no. 10: 1100–1109. 10.17521/cjpe.2016.0152. [DOI] [Google Scholar]
- Horning, M. 2012. “Constraint Lines and Performance Envelopes in Behavioral Physiology: The Case of the Aerobic Dive Limit.” Frontiers in Physiology 3: 31231. 10.3389/fphys.2012.00381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ives, A. R. 2022. “Random Errors Are Neither: On the Interpretation of Correlated Data.” Methods in Ecology and Evolution 13, no. 10: 2092–2105. 10.1111/2041-210X.13971. [DOI] [Google Scholar]
- Kelt, D. A. , and Van Vuren D. H.. 2001. “The Ecology and Macroecology of Mammalian Home Range Area.” American Naturalist 157: 637–645. 10.1086/320621. [DOI] [PubMed] [Google Scholar]
- Koenker, R. , and Bassett G.. 1978. “Regression Quantiles.” Econometrica 46, no. 1: 33–50. 10.2307/1913643. [DOI] [Google Scholar]
- Lessin, L. M. , Dyer A. R., and Goldberg D. E.. 2001. “Using Upper Boundary Constraints to Quantify Competitive Response of Desert Annuals.” Oikos 92, no. 1: 153–159. 10.1034/j.1600-0706.2001.920118.x. [DOI] [Google Scholar]
- Li, Z. , Guan D., Zhou L., and Zhang Y.. 2022. “Constraint Relationship of Ecosystem Services in the Yangtze River Economic Belt, China.” Environmental Science and Pollution Research 29: 12484–12505. 10.1007/s11356-021-13845-2. [DOI] [PubMed] [Google Scholar]
- Maller, R. A. 1990. “Some Aspects of a Mixture Model for Estimating the Boundary of a Set of Data.” ICES Journal of Marine Science 46, no. 2: 140–147. 10.1093/icesjms/46.2.140. [DOI] [Google Scholar]
- Maller, R. A. , de Boer E. S., Joll L. M., Anderson D. A., and Hinde J. P.. 1983. “Determination of the Maximum Foregut Volume of Western Rock Lobsters ( Panulirus cygnus ) From Field Data.” Biometrics 39: 543–551. 10.2307/2531082. [DOI] [Google Scholar]
- Medinski, T. V. , Mills A. J., Esler K. J., Schmiedel U., and Jürgens N.. 2010. “Do Soil Properties Constrain Species Richness? Insights From Boundary Line Analysis Across Several Biomes in South Western Africa.” Journal of Arid Environments 74, no. 9: 1052–1060. 10.1016/j.jaridenv.2010.03.004. [DOI] [Google Scholar]
- Mills, A. J. , Fey M. V., Donaldson J., Todd S., and Theron L.. 2009. “Soil Infiltrability as a Driver of Plant Cover and Species Richness in the Semi‐Arid Karoo, South Africa.” Plant and Soil 320: 321–332. 10.1007/s11104-009-9904-5. [DOI] [Google Scholar]
- Mills, A. J. , Fey M. V., Gröngröft A., Petersen A., and Medinski T. V.. 2006. “Unravelling the Effects of Soil Properties on Water Infiltration: Segmented Quantile Regression on a Large Data Set From Arid South‐West Africa.” Soil Research 44, no. 8: 783–797. 10.1071/SR05180. [DOI] [Google Scholar]
- Mills, A. J. , Milewski A. V., Fey M. V., Gröngröft A., Petersen A., and Sirami C.. 2013. “Constraint on Woody Cover in Relation to Nutrient Content of Soils in Western Southern Africa.” Oikos 122, no. 1: 136–148. 10.1111/j.1600-0706.2012.20417.x. [DOI] [Google Scholar]
- Milne, A. , Ferguson R., and Lark R.. 2006. “Estimating a Boundary Line Model for a Biological Response by Maximum Likelihood.” Annals of Applied Biology 149, no. 2: 223–234. 10.1111/j.1744-7348.2006.00086.x. [DOI] [Google Scholar]
- Naghavian, R. , Faigle W., Oldrati P., et al. 2023. “Microbial Peptides Activate Tumour‐Infiltrating Lymphocytes in Glioblastoma.” Nature 617, no. 7962: 807–817. 10.1038/s41586-023-06081-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peterson, M. S. , Benneke B., Collins K., et al. 2023. “A Temperate Earth‐Sized Planet With Tidal Heating Transiting an M6 Star.” Nature 617, no. 7962: 701–705. 10.1038/s41586-023-05934-8. [DOI] [PubMed] [Google Scholar]
- Pielou, E. C. 1966. “The Measurement of Diversity in Different Types of Biological Collection.” Journal of Theoretical Biology 13: 131–144. 10.1016/0022-5193(66)90013-0. [DOI] [Google Scholar]
- Puglielli, G. , Hutchings M. J., and Laanisto L.. 2021. “The Triangular Space of Abiotic Stress Tolerance in Woody Species: A Unified Trade‐Off Model.” New Phytologist 229, no. 3: 1354–1362. 10.1111/nph.16952. [DOI] [PubMed] [Google Scholar]
- Sankaran, M. , Hanan N. P., Scholes R. J., et al. 2005. “Determinants of Woody Cover in African Savannas.” Nature 438, no. 7069: 846–849. 10.1038/nature04070. [DOI] [PubMed] [Google Scholar]
- Scharf, F. S. , Juanes F., and Sutherland M.. 1998. “Inferring Ecological Relationships From the Edges of Scatter Diagrams: Comparison of Regression Techniques.” Ecology 79, no. 2: 448–460. 10.1890/0012-9658(1998)079[0448:IERFTE]2.0.CO;2. [DOI] [Google Scholar]
- Schnug, E. , Heym J., and Achwan F.. 1996. “Establishing Critical Values for Soil and Plant Analysis by Means of the Boundary Line Development System (BOLIDES).” Communications in Soil Science and Plant Analysis 27, no. 13–14: 2739–2748. 10.1080/00103629609369736. [DOI] [Google Scholar]
- Sheftel, H. , Shoval O., Mayo A., and Alon U.. 2013. “The Geometry of the Pareto Front in Biological Phenotype Space.” Ecology and Evolution 3, no. 6: 1471–1483. 10.1002/ece3.528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shoval, O. , Sheftel H., Shinar G., et al. 2012. “Evolutionary Trade‐Offs, Pareto Optimality, and the Geometry of Phenotype Space.” Science 336: 1157–1160. 10.1126/science.1217405. [DOI] [PubMed] [Google Scholar]
- Steury, T. D. , McCarthy J. E., Roth T. C., Lima S. L., and Murray D. L.. 2010. “Evaluation of Root‐n Bandwidth Selectors for Kernel Density Estimation.” Journal of Wildlife Management 74, no. 3: 539–548. 10.2193/2008-327. [DOI] [Google Scholar]
- Strong, W. L. 2011. “Tree Canopy Effects on Understory Species Abundance in High‐Latitude Populus tremuloides Stands, Yukon, Canada.” Community Ecology 12: 89–98. 10.1556/ComEc.12.2011.1.11. [DOI] [Google Scholar]
- Thomson, J. D. , Weiblen G., Thomson B. A., Alfaro S., and Legendre P.. 1996. “Untangling Multiple Factors in Spatial Distributions: Lilies, Gophers, and Rocks.” Ecology 77, no. 6: 1698–1715. 10.2307/2265776. [DOI] [Google Scholar]
- Vo, T. T. , Hu L., Xue L., Li Q., and Chen S.. 2023. “Urban Effects on Local Cloud Patterns.” Proceedings of the National Academy of Sciences 120, no. 21: e2216765120. 10.1073/pnas.2216765120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walworth, J. , Letzsch W., and Sumner M.. 1986. “Use of Boundary Lines in Establishing Diagnostic Norms.” Soil Science Society of America Journal 50, no. 1: 123–128. 10.2136/sssaj1986.03615995005000010024x. [DOI] [Google Scholar]
- Webb, R. A. 1972. “Use of the Boundary Line in the Analysis of Biological Data.” Journal of Horticultural Science 47, no. 3: 309–319. 10.1080/00221589.1972.11514472. [DOI] [Google Scholar]
- Zhang, L. , Bi H., Gove J. H., and Heath L. S.. 2005. “A Comparison of Alternative Methods for Estimating the Self‐Thinning Boundary Line.” Canadian Journal of Forest Research 35, no. 6: 1507–1514. 10.1139/x05-070. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The authors have nothing to report.