Examination of the Effects of Curvature in Geometrical Space on Accuracy of Scaling Derived Projections of Plant Biomass Units: Applications to the Assessment of Average Leaf Biomass in Eelgrass Shoots

Héctor Echavarría-Heras; Cecilia Leal-Ramírez; Enrique Villa-Diharce; Abelardo Montesinos-López

doi:10.1155/2019/3613679

. 2019 Apr 23;2019:3613679. doi: 10.1155/2019/3613679

Examination of the Effects of Curvature in Geometrical Space on Accuracy of Scaling Derived Projections of Plant Biomass Units: Applications to the Assessment of Average Leaf Biomass in Eelgrass Shoots

Héctor Echavarría-Heras ^1,^✉, Cecilia Leal-Ramírez ¹, Enrique Villa-Diharce ², Abelardo Montesinos-López ³

PMCID: PMC6507111 PMID: 31179319

Abstract

Conservation of eelgrass relies on transplants and evaluation of success depends on nondestructive measurements of average leaf biomass in shoots among other variables. Allometric proxies offer a convenient way to assessments. Identifying surrogates via log transformation and linear regression can set biased results. Views conceive this approach to be meaningful, asserting that curvature in geometrical space explains bias. Inappropriateness of correction factor of retransformation bias could also explain inconsistencies. Accounting for nonlinearity of the log transformed response relied on a generalized allometric model. Scaling parameters depend continuously on the descriptor. Joining correction factor is conceived as the partial sum of series expansion of mean retransformed residuals leading to highest reproducibility strength. Fits of particular characterizations of the generalized curvature model conveyed outstanding reproducibility of average eelgrass leaf biomass in shoots. Although nonlinear heteroscedastic regression resulted also to be suitable, only log transformation approaches can unmask a size related differentiation in growth form of the leaf. Generally, whenever structure of regression error is undetermined, choosing a suitable form of retransformation correction factor becomes elusive. Compared to customary nonparametric characterizations of this correction factor, present form proved more efficient. We expect that offered generalized allometric model along with proposed correction factor form provides a suitable analytical arrangement for the general settings of allometric examination.

1. Introduction

The model of relative growth of Huxley [1] is formally stated by means of a scaling relationship of the form

\begin{matrix} w = β a^{α}, \end{matrix}

(1)

where w and a are measurable traits and the parameter α is designated as the allometric exponent, while β is identified as the normalization constant. This model, also termed equation of simple allometry, has been extensively used in research problems in biology [1–5], physics [6], economics [7], earth sciences [8], resource management, and conservation [9, 10], among other fields.

Eelgrass provides nursery for waterfowl and fish species. By trapping sediment and stumping wave energy, this seagrass promotes shoreline stabilization. Eelgrass services also include nutrient recycling, water filtration, and carbon dioxide removal. Current anthropogenic influences threaten eelgrass permanence. Conservation efforts rely on plot transplanting in a fundamental way. Monitoring effectiveness depends on measurements of standing stock and productivity through time. This makes the assessment of average leaf biomass in shoots a necessary input. But traditional estimation of eelgrass leaf biomass units relies on destructive methods. This could alter shoot density in a developing transplant. Thus evaluation renders indirect assessment methods necessary [10, 11]. Results show that an allometric scaling of the form of (1) for eelgrass leaf biomass w and associated area a is consistent [11]. Derived projections of individual leaf biomass convey useful surrogates for mean leaf biomass in shoots. Moreover, estimates of the parameters α and β are invariant within a given geographical region [10–12]. Hence, estimates fitted at site can endow suitable projections of leaf biomass values currently observed in other places of the region. This bears the referred allometric projections of a convenient nondestructive feature ([11, 13]).

Simplicity of (1) makes allometric projection of eelgrass leaf biomass units convenient. But there are caveats on dependability. For instance, even though α and β are invariant, environmental influences can induce a relative extent of variability on local estimates ([12, 14]). Besides, the response in the power function-like scaling of (1) is very sensitive to variation of parameter estimates. Then, accuracy of derived proxies is subject of error propagation of estimates. In addition, there are factors of biological scaling which can influence precision of estimates (e.g., [10, 11, 13, 15]). Packard [16] questioned results of Mascaro et al. [17] on allometric examination, and Mascaro et al. [18] responded to criticism. Going over this exchange highlights the relevance of procedural factors in determining precision of parameter estimates of allometric scaling. It also offers a convenient framework for the aims of the present research.

An important factor influencing precision of estimates of allometric parameters is analysis method. A widespread approach is the traditional analysis method of allometry (TAMA hereafter). It relies on a log transformation of data in arithmetical scale in order to contemplate a linear regression model in geometrical scale. Then, the fitted line is back-transformed to yield a two-parameter power function in the original scale. But embracing this procedure fuels a vivid unresolved debate. Views assert that this protocol can lead to biased results (e.g., [16, 19–29]). And other practitioners consistently wed to the idea that logarithmic transformations are necessary (e.g., [18, 30–39]). An alternative to the TAMA approach is using nonlinear regression methods in the direct scale of the data [16]. Echavarria-Heras et al. [11] concluded that producing allometric projections of average leaf biomass in eelgrass shoots must rely on this protocol. Yet a direct nonlinear regression approach in allometry is also not unfailing. For instance, inadequate identification of the inherent error structure can lead to significant bias [18]. Besides, Lai et al. [30] found that estimates of allometric parameters fitted by nonlinear regression can exhibit a high sensitivity to the largest values of the covariate. Therefore, evaluation of analysis method suitability in acquiring consistent eelgrass leaf biomass proxies needs revision.

The adoption of methods of curvature in geometrical space could offer a way to overcome inadequacy of the TAMA procedure ([27, 40–42]). In particular, it is pertinent to examine if taking curvature into account leads to improved accuracy of eelgrass leaf biomass proxies. But, according to Mascaro et al [18], curvature could manifest because of methodological factors of data gathering. Thus, an examination of the effect of curvature in eelgrass leaf biomass allometry must also take into account a possible participation of data quality effects. Mascaro et al. [18] reminds on three ways of handling curvilinearity in geometrical space. One is by separating data to contemplate different local linear models to account for heterogeneity of effects of the covariate [43–45]. A second one is by fitting a polynomial model [46–49]. A third approach endorses direct nonlinear regression assuming a heteroscedastic error structure as contemplated by Mascaro et al. [17]. Either approach above bears complexity beyond the linear model in geometrical space that associates with the customary bivariate power function of allometry. This suggested putting forward a generalized allometric model intended to deal with curvature in geometrical space. This paradigm incorporates parameters that change as continuous functions of the log transformed covariate ([27, 39, 46]). As we explain further on, the curvature arrangements recommended by Mascaro et al. [18] can be all derived from the offered formalization. Moreover, a nonzero intercept power function that Packard [50] recommends to handle curvilinearity in geometrical space also derives from the presented generalized scaling model.

But any scheme addressing curvature in geometrical space depends on a factor for correction of bias of retransformation of the regression error. In the general settings, if ϵ stands for the regression error, then the said correction factor, through denoted using the symbol δ(ϵ), is given by the mean of the exponentiated error random variable; that is, δ(ϵ) = E(e^∈) [51–53]. Furthermore, the TAMA approach relies on the essential assumption that ϵ is additive, normally distributed, and homoscedastic [33]. When this happens, δ(ϵ) takes its lognormal-mean form [51–53]. But if ϵ fails to be normally distributed, there are two possibilities. If the distribution of ϵ is known, we could derive a closed form for δ(ϵ). In turn, if the error distribution is not identified a priori, a widespread approach is taking δ(ϵ) in the nonparametric form given by the smearing estimator of bias of Duan [54]. Still, there are provisions on this. A smearing estimate form can fail to compensate the downward retransformation bias of logged data ([53, 55, 56]). Thus, in a circumstance where ϵ is unspecified, characterizing δ(ϵ) seems elusive. Here, we put forward an arrangement for δ(ϵ) aimed to get around this circumstance. Zeng and Tang [57] proposed a nonparametric alternate to the smearing form. It matches the first three terms' partial sum of a power series expression of E(e^∈), assuming E(ϵ) = 0. Form suggested here corresponds to a generalization of this construct. It does not abide the restriction E(ϵ) = 0 and matches an n −terms partial sum approximation of the exponential series representation of E(e^∈). The partial sum that maximizes reproducibility strength of retransformed mean response sets criterion to choose n.

Present results show that a consideration of curvature in geometrical space, as well as a suitable characterization of the correction factor of retransformation bias, offers consistent allometric proxies of observed mean leaf biomass in eelgrass shoots. Hence, contrary to views asserting direct nonlinear regression as mandatory in allometric examination, our findings validate a parallel reliability of log transformation based methods. This is well in line with claims of Mascaro et al. [18] and many others about blaming the use of logarithms of incongruent results in allometric analysis. Moreover, keeping the analysis in geometrical space unraveled heterogeneity in the inherent leaf biomass scaling pattern. This could not be achieved by clinging to direct nonlinear regression in arithmetic space as the only valid approach of allometric examination. Offered analytical arrangement is expected to be applicable in the general settings of allometry.

2. Materials and Methods

2.1. Data

For the aims of the present research, we relied on an extensive eelgrass data set collected in San Quintin Bay, a coastal lagoon on the Pacific side of the Baja California Peninsula, México (30°30' N – 116°00′W), and through a 13 months' long sampling period covering a whole-year cycle. Data composes measurements of length (mm), width (mm), and dry weight w (g) of a total of 10412 individual eelgrass leaves taken from 20 randomly thrown 400cm² quadrats every monthly visit to the site. A sampling visit will be further referred to as “sampling time” in the text. The length times width proxy [11] provided estimations of leaf area a (mm²). In order to test for methodological influences of data gathering, we processed raw data set according to a mean plus or minus two standard deviations outlier's removal procedure [58]. Appendix A presents results of an exploratory analysis of data.

2.2. Models

As above specified symbols, w and a stand for the biomass of an individual eelgrass leaf and its respective area one to one. Echavarría-Heras et al. [11] assert that these variables can be related through the bivariate allometric model of (1). One procedure to acquire estimates for the parameters α and β is fitting directly in arithmetical scale a nonlinear homoscedastic regression model. Besides, we can use a TAMA approach, that is, fitting the linear regression model

\begin{matrix} v = \ln β + α u + ϵ, \end{matrix}

(2)

where v = ln⁡w, u = ln⁡a, and ϵ is a random error term assumed to be normally distributed with zero mean and variance σ², that is, ϵ ~ N(0, σ²).

We conceive curvature in geometrical space as a circumstance where fitting results of regression model of (2) are inconsistent. Dealing with this situation amounts for considering complexity beyond incorporated by (1). One possible approach to address curvature is assuming that scaling parameters α and β in (2) depend continuously on the covariate ([27, 39, 46]). This is consistent with the generalized allometric model,

\begin{matrix} w = β (a) a^{α (a)}, \end{matrix}

(3)

with β(a) and α(a) intended to be continuous and differentiable functions defined on R⁺ and with β(a) being positive. Certainly a log transformation v = ln⁡w, u = ln⁡a of (3) establishes the regression model

\begin{matrix} v = v_{C} (a, u) + ϵ, \end{matrix}

(4)

where

\begin{matrix} v_{C} (a, u) = \ln β (a (u)) + α (a (u)) u, \end{matrix}

(5)

where ϵ, a residual error term, is conceived as a random variable that in the general settings is ψ-distributed with mean μ and variance set by a function σ²(u) of the covariate u, that is, ϵ ~ ψ(μ, σ²(u)).

Setting β(a(u)) = β and α(a(u)) = α with α and β constants reduces (4) to the regression model of (2). In Appendix B, we explain that (3) accommodates all curvature paradigms suggested by Mascaro et al. [18]. These include a biphasic and a polynomial model in geometrical space, as well as the nonlinear heteroscedastic model referred to by Mascaro et al. [17] in direct arithmetical space. Moreover, as shown in Appendix B, the three-parameter power function chosen as an alternate standard for curvature [59] also derives from (3).

2.2.1. Biphasic Model in Geometrical Space

In order to characterize the model of (3) in a biphasic mode, we let β(a) = β_B(a) and α(a) = α_B(a), such that

\begin{matrix} α_{B} (a) = \sum_{1}^{2} ϑ_{i} (u (a)) α_{i} \\ β_{B} (a) = \exp (\sum_{1}^{2} ϑ_{i} (u (a)) \ln β_{i}) \end{matrix}

(6)

including parameters β_i and α_i and the function ϑ_i(u(a)) given by

\begin{matrix} ϑ_{i} (u (a)) = \{\begin{matrix} (2 - i) (1 - H (u - u_{b})) & for i = 1 \\ (i - 1) H (u - u_{b}) & for i = 2, \end{matrix} \end{matrix}

(7)

for i = 1,2. H(u − u_b) is a Heaviside function H(z) [60], evaluated at z = u − u_b and correspondingly u_b = ln⁡a_b, with a_min ≤ a_b ≤ a_max being a point separating growth phases u ≤ u_b and u > u_b. Then, denoting by means of v_B(a, u) the resulting form of v_C(a, u) from (4), we get the biphasic regression model

\begin{matrix} v = v_{B} (a, u) + ϵ, \end{matrix}

(8)

where ϵ is a random error term as defined in (4) and

\begin{matrix} v_{B} (a, u) = \sum_{1}^{2} ϑ_{i} (u (a)) f_{i} (u (a)) \end{matrix}

(9)

with

\begin{matrix} f_{i} (u (a)) = \ln β_{i} + α_{i} u (a), \end{matrix}

(10)

where β_i and α_i for i = 1,2 parameters are to be estimated from data.

2.2.2. Polynomial Model in Geometrical Space

Similarly, assume that α(a) = α_P(a) and β(a) = β_P(a), with

\begin{matrix} α_{P} (a) = \sum_{0}^{n} {α_{k} u (a)}^{k} \\ β_{P} (a) = e^{\sum_{0}^{n} {β_{k} u (a)}^{k}} . \end{matrix}

(11)

α_k and β_k for 1 ≤ k ≤ m are coefficients; one can acquire a polynomial representation v_P(a, u) for the generalized mean response function in geometrical space v_C(a, u). This way the polynomial form of regression (4) becomes

\begin{matrix} {v = v}_{P} (a, u) + ϵ, \end{matrix}

(12)

where ϵ is a random error term as defined in (4), with

\begin{matrix} v_{P} (a, u) = \sum_{0}^{m} {p_{k} u (a)}^{k}, \end{matrix}

(13)

and p_k, for k = 0,2,…, m parameters.

2.2.3. Nonlinear Heteroscedastic Model in Arithmetical Space

As we explain ahead, direct algebraic manipulation of (3) leads to the consideration of the nonlinear heteroscedastic regression model addressed by Mascaro et al. [17]; namely,

\begin{matrix} w = {β_{θ} a}^{α_{θ}} + a^{θ} ϵ, \end{matrix}

(14)

with α_θ, β_θ, and θ being parameters and ϵ being a zero mean normally distributed error term with a covariate dependent variance σ²(a) = σ²a^2θ, that is, ϵ ~ N(0, σ²a^2θ).

A nonlinear homoscedastic form derives from (14) by setting θ = 0; that is,

\begin{matrix} w = β_{o} a^{α_{o}} + ϵ, \end{matrix}

(15)

And, again ϵ is an additive error term assumed to be normally distributed with zero mean and variance σ², that is, ϵ ~ N(0, σ²).

Appendix A deals with exploratory analysis of data Appendix B presents notation convention and also explains how all addressed paradigms derive from the generalized model of (3). Appendix C explains the addressed forms of correction factor for bias of retransformation of the regression error. Fitting results of the geometrical space based models appear in Appendix D. Those corresponding to the nonlinear heteroscedastic and homoscedastic models pertain to Appendix E. Agreement between observed and projected values is commonly evaluated by analyzing values of Lin's Concordance Correlation Coefficient (CCC) [61]. This correspondence index is commonly denoted by means of the symbol ρ. Agreement will be defined as poor whenever ρ < 0.90, moderate for 0.90 ≤ ρ < 0.95, good for 0.95 ≤ ρ < 0.99, or excellent for ρ ≥ 0.99 [62]. Besides CCC values, we assessed reproducibility by comparing goodness-of-fit statistics, such as the coefficient of determination (CD), standard error of estimate (SEE), mean prediction error (MPE), total relative error (TRE), average systematic error (ASE), and mean percent standard error (MPSE) ([63–65]). For statistical tasks, we relied on the R package release 3.5.

3. Results

Exploratory analysis in Appendix A identifies maximum, minimum, and sample mean values for observed leaf area values a and associated dry weights w. We also explain distribution of variables in terms of quantiles of probability 0.1, 0.25, 0.50, 0.75, and 0.90, for both crude and processed data. Statistical exploration extends to log transformed values of these variables. We present Q-Q plots (quantile-quantile) for comparison of distribution patterns, as well as boxplots for the 13 months' long sampling scheme for both crude and processed data. We can learn that, from month 2 to month 6, a reduction in the values of weight and area occurred; this perhaps is explained by an increase in temperature during the period. We can be also aware that a similar variation pattern over time is shown in both raw and processed data sets.

3.1. Fitting Results of Geometrical Space Models

In order to validate curvature in geometrical space, we compared the linear model derived from (1) as well as biphasic and polynomial alternates derived from the generalized model of (3). Appendix B explains formal matters. Tables 1 and 2 summarize notation convention. Equations numbered beyond (15) belong to the appendices. Appendix D explains corresponding regression protocols.

Table 1.

Summary of notation convention of the bivariate scaling model of (1) and its generalization to account for curvature as given by (3). GS stands for geometrical space, AS stands for arithmetical space, and CF means correction factor of retransformation bias.

	Typical bivariate	Eq.	Generalized for curvature	Eq.
Basic form	w = βa^α	(1)	w = β(a)a^α(a)	(3)

Regression equation in GS	v = ln⁡β + αu + ϵ	(2), (D.1)	v = v_C(a, u) + ϵ	(4)

Mean response GS	E _T(v \| u) = ln⁡β + ln⁡α	(B.4)	E _C(v \| u) = ln⁡β(a) + α(a)u(a)	(B.1)

Back transformation AS	w = e^{E_T(v \| u)}e^ϵ	(B.5)	w = e^{E_C(v \| u)}e^ϵ	(B.2)

Mean response AS	E _Tδ(w \| a) = βa^αδ(ϵ) δ(ϵ) = E(e^ϵ)	(B.6)	E _Cδ(w \| a) = β(a)a^α(a)δ(ϵ) δ(ϵ) = E(e^ϵ)	(B.3)

CF Baskerville	E _TB(w \| a) = βa^α δ_B(ϵ) δ_B(ϵ) = e^σ²/2	(B.6), (C.1)	E _CB(w \| a) = β(a)a^α(a) δ_B(ϵ)	(B.3), (C.1)

CF Duan	E _TD(w \| a) = βa^αδ_D(ϵ) $δ_{D} (ϵ) = \frac{\sum_{1}^{m} e^{ϵ_{j}}}{m}$	(B.6), (C.2)	E _C(w \| a) = β(a)a^α(a)δ_D(ϵ)	(B.3), (C.2)

CF Zeng and Tang	E _TZT(w \| a) = βa^αδ_ZT(ϵ) $δ_{Z T} (ϵ) = 1 + \frac{σ^{2}}{2}$	(B.6), (C.3)	E _CZT(w \| a) = β(a)a^α(a)δ_ZT(ϵ)	(B.3), (C.3)

CF n-partial sum	E _Tn(w \| a) = βa^αδ_n(ϵ) $δ_{n} (ϵ) = \sum_{0}^{n} \frac{E (ϵ^{k})}{k!}$	(B.6), (C.4)	E _Cn(w \| a) = β(a)a^α(a)δ_n(ϵ)	(B.3), (C.4)

Open in a new tab

Table 2.

Notation convention for curvature models in geometrical space. We include the biphasic and polynomial characterizations of the generalized allometric model of (3).GS stands for geometrical space, AS stands for arithmetical space, and CF means correction factor of retransformation bias of the regression error.

	Biphasic model	Eq.	Polynomial model	Eq.
Basic form	w = β_B(a)a^α_B(a)	(B.11)	w = β_P(a)a^α_P(a)	(B.27)

Regression equation	v = v_B(a, u) + ϵ	(8), (D.4)	v = v_P(a, u) + ϵ	(B.32), (D.8)

Mean response GS	E _B(v \| u) = ∑₁²ϑ_i(u(a))f_i(u(a))	(B.18)	E _P(v \| u) = ∑₀^mp_ku(a)^k	(B.33)

Back transformation AS	w = β_B(a)a^α_B(a)e^ϵ	(B.21)	w = β_P(a)a^α_P(a)e^ϵ	(B.34)

Mean response AS	E _Bδ(w \| a) = β_B(a)a^α_B(a)δ(ϵ) δ(ϵ) = E(e^ϵ)	(B.22)	E _P(w \| a) = β_P(a)a^α_P(a)δ(ϵ) δ(ϵ) = E(e^ϵ)	(B.35)

CF Baskerville	E _BB(w \| a) = β_B(a)a^α_B(a) δ_B(ϵ)	(B.22), (C.1)	E_PB(w \| a) = β_P(a)a^α_P(a)δ_B(ϵ)	(B.35), (C.1)

CF Duan	E _BD(w \| a) = β_B(a)a^α_B(a)δ_D(ϵ)	(B.22), (C.2)	E_PD(w \| a) = β_P(a)a^α_P(a)δ_D(ϵ)	(B.35), (C.2)

CF Zeng and Tang	E _BZT(w \| a) = β_B(a)a^α_B(a)δ_ZT(ϵ)	(B.22), (C.3)	E_PZT(w \| a) = β_P(a)a^α_P(a)δ_ZT(ϵ)	(B.35), (C.3)

CF n-partial sum	E _Bn(w \| a) = β_B(a)a^α_B(a)δ_n(ϵ)	(B.22), (C.4)	E _Pn(w \| a) = β_P(a)a^α_P(a)δ_n(ϵ)	(B.35), (C.4)

Open in a new tab

Fitting results of the TAMA arrangement of (2) appear in Appendix D. Figure 1 shows the spread about TAMA's linear mean response function E_T(v∣u). We can visually ascertain that deviations from the linear mean response function E_T(v∣u) suggest curvature (red dots). Thus, data processing removed inconsistent replicates but shown spread still deviates from a linear mean response. This suggests that curvature in geometrical space could not be explained by methodological factors related to data gathering.

Spread about the TAMA's linear mean response function E_T(v | u). Deviations about E_T(v | u) shown by red dots suggest curvature.

Fitting results of the biphasic protocol of (8) are summarized in Appendix D. Figure 2 displays the spread about mean response function E_B(v∣u) in geometrical space. Compared with Figure 1, we can ascertain that the biphasic fit provides a consistent account of different variation patterns among smaller and larger leaves. We can visually ascertain that fit produced consistent results. This confirms a judgement that identified curvature might be due to intrinsic factors of leaf growth rather than methodological influences related to data gathering.

Spread about the biphasic mean response function E_B(v | u). Compared with the plot in Figure 1, we observe that the biphasic choice offers a better account of variability of the log transformed response than the TAMA alternate.

Appendix D presents fitting results of the polynomial model (m = 6). Figure 3 displays dispersion about the polynomial mean response function in geometrical space E_P(v∣u). A polynomial representation also exhibits higher consistency than the TAMA arrangement. Recalling the biphasic scheme, the polynomial suggests a smooth transition between two growing phases.

Spread about the polynomial mean response function E_P(v | u). Compared with the plot in Figure 1, we can be aware that, as opposite to the TAMA protocol, the polynomial scheme offers a consistent account of curvature.

3.2. Model Selection in Geometrical Space

Assessment of models fitted on geometrical space relied on goodness-of-fit statistics, that is, the coefficient of determination, standard error of estimate, mean prediction error, total relative error, average systematic error, and mean percent standard error ([63–65]). Besides, we took into account concordance correlation coefficient [61] and Akaike's information index [66]. Table 3 presents results. Goodness-of-fit statistics and ρ and AIC values disfavored the TAMA protocol. On the contrary, comparison indices favored the biphasic model. Moreover, differences among indices but TRE and ASE for this scheme and the polynomial (m = 6) are slight. Particularly, the highest AIC is associated with the TAMA protocol (AIC = 15069.9, ∆AIC = 2491.9). Therefore, this model bears the less support. The biphasic choice delivered the smallest AIC's value (AIC = 12578.0, ∆AIC = 0). Nevertheless, difference in AIC is just barely relative to the m = 6 polynomial model, since this choice conveyed (AIC = 12615.0 and ∆AIC = 37). Thus, model confrontation shows that the TAMA protocol is unsuited, thus backing the assertion that whatever model aims to be consistent with the present data, it ought to be nonlinear in geometrical space.

Table 3.

Assessment of geometrical space fitted models. Comparison took into account goodness-of-fit statistics, that is, the coefficient of determination (R²) standard error of estimate (SEE), mean prediction error (MPE), total relative error (TRE), average systematic error (ASE), and mean percent standard error (MPSE) ([63–65]). Besides, we considered concordance correlation coefficient (ρ) [61] and Akaike's information index (AIC) [66].

Method	AIC	ρ	R²	SEE	TRE	ASE	MPE	MPSE
TAMA	15069.9	0.9505	0.9045	0.5135	-0.0286	-0.2462	-0.1879	5.7877
Biphasic	12578.0	0.9614	0.9256	0.4530	0.0037	0.0286	-0.1658	5.1446
Polynomial	12615.0	0.9614	0.9241	0.4576	0.8497	1.2008	-0.1675	5.3187

Open in a new tab

3.3. Retransformation Results

The TAMA protocol was not supported by the model selection criteria. Anyway, for comparison, corresponding retransformation results are included in Appendix D. Related with the TAMA protocol, fitting results of the biphasic model display a relatively improved distribution of residuals about the zero line. Nevertheless, normal Q-Q plot still shows heavier tails than those expected for a normal distribution. And, again both test statistics and p values of an Anderson-Darling test [67] provide evidence against normality of residuals. This justifies choosing the nonparametric forms δ_D(ϵ), δ_ZT(ϵ), or δ_n(ϵ) for compensation of downward bias induced by retransformation of the regression error ([11, 52, 53]). Table 4 displays comparison statistics for the reproducibility strength of the biphasic mean response E_Bδ(w∣a) as shaped by the different forms of δ(ϵ).

Table 4.

Comparison of reproducibility strength statistic for the biphasic mean response E_Bδ(w | a) as calculated by different forms of the correction factor for bias of retransformation of the regression error ϵ. The ρ(δ(ϵ)) symbol stands for the concordance correlation value associated with δ(ϵ).

CF	ρ(δ(ϵ))	R²	SEE	TRE	ASE	MPE	MPSE
δ _D(ϵ)	0.9664	0.9256	0.0049	-9.2196	4.48e-13	0.8133	31.2839
δ _ZT(ϵ)	0.9687	0.9319	0.0047	-7.5530	1.8359	0.7780	31.4388
δ _n(ϵ)	0.9727	0.9464	0.0042	1.9516	12.3058	0.6903	33.9525

Open in a new tab

We can learn that agreement between the biphasic mean response and leaf biomass data is best for δ_n(ϵ). Figure 4 shows spread of processed leaf biomass values about the biphasic mean response function E_Bδ(w∣a) as shaped by the considered forms of δ(ϵ). We can observe that both δ_D(ϵ) and δ_ZT(ϵ) overcompensate the bias correction by δ_n(ϵ). Moreover results show that as opposed to the TAMA a biphasic protocol along with the δ_n(ϵ) form offers consistent proxies of individual leaf biomass. But it is worth mentioning that in spite of the fact that model selection favored the biphasic scheme, examination of the polynomial model output reveals similar predictive strength to the biphasic alternate.

Spread about the biphasic mean response function E_Bδ(w | a) in arithmetical space. Black lines relate to δ_D(ϵ), green to δ_ZT(ϵ), and red ones to δ_n(ϵ).

3.4. Assessing Curvature by Direct Nonlinear Regression

As suggested by Mascaro et al. [18], effects of curvature in geometrical space can be analyzed by means of the direct nonlinear heteroscedastic regression model of (14). In Appendix B, we explain that such a protocol also derives from the generalized bivariate allometric model of (3). Table 5 presents pertinent notation convention. For comparison, we also present results for the associated homoscedastic case.

Table 5.

Notation convention for the nonlinear heteroscedastic and homoscedastic models of (14) and (15), respectively.

Nonlinear model	Heteroscedastic	Eq.	Homoscedastic	Eq.
Regression equation	w = β_θ a^α_θ + a^θϵ ϵ = N(0, a^2θσ²)	(14), (E.1)	w = β_o a^α_o + ϵ ϵ ~ N(0, σ²)	(15), (E.5)
Mean response function	E _θ(w \| a) = β_θa^α_θ	(B.45)	E _o(w \| a) = β_o a^α_o	(B.46)

Open in a new tab

Fitting results of the heteroscedastic and homoscedastic models appear in Appendix E. We can learn that estimates for the normalization constant and scaling exponent parameters are very similar. Certainly, corresponding 95% confidence intervals display some overlap. As a result, we can expect similar reproducibility features for both models. Table 6 presents comparison statistics.

Table 6.

Assessment of models fitted in arithmetical space. This includes the nonlinear homoscedastic and heteroscedastic protocols.

Method	AIC	ρ	R²	SEE	TRE	ASE	MPE	MPSE
Heteroscedastic	-92761.13	0.972	0.9467	0.0041	0.0311	26.1073	0.6882	50.2781
Homoscedastic	-81386.51	0.972	0.9467	0.0041	0.2954	28.4527	0.6881	51.7400

Open in a new tab

We can be aware that model assessment backs the heteroscedastic model. But selection here is mainly on qualitative grounds. It actually concerns the ability of the heteroscedastic model to identify an expected dependence of variance in the covariate. Certainly, the reproducibility strengths of both paradigms are equivalent. Indeed, Figure 5 shows that mean response curves E_θ(w∣a) and E_o(w∣a) differ just barely.

Spread about the mean response, curves E_θ(w | a) and E_o(w | a) associated with the nonlinear heteroscedastic and homoscedastic regression models of (14) and (15) one to one. The mean response E_θ(w | a) is shown in black lines and those corresponding to E_o(w | a) in red.

Results show that as it occurred for models fitted in geometrical space, data cleaning failed to correct a heavy tails problem for the nonlinear fits. This can be ascertained from the normal Q-Q plot of residuals. This strengthens our point on the consideration of a different error structure from the one assumed here. Exploring the effects of error structure in the fitting of models for curvature addressed here will be a matter of further research. Interestingly, both the homoscedastic and heteroscedastic models seem to induce the same reproducibility strengths.

3.5. Model Assessment in Arithmetical Space

The model selection assay in geometrical space summarized in Table 3 favored the biphasic protocol. Correspondingly, statistics in Table 6 support the nonlinear heteroscedastic model. Results of Table 4 endure δ_n(ϵ) as required for largest reproducibility of retransformation output. Table 7 allows assessment of these models. We can learn that half the number of comparison indices coincide (ρ, R², SEE, and MPE). In addition, the biphasic model is favored by AIC, ASE, and MPSE. This sets criterion for selection of curvature in geometric space as a consistent paradigm for the present data. Accordingly, the biphasic model bears adequate.

Table 7.

Assessment of models in arithmetical space. This includes the nonlinear heteroscedastic and biphasic protocols.

Method	AIC	ρ	R²	SEE	TRE	ASE	MPE	MPSE
Heteroscedastic	-92761.13	0.972	0.946	0.004	0.031	26.107	0.690	50.278
Biphasic	-96879.30	0.972	0.946	0.004	1.951	12.305	0.690	33.952

Open in a new tab

3.6. Implications for Allometric Proxies of Mean Leaf Biomass in Eelgrass Shoots

We in turn consider allometric proxies for average leaf biomass in eelgrass shoots. In getting these surrogates, we aggregate allometric projections of individual leaf biomass conforming a shoot. For comparison, we consider individual leaf biomass surrogates produced by the different projection methods. Table 8 compares resulting reproducibility strengths.

Table 8.

Comparison of reproducibility strengths of proxies of average leaf biomass in shoots resulting from the biphasic polynomial or nonlinear heteroscedastic protocols.

Method	ρ	R²	SEE	TRE	ASE	MPE	MPSE
Biphasic	0.997	0.994	8.729e-04	2.408	2.229	3.486	5.080
Polynomial (m=6)	0.996	0.992	0.001	-2.456	-1.033	4.807	5.366
Heteroscedastic	0.997	0.994	7.722e-04	0.983	-0.726	3.084	5.645

Open in a new tab

Results in [11] stablished that proxies derived from the TAMA protocol are inconsistent with observed values. This endorsed nonlinear regression in the direct scale as a requirement for reliability of allometric projections of mean leaf biomass in eelgrass shoots. But Table 8 shows that a curvature model fitted in geometrical space can offer proxies entailing similar predictive power to a nonlinear regression protocol. Plots in Figure 6 allow getting a glimpse of this assertion.

Average leaf biomass in shoots calculated from observed-processed data compared with their allometric projections. Projection lines resulting from the biphasic and polynomial models relied on the δ_n(ϵ) form.

4. Discussion

The customary bivariate allometric model of (1) offers nondestructive surrogates for average leaf biomass in eelgrass shoots [11]. But there are methodological factors that could influence dependability. Views assert that parameter identification based on logarithmic transformations leads to biased projections [20–29]. But other practitioners clung to this approach as meaningful and necessary in allometric examination [30–39]. This going over suggests that surpassing this controversy amounts to considering curvature in geometrical space. For that aim, we proposed the generalized model of (3). Approaches such as direct nonlinear heteroscedastic regression, as well as biphasic and polynomial protocols in geometrical space [18], became logical resultants from this construct. For present data model selection validated maintenance of the analysis in geometrical space. Nevertheless, at an empirical level, addressed protocols produced allometric projections of individual leaf biomass of correspondent precision. This was also verified for concomitant projections of average leaf biomass in shoots. But, from a qualitative standpoint, the nonlinear regression protocol mainly contributed by identifying expected dependence of the variance on covariate. Moreover. Figure 7(a) depicts manifest differences in mean response trends between the polynomial fit and the nonlinear heteroscedastic model. Nonetheless, those in Figure 7(b) corresponding to this and the biphasic models differ but only barely. Then, a nonlinear regression scheme at best shaped a reasonable approximation of the mean response function resultant from curvature methods.

Trends of mean response functions calculated from the retransformed outputs of polynomial (a) and biphasic models (b) involving the δ_n(ϵ) form of δ(ϵ). We can observe a differentiation in trends relative to that of a power function fitted in directly by means of the nonlinear heteroscedastic protocol of (14). Although relative deviations manifest for the biphasic model, differences in mean response patterns are more clearly depicted for the polynomial choice.

Differences in patterns of the biphasic and polynomial mean response functions relative to the nonlinear protocol exhibit that clinging to this last paradigm could impair detection of the true allometric relationship. Moreover, relying on direct nonlinear regression impairs identification of heterogeneity in the log transformed response as covariate changes. This further stresses on limitations of this device as a tool for allometric examination ([18, 30]). Oppositely, output of the selected biphasic model shown in Figure 2 suggests differentiation of growth patterns among smaller and larger leaves. Besides, the polynomial mean response in Figure 3 suggests a gradual transition between different growth phases. Thus, as opposed to direct nonlinear regression, a consideration of curvature in geometrical space could elucidate an inherent leaf growth pattern. This strengthens a judgement that the log transformation step, essential to traditional allometric examination, cannot be thrown away without losing relevant information ([33, 37]).

Mascaro et al. [18] conceived curvature in geometrical space as related to methodological factors of data gathering. But present examination corroborated consistency of curvature models for processed data. This suggests that manifestation of curvature is rather explained by intrinsic factors in leaf growth. Additionally, data processing failed to amend the heavy tails problem detected on Q-Q plots. This indicates departure of residuals from an assumed error structure. As a result, numerical values of the addressed correction factor forms turned out to be different, thus conveying ambiguity in selection of suitable mean response of models fitted in geometrical space. This could entail the only advantage of nonlinear regression method over log-transformation-curvature paradigms. But, also for this analysis method, an inadequate postulation of inherent error structure can lead to significant bias [18]. It seems then reasonable considering that a suitable characterization of error structure could lead to robustness of built allometric proxies, even when they derive from crude data. Steering to an error structure different from what is assumed here is a worthwhile subject of further research.

When dealing with similar data we suggest taking into account recommendations that come up from this examination. First, it is highly advisable to perform a preliminary examination of the spread around the straight line in geometrical space resulting from the model of (1). If further statistical exploration confirms that linearity and assumed error structure are consistent with data, Huxley's bivariate allometric model could suit. Otherwise, the arrangement of curvature, error structure, and correction factor form such as proposed here could be called into account for the analysis. The use of data cleaning procedures in order to achieve a better fit is controversial [11]. Instead of performing data processing a posteriori, it is highly advisable to rely on standardized data gathering procedures. This will prevent proliferation of inconsistent replicates that could exacerbate a heavy tails problem on Q-Q plots.

5. Conclusion

Failure to perform both a preliminary exploration of spread of log transformed allometric data and a sound evaluation of model adequacy could impair detecting a possible manifestation of curvature. As a consequence, the output of a traditional analysis method could set biased predictions of observed values. This circumstance could result in dismissal of a log transformation step in the analysis, giving way to contemplation of direct nonlinear regression as the only protocol to acquire reliable parameter estimates [11]. Results of this examination suggest that consideration of curvature in geometrical space as set by the model of (3) could offer dependable allometric proxies of average leaf biomass in eelgrass shoots.

From a general perspective, complexity as encompassed by the model of (3) can stand for curvature as conceived in allometric examination. Particularly, biphasic or polynomial protocols in geometrical space, as well as a direct nonlinear heteroscedastic regression model, derive as particular characterizations of this paradigm. Moreover all statistical models for accurate estimates of relative growth contemplated by Bervian et, al., [68] can be also accomodated by the present generalization of the model of simple allometry of Huxley. But empirical convenience on its own does not validate adoption of this paradigm as a general tool. Certainly, the Weierstrass approximation theorem [69] backs a polynomial regression model as a reasonable identification device for the generalized allometric model expressed in geometrical space. But suitability of retransformation results will sensibly depend on correction factor form. And a mean function resulting from a polynomial fitted in geometrical space will not enable characterization of functions α(a) and β(a) one to one. Furthermore, complexity of (3) could pose significant difficulties while attempting its identification through direct nonlinear regression methods. Needless to say, biological interpretation of the scaling functions α(a) and β(a) is also pending. A quest for efficient tools of nondestructive assessment of plant biomass units justifies addressing these examinations in a further research.

Acknowledgments

Funding for this research was institutionally granted. This was in part sponsored by an allowance from Mexico's Consejo Nacional de Ciencia y Tecnología.

Appendix

A. Data Exploratory Analysis

This appendix presents an exploratory analysis of raw and processed data sets contemplated in this examination.

Table 9 describes the distribution pattern, in terms of quantiles, for a sample of 10412 measurements of eelgrass leaf weights and related areas taken over 13 months conforming the present raw data. The first four columns in the uppermost row label the minimum followed by quantiles of probability 0.1, 0.25, and 0.50. Correspondingly, the fifth column in the first row presents the sample mean followed by quantiles of probability 0.75 and 0.90 before the maximum. Second row presents leaf dry weight values (w) and third row shows corresponding areas a. Third and fourth rows present transformed ln⁡w and ln⁡a values one to one. Similarly, Table 10 shows the variation pattern of the 10023 observations resulting after applying to raw data a mean plus or minus two standard deviations outlier's removal procedure [58] designed to remove replicates considered significantly discrepant from the mean response function of a simple allometric model for a leaf dry weight response in terms of a leaf area covariate.

Table 9.

Values of minimum, maximum, sample mean, and quantiles for measurements of eelgrass dry weight (w[gr]) and area (a[mm²]) (and their logarithms) of a sample of 10412 leaves before applying a data-cleaning procedure aimed to eliminate outliers.

	Min	0.10	0.25	0.50	Mean	0.75	0.90	Max
w[gr]	0.00001	0.00042	0.00154	0.00564	0.01293	0.01477	0.035443	0.38058
a[mm²]	2.00	32.50	127.5	355.2	690.5	836.0	1859.00	7868.0
ln⁡w	-11.513	-7.7753	-6.4760	-5.1779	-5.4001	-4.2152	-3.33983	-0.9661
ln⁡a	0.6931	3.48124	4.8481	5.8728	5.6729	6.7286	7.527794	8.9706

Open in a new tab

Table 10.

Values of minimum, maximum, sample mean, and quantiles for measurements of eelgrass dry weight (w[gr]) and area (a[mm²]) (and their logarithms) of a sample of 10023 leaves remaining after applying a data-cleaning procedure aimed to eliminate outliers.

	Min	0.10	0.25	0.50	Mean	0.75	0.90	Max
w[gr]	0.00001	0.00041	0.00144	0.00529	0.01211	0.01373	0.03381	0.15096
a[mm²]	2.00	30.00	124.00	352.50	672.83	828.00	1835.80	6240.00
ln⁡w	-11.5129	-7.7994	-6.5431	-5.2419	-5.4593	-4.2878	-3.3868	-1.8907
ln⁡a	.6931	3.4012	4.8203	5.8651	5.6518	6.7190	7.5152	8.7387

Open in a new tab

Comparing quantile values for the leaf dry weight (w) and corresponding area (a) variables reported in Tables 9 and 10 for crude and processed data respectively, we can ascertain that in spite of removing a large number of discrepant observations by data cleaning, both original and remnant sets show an equivalent distribution pattern. This similarity in distribution before and after data processing of the data can be better perceived in the Q-Q (quantile-quantile) graphs observed in Figure 8 for raw (a) and processed data (b), respectively.

Q-Q plots (quantile-quantile) comparing distribution pattern of observations of eelgrass leaf area a and associated dry weight w for raw (a) and processed data (b).

Q-Q plots in Figure 8 compare the distributions (regardless what they are) of the leaf area a and associated dry weight w variables. The linear relationship observed between the quantiles of both variables underlines that both variables have similar distributions, although with different parameters. This similarity between distributions of both variables is observed for both sets of observations. The great similarity of the fitted straight lines for both sets of observations in Table 11 confirms that the referred distribution pattern does not change after data processing.

Table 11.

Parameter estimates of straight lines fitted on raw (10412) and processed data (10023) and shown in the Q-Q plots of Figure 8.

	Estimate	Standard Error
Raw data
Intercept	141.22	3.88
slope	42493.46	163.29

Processed data
Intercept	111.20	2.50
Slope	46360.00	114.90

Open in a new tab

Figures 9 and 10 display boxplots for the 13 months' long sampling scheme. We can learn that, for raw data, from month 2 to month 6, a reduction in the values of both leaf dry weight and linked area occurred. This is perhaps explained by an increase in temperature during those months. Processed data exhibits similar dynamics through time for these variables. Moreover, the overall variation patterns of raw and processed data, throughout the 13 months of sampling, are similar

Boxplots for values of eelgrass leaf dry weight w (a) and linked area a(b) classified by month, as indicated in the horizontal axis. The data is from a sample of 10412 observations, before applying data processing.

B. Derivation of Regression Protocols

In this appendix, we first present notation convention for statistics related to the generalized bivariate model of (3). We then explain how the different regression protocols addressed in this examination derive from this paradigm. We also include notation convention for related statics.

B.1. Generalized Bivariate Allometric Model

A transformation v = ln⁡w and u(a) = ln⁡a of (3) establishes the generalized regression model in geometrical space given by (4). The term v_C(a, u) stands for the corresponding mean response function. Using the customary notation convention, we represent v_C(a, u) through symbol E_C(v∣u); that is,

\begin{matrix} E_{C} (v ∣ u) = \ln β (a) + α (a) u (a) . \end{matrix}

(B.1)

Furthermore, back-transformation w = e^v of (4) leads to the result

\begin{matrix} w = e^{E_{C} (v ∣ u)} e^{ϵ} . \end{matrix}

(B.2)

Thus, e^ϵ is understood as a multiplicative error term. Denoting by the symbol E_Cδ(w∣a) the corresponding mean response function in arithmetical space, from (B.1) and (B.2), we have

\begin{matrix} E_{C δ} (w ∣ a) (w ∣ a) = β (a) a^{α (a)} δ (ϵ) \end{matrix}

(B.3)

with δ(ϵ) = E(e^ϵ) interpreted as a correction factor of bias of retransformation of the regression error ϵ.

B.2. Traditional Analysis Method of Allometry (TAMA)

The bivariate allometric model of (1) derives from the model of (3) setting the scaling parameters constant; that is, β(a) = β and α(a) = α. The resultant regression model in geometrical space is given by (2). Corresponding mean response function will be denoted here by means of the symbol E_T(v∣u). Hence, from (2) we have

\begin{matrix} E_{T} (v ∣ u) = \ln β + α u . \end{matrix}

(B.4)

Retransformation w = e^v of (2) leads to the result

\begin{matrix} w = e^{E_{T} (v ∣ u)} e^{ϵ} . \end{matrix}

(B.5)

And linked mean response function in arithmetical space denoted through E_Tδ(w∣a) becomes

\begin{matrix} E_{T δ} (w ∣ a) = {β a}^{α} δ (ϵ) \end{matrix}

(B.6)

B.3. Biphasic Model in Geometrical Space

Here, we explain the result of (8), as well as the notation convention for related statistics. In order to characterize a biphasic form of the model of (3), we introduce fixed values a_min and a_max, so that the covariate a takes values in a range a_min ≤ a ≤ a_max. We then conceive a_b as a fixed value of a satisfying a_min ≤ a_b ≤ a_max. And recalling the transformation v = ln⁡w and u(a) = ln⁡a, we take u_b = ln⁡a_b as a breaking point for transition between two different phases of the variation of v. Moreover, in the model of (3), we let β(a) = β_B(a) and α(a) = α_B(a) given by

\begin{matrix} α_{B} (a) = \sum_{1}^{2} ϑ_{i} (u (a)) α_{i} \end{matrix}

(B.7)

\begin{matrix} β_{B} (a) = \exp (\sum_{1}^{2} ϑ_{i} (u (a)) \ln β_{i}), \end{matrix}

(B.8)

\begin{matrix} ϑ_{i} (u (a)) = \{\begin{matrix} (2 - i) (1 - H (u - u_{b})) & for i = 1 \\ (i - 1) H (u - u_{b}) & for i = 2 \end{matrix} \end{matrix}

(B.9)

where H(x) is the Heaviside [60] function defined through

\begin{matrix} H (x) = \{\begin{matrix} 0 & x < 0 \\ 1 & x \geq 0 . \end{matrix} \end{matrix}

(B.10)

Then, the biphasic form of (3) is formally represented by

\begin{matrix} w = β_{B} (a) a^{α_{B} (a)} . \end{matrix}

(B.11)

Denoting by v_B(a, u) the resulting form of v_C(a, u), (5) yields

\begin{matrix} v_{B} (a, u) = \ln β_{B} (a) + α_{B} (a) u (a) . \end{matrix}

(B.12)

Then, (B.7), (B.8), and (B.12) imply that

\begin{matrix} v_{B} (a, u) = \sum_{1}^{2} ϑ_{i} (u (a)) f_{i} (u (a)), \end{matrix}

(B.13)

where

\begin{matrix} f_{i} (u (a)) = \ln β_{i} + α_{i} u (a) . \end{matrix}

(B.14)

This explains (8). Since w as given by (3) is assumed to be a continuous function of a, this property ought to be satisfied by the biphasic model; that is, we require to set a condition, f₁(u_b) = f₂(u_b). This leads to the equation

\begin{matrix} \ln \frac{β_{1}}{β_{2}} = (α_{2} - α_{1}) u_{b} . \end{matrix}

(B.15)

Moreover, v_B(a, u) as given by (8) can be equivalently represented by

\begin{matrix} v_{B} (a, u) = \{\begin{matrix} \ln β_{1} + α_{1} u & u \leq u_{b} \\ \ln β_{2} + α_{2} u & u > u_{b}, \end{matrix} \end{matrix}

(B.16)

with the continuity condition of (B.15).

Correspondingly, the regression model of (4) in its biphasic form becomes

\begin{matrix} v = v_{B} (a, u) + ϵ . \end{matrix}

(B.17)

As we explained in (4), ϵ is a residual error term conceived as a random variable distributed according to an unknown distribution ψ having mean μ and variance set by a function σ²(u) of the covariate u; that is, ϵ ~ ψ(μ, σ²(u)).

The form of the generalized mean response E_C(v∣u) for the biphasic model is denoted through E_B(v∣u) and becomes

\begin{matrix} E_{B} (v ∣ u) = \sum_{1}^{2} ϑ_{i} (u (a)) f_{i} (u (a)) . \end{matrix}

(B.18)

It follows from (B.16) that the expression for E_B(v∣u) can be equivalently represented by

\begin{matrix} E_{B} (v ∣ u) = \{\begin{matrix} \ln β_{1} + α_{1} u & u \leq u_{b} \\ \ln β_{2} + α_{2} u & u > u_{b} . \end{matrix} \end{matrix}

(B.19)

Back-transformation w = e^v of (B.17) leads to the result

\begin{matrix} w = e^{E_{B} (v ∣ u)} e^{ϵ}, \end{matrix}

(B.20)

which can be written in the form

\begin{matrix} w = β_{B} (a) a^{α_{B} (a)} e^{ϵ} . \end{matrix}

(B.21)

Then, the mean response function in arithmetical space E(w∣a) becomes

\begin{matrix} E_{B δ} (w ∣ a) = β_{b} (a) a^{α_{b} (a)} δ (ϵ) . \end{matrix}

(B.22)

B.4. Polynomial Model in Geometrical Space

In this section, we explain how (12) can be derived from the generalized structure of model of (3). We also set the notation convention for related statistics. For these aims, we begin by considering polynomials ϕ_β(a) and ϕ_α(a) given by

\begin{matrix} ϕ_{β} (a) = \sum_{0}^{n} β_{k} u {(a)}^{k} \end{matrix}

(B.23)

\begin{matrix} ϕ_{α} (a) = \sum_{0}^{n} α_{k} u {(a)}^{k}, \end{matrix}

(B.24)

where, for 1 ≤ k ≤ m, α_k and β_k stand for coefficients. Now, let α(a) = α_P(a) and β(a) = β_P(a), where

\begin{matrix} α_{P} (a) = ϕ_{α} (a) \end{matrix}

(B.25)

\begin{matrix} β_{P} (a) = e^{ϕ_{β} (a)} . \end{matrix}

(B.26)

This way, we obtain a representation of the model of (3) in the form

\begin{matrix} w = β_{P} (a) a^{α_{P} (a)} . \end{matrix}

(B.27)

Now, denoting by means of v_P(a, u) the associated form of v_C(a, u), according to (5), we have

\begin{matrix} v_{P} (a, u) = \ln β_{P} (a) + α_{P} (a) u (a) . \end{matrix}

(B.28)

From (B.23) through (B.28), we obtain

\begin{matrix} v_{P} (a, u) = \sum_{0}^{n} β_{k} u {(a)}^{k} + \sum_{0}^{n} α_{k} u {(a)}^{k + 1} . \end{matrix}

(B.29)

Rearranging, we ascertain that v_B(u, a) takes on the polynomial form

\begin{matrix} v_{P} (a, u) = \sum_{0}^{m} p_{k} u {(a)}^{k} \end{matrix}

(B.30)

where m = n + 1, and

\begin{matrix} p_{0} = β_{0}, \\ p_{k} = β_{k} + α_{(k - 1)} f o r 1 \leq k \leq m - 1, \\ p_{m} = α_{m} . \end{matrix}

(B.31)

This explains the result of (12). Correspondingly, a polynomial characterization of the regression model of (4) takes the form

\begin{matrix} v = v_{P} (a, u) + ϵ, \end{matrix}

(B.32)

with ϵ being a residual error as described in (4). The form of the generalized mean response E_c(v∣u) for the polynomial characterization is denoted through E_P(v∣u). It becomes

\begin{matrix} E_{P} (v ∣ u) = \sum_{0}^{m} p_{k} u {(a)}^{k} . \end{matrix}

(B.33)

Back-transformation w = e^v of (B.32) leads to

\begin{matrix} w = e^{E_{P} (v ∣ u)} e^{ϵ} . \end{matrix}

(B.34)

And the mean response function in arithmetical space E_Pδ(w∣a) becomes

\begin{matrix} E_{P δ} (w ∣ a) = β_{P} (a) a^{α_{P} (a)} δ (ϵ) \end{matrix}

(B.35)

B.5. Nonlinear Regression Models

In this section, we explain how the nonlinear heteroscedastic model of (14) can be also associated with (3). We also explain the notation convention for related statistics. For these aims, we begin by noticing that v_C(a, u) in (5) can be also written in the form

\begin{matrix} v_{C} (a, u) = \ln β + α u + λ (a) \end{matrix}

(B.36)

where u = ln⁡a and

\begin{matrix} λ (a) = \ln (\frac{β (a)}{β}) + (α (a) - α) \ln a . \end{matrix}

(B.37)

Now, if β(a) and α(a) are as defined in (3), then (B.36) turns out to be nonlinear in geometrical space. Moreover, since from (B.36) we have

\begin{matrix} e^{v_{C} (a, u)} = {β a}^{α} e^{λ (a)} \end{matrix}

(B.38)

noticing that, from (4), e^{v_C(a, u)} = β(a)a^α(a), then solving for e^λ(a) above yields

\begin{matrix} e^{λ (a)} = \frac{β (a) a^{α (a)}}{{β a}^{α}} . \end{matrix}

(B.39)

Then, we have that the generalized curvature model of (3) can be also written in the form

\begin{matrix} w = {β a}^{α} e^{λ (a)} \end{matrix}

(B.40)

Thus, defining

\begin{matrix} ε (a) = β a^{α - θ} (e^{λ (a)} - 1), \end{matrix}

(B.41)

(B.39) implies

\begin{matrix} w = {β_{θ} a}^{α_{θ}} + a^{θ} ε (a), \end{matrix}

(B.42)

which suggests the nonlinear heteroscedastic regression model of (14). That is,

\begin{matrix} w = {β_{θ} a}^{α_{θ}} + a^{θ} ϵ, \end{matrix}

(B.43)

which can be considered as a tool to analyze curvature in geometrical space. Particularly, by setting θ = 0, we can consider the homoscedastic model

\begin{matrix} w = {β_{0} a}^{α_{0}} + ϵ, \end{matrix}

(B.44)

where in (B.43) and (B.44) the error term ϵ is assumed to be normally distributed random variable having zero mean and homogeneous variance; that is, ϵ ~ N(0, σ²).

It turns out that the mean response function E_θ(w∣a) associated with the heteroscedastic model becomes

\begin{matrix} E_{θ} (w ∣ a) = {β_{θ} a}^{α_{θ}} . \end{matrix}

(B.45)

Similarly, the mean response function E₀(w∣a) associated with the homoscedastic model becomes

\begin{matrix} E_{0} (w ∣ a) = β_{0} a^{α_{0}} . \end{matrix}

(B.46)

Finally, by setting α(a) = α and β(a) = β + c/a^α in (3), a log transformation v = ln⁡w and u = ln⁡a leads to the nonlinear regression model in geometrical space:

\begin{matrix} v = \ln (β + c e^{- α u}) + α u + ϵ \end{matrix}

(B.47)

with the error term assumed to be normally distributed with zero mean and constant variance σ². Then, back-transformation w = e^v of (B.47) to arithmetical space yields

\begin{matrix} w = (β a^{α} + c) e^{ϵ} . \end{matrix}

(B.48)

The mean response function in arithmetical space becomes

\begin{matrix} E (w ∣ a) = (β a^{α} + c) δ (ϵ) \end{matrix}

(B.49)

Packard [59] asserts that, commonly, any data set on arithmetical scale that is consistently described by a three-parameter power function will track a curved path when transformed to the logarithmic scale. Equations (B.45) through (B.47) provide a formal set-up accommodating such a statement.

C. Forms of the Correction Factor of Bias of Retransformation of the Regression Error

In this appendix, we explain the different characterizations of δ(ϵ), defined as a correction factor for bias of retransformation of the regression error ϵ introduced by (B.3) for E_Cδ(w∣a).

If ϵ is normally distributed with zero mean and constant variance σ², that is ϵ ~ N(0, σ²), resulting form of δ(ϵ), denoted here by means of the symbol δ_B(ϵ), is given by

\begin{matrix} δ_{B} (ϵ) = e^{σ^{2} / 2} . \end{matrix}

(C.1)

δ_B(ϵ) will be referred to as the Baskerville [51] form of δ(ϵ).

In case of a nonnormally distributed residual error term ϵ, Newman [52] asserts that δ(ϵ) must take the form provided by Duan's smearing estimate of bias [54]. Here, this form is correspondingly represented by means of the symbol δ_D(ϵ), and it is calculated by means of

\begin{matrix} δ_{D} (ϵ) = \frac{\sum_{1}^{m} e^{ϵ_{j}}}{m}, \end{matrix}

(C.2)

with ϵ_j standing for the j − th residual of the contemplated regression model. Nonetheless, this form of δ(ϵ) could produce bias overcompensation ([53, 55]). Moreover, δ_D(ϵ) corresponds to the sample mean of retransformed residuals. Therefore, whenever outliers occur, the actual central tendency of retransformed residuals data could not be represented by δ_D(ϵ). Under such a circumstance, a suitable form of the correction factor δ(ϵ) seems indefinite. Moreover, Zeng and Tang [57] suggest a distribution-free form of δ(ϵ) represented here by means of δ_ZT(ϵ) and given by

\begin{matrix} δ_{Z T} (ϵ) = 1 + \frac{σ^{2}}{2} . \end{matrix}

(C.3)

We notice that δ_ZT(ϵ) is actually an approximation to δ(ϵ) as it corresponds to a three terms' partial sum of the power series expression of δ(ϵ) assuming E(ϵ) = 0. By the same token, we can consider an alternative approximation for δ(ϵ); this is represented through the symbol δ_n(ϵ) and given by an n-terms partial sum of the series representation of E(e^ϵ); that is,

\begin{matrix} δ_{n} (ϵ) = \sum_{0}^{n} \frac{E (ϵ^{k})}{k!} . \end{matrix}

(C.4)

The value of n leading to the highest concordance correlation coefficient value between E_Cδ(w∣a) projections and observed values sets the explicit form of δ_n(ϵ) in (C.4).

D. Fitting Results of Regresion Models in Geometrical Space

This appendix presents fitting results of the geometrical space protocols derived from the generalized model of (3). This includes TAMA and biphasic and polynomial schemes.

D.1. Fitting Results of the TAMA Protocol

Since we are dealing with bivariate data (w_i, a_i), in arithmetical scale, according to the TAMA protocol, we have to consider pairs (v_i, u_i), where v_i = ln⁡w_i and u_i = ln⁡a_i for i = 1,2,….n. Moreover, the linear regression model of (2) becomes

\begin{matrix} v_{i} = \ln β + α u_{i} + ε_{i}, \end{matrix}

(D.1)

where the error term ε_i is assumed to be normally distributed with zero mean and constant variance σ². Hence, the response v_i has normal distribution N(μ_i, σ²) with constant variance σ² and a mean μ_i(α, β, u_i) expressed as a function of the covariate u_i; namely,

\begin{matrix} μ_{i} (α, β, u_{i}) = \ln β + α u_{i} . \end{matrix}

(D.2)

Identification method contemplated here relies on finding parameter estimates that maximize the log-likelihood function l(β, α, σ). This is given by

\begin{matrix} l (β, α, σ) = - \frac{n}{2} \log (2 π) - \sum_{1}^{n} \log (σ) \\ - \frac{1}{2} \sum_{1}^{n} {(\frac{v_{i} - \ln β - α u_{i}}{σ})}^{2} . \end{matrix}

(D.3)

Fitting results of the linear model of (D.1) through (D.3) to raw data appear in Table 12.

Table 12.

Fitting results of regression model of the TAMA protocol (cf. (D.1) through (D.3)). CI stands for confidence interval, RSE stands for standard error of residuals, RMS means multiple R-squared, and ARS is adjusted R-squared.

Statistics of residuals
Minimum	1Q	Median	3Q		Maximum
-4.7088	-0.2344	0.0281	0.2244		4.7169

Parameters	Estimate	Std. Error	t value	Pr(>\|t\|)	C.I. (95%)
α	1.028222	0.003335	308.27	<2e-16	(1.021684, 1.03476)
ln⁡β	-11.270569	0.019534	-576.94	<2e-16	(-11.308861, -11.23228)

RSE σ		0.5131 on 10021 df
MRS		0.9046
ARS		0.9046
F-statistic		9.504e+04 on 1 and 10021 df
p-value		< 2.2e-16

Open in a new tab

Figure 11(a) displays a biased distribution of residuals around the zero line. The normal Q-Q plot in Figure 11(b) can be considered as a visual test of goodness of fit. In this case, this provides primary evidence against normality of residuals. Indeed, we can learn of a heavy-tails pattern in this plot. Moreover, an Anderson-Darling [67] goodness-of-fit test resulted in a test statistic of 246.7 and in a p value of <2.2e-16, which confirms lack of normality of residuals.

Residuals and normal probability plots ((a) and (b), resp.) resulting from fitting the linear regression model of the TAMA protocol (cf. (D.1) through (D.3)). A biased distribution of residuals around the zero line is depicted. Also, the normal Q-Q plot shows heavier tails than those expected for a normal distribution.

Figure 12 shows spread about TAMA's mean response E_Tδ(w∣a) as produced by different forms of the correction factor δ(ϵ). Plot suggests moderate bias about E_Tδ(w∣a) for leaves of small-to-medium-sized leaves. Nevertheless, deviations between projected and processed data are notorious for larger area values. Thus, spread in Figure 12 suggests that the TAMA protocol is unsuited for the analysis of present data.

Spread about the mean response function E_Tδ(w | a) obtained by retransforming the linear model of the TAMA protocol and using the addressed forms of the correction factor δ(ϵ) (Table 1). Black lines associated with E_TD(w | a) green lines go with E_TZT(w | a), and red ones go with E_Tn(w | a) and in yellow we show those for E_TB(w | a). We can learn of a biased spread of observed values about the mean response functions E_Tδ(w | a) even after data processing.

D.2. Fitting Results of the Biphasic Model

As before, for data pairs (v_i, u_i), with v_i = ln⁡w_i and u_i = ln⁡a_i, for i = 1,2,….n, (B.13) and (B.17) yield the regression model

\begin{matrix} v_{i} = (ϑ_{1} (u_{i}) \ln β_{1} + ϑ_{2} (u_{i}) \ln β_{2}) \\ + (ϑ_{1} (u_{i}) α_{1} + ϑ_{2} {(u_{i}) α}_{2}) u_{i} + ε_{i}, \end{matrix}

(D.4)

with the additive errors ε₁, ε₂,…, ε_n assumed to be normally distributed with zero mean and constant variance σ². Moreover, the response v_i has normal distribution N(μ_i, σ²) with constant variance σ² and a mean μ_i(α, β, u_i) expressed as a function of the covariate u_i; namely,

\begin{matrix} ϑ_{k} (u_{i}) = \{\begin{matrix} (2 - k) (1 - H (u_{i} - u_{b})) & k = 1 \\ (k - 1) H (u_{i} - u_{b}) & k = 2, \end{matrix} \end{matrix}

(D.5)

and, hence, the response v_i as given by (D.4) is normally distributed having mean μ_i(u_i, π) with π standing for the parameter vector (β₁, β₂, α₁, α₂, u_b) and given by

\begin{matrix} μ_{i} (u_{i}, π) = (\ln β_{1} + H (u_{i} - u_{b}) \ln (\frac{β_{2}}{β_{1}})) \\ + (α_{1} + H (u_{i} - u_{b}) (α_{2} - α_{1})) u_{i} \end{matrix}

(D.6)

and variance σ². Therefore, the associated log-likelihood function l(β₁, β₂, α₁, α₂, u_b, σ) becomes

\begin{matrix} l (β_{1}, β_{2}, α_{1}, α_{2}, u_{b}, σ) \\ = - \frac{n}{2} \log (2 π) - \sum_{1}^{n} \log (σ) \\ - \frac{1}{2} \sum_{1}^{n} {(\frac{w_{i} - μ_{i} (u_{i}, β_{1}, β_{2}, α_{1}, α_{2}, u_{b})}{σ})}^{2} . \end{matrix}

(D.7)

Table 13 Presents fitting results of the biphasic regression model of (D.4) through (D.7). It is worth mentioning that in this case we are performing a nonlinear fit in geometrical space.

Table 13.

Fitting results of the biphasic regression model of (D.4) through (D.7) to processed data. CI stands for confidence interval and RSE for standard error of residuals.

Residual statistics
Minimum	1Q		Median	3Q	Maximum
-4.4406	-0.1948		0.0240	0.2072	4.4626

Parameters	Estimate	Std. Error	t value	Pr(>\|t\|)	CI (95%)
ln⁡β₁	-9.190321	0.052812	-174.0	<2e-16	(-9.2938, -9.0868)
ln⁡β₂	-11.9381	0.093895	-127.1	<2e-16	(-12.1221, -11.7541)
u _b	3.550709	0.032744	108.4	<2e-16	(3.4865, 3.6149)
α ₁	0.336339	0.020042	16.8	<2e-16	(0.2971, 0.3756)
α ₂	1.164558	0.004248	274.1	<2e-16	(1.1562, 1.1729)

RSE σ		0.452947 on 10018 df

Open in a new tab

Comparing the behavior of the residuals of the TAMA and biphasic models from Figure 13, we can learn of a relative stabilization of variability of the residues. Nevertheless, the biphasic normal Q-Q plot still shows heavier tails than those expected for a normal distribution. This amounts to primary evidence against normality of the residuals. An Anderson-Darling goodness-of-fit test [67] that resulted in a test statistic of 254.5 and a p value <2.2e-16 confirms a lack of normality for the involved residuals. Nevertheless, compared with the TAMA fit, the normal Q-Q plot for the biphasic model reveals a larger region where data track a normal distribution pattern.

Residuals and normal probability plots ((a) and (b) one to one) for the fitting of the biphasic regression model to processed data. Compared with the TAMA fit, distribution of residuals about the zero line improved. Moreover, the Q-Q plot shows heavier tails than those expected for a normal distribution.

D.3. Fitting Results of the Polynomial Model

We now explain the statistical analysis involved in the identification of the polynomial model of (B.27). Again, for data pairs (v_i, u_i), with v_i = ln⁡w_i and u_i = ln⁡a_i, for i = 1,2,….n, from (B.29) and (B.30), we get the regression model

\begin{matrix} v_{i} = \sum_{0}^{m} {p_{k} u_{i}}^{k} + ε_{i}, \end{matrix}

(D.8)

where the additive errors ε₁, ε₂,…, ε_n are independent and identically distributed, with distribution N(0, σ²). Hence, the response v_i above is normally distributed having mean μ_i(u_i, π) with π standing for the parameter vector (p₀, p₁,…, p_m); namely,

\begin{matrix} μ_{i} (u_{i}, π) = \sum_{0}^{m} p_{k} {u_{i}}^{k} . \end{matrix}

(D.9)

Therefore, the associated log-likelihood function l(p₀, p₁,…, p_m, σ) becomes

\begin{matrix} l (p_{0}, p_{1}, \dots p_{m}, σ) = - \frac{n}{2} \log (2 π) - \sum_{1}^{n} \log (σ) \\ - \frac{1}{2} \sum_{1}^{n} {(\frac{w_{i} - μ_{i} (u_{i}, π)}{σ})}^{2} . \end{matrix}

(D.10)

Fitting results of the polynomial regression model of (D.8) through (D.10) are shown in Table 14.

Table 14.

Fitting results of the polynomial regression model. CI stands for confidence interval, RSE stands for standard error of residuals, RMS means multiple R-squared, and ARS is adjusted R-squared.

Residual statistics
Minimum	1Q		Median	3Q	Maximum
-4.4207	-0.1947		0.0248	0.2072	4.5703

Parameters	Estimate	Std. Error	t value	Pr(>\|t\|)	CI (95%)
p ₀	-11.080	0.4136	-26.783	<2e-16	(-11.889e+1, -10.267e+1)
p ₁	4.602	0.7411	6.209	5.54e-10	(3.1489, 6.0543)
p ₂	-3.2120	0.5027	-6.390	1.74e-10	(-4.1976, -2.2267)
p ₃	1.0620	0.1671	6.355	2.17e-10	(0.7344, 1.3894)
p ₄	-0.1678	0.02911	-5.765	8.43e-09	(-0.22483, -0.11072)
p ₅	0.01287	0.002549	5.047	4.57e-07	(0.0078689, 0.017863)
p ₆	- 3.854e-04	8.854e-05	-4.353	1.36e-05	(-0.00055897, -0.00021186)

RSE σ		0.4538 on 10016 df
MRS		0.9254
ARS		0.9254
F-statistic		2.071e+4 on 6 and 10016 df
p-value		< 2.2e-16

Open in a new tab

Reviewing fitting results of the polynomial model, we observe similar results to those we found earlier for the fit of the biphasic model. Moreover, Figure 14 displays similar patterns to those shown in Figure 13. Also for this fit, an Anderson-Darling normality test [67] delivered a test statistic 257.1 and a p value <2.2e-16 that yields remarkable evidence against normality of residuals. Again, comparing with TAMA results, distribution of residuals about the zero line improved. Moreover, the Q-Q plot shows a larger plateau, where residuals agree to a normal distribution pattern.

Residuals and normal probability plots ((a) and (b) one to one) for the fitting of the polynomial regression model of (D.8) through (D.10). Compared with TAMA results, distribution of residuals about the zero line improved. Normal Q-Q plot reveals a large plateau, where residuals conform to a normal distribution, but anyhow heavier tails than expected for such a pattern remain.

E. Fitting Results of Direct Nonlinear Regression Models

This appendix presents the exploration of curvature by means of direct nonlinear regression methods. For that aim, we present fitting results of the nonlinear heteroscedastic model of (14). For comparison, we include fitting results of the homoscedastic case of (15).

Since we deal with data pairs (w_i, a_i) for the heteroscedastic case, the regression equation to be considered turns out to be

\begin{matrix} w_{i} = β_{θ} {a_{i}}^{α_{θ}} + {a_{i}}^{θ} ϵ_{i}, \end{matrix}

(E.1)

where ϵ_i is a normally distributed zero mean random variable with standard deviation σ. Then the response w_i is normally distributed having mean function μ_i(a_i,α_θ, β_θ) given by

\begin{matrix} μ_{i} (a_{i,} α_{θ}, β_{θ}) = β_{θ} {a_{i}}^{α_{θ}} . \end{matrix}

(E.2)

And a variance set as a function σ_i²(a_i) of covariate a_i is, namely,

\begin{matrix} σ_{i}^{2} (a_{i}) = σ^{2} a_{i}^{2 θ} . \end{matrix}

(E.3)

Again, fitting this regression model relied on a likelihood approach, that is, obtaining estimates for the parameters α_θ, β_θ, θ, and σ, which result in maximizing the log-likelihood function l(β, α_θ, β_θ, σ) expressed by

\begin{matrix} l (β, α_{θ}, β_{θ}, σ) = - \frac{n}{2} \log (2 π) - \sum_{1}^{n} \log (σ a_{i}^{θ}) \\ - \frac{1}{2} \sum_{1}^{n} {(\frac{w_{i} - β_{θ} {a_{i}}^{α_{θ}}}{σ a_{i}^{θ}})}^{2}, \end{matrix}

(E.4)

and in turn the linked homoscedastic regression model is

\begin{matrix} w_{i} = β_{o} {a_{i}}^{α_{o}} + ϵ_{i} \end{matrix}

(E.5)

for i = 1,2,…, n and with error terms, ϵ₁, ϵ₂,…, ϵ_n, independent and identically distributed normally with common mean μ set to zero and having an invariant standard deviation σ. The response w_i above is normally distributed with mean function μ_i(a_i, α_o, β_o) given by

\begin{matrix} μ_{i} (a_{i}, α_{o}, β_{o}) = β_{o} {a_{i}}^{α_{o}} . \end{matrix}

(E.6)

Corresponding log-likelihood function l(α_o, β_o, σ) becomes

\begin{matrix} l (α_{o}, β_{o}, σ) = - \frac{n}{2} \log (2 π) - \sum_{1}^{n} \log (σ) \\ - \frac{1}{2} \sum_{1}^{n} {(\frac{w_{i} - β_{o} {a_{i}}^{α_{o}}}{σ})}^{2} . \end{matrix}

(E.7)

It can be ascertained from Table 15 that estimates for the parameters α_o and β_o for raw data are very similar to α_θ, as well as β_θ, for the heteroscedastic regression model of (E.1) and (E.4), respectively.

Table 15.

Maximum likelihood estimates of parameters for the heteroscedastic and homoscedastic regression models ((E.1) and (E.5) one to one). CI stands for confidence interval.

Heteroscedastic fit

Parameter	Estimate	Std. Error	CI (95%)
α _θ	1.1298	0.003414	(1.1230, 1.1366)
β _θ	7.073e-06	1.804e-07	(6.7123e-06, 7.434e-06)
θ	0.4415	0.003378	(0.4347, 0.4483)
σ	1.951e-04	3.972e-06	(1.8716e-04, 2.0304e-04)

Homoscedastic fit

Parameter	Estimate	Std. Error	CI (95%)
α _o	1.1365	0.003560	(1.1293, 1.1437)
β _o	6.724e-06	1.883e-07	(6.347e-06, 7.10e-06)
σ	0.004173	2.948e-05	(0.00411, 0.004229

Open in a new tab

Confidence intervals in Table 15 show some overlap. Therefore, it can be established that parameter estimates differ but just barely. This statement is supported by the estimated value of ρ, Lin's concordance correlation coefficient between projected and observed leaf biomass values. For both regression models, homoscedastic and heteroscedastic, the estimate was ρ = 0.972. Likewise, plotting the mean response curves w_i = β_θa_i^α_θ and w_i = β_oa_i^α_o on a dispersion diagram of weight and area values reveals that curves corresponding to parameter estimates fitted considering heteroscedasticity or not are very similar (Figure 5). Nevertheless, the heteroscedastic fit clearly identifies for the variance a variation pattern that grows in a power function as leaf area increases. Indeed, Figure 15 shows remarkable differences in scatter patterns of dry weight residuals against leaf area for the heteroscedastic and homoscedastic models ((a) and (b), resp.). This can be ascertained from values in Table 6, unveiling that the heteroscedastic model is selected by most agreement indices.

Diagram of dispersion of leaf dry weight residuals compared to leaf area for the fits heteroscedastic and homoscedastic models of (E.1) and (E.5), respectively. Region bounded by red lines determines (95%) confidence intervals for residuals.

Nevertheless, normal Q-Q plots of residuals of the fits of the heteroscedastic model shown in Figure 16 reveal that in spite of data cleaning we can be aware of a severe problem of heavy tails. This points to consideration of a different error structure from the normal one assumed here.

Residuals and normal probability plots ((a) and (b) one to one) for the fitting of the nonlinear heteroscedastic regression model of (E.1). We can observe a slightly biased distribution of residuals around the zero line. Also, normal Q-Q plot in (b) displays heavier tails than those expected for a normal distribution.

Data Availability

Data will be provided by the corresponding author upon usage agreement.

Disclosure

This research was completed while Enrique Villa-Diharce (CVU:208964) was on sabbatical at CICESE.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

1.Huxley J. S. Problems of Relative Growth. London, UK: Methuen; 1932. http://www.archive.org/details/problemsofrelati033234mbp. [Google Scholar]
2.Lu M., Caplan J. S., Bakker J. D., et al. Allometry data and equations for coastal marsh plants. Ecology. 2016;97(12):3554–3554. doi: 10.1002/ecy.1600. [DOI] [Google Scholar]
3.Savage V. M., Gillooly J. F., Woodruff W. H., et al. The predominance of quarter-power scaling in biology. Functional Ecology. 2004;18(2):257–282. doi: 10.1111/j.0269-8463.2004.00856.x. [DOI] [Google Scholar]
4.Myhrvold N. P. Dinosaur metabolism and the allometry of maximum growth rate. PLoS ONE. 2016;11(11) doi: 10.1371/journal.pone.0163205.e0163205 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.West G. B., Brown J. H. The origin of allometric scaling laws in biology from genomes to ecosystems: Towards a quantitative unifying theory of biological structure and organization. Journal of Experimental Biology. 2005;208(9):1575–1592. doi: 10.1242/jeb.01589. [DOI] [PubMed] [Google Scholar]
6.Newman M. E. J. Power laws, Pareto distributions and Zipf's law. Contemporary Physiscs. 2005;46(5):323–351. doi: 10.1080/00107510500052444. [DOI] [Google Scholar]
7.Samaniego H., Moses M. E. Cities as organisms: allometric scaling of urban road networks. Journal of Transport and Land Use. 2008;1:21–39. [Google Scholar]
8.Maritan A., Rigon R., Banavar J. R., Rinaldo A. Network allometry. Geophysical Research Letters. 2002;29(11):3-1–3-4. doi: 10.1029/2001GL014533. [DOI] [Google Scholar]
9.De Robertis A., Williams K. Weight-length relationships in fisheries studies: The standard allometric model should be applied with caution. Transactions of the American Fisheries Society. 2008;137(3):707–719. doi: 10.1577/T07-124.1. [DOI] [Google Scholar]
10.Echavarría-Heras H., Leal-Ramírez C., Villa-Diharce E., Cazarez-Castro N. R. The effect of parameter variability in the allometric projection of leaf growth rates for eelgrass (Zostera marina L.) II: the importance of data quality control procedures in bias reduction. Theoretical Biology and Medical Modelling. 2015;12(30) doi: 10.1186/s12976-015-0025-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Echavarría-Heras H., Leal-Ramírez C., Villa-Diharce E., Cazarez-Castro N. On the suitability of an allometric proxy for nondestructive estimation of average leaf dry weight in eelgrass shoots I: sensitivity analysis and examination of the influences of data quality, analysis method, and sample size on precision. Theoretical Biology and Medical Modelling. 2018;15(4) doi: 10.1186/s12976-018-0076-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Solana-Arellano E., Echavarría-Heras H., Leal-Ramírez C., Lee K.-S. The effect of parameter variability in the allometric projection of leaf growth rates for eelgrass (Zostera marina L.) Latin American Journal of Aquatic Research. 2014;42(5):1099–1108. doi: 10.3856/vol42-issue5-fulltext-14. [DOI] [Google Scholar]
13.Montesinos-López A., Villa-Diharce E., Echavarría-Heras H., Leal-Ramírez C. Correction to: Improved allometric proxies for eelgrass conservation. Journal of Coastal Conservation. 2018;23(1):71–91. doi: 10.1007/s11852-018-0639-4. [DOI] [Google Scholar]
14.Agutter P. S., Tuszynski J. A. Analytic theories of allometric scaling. Journal of Experimental Biology. 2011;214(7):1055–1062. doi: 10.1242/jeb.054502. [DOI] [PubMed] [Google Scholar]
15.Hui D., Jackson R. B. Uncertainty in allometric exponent estimation: a case study in scaling metabolic rate with body mass. Journal of Theoretical Biology. 2007;249(1):168–177. doi: 10.1016/j.jtbi.2007.07.003. [DOI] [PubMed] [Google Scholar]
16.Packard G. C. Is logarithmic transformation necessary in allometry? Biological Journal of the Linnean Society. 2013;109(2):476–486. doi: 10.1111/bij.12038. [DOI] [Google Scholar]
17.Mascaro J., Litton C. M., Hughes R. F., Uowolo A., Schnitzer S. A. Minimizing bias in biomass allometry: model selection and log-transformation of data. Biotropica. 2011;43(6):649–653. doi: 10.1111/j.1744-7429.2011.00798.x. [DOI] [Google Scholar]
18.Mascaro J., Litton C. M., Hughes R. F., Uowolo A., Schnitzer S. A. Is logarithmic transformation necessary in allometry? ten, one-hundred, one-thousand-times yes. Biological Journal of the Linnean Society. 2014;111(1):230–233. doi: 10.1111/bij.12177. [DOI] [Google Scholar]
19.Thompson D. W. On Growth and Form. New York, NY, USA: Macmillan; 1943. [Google Scholar]
20.Smith R. J. Allometric scaling in comparative biology: problems of concept and method. American Journal of Physiology-Regulatory, Integrative and Comparative Physiology. 1984;246(2):R152–R160. doi: 10.1152/ajpregu.1984.246.2.R152. [DOI] [PubMed] [Google Scholar]
21.Lovett D. L., Felder D. L. Application of regression techniques to studies of relative growth in crustaceans. Journal of Crustacean Biology. 1989;9(4):529–539. doi: 10.2307/1548585. [DOI] [Google Scholar]
22.Bales G. S. Heterochrony in brontothere horn evolution: allometric interpretations and the effect of life history scaling. Paleobiology. 1996;22(4):481–495. doi: 10.1017/S009483730001647X. [DOI] [Google Scholar]
23.Lagergren R., Svensson J.-E., Stenson J. A. E. Models of ontogenetic allometry in cladoceran morphology studies. Hydrobiologia. 2007;594(1):109–116. doi: 10.1007/s10750-007-9085-2. [DOI] [Google Scholar]
24.Sartori A. F., Ball A. D. Morphology and postlarval development of the ligament of Thracia phaseolina (bivalvia: Thraciidae), with a discussion of model choice in allometric studies. Journal of Molluscan Studies. 2009;75(3):295–304. doi: 10.1093/mollus/eyp029. [DOI] [Google Scholar]
25.Packard G. C. Multiplicative by nature: logarithmic transformation in allometry. Journal of Experimental Zoology Part B: Molecular and Developmental Evolution. 2014;322(4):202–207. doi: 10.1002/jez.b.22570. [DOI] [PubMed] [Google Scholar]
26.Packard G. C. Quantifying the curvilinear metabolic scaling in mammals. Journal of Experimental Zoology Part A: Ecological Genetics and Physiology. 2015;323(8):540–546. doi: 10.1002/jez.1946. [DOI] [PubMed] [Google Scholar]
27.Packard G. C. Relative growth by the elongated jaws of gars: a perspective on polyphasic loglinear allometry. Journal of Experimental Zoology Part B: Molecular and Developmental Evolution. 2016;326(3):168–175. doi: 10.1002/jez.b.22673. [DOI] [PubMed] [Google Scholar]
28.Packard G. C. Is complex allometry in field metabolic rates of mammals a statistical artifact? Comparative Biochemistry and Physiology Part A: Molecular & Integrative Physiology. 2017;203:322–327. doi: 10.1016/j.cbpa.2016.10.005. [DOI] [PubMed] [Google Scholar]
29.Packard G. C. The essential role for graphs in allometric analysis. Biological Journal of the Linnean Society. 2017;120:468–473. [Google Scholar]
30.Lai J., Yang B., Lin D., Kerkhoff A. J., Ma K., Bond-Lamberty B. The allometry of coarse root biomass: log-transformed linear regression or nonlinear regression? PLoS ONE. 2013;8(10) doi: 10.1371/journal.pone.0077007.e77007 [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Klingenberg C. P. Heterochrony and allometry: the analysis of evolutionary change in ontogeny. Biological Reviews. 1998;73(1):79–123. doi: 10.1017/s000632319800512x. [DOI] [PubMed] [Google Scholar]
32.Nevill A. M., Bate S., Holder R. L. Modeling physiological and anthropometric variables known to vary with body size and other confounding variables. Yearbook of Physical Anthropology. 2005;48:141–153. doi: 10.1002/ajpa.20356. [DOI] [PubMed] [Google Scholar]
33.Kerkhoff A. J., Enquist B. J. Multiplicative by nature: why logarithmic transformation is necessary in allometry. Journal of Theoretical Biology. 2009;257(3):519–521. doi: 10.1016/j.jtbi.2008.12.026. [DOI] [Google Scholar]
34.Xiao X., White E. P., Hooten M. B., Durham S. L. On the use of log-transformation vs. nonlinear regression for analyzing biological power laws. Ecology. 2011;92(10):1887–1894. doi: 10.1890/11-0538.1. [DOI] [PubMed] [Google Scholar]
35.White E. P., Xiao X., Isaac N. J. B., Sibly R. M. Methodological tools. In: Sibly R. M., Brown J. H., Kodric-Brown A., editors. Metabolic Ecology: A Scaling Approach. Oxford, UK: Wiley-Blackwell; 2012. pp. 9–20. [Google Scholar]
36.Ballantyne F. Evaluating model fit to determine if logarithmic transformations are necessary in allometry: a comment on the exchange between Packard (2009) and Kerkhoff and Enquist (2009) Journal of Theoretical Biology. 2013;317:418–421. doi: 10.1016/j.jtbi.2012.09.035. [DOI] [PubMed] [Google Scholar]
37.Glazier D. S. Log-transformation is useful for examining proportional relationships in allometric scaling. Journal of Theoretical Biology. 2013;334:200–203. doi: 10.1016/j.jtbi.2013.06.017. [DOI] [PubMed] [Google Scholar]
38.Niklas K. J., Hammond S. T. Assessing scaling relationships: uses, abuses, and alternatives. International Journal of Plant Sciences. 2014;175(7):754–763. doi: 10.1086/677238. [DOI] [Google Scholar]
39.Lemaître J. F., Vanpé C., Plard F., Pélabon C., Gaillard J. M. Response to Packard: make sure we do not throw out the biological baby with the statistical bath water when performing allometric analyses. Biology Letters. 2015;11(6):p. 20150144. doi: 10.1098/rsbl.2015.0144. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Strauss R. E., Huxley J. S. The study of allometry since Huxley. In: Thompson D. H., editor. Problems of Relative Growth. Baltimore, UK: Johns Hopkins University Press; 1993. [Google Scholar]
41.Knell R. J. On the analysis of non-linear allometries. Ecological Entomology. 2009;34(1):1–11. doi: 10.1111/j.1365-2311.2008.01022.x. [DOI] [Google Scholar]
42.MacLeod C. D. Assessing the shape and topology of allometric relationships with body mass: a case study using testes mass allometry. Methods in Ecology and Evolution. 2010;1(4):359–370. doi: 10.1111/j.2041-210X.2010.00037.x. [DOI] [Google Scholar]
43.Mori S., Yamaji K., Ishida A., et al. Mixed-power scaling of whole-plant respiration from seedlings to giant trees. Proceedings of the National Acadamy of Sciences of the United States of America. 2010;107(4):1447–1451. doi: 10.1073/pnas.0902554107. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Snelling E. P., Taggart D. A., Maloney S. K., Farrell A. P., Seymour R. S. Biphasic allometry of cardiac growth in the developing Kangaroo Macropus fuliginosus. Physiological and Biochemical Zoology. 2015;88(2):216–225. doi: 10.1086/679718. [DOI] [PubMed] [Google Scholar]
45.Duffy J. E. Ecology and eusociality in sponge-dwelling shrimp. In: Organisms., Duffy J. E., Thiel M., editors. Evolutionary Ecology of Social and Sexual Systems, Crustaceans as Model. New York, NY, USA: Oxford University Press; 2007. pp. 1–520. [Google Scholar]
46.Batterham A. M., George K. P. Allometric modeling does not determine a dimensionless power function ratio for maximal muscular function. Journal of Applied Physiology. 1997;83(6):2158–2166. doi: 10.1152/jappl.1997.83.6.2158. [DOI] [PubMed] [Google Scholar]
47.Chave J., Andalo C., Brown S., et al. Tree allometry and improved estimation of carbon stocks and balance in tropical forests. Oecologia. 2005;145(1):87–99. doi: 10.1007/s00442-005-0100-x. [DOI] [PubMed] [Google Scholar]
48.Niklas K. J. Size-dependent allometry of tree height, diameter and trunk-taper. Annals of Botany. 1995;75(3):217–227. doi: 10.1006/anbo.1995.1015. [DOI] [Google Scholar]
49.Niklas K. J. Mechanical properties of black locust (Robinia pseudoacacia L.) Wood. Size-and age-dependent variations in sap- and heartwood. Annals of Botany. 1997;79(3):265–272. doi: 10.1006/anbo.1996.0340. [DOI] [Google Scholar]
50.Packard G. C. Why allometric variation in mammalian metabolism is curvilinear on the logarithmic scale. Journal of Experimental Zoology Part A: Ecological and Integrative Physiology. 2017;327(9):537–541. doi: 10.1002/jez.2129. [DOI] [PubMed] [Google Scholar]
51.Baskerville G. L. Use of logarithmic regression in the estimation of plant biomass. Canadian Journal of Forest Research. 1972;2(1):49–53. doi: 10.1139/x72-009. [DOI] [Google Scholar]
52.Newman M. C. Regression analysis of log-transformed data: statistical bias and its correction. Environmental Toxicology and Chemistry. 1993;12(6):1129–1133. doi: 10.1002/etc.5620120618. [DOI] [Google Scholar]
53.Smith R. J. Logarithmic transformation bias in allometry. American Journal of Physical Anthropology. 1993;90(2):215–228. doi: 10.1002/ajpa.1330900208. [DOI] [Google Scholar]
54.Duan N. Smearing estimate: a nonparametric retransformation method. Journal of the American Statistical Association. 1983;78(383):605–610. doi: 10.1080/01621459.1983.10478017. doi: 10.1080/01621459.1983.10478017. [DOI] [Google Scholar]
55.Koch R. W., Smillie G. M. Comment on “River loads underestimated by rating curves” by R. I. Ferguson. Water Resources Research. 1986;22(13):2121–2122. doi: 10.1029/WR022i013p02121. [DOI] [Google Scholar]
56.Manning W. G. The logged dependent variable, heteroscedasticity, and the retransformation problem. Journal of Health Economics. 1998;17(3):283–295. doi: 10.1016/S0167-6296(98)00025-3. [DOI] [PubMed] [Google Scholar]
57.Zeng W. S., Zeng W. S., Tang S. Z. Bias correction in logarithmic regression and comparison with weighted regression for nonlinear models. Nature Precedings. 2011 doi: 10.1038/npre.2011.6708.1. [DOI] [Google Scholar]
58.Miller J. Short report: reaction time analysis with outlier exclusion: bias varies with sample size. The Quarterly Journal of Experimental Psychology Section A. 1991;43(4):907–912. doi: 10.1080/14640749108400962. [DOI] [PubMed] [Google Scholar]
59.Packard G. C. Is non-loglinear allometry a statistical artifact? Biological Journal of the Linnean Society. 2012;107(4):764–773. doi: 10.1111/j.1095-8312.2012.01995.x. [DOI] [Google Scholar]
60.Berg E. J. Heaviside's Operational Calculus. 2nd. New York, NY, USA: McGraw-Hill; 1936. [Google Scholar]
61.I-Kuei Lin L. A concordance correlation coefficient to evaluate reproducibility. Biometrics. 1989;45(1):255–268. doi: 10.2307/2532051. [DOI] [PubMed] [Google Scholar]
62.McBride G. B. HAM2005-062. Hamilton, New Zeeland: National Institute of Water & Atmospheric Research; 2005. A Proposal for strength-of-agreement criteria for Lin’s Concordance Correlation Coefficient. [Google Scholar]
63.Zeng W. S., Zhang L. J., Chen X. Y., Cheng Z. C., Ma K. X., Li Z. H. Construction of compatible and additive individual-tree biomass models for Pinus tabulaeformis in China. Canadian Journal of Forest Research. 2017;47(4):467–475. doi: 10.1139/cjfr-2016-0342. [DOI] [Google Scholar]
64.Zeng W. S., Tang S. Z. Goodness evaluation and precision analysis of tree biomass equations. Sci Silvae Sinicae. 2011;47:106–113. [Google Scholar]
65.Zeng W., Duo H., Lei X., et al. Individual tree biomass equations and growth models sensitive to climate variables for Larix spp. in China. European Journal of Forest Research. 2017;136(2):233–249. doi: 10.1007/s10342-017-1024-9. [DOI] [Google Scholar]
66.Akaike H. A new look at the statistical model identification. IEEE Transactions on Automatic Control. 1974;19(6):716–723. doi: 10.1109/TAC.1974.1100705. [DOI] [Google Scholar]
67.Anderson T. W., Darling D. A. Asymptotic theory of certain goodness of fit criteria based on stochastic processes. Annals of Mathematical Statistics. 1952;23:193–212. doi: 10.1214/aoms/1177729437. [DOI] [Google Scholar]
68.Bervian G., Fontoura N. F., Haimovici M. Statistical model of variable allometric growth: otolith growth in Micropogonias furnieri(Actinopterygii, Sciaenidae) Journal of Fish Biology. 2006;68(1):196–208. doi: 10.1111/j.0022-1112.2006.00890.x. [DOI] [Google Scholar]
69.Brosowski B., Deutsch F. An elementary proof of the Stone-Weierstrass theorem. Proceedings of the American Mathematical Society. 1981;81(1):89–92. doi: 10.2307/2043993. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data will be provided by the corresponding author upon usage agreement.

[B1] 1.Huxley J. S. Problems of Relative Growth. London, UK: Methuen; 1932. http://www.archive.org/details/problemsofrelati033234mbp. [Google Scholar]

[B2] 2.Lu M., Caplan J. S., Bakker J. D., et al. Allometry data and equations for coastal marsh plants. Ecology. 2016;97(12):3554–3554. doi: 10.1002/ecy.1600. [DOI] [Google Scholar]

[B3] 3.Savage V. M., Gillooly J. F., Woodruff W. H., et al. The predominance of quarter-power scaling in biology. Functional Ecology. 2004;18(2):257–282. doi: 10.1111/j.0269-8463.2004.00856.x. [DOI] [Google Scholar]

[B4] 4.Myhrvold N. P. Dinosaur metabolism and the allometry of maximum growth rate. PLoS ONE. 2016;11(11) doi: 10.1371/journal.pone.0163205.e0163205 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5.West G. B., Brown J. H. The origin of allometric scaling laws in biology from genomes to ecosystems: Towards a quantitative unifying theory of biological structure and organization. Journal of Experimental Biology. 2005;208(9):1575–1592. doi: 10.1242/jeb.01589. [DOI] [PubMed] [Google Scholar]

[B6] 6.Newman M. E. J. Power laws, Pareto distributions and Zipf's law. Contemporary Physiscs. 2005;46(5):323–351. doi: 10.1080/00107510500052444. [DOI] [Google Scholar]

[B7] 7.Samaniego H., Moses M. E. Cities as organisms: allometric scaling of urban road networks. Journal of Transport and Land Use. 2008;1:21–39. [Google Scholar]

[B8] 8.Maritan A., Rigon R., Banavar J. R., Rinaldo A. Network allometry. Geophysical Research Letters. 2002;29(11):3-1–3-4. doi: 10.1029/2001GL014533. [DOI] [Google Scholar]

[B9] 9.De Robertis A., Williams K. Weight-length relationships in fisheries studies: The standard allometric model should be applied with caution. Transactions of the American Fisheries Society. 2008;137(3):707–719. doi: 10.1577/T07-124.1. [DOI] [Google Scholar]

[B10] 10.Echavarría-Heras H., Leal-Ramírez C., Villa-Diharce E., Cazarez-Castro N. R. The effect of parameter variability in the allometric projection of leaf growth rates for eelgrass (Zostera marina L.) II: the importance of data quality control procedures in bias reduction. Theoretical Biology and Medical Modelling. 2015;12(30) doi: 10.1186/s12976-015-0025-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11.Echavarría-Heras H., Leal-Ramírez C., Villa-Diharce E., Cazarez-Castro N. On the suitability of an allometric proxy for nondestructive estimation of average leaf dry weight in eelgrass shoots I: sensitivity analysis and examination of the influences of data quality, analysis method, and sample size on precision. Theoretical Biology and Medical Modelling. 2018;15(4) doi: 10.1186/s12976-018-0076-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] 12.Solana-Arellano E., Echavarría-Heras H., Leal-Ramírez C., Lee K.-S. The effect of parameter variability in the allometric projection of leaf growth rates for eelgrass (Zostera marina L.) Latin American Journal of Aquatic Research. 2014;42(5):1099–1108. doi: 10.3856/vol42-issue5-fulltext-14. [DOI] [Google Scholar]

[B13] 13.Montesinos-López A., Villa-Diharce E., Echavarría-Heras H., Leal-Ramírez C. Correction to: Improved allometric proxies for eelgrass conservation. Journal of Coastal Conservation. 2018;23(1):71–91. doi: 10.1007/s11852-018-0639-4. [DOI] [Google Scholar]

[B14] 14.Agutter P. S., Tuszynski J. A. Analytic theories of allometric scaling. Journal of Experimental Biology. 2011;214(7):1055–1062. doi: 10.1242/jeb.054502. [DOI] [PubMed] [Google Scholar]

[B15] 15.Hui D., Jackson R. B. Uncertainty in allometric exponent estimation: a case study in scaling metabolic rate with body mass. Journal of Theoretical Biology. 2007;249(1):168–177. doi: 10.1016/j.jtbi.2007.07.003. [DOI] [PubMed] [Google Scholar]

[B17] 16.Packard G. C. Is logarithmic transformation necessary in allometry? Biological Journal of the Linnean Society. 2013;109(2):476–486. doi: 10.1111/bij.12038. [DOI] [Google Scholar]

[B18] 17.Mascaro J., Litton C. M., Hughes R. F., Uowolo A., Schnitzer S. A. Minimizing bias in biomass allometry: model selection and log-transformation of data. Biotropica. 2011;43(6):649–653. doi: 10.1111/j.1744-7429.2011.00798.x. [DOI] [Google Scholar]

[B16] 18.Mascaro J., Litton C. M., Hughes R. F., Uowolo A., Schnitzer S. A. Is logarithmic transformation necessary in allometry? ten, one-hundred, one-thousand-times yes. Biological Journal of the Linnean Society. 2014;111(1):230–233. doi: 10.1111/bij.12177. [DOI] [Google Scholar]

[B19] 19.Thompson D. W. On Growth and Form. New York, NY, USA: Macmillan; 1943. [Google Scholar]

[B20] 20.Smith R. J. Allometric scaling in comparative biology: problems of concept and method. American Journal of Physiology-Regulatory, Integrative and Comparative Physiology. 1984;246(2):R152–R160. doi: 10.1152/ajpregu.1984.246.2.R152. [DOI] [PubMed] [Google Scholar]

[B21] 21.Lovett D. L., Felder D. L. Application of regression techniques to studies of relative growth in crustaceans. Journal of Crustacean Biology. 1989;9(4):529–539. doi: 10.2307/1548585. [DOI] [Google Scholar]

[B22] 22.Bales G. S. Heterochrony in brontothere horn evolution: allometric interpretations and the effect of life history scaling. Paleobiology. 1996;22(4):481–495. doi: 10.1017/S009483730001647X. [DOI] [Google Scholar]

[B23] 23.Lagergren R., Svensson J.-E., Stenson J. A. E. Models of ontogenetic allometry in cladoceran morphology studies. Hydrobiologia. 2007;594(1):109–116. doi: 10.1007/s10750-007-9085-2. [DOI] [Google Scholar]

[B24] 24.Sartori A. F., Ball A. D. Morphology and postlarval development of the ligament of Thracia phaseolina (bivalvia: Thraciidae), with a discussion of model choice in allometric studies. Journal of Molluscan Studies. 2009;75(3):295–304. doi: 10.1093/mollus/eyp029. [DOI] [Google Scholar]

[B25] 25.Packard G. C. Multiplicative by nature: logarithmic transformation in allometry. Journal of Experimental Zoology Part B: Molecular and Developmental Evolution. 2014;322(4):202–207. doi: 10.1002/jez.b.22570. [DOI] [PubMed] [Google Scholar]

[B26] 26.Packard G. C. Quantifying the curvilinear metabolic scaling in mammals. Journal of Experimental Zoology Part A: Ecological Genetics and Physiology. 2015;323(8):540–546. doi: 10.1002/jez.1946. [DOI] [PubMed] [Google Scholar]

[B27] 27.Packard G. C. Relative growth by the elongated jaws of gars: a perspective on polyphasic loglinear allometry. Journal of Experimental Zoology Part B: Molecular and Developmental Evolution. 2016;326(3):168–175. doi: 10.1002/jez.b.22673. [DOI] [PubMed] [Google Scholar]

[B28] 28.Packard G. C. Is complex allometry in field metabolic rates of mammals a statistical artifact? Comparative Biochemistry and Physiology Part A: Molecular & Integrative Physiology. 2017;203:322–327. doi: 10.1016/j.cbpa.2016.10.005. [DOI] [PubMed] [Google Scholar]

[B29] 29.Packard G. C. The essential role for graphs in allometric analysis. Biological Journal of the Linnean Society. 2017;120:468–473. [Google Scholar]

[B30] 30.Lai J., Yang B., Lin D., Kerkhoff A. J., Ma K., Bond-Lamberty B. The allometry of coarse root biomass: log-transformed linear regression or nonlinear regression? PLoS ONE. 2013;8(10) doi: 10.1371/journal.pone.0077007.e77007 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B31] 31.Klingenberg C. P. Heterochrony and allometry: the analysis of evolutionary change in ontogeny. Biological Reviews. 1998;73(1):79–123. doi: 10.1017/s000632319800512x. [DOI] [PubMed] [Google Scholar]

[B32] 32.Nevill A. M., Bate S., Holder R. L. Modeling physiological and anthropometric variables known to vary with body size and other confounding variables. Yearbook of Physical Anthropology. 2005;48:141–153. doi: 10.1002/ajpa.20356. [DOI] [PubMed] [Google Scholar]

[B33] 33.Kerkhoff A. J., Enquist B. J. Multiplicative by nature: why logarithmic transformation is necessary in allometry. Journal of Theoretical Biology. 2009;257(3):519–521. doi: 10.1016/j.jtbi.2008.12.026. [DOI] [Google Scholar]

[B34] 34.Xiao X., White E. P., Hooten M. B., Durham S. L. On the use of log-transformation vs. nonlinear regression for analyzing biological power laws. Ecology. 2011;92(10):1887–1894. doi: 10.1890/11-0538.1. [DOI] [PubMed] [Google Scholar]

[B35] 35.White E. P., Xiao X., Isaac N. J. B., Sibly R. M. Methodological tools. In: Sibly R. M., Brown J. H., Kodric-Brown A., editors. Metabolic Ecology: A Scaling Approach. Oxford, UK: Wiley-Blackwell; 2012. pp. 9–20. [Google Scholar]

[B36] 36.Ballantyne F. Evaluating model fit to determine if logarithmic transformations are necessary in allometry: a comment on the exchange between Packard (2009) and Kerkhoff and Enquist (2009) Journal of Theoretical Biology. 2013;317:418–421. doi: 10.1016/j.jtbi.2012.09.035. [DOI] [PubMed] [Google Scholar]

[B37] 37.Glazier D. S. Log-transformation is useful for examining proportional relationships in allometric scaling. Journal of Theoretical Biology. 2013;334:200–203. doi: 10.1016/j.jtbi.2013.06.017. [DOI] [PubMed] [Google Scholar]

[B38] 38.Niklas K. J., Hammond S. T. Assessing scaling relationships: uses, abuses, and alternatives. International Journal of Plant Sciences. 2014;175(7):754–763. doi: 10.1086/677238. [DOI] [Google Scholar]

[B39] 39.Lemaître J. F., Vanpé C., Plard F., Pélabon C., Gaillard J. M. Response to Packard: make sure we do not throw out the biological baby with the statistical bath water when performing allometric analyses. Biology Letters. 2015;11(6):p. 20150144. doi: 10.1098/rsbl.2015.0144. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B40] 40.Strauss R. E., Huxley J. S. The study of allometry since Huxley. In: Thompson D. H., editor. Problems of Relative Growth. Baltimore, UK: Johns Hopkins University Press; 1993. [Google Scholar]

[B41] 41.Knell R. J. On the analysis of non-linear allometries. Ecological Entomology. 2009;34(1):1–11. doi: 10.1111/j.1365-2311.2008.01022.x. [DOI] [Google Scholar]

[B42] 42.MacLeod C. D. Assessing the shape and topology of allometric relationships with body mass: a case study using testes mass allometry. Methods in Ecology and Evolution. 2010;1(4):359–370. doi: 10.1111/j.2041-210X.2010.00037.x. [DOI] [Google Scholar]

[B43] 43.Mori S., Yamaji K., Ishida A., et al. Mixed-power scaling of whole-plant respiration from seedlings to giant trees. Proceedings of the National Acadamy of Sciences of the United States of America. 2010;107(4):1447–1451. doi: 10.1073/pnas.0902554107. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B44] 44.Snelling E. P., Taggart D. A., Maloney S. K., Farrell A. P., Seymour R. S. Biphasic allometry of cardiac growth in the developing Kangaroo Macropus fuliginosus. Physiological and Biochemical Zoology. 2015;88(2):216–225. doi: 10.1086/679718. [DOI] [PubMed] [Google Scholar]

[B45] 45.Duffy J. E. Ecology and eusociality in sponge-dwelling shrimp. In: Organisms., Duffy J. E., Thiel M., editors. Evolutionary Ecology of Social and Sexual Systems, Crustaceans as Model. New York, NY, USA: Oxford University Press; 2007. pp. 1–520. [Google Scholar]

[B46] 46.Batterham A. M., George K. P. Allometric modeling does not determine a dimensionless power function ratio for maximal muscular function. Journal of Applied Physiology. 1997;83(6):2158–2166. doi: 10.1152/jappl.1997.83.6.2158. [DOI] [PubMed] [Google Scholar]

[B47] 47.Chave J., Andalo C., Brown S., et al. Tree allometry and improved estimation of carbon stocks and balance in tropical forests. Oecologia. 2005;145(1):87–99. doi: 10.1007/s00442-005-0100-x. [DOI] [PubMed] [Google Scholar]

[B48] 48.Niklas K. J. Size-dependent allometry of tree height, diameter and trunk-taper. Annals of Botany. 1995;75(3):217–227. doi: 10.1006/anbo.1995.1015. [DOI] [Google Scholar]

[B49] 49.Niklas K. J. Mechanical properties of black locust (Robinia pseudoacacia L.) Wood. Size-and age-dependent variations in sap- and heartwood. Annals of Botany. 1997;79(3):265–272. doi: 10.1006/anbo.1996.0340. [DOI] [Google Scholar]

[B50] 50.Packard G. C. Why allometric variation in mammalian metabolism is curvilinear on the logarithmic scale. Journal of Experimental Zoology Part A: Ecological and Integrative Physiology. 2017;327(9):537–541. doi: 10.1002/jez.2129. [DOI] [PubMed] [Google Scholar]

[B51] 51.Baskerville G. L. Use of logarithmic regression in the estimation of plant biomass. Canadian Journal of Forest Research. 1972;2(1):49–53. doi: 10.1139/x72-009. [DOI] [Google Scholar]

[B52] 52.Newman M. C. Regression analysis of log-transformed data: statistical bias and its correction. Environmental Toxicology and Chemistry. 1993;12(6):1129–1133. doi: 10.1002/etc.5620120618. [DOI] [Google Scholar]

[B53] 53.Smith R. J. Logarithmic transformation bias in allometry. American Journal of Physical Anthropology. 1993;90(2):215–228. doi: 10.1002/ajpa.1330900208. [DOI] [Google Scholar]

[B54] 54.Duan N. Smearing estimate: a nonparametric retransformation method. Journal of the American Statistical Association. 1983;78(383):605–610. doi: 10.1080/01621459.1983.10478017. doi: 10.1080/01621459.1983.10478017. [DOI] [Google Scholar]

[B55] 55.Koch R. W., Smillie G. M. Comment on “River loads underestimated by rating curves” by R. I. Ferguson. Water Resources Research. 1986;22(13):2121–2122. doi: 10.1029/WR022i013p02121. [DOI] [Google Scholar]

[B56] 56.Manning W. G. The logged dependent variable, heteroscedasticity, and the retransformation problem. Journal of Health Economics. 1998;17(3):283–295. doi: 10.1016/S0167-6296(98)00025-3. [DOI] [PubMed] [Google Scholar]

[B57] 57.Zeng W. S., Zeng W. S., Tang S. Z. Bias correction in logarithmic regression and comparison with weighted regression for nonlinear models. Nature Precedings. 2011 doi: 10.1038/npre.2011.6708.1. [DOI] [Google Scholar]

[B58] 58.Miller J. Short report: reaction time analysis with outlier exclusion: bias varies with sample size. The Quarterly Journal of Experimental Psychology Section A. 1991;43(4):907–912. doi: 10.1080/14640749108400962. [DOI] [PubMed] [Google Scholar]

[B59] 59.Packard G. C. Is non-loglinear allometry a statistical artifact? Biological Journal of the Linnean Society. 2012;107(4):764–773. doi: 10.1111/j.1095-8312.2012.01995.x. [DOI] [Google Scholar]

[B60] 60.Berg E. J. Heaviside's Operational Calculus. 2nd. New York, NY, USA: McGraw-Hill; 1936. [Google Scholar]

[B61] 61.I-Kuei Lin L. A concordance correlation coefficient to evaluate reproducibility. Biometrics. 1989;45(1):255–268. doi: 10.2307/2532051. [DOI] [PubMed] [Google Scholar]

[B62] 62.McBride G. B. HAM2005-062. Hamilton, New Zeeland: National Institute of Water & Atmospheric Research; 2005. A Proposal for strength-of-agreement criteria for Lin’s Concordance Correlation Coefficient. [Google Scholar]

[B63] 63.Zeng W. S., Zhang L. J., Chen X. Y., Cheng Z. C., Ma K. X., Li Z. H. Construction of compatible and additive individual-tree biomass models for Pinus tabulaeformis in China. Canadian Journal of Forest Research. 2017;47(4):467–475. doi: 10.1139/cjfr-2016-0342. [DOI] [Google Scholar]

[B64] 64.Zeng W. S., Tang S. Z. Goodness evaluation and precision analysis of tree biomass equations. Sci Silvae Sinicae. 2011;47:106–113. [Google Scholar]

[B65] 65.Zeng W., Duo H., Lei X., et al. Individual tree biomass equations and growth models sensitive to climate variables for Larix spp. in China. European Journal of Forest Research. 2017;136(2):233–249. doi: 10.1007/s10342-017-1024-9. [DOI] [Google Scholar]

[B66] 66.Akaike H. A new look at the statistical model identification. IEEE Transactions on Automatic Control. 1974;19(6):716–723. doi: 10.1109/TAC.1974.1100705. [DOI] [Google Scholar]

[B67] 67.Anderson T. W., Darling D. A. Asymptotic theory of certain goodness of fit criteria based on stochastic processes. Annals of Mathematical Statistics. 1952;23:193–212. doi: 10.1214/aoms/1177729437. [DOI] [Google Scholar]

[B69] 68.Bervian G., Fontoura N. F., Haimovici M. Statistical model of variable allometric growth: otolith growth in Micropogonias furnieri(Actinopterygii, Sciaenidae) Journal of Fish Biology. 2006;68(1):196–208. doi: 10.1111/j.0022-1112.2006.00890.x. [DOI] [Google Scholar]

[B68] 69.Brosowski B., Deutsch F. An elementary proof of the Stone-Weierstrass theorem. Proceedings of the American Mathematical Society. 1981;81(1):89–92. doi: 10.2307/2043993. [DOI] [Google Scholar]

PERMALINK

Examination of the Effects of Curvature in Geometrical Space on Accuracy of Scaling Derived Projections of Plant Biomass Units: Applications to the Assessment of Average Leaf Biomass in Eelgrass Shoots

Héctor Echavarría-Heras

Cecilia Leal-Ramírez

Enrique Villa-Diharce

Abelardo Montesinos-López

Abstract

1. Introduction

2. Materials and Methods

2.1. Data

2.2. Models

2.2.1. Biphasic Model in Geometrical Space

2.2.2. Polynomial Model in Geometrical Space

2.2.3. Nonlinear Heteroscedastic Model in Arithmetical Space

3. Results

3.1. Fitting Results of Geometrical Space Models

Table 1.

Table 2.

Figure 1.

Figure 2.

Figure 3.

3.2. Model Selection in Geometrical Space

Table 3.

3.3. Retransformation Results

Table 4.

Figure 4.

3.4. Assessing Curvature by Direct Nonlinear Regression

Table 5.

Table 6.

Figure 5.

3.5. Model Assessment in Arithmetical Space

Table 7.

3.6. Implications for Allometric Proxies of Mean Leaf Biomass in Eelgrass Shoots

Table 8.

Figure 6.

4. Discussion

Figure 7.

5. Conclusion

Acknowledgments

Appendix

A. Data Exploratory Analysis

Table 9.

Table 10.

Figure 8.

Table 11.

Figure 9.

Figure 10.

B. Derivation of Regression Protocols

B.1. Generalized Bivariate Allometric Model

B.2. Traditional Analysis Method of Allometry (TAMA)

B.3. Biphasic Model in Geometrical Space

B.4. Polynomial Model in Geometrical Space

B.5. Nonlinear Regression Models

C. Forms of the Correction Factor of Bias of Retransformation of the Regression Error

D. Fitting Results of Regresion Models in Geometrical Space

D.1. Fitting Results of the TAMA Protocol

Table 12.

Figure 11.

Figure 12.

D.2. Fitting Results of the Biphasic Model

Table 13.

Figure 13.

D.3. Fitting Results of the Polynomial Model

Table 14.

Figure 14.

E. Fitting Results of Direct Nonlinear Regression Models

Table 15.

Figure 15.

Figure 16.

Data Availability

Disclosure

Conflicts of Interest

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles