Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2017 Jun 27.
Published in final edited form as: Ecology. 2016 Feb;97(2):286–293. doi: 10.1890/15-0086.1

Improving phylogenetic regression under complex evolutionary models

Florent Mazel 1, T Jonathan Davies 2, Damien Georges 3, Sébastien Lavergne 4, Wilfried Thuiller 5, Pedro R Peres-Neto 6
PMCID: PMC5486445  EMSID: EMS73200  PMID: 27145604

Abstract

Phylogenetic Generalised Least Square (PGLS) is the tool of choice among phylogenetic comparative methods to measure the correlation between species features such as morphological and life-history traits or niche characteristics. In its usual form it assumes that the residual variation follows a homogenous model of evolution across the branches of the phylogenetic tree. Since a homogenous model of evolution is unlikely to be realistic in nature, we explored the robustness of the phylogenetic regression when this assumption is violated. We did so by simulating a set of traits under various heterogeneous models of evolution, and evaluating the statistical performance (type I error -the % of tests based on samples that incorrectly rejected a true null hypothesis- and power -the % of tests that correctly rejected a false null hypothesis) of classical phylogenetic regression. We found that PGLS has good power but unacceptable type I error rates. This finding is important since this method has been increasingly used in comparative analyses over the last decade. To address this issue, we propose a simple solution based on transforming the underlying variance-covariance matrix to adjust for model heterogeneity within PGLS. We suggest that heterogeneous rates of evolution might be particularly prevalent in large phylogenetic trees, whilst most current approaches assume a homogenous rate of evolution. Our analysis demonstrates that overlooking rate heterogeneity can result in inflated type I errors, thus misleading comparative analyses. We show that it is possible to correct for this bias even when the underlying model of evolution is not known a priori.

Keywords: Comparative methods, PGLS, non-stationarity, statistical performance

Introduction

Comparative methods are among the key tools for understanding ecological and evolutionary processes (Felsenstein 1985; Harvey & Pagel 1991) and are used to test hypotheses about the correlated evolution of traits (e.g. Pearman et al. 2014). Since species share common ancestry, they should not be considered statistically independent units, thus traditional statistical methods such as Ordinary Least Square (OLS) regression are not appropriate for analyzing comparative data. When analyzed by OLS, the two major issues that arise from shared evolution are increased type I error when traits are uncorrelated with each other and reduced precision in parameter estimation when traits are correlated with each other (Revel 2010). Therefore, interspecific trait data should be analysed within a phylogenetic framework (Harvey and Pagel 1991, Freckleton et al. 2002, Revell 2010).

Multiple methods have been proposed to account for phylogenetic non-independence of species when regressing two (or more) continuous or categorical traits (Felsenstein 1985, Grafen 1989, Maddison 1990, Lynch 1991, Garland et al. 1992, Martins and Hansen 1997, Diniz-Filho et al. 1998, Freckleton et al. 2002, Paradis and Claude 2002, Ives and Garland 2010). The phylogenetic regression based on a generalised least square, where the inverse of the phylogenetic covariance matrix is used as weights, is perhaps now the most widely adopted (Grafen 1989; Martins & Hansen 1997): it is a generalisation of phylogenetic independent contrasts (Rohlf 2001) originally proposed by Felsenstein (1985) and a particular case of general linear models (Rencher & Schaalje, 2007). Phylogenetic regression assumes that the model residual error ε is distributed according to σε2C where and σε2 represents the residual variance and C is a n × n matrix (n is the number of tips, i.e., species in most cases) describing the evolutionary relationships among species (i.e., a phylogenetic covariance matrix with diagonal elements estimated as the total branch length between each tip and the root, and off-diagonal elements estimated as the evolutionary time shared by each species pair).

In its simplest form, PGLS assumes a Brownian Motion (BM, Edwards & Cavalli-Sforza 1964) model of evolution with a single rate, σ2. Nevertheless, PGLS is highly flexible and it can be extended to alternative evolutionary models (Martins and Hansen 1997). For example, recent PGLS implementations have incorporated tree transformation models that capture different modes of evolution (e.g. early vs. late trait diversification, continuous vs. punctual evolution, Pagel 1997, 1999; Freckleton et al. 2002; Revell 2010) or selective regimes (e.g. Ornstein–Uhlenbeck [OU] models, Hansen 1997; Butler & Kings 2004; Lavin et al. 2008). This model flexibility has been key in reducing type I errors due to model misspecification, one of major issues in comparative biology (Freckleton 2009).

Despite recent advances, however, current PGLS implementations still assume that the tempo and mode of evolution remain constant across the phylogenetic tree (although they allow for rate variation over time), whereas it is likely that both are highly heterogeneous (Simpson 1944, Gould 2002), particularly in the case of large phylogenetic trees (O’Meara 2012). The construction of very large (Jetz et al. 2012, Zanne et al. 2014) hand in hand with the increased availability of corresponding large trait datasets (e.g. Wilman et al. 2014) has generated an increasing need to consider more complex models of evolution within the regression framework. Heterogeneous trait evolution, where trait evolution has been markedly different across multiple clades is a potential source of bias that has been largely overlooked. If two traits evolved under complex models of evolution, standard PGLS (i.e. assuming a single rate of evolution) may not be appropriate. Since it has been demonstrated that an incorrectly defined VCV matrix in PGLS increases the type I error rate (Revell 2010) for simple homogeneous models, it naturally follows that for large comparative phylogenetic datasets, where evolutionary processes are likely heterogeneous, there will also be an increase in type I error rates and/or reduced statistical power.

One potential solution arises from the development of heterogeneous models of evolution, which allow the fit of highly complex VCV matrices. Variation in evolutionary rates across the phylogenetic tree can be modeled, for instance, with a heterogeneous BM process where σ2 varies across the phylogenetic tree (O’Meara et al. 2006, Thomas et al. 2006). Similarly, heterogeneous OU models with multiple optima, strength of selection and evolutionary rates have be proposed to model adaptive peaks (Beaulieu 2013; Ingram & Mahler, 2013). These models have been successfully applied in the literature to investigate how a single trait has evolved across the phylogeny (e.g. to study the evolution of a clade life form, Adams et al. 2009; Boucher et al. 2012) but they are not yet incorporated into the toolbox of comparative analyses of trait correlation. Transforming the phylogenetic tree according to a heterogeneous model of trait evolution fitted with the data and using this transformed tree to derive a new VCV matrix could help in increasing the flexibility of the PGLS framework, but this potential has remained unexplored.

Here using simulations, we first explore the performance (type I error rate and statistical power) of PGLS under models of heterogeneous trait evolution. We simulate multiple models of trait evolution and cross correlations among traits, and contrast power and type I error rates. We show that complex models of evolution lead to inflated type I error rates, but PGLS is able to handle such complexities when the correct VCV matrices is known. We then propose an implementation of PGLS that has valid type I error rates even under models with large rate heterogeneity where the underlying model of evolution is not known a priori

Methods & Results

Simulating traits under complex evolutionary models

Simulation model

Our simulations considered two traits (X and Y) generated by the following basic equation, setting a to zero (following Revel 2010):

Y=a+βX+ε (Eqn. 1)

We defined evolutionary models by manipulating the phylogenetic covariance structure and rates of evolution (see below) and simulate X and the residuals error ε assuming normally distributed values N(0, σ2); where σ2 represents the instantaneous rate of evolution that was set depending on the specific scenarios of homogenous (one single evolutionary rate) and heterogeneous (multiple changes in evolutionary rates) models of trait evolution (see below). Type I error for a given method was assessed by simulating data with β=0 while statistical power was assessed by simulating data with β=1 (see Supplementary Material 1). In both cases, we tested the null hypothesis H0: β=0 and reported the percentage of simulations in which H0 was rejected with an alpha level of 0.05 (i.e., type I error rate as number of rejections when β=0 and statistical power as the number of rejections when β=1). For β=0, Y=ε and the evolution of X and Y are simulated independently. For β=1, the evolutionary model for Y was a function of the two evolutionary processes generating X and ε. We consider three different scenarios representing different evolutionary models for X and ε (see Supp. Mat. 1). We generated X and ε under (1) the same evolutionary model, (2) different models or (3) assuming that only ε followed an evolutionary model while X was drawn from a normal distribution (with mean of 0 and a standard deviation of one).

Phylogenetic trees

To make sure our results were not just representative of a particular phylogenetic topology, we ran all analyses on two very different phylogenetic topologies of 128 species each (rescaled so that their total depth equaled one). One tree was completely balanced, whereas the second one was obtained using a pure birth process leading to a more realistic unbalanced tree (see Supp. Mat. 2).

Homogeneous models of trait evolution

We considered three classic models (Brownian motion (BM, Edwards & Cavalli-Sforza 1964), Ornstein–Uhlenbeck (OU, Hansen 1997) and the lambda (λ) tree transformation (Pagel 1999). For BM, the change in species traits over time was expressed as:

dX(t)=σdB(t) (Eq. 2)

where dX(t) is the change in trait X over time period dt. The parameter σ measures the rate of evolution, while the term B(t) is random noise ~ N(0, dt).

For an OU process, the change in species traits over time was expressed as:

dX(t)=α[θX(t)]dt+σdB(t) (Eq. 3)

where θ represents the mean trait value (often interpreted as the trait optimum) and alpha measures the rate of decay of trait similarity through time (often interpreted as the intensity of stabilizing selection). When α = 0, the OU model simplifies to a BM model (see Eq. 2).

The λ tree transformation model simply rescales the phylogenetic tree before applying the classic BM model. In our case, λ is the multiplier of internal branches and we considered values between 1 (no transformation: the trait evolved under a classical BM) and 0 (the tree is a star phylogeny and the trait has no ‘phylogenetic signal’, i.e. related species do not tend to share similar trait value). Continuous characters (representing X and ε) under each model had a starting value of 0 at the root of the tree and were evolved tipward according to each model.

Heterogeneous models of trait evolution

For heterogeneous BM models, we simulated traits with two different rates of evolution occurring in different parts of the phylogenetic tree. In the simplest case, one sub-clade evolved with a rate σ2=1 while the other sub-clade evolved with one of the following σ2: 1/1000, 1/100, 1/10, 1/4, 1/2 or 3/4. In this case we simulated a single rate shift near the root of the phylogenetic trees so that the two major sub-clades of the phylogeny evolved under different σ2 (see Supp. Mat. 2). We also generated traits evolving under multiple rate shifts (3, 5 and 9) within each of the two major clades, but we kept the total number of rate values restricted to two (see Supp. Mat. 2).

For heterogeneous OU models, we simulated optima shifts and kept σ2 and α constant at 1 and 0.5, respectively. The initial OU regime started with an optimal value of θ (either 1, 2, 3 or 4) and then shifts to θ at the same node as described for BM rate shifts above (see Supp. Mat. 2).

For heterogeneous λ-transformed models, we simulated a single shift occurring near the root of the tree (i.e. separating two major clades A and B, see Supp. Mat. 3) by multiplying the internal branch length of clade A by λ, keeping the internal branches lengths of clade B unchanged. The resulting sub-clades therefore differed in their root-to-tip distances and tip-to-internal branch length ratios. If we had used a single σ2 for this transformed tree, the trait evolution of the two sub-clades would thus have different rates and phylogenetic signal. Because we were interested here in the differences in phylogenetic signal only, we rescaled the transformed branches of clade A so that all species in the complete tree had the same root-to-tip distance (same overall evolutionary rate) but differed in their ratio of tip-to-internal branch length (see Supp. Mat. 3). For λ, we restricted our analysis to the balanced tree (see above) because it is not straightforward to retain tree ultrametricity for more complex tree topologies, potentially confounding comparisons between our alternative models. As for the homogenously evolved traits, traits had a starting value of 0 at the root of the tree and were evolved tipward according to each model and evolutionary rates σ2 that were clade dependent.

Result 1: assessing power and type 1 error rates of the phylogenetic regression

We fitted two classical linear regression models to each of the different simulated datasets (scenarios). We first fitted an OLS regression, for which we tested the significance of the slope with a t-test using n-2=126 d.f. Note that a t-test was used given its common usage in the comparative analysis literature, though a likelihood test contrasting the slope and the intercept-only models could had been equally applied. Implementation of the latter could be the object of future studies. Second we fitted a PGLS (using the pgls function in the caper R package) that also simultaneously optimized a single λ value for the residuals of the model (λ=”ML” in the pgls function, ‘PGLSglobal_λ’ hereafter). The λ parameter (Pagel 1999) is a multiplier for the off-diagonal elements (i.e. the internal phylogenetic branches) of the VCV matrix and usually varies between 0 and 1. If λ=1, the VCV is left unchanged and the PGLS assumes a BM while if λ=0, the VCV is a diagonal matrix and the PGLS reduces to an OLS. Any value between 0 and 1 indicates that the phylogenetic strength in the residuals is reduced in contrast to a BM. As such, the optimization of λ in PGLS allows encompassing both classical OLS and PGLS. As for OLS, we then tested the significance of the slope with a t-test with n-2-1=125 d.f. (λ is an additional parameter).

Power analysis

First, we simulated X and ε independently from each other to generate Y (Y = βX + ε with β=1).

Under this scenario, both OLS and PGLSglobal_λ had good statistical power (i.e. they both detected the simulated correlation) for all models of evolution tested (Table 1 and Supplementary material 4).

Table 1.

summarizes the statistical performance (type I error -the % of tests based on samples that incorrectly rejected a true null hypothesis- and power -the % of tests that correctly rejected a false null hypothesis) of classical PGLS (optimized for a single λ) under different simulation scenarios. Complex evolutionary models were used to simulate either X and the residuals of Y (column 1), only X (column 2) or only the residuals (column 3). ‘KO’ indicates reduced power or inflated type I error, ‘OK’ indicates good power and correct type I errors.

Complex Evolutionary model in: Residuals & X Only X (residuals are normally distributed) Only residuals (X is normally distributed)
Statistical performance:
Type 1 error KO OK OK
Power OK OK OK

Type 1 error analysis

Second, we simulated Y and X independently from each other (Y = βX + ε with β=0, so ε=Y), and evaluated the percentage of simulations where a correlation was (incorrectly) detected (i.e. false positives). The type I error of a valid test should equal the alpha value selected when assessing test significance (5% here). Results differed across the three simulated scenarios (Table 1 and see Supp. Mat. 5): (1) When X was simulated without phylogenetic signal and Y followed different models of trait evolution (i.e. heterogeneous BM and OU models), both OLS and PGLS had correct type I error rates (Supp. mat. 6.1); (2) When X and Y followed different models of trait evolution (e.g. X followed a heterogeneous BM model and Y followed a heterogeneous OU model), OLS showed inflated type I error while PGLSglobal_λ still performed well (i.e., it had correct type I error rates, Supp. Mat. 6.2); (3) When X and Y followed the same heterogeneous model of trait evolution results were more mixed. When we simulated BM with heterogeneous Pagel λ, all methods had an inflated type I error, which covaried with the heterogeneity in the strength of the phylogenetic signal (see Figure 1). When we simulated BM models with heterogeneous rates of evolution (different σ2 across the tree), type I error rates were also inflated, and varied with the strength of the evolutionary rate variation, but were only weakly influenced by the number of rate shifts (Fig. 2). We did not evaluate type I error rates under a heterogeneous OU model of evolution for reasons described in the Discussion section PGLS and hidden selective trends. Unbalanced and balanced tree gave qualitatively similar results (see Fig. 2 and Supp. mat. 6.3, respectively), for brevity we report only values for the unbalanced tree in the main text.

Figure 1. Effect of heterogeneous phylogenetic signal on type I error.

Figure 1

Effect of variation in phylogenetic signal heterogeneity on Type I error rates for the different comparative methods: classical OLS, PGLS that jointly optimises a single λ value for the residuals together with parameter estimates (‘PGLSglobal_λ‘) and a PGLS that uses the true VCV matrix (‘PGLSTrueVCV’). The X-axis represents λ of clade B (λ of clade A is set to one). Plotted below the X-axis are the corresponding transformed trees for a homogeneous signal (λ[Clade A] = λ[Clade B] =1) and a heterogeneous signal (λ[Clade A] = 1; λ[Clade B] =0.01). The type I error represents the percentage of simulation that detected a significant correlation at the 5% level (1000 simulations) between the two traits (black horizontal line).

Figure 2. Effect of heterogeneous rate of trait evolution on type I error.

Figure 2

Effect of variation in evolutionary rate heterogeneity (i.e. the ratio of rate evolution between clades) on Type I error rates for different comparative methods (see legend of Figure 1). Different models of rate heterogeneity are presented (i.e. from 1 to 9 rate shifts). For the simplest case (one single rate shift), we plotted below the X-axis the corresponding transformed trees for a homogeneous rate (σ2 [Clade 1] = σ2 [Clade 2] =1) and a heterogeneous signal (σ2 [Clade 1] = 1; σ2 [Clade 2] =0.01). The type I error represents the percentage of simulation that detected a significant correlation at the 5% level (1000 simulations) between the two traits (black horizontal line).

Providing the correct VCV to PGLS led to correct type I error rates in all cases (Fig. 2 and Supp. Mat. 6.3). However, this is obviously not a viable solution for most empirical studies since the true VCV is not known a priori, but it nonetheless shows that PGLS is able to deal with complex models of evolution when they are correctly estimated.

Result 2: a solution to correct for inflated type 1 error

Our simulations showed that PGLS had inflated type I error rates under heterogeneous BM but that it could theoretically handle such models when a correct VCV was provided.

To correct for inflated type I errors when the correct VCV is not known, we studied the statistical performance of the following three step procedure: 1) fit heterogeneous BM models of trait evolution to the raw OLS residuals; 2) use this fit to modify the VCV matrix used in the standard PGLS; and 3) apply a significance criterion (see Modified method for significance testing below) that allows for proper inference (i.e., correct type I error) associated to the two initial fits that themselves involve one statistical test each. For clarity, note that the correction in step 3 is completely independent from the issues of inflated type I errors in comparative analysis involving multiple rates of evolution (the focus of this paper). However, because our proposed framework is based on a two-step procedure, one would incur additional type I errors if a multiple testing criterion was not applied.

We tested two approaches to detect rate shifts on the OLS residuals: the auteur bayesian approach (Eastman et al. 2011, implemented in the R package geiger) and the trait medusa approach (Thomas and Freckleton 2012, implemented in the R package motmot). As auteur and trait medusa yielded similar results but auteur being much faster, we present in the main text the detailed procedure and results for auteur (PGLSauteur hereafter). The details and results of the procedure involving trait medusa are provided in Supp. Mat 7. We provide R code and example data of both procedures as a supplement (Supp. Mat. 8 & 9).

Detecting evolutionary rate shifts with auteur

auteur (Eastman et al. 2011) is a recently developed Bayesian approach to model evolutionary rate heterogeneity along a phylogeny. It uses a reversible jump Markov chain Monte Carlo procedure to sample a distribution of multi-rate BM models in inverse proportion to their poorness of fit. In the optimization procedure, the Markov chain jumps from models of differing complexities (i.e. number and position of the shifts in the phylogenetic tree).

We sampled parameters every 500 generations for a total of 20,000 generations. We removed 25% of the sampled parameters as burn-in to obtain marginalized distributions of relative rates for each branch of the tree.

Using fitted multi-rate models in standard PGLS

We then used the parameter estimates from auteur to rescale the phylogenetic tree using the function rescale in the geiger package. The rescaled tree was then used in a standard PGLS procedure (PGLSauteur, function pgls in the R package caper) as described above. We then tested the significance of the slope with a t-test with n-2 d.f. (PGLSauteur, see Fig. 3). As described above, this stepwise procedure may be expected to have a slight inflated type I error rate because it represents a two-step procedure (Fig. 3), and each step has independent errors (the same reasoning applies to the PGLStrait medusa procedure, see Supp. Mat. 7). Step 1 estimates the evolutionary model of the OLS residuals, and step 2 fits PGLS using the estimated VCV matrix in step 1. As before, because two statistical tests were involved here, we applied a modified significance testing criterion.

Figure 3. Type I error rates for modified PGLS.

Figure 3

Comparison of type I error rate for classical (OLS, PGLSglobal_λ, and PGLSTrueVCV) and modified (PGLSauteur and PGLScombination) comparative methods as a function of evolutionary rate heterogeneity between clades. We show here the result for the simplest model of rate heterogeneity (i.e. one single rate shift) and plot below the X-axis the corresponding transformed trees for a homogeneous rate (σ2 [Clade 1] = σ2 [Clade 2] =1) and a heterogeneous rate (σ2 [Clade 1] = 1; σ2 [Clade 2] =0.01). The type I error represents the percentage of simulation that detected a significant correlation at the 5% level (1000 simulations) between the two traits (black horizontal line).

Modified method for significance testing

Statistical testing when using a two-step procedure is likely to have inflated incorrect family wise type I error. This has been previously recognized by ter Braak et al. (2012) when testing for the links between trait to environmental variation. Their procedure was based on a two-step procedure in which the links between species trait variation and species distributions, and the links between environmental variation and species distributions were both tested. They adopted a rejection criterion for establishing the significance of the links between environment and trait variation in which the largest probability of the two tests involved needed to be below the pre-established alpha-level (i.e., 0.05). Retaining the largest p-value is equivalent to conducting only one statistical test instead of two, assuring correct type I errors. Following ter Braak et al. (2012), we thus considered the relationship between Y and X (PGLScombination) and retained the largest p-value between the PGLSglobal_λ and PGLSauteur. This approach produced correct type I error rate in all cases (Fig. 3).

Discussion

With the increasingly availability of well resolved phylogenetic trees, PGLS has become routinely employed in the analysis of interspecific data over the past decades (Felsenstein 1985, Grafen 1989, Martins and Hansen 1997, Freckleton et al. 2002). By assuming an explicit model of evolution, PGLS contrasts with some other approaches, for example, non-parametric eigenvector decomposition (Diniz-Filho et al. 1998, Freckleton et al. 2011). While it has been argued that the inclusion of an explicit evolutionary model within PGLS allows for increased efficiency of estimation, decreased variance in parameter estimates, and decreased Type I errors (Freckleton et al. 2011), misspecifying the evolutionary model may have important consequences for hypotheses testing (Revell 2010) (though not for parameter estimation, see e.g. Rohlf 2006). One solution is to simultaneously estimate the parameters of the PGLS model and the model of evolution of the residuals. For some simple models of trait evolution, in which both the tempo and mode of evolution remain constant across the phylogenetic tree, it is possible to adjust the PGLS model residuals using Pagel’s (1999) lambda tree transformations (Freckleton 2002). However, such simple models of evolution are likely rare, particularly for large trees (O’Meara 2012), which have become widely used in the comparative literature with the increasingly availability of large scale mega-phylogenies including several thousand species. Here we evaluated the performance (power and type I errors) of PGLS methods, under more complex evolutionary scenarios. We show that most developed methods around PGLS have good power, but unacceptably high type I errors under some scenarios with heterogeneous evolutionary rates. Nonetheless, PGLS methods perform well when the VCV matrix is estimated properly.

X and Y follow different models of evolution

When there is no phylogenetic signal in the independent variable (i.e. X was normally distributed and independent from phylogeny) but the residuals (of Y) follow a heterogeneous model of evolution (either BM or OU), all methods (OLS and PGLS) showed correct type I error rate. Similar results have been reported for homogeneous BM models (see Revell 2010). However, when we simulated a heterogeneous BM in the independent variable (X) and the dependent variable (Y) followed a heterogeneous OU, OLS shows inflated type I error rates, but PGLS still performs well.

X and Y follow the same model of evolution

When we simulated X and Y with the same heterogeneous model of evolution all classical methods (OLS and PGLS) showed inflated type I error rates. Nevertheless PGLS is theoretically able to handle such bias (Martins and Hansen 1997), and we demonstrated empirically that providing the correct VCV leads to valid test of correlated evolution with appropriate type I error rates.

Because knowing the correct VCV transformation a priori is difficult for traits evolving under complex evolutionary models, we implemented a simple approach that allowed us to estimate the model of evolution on the residuals of the OLS, and used this to transform the VCV matrix for PGLS. Here we used two methods (a Bayesian approach and a bootstrap approach in conjunction with an algorithm for estimating multiple rate shifts) to identify the appropriate model for transforming the VCV matrix, though we acknowledge that alternative methods are available (see Revell et al. 2012 for an example). We show that assuming the transformed VCV matrix, PGLS had appropriate type I errors after correcting for the two-step model selection procedure (ter Braak et al. 2012). However, we caution that one of the methods used is sensitive to overfitting (a known problem with the medusa algorithm assuming the AICc stopping criterion) and that a wrongly defined VCV matrix can lead to type I error rates as high as for raw OLS models. It is crucial, therefore, to use a methodology that correctly assigns rate shifts, or to test overfitting by parametric bootstraping (Boettiger et al. 2012).

It is possible that more efficient strategies will become soon available for fitting PGLS under heterogeneous models of evolution, nonetheless we have shown that our approach works well, and is reasonably transparent. Due to computational constrains, it was not feasible to explore the full range of possible evolutionary models, tree shapes, and trees sizes (the combinations of which are effectively infinite), but we see no reason why our approach should fail under different conditions except when clades sizes for which rate changes have occurred are too small to accurately infer the correct evolutionary model.

Last, we note that we do not provide a solution for the heterogeneous phylogenetic signal scenario; however, future advances that allow the fit of heterogeneous phylogenetic signal to the tree could be easily implemented within our framework.

PGLS and hidden selective trends

We did not evaluate type I error rates under a heterogeneous OU model of evolution. Under this scenario, even for B=0 (i.e. no expected correlation between X and Y), the two traits might be significantly correlated if both have followed the same selective trends (i.e. optima and selection strength shifts). In this case, we are no longer considering type I error rates, but rather Power of the method and it is not straightforward to derive expectations if the two traits follow slightly different models. For example, one could imagine that the two traits show the same general trends but with slightly different optima, strength of selection or location of shifts. One interesting idea would be to simulate traits with different OU parameters and positions, and plot the percentage of detected correlations against the amount of difference between the two models using a version of PGLS that takes into account the drift part of the OU model. It might also be informative to explore models where traits co-evolve, or the evolution of one is driven by the evolution of the other, causing consistent time-lags between their evolutionary shifts. It would then be possible to compare the likelihood of such a model with another in which traits evolve independently. This would avoid the direct correlation of extant species traits. Finally, in the present study we have assumed a parametric method for estimating changes in evolutionary rates in the sense that the procedures here estimate variance-covariance phylogenetic matrices based on families of known models of evolution. Another potential solution, not explored here, would be to use an iterative weighted least-square implementation (e.g., Björck 1996) in order to estimate an appropriate variance-covariance structure that would make model residuals independent.

Taken together, we have shown that currently implemented phylogenetic comparative methods have unacceptable type I error rate when species’ traits evolve under heterogeneous models of evolution. We proposed a flexible solution based on PGLS and showed that it had correct type I error. Our framework is potentially extendible to most complex evolutionary scenarios.

Supplementary Material

Supplementary Materials

Acknowledgements

FM would like to thank Rafael Wüest for stimulating discussions on the PGLS approach. The research leading to these results had received funding from the European Research Council under the European Community’s Seven Framework Programme FP7/2007-2013 Grant Agreement no. 281422 (TEEMBIO). FM, DG, SL and WT belong to the Laboratoire d’Écologie Alpine, which is part of Labex OSUG@2020 (ANR10 LABX56). All computations performed using the CiGRI platform of the CIMENT infrastructure (https://ciment.ujf-grenoble.fr), which is supported by the Rhône-Alpes region (GRANT CPER07_13 CIRA) and France-Grille (http://www.france-grilles.fr). TJD and PRPN were supported by a team grant from the Fonds de recherche du Québec – Nature et technologies.

Contributor Information

Florent Mazel, Univ. Grenoble Alpes & CNRS, Laboratoire d’Écologie Alpine (LECA), F-38000 Grenoble, France.

T. Jonathan Davies, Department of Biology, McGill University, 1205, Avenue Docteur Penfield, Montreal, Quebec, Canada. African Centre for DNA Barcoding, University of Johannesburg, APK Campus, PO Box 524, Auckland Park 2006, Johannesburg, South Africa.

Damien Georges, Univ. Grenoble Alpes, Laboratoire d’Écologie Alpine (LECA), F-38000 Grenoble, France ; CNRS, Grenoble, France.

Sébastien Lavergne, Univ. Grenoble Alpes & CNRS, Laboratoire d’Écologie Alpine (LECA), F-38000 Grenoble, France.

Wilfried Thuiller, Univ. Grenoble Alpes & CNRS, Laboratoire d’Écologie Alpine (LECA), F-38000 Grenoble, France.

Pedro R. Peres-Neto, Canada Research Chair in Spatial Modelling and Biodiversity, Département des sciences biologiques, Université du Québec à Montréal, Montréal, QC H3C 3P8, Canada

References

  1. Adams DC, Berns CM, Kozak KH, Wiens JJ. Are rates of species diversification correlated with rates of morphological evolution? Proceedings of the Royal Society of London, Series B. 2009;276:2729–2738. doi: 10.1098/rspb.2009.0543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Björck Å. Numerical Methods for Least Squares Problems. SIAM; Philadeplphia, PA: 1996. [Google Scholar]
  3. Boettiger C, Coop G, Ralph P. Is your phylogeny informative? Measuring the power of comparative methods. Evolution. 2012;66:2240–2251. doi: 10.1111/j.1558-5646.2011.01574.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Boucher FC, Thuiller W, Roquet C, Douzet R, Aubert S, Alvarez N, Lavergne S. Reconstructing the origins of high-alpine niches and cushion life form in the genus Androsace sl (Primulaceae) Evolution. 2012;66:1255–1268. doi: 10.1111/j.1558-5646.2011.01483.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ter Braak C, Cormont A, Dray S. Improved testing of species traits-environment relationships in the fourth-corner problem. Ecology. 2012;93:1525–1526. doi: 10.1890/12-0126.1. [DOI] [PubMed] [Google Scholar]
  6. Butler MA, Kings AA. Phylogenetic comparative analysis: A modeling approach for adaptive evolution. The American naturalist. 2004;164:683–695. doi: 10.1086/426002. [DOI] [PubMed] [Google Scholar]
  7. Diniz-Filho JAF, de Sant’Ana CER, Bini LM. An Eigenvector Method for Estimating Phylogenetic Inertia. Evolution. 1998;52:1247–1262. doi: 10.1111/j.1558-5646.1998.tb02006.x. [DOI] [PubMed] [Google Scholar]
  8. Eastman JM, Alfaro ME, Joyce P, Hipp AL, Harmon LJ. A novel comparative method for identifying shifts in the rate of character evolution on trees. Evolution; international journal of organic evolution. 2011;65:3578–89. doi: 10.1111/j.1558-5646.2011.01401.x. [DOI] [PubMed] [Google Scholar]
  9. Edwards AWF, Cavalli-Sforza LL. Reconstruction of evolutionary trees. In: Heywood VH, McNeill J, editors. Phenetic and phylogenetic classification. London: 1964. pp. 67–76. [Google Scholar]
  10. Felsenstein J. Phylogenies and the comparative method. The American Naturalist. 1985:1–15. doi: 10.1086/703055. [DOI] [PubMed] [Google Scholar]
  11. Freckleton R. The seven deadly sins of comparative analysis. Journal of evolutionary biology. 2009;22:1367–1375. doi: 10.1111/j.1420-9101.2009.01757.x. [DOI] [PubMed] [Google Scholar]
  12. Freckleton RP, Cooper N, Jetz W. Comparative methods as a statistical fix: the dangers of ignoring an evolutionary model. The American naturalist. 2011;178:E10–E17. doi: 10.1086/660272. [DOI] [PubMed] [Google Scholar]
  13. Freckleton RP, Harvey PH, Pagel M. Phylogenetic analysis and comparative data: a test and review of evidence. The American Naturalist. 2002;160:712–726. doi: 10.1086/343873. [DOI] [PubMed] [Google Scholar]
  14. Garland T, Harvey P, Ives A. Procedures for the analysis of comparative data using phylogenetically independent contrasts. Systematic Biology. 1992;41:18–32. [Google Scholar]
  15. Gould SJ. The Structure of Evolutionary Theory. Harvard University Press; 2002. [Google Scholar]
  16. Grafen A. The phylogenetic regression. Philosophical Transactions of the Royal Society of London B. 1989:119–157. doi: 10.1098/rstb.1989.0106. [DOI] [PubMed] [Google Scholar]
  17. Hansen TF. Stabilizing Selection and the Comparative Analysis of Adaptation. Evolution. 1997;51:1341. doi: 10.1111/j.1558-5646.1997.tb01457.x. [DOI] [PubMed] [Google Scholar]
  18. Harvey P, Pagel M. the Comparative Method in Evolutionary Biology. Oxford University Press; Oxford, UK: 1991. [Google Scholar]
  19. Ingram T, Mahler D. SURFACE: detecting convergent evolution from comparative data by fitting Ornstein-Uhlenbeck models with stepwise Akaike Information Criterion. Methods in Ecology and Evolution. 2013;4:416–425. [Google Scholar]
  20. Ives A, Garland T. Phylogenetic logistic regression for binary dependent variables. Systematic Biology. 2010;59:9–26. doi: 10.1093/sysbio/syp074. [DOI] [PubMed] [Google Scholar]
  21. Jetz W, Thomas G, Joy J, Mooers AØ. The global diversity of birds in space and time. Nature. 2012;491:444–448. doi: 10.1038/nature11631. [DOI] [PubMed] [Google Scholar]
  22. Lavin SR, Karasov WH, Ives AR, Middleton KM, Garland T. Morphometrics of the avian small intestine compared with that of nonflying mammals: a phylogenetic approach. Physiological and biochemical zoology : PBZ. 2008;81:526–50. doi: 10.1086/590395. [DOI] [PubMed] [Google Scholar]
  23. Lynch M. Methods for the analysis of comparative data in evolutionary biology. Evolution. 1991;45:1065–1080. doi: 10.1111/j.1558-5646.1991.tb04375.x. [DOI] [PubMed] [Google Scholar]
  24. Maddison WP. A Method for Testing the Correlated Evolution of Two Binary Characters: Are Gains or Losses Concentrated on Certain Branches of a Phylogenetic Tree? Evolution. 1990;44 doi: 10.1111/j.1558-5646.1990.tb05937.x. 223–539–557. [DOI] [PubMed] [Google Scholar]
  25. Martins EP, Hansen TF. Phylogenies and the comparative method: a general approach to incorporating phylogenetic information into the analysis of interspecific data. The American naturalist. 1997;149:646–667. [Google Scholar]
  26. O’Meara B. Evolutionary inferences from phylogenies: a review of methods. Annual Review of Ecology, Evolution, and Systematics. 2012;43:267–285. [Google Scholar]
  27. O’Meara BC, Ané C, Sanderson MJ, Wainwright PC. Testing for different rates of continuous trait evolution using likelihood. Evolution. 2006;60:922–933. [PubMed] [Google Scholar]
  28. Pagel M. Inferring evolutionary processes from phylogenies. Zoologica Scripta. 1997;26:331–348. [Google Scholar]
  29. Pagel M. Inferring the historical patterns of biological evolution. Nature. 1999;401:877–884. doi: 10.1038/44766. [DOI] [PubMed] [Google Scholar]
  30. Paradis E, Claude J. Analysis of Comparative Data Using Generalized Estimating Equations. Journal of Theoretical Biology. 2002;218:175–185. doi: 10.1006/jtbi.2002.3066. [DOI] [PubMed] [Google Scholar]
  31. Pearman P, Lavergne S, Roquet C, Wuest R, Zimmermann NE, Thuiller W. Phylogenetic patterns of climatic, habitat and trophic niches in a European avian assemblage. Global Ecology and Biogeography. 2014;23:414–424. doi: 10.1111/geb.12127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Rencher AC, Schaalje GB. Linear Models in Statistics. John Wiley & Sons, Inc; Hoboken, NJ, USA: 2007. [Google Scholar]
  33. Revell L. Phylogenetic signal and linear regression on species data. Methods in Ecology and Evolution. 2010;1:319–329. [Google Scholar]
  34. Revell L, Mahler D, Peres-Neto P, Redelings B. A new phylogenetic method for identifying exceptional phenotypic diversification. Evolution. 2012;66:135–146. doi: 10.1111/j.1558-5646.2011.01435.x. [DOI] [PubMed] [Google Scholar]
  35. Rohlf F. A comment on phylogenetic correction. Evolution. 2006;60:1509–1515. doi: 10.1554/05-550.1. [DOI] [PubMed] [Google Scholar]
  36. Rohlf FJ. Comparative methods for the analysis of continuous variables: geometric interpretations. Evolution. 2001;55:2143–2160. doi: 10.1111/j.0014-3820.2001.tb00731.x. [DOI] [PubMed] [Google Scholar]
  37. Simpson GG. Tempo and Mode in Evolution. Columbia University Press; 1944. [Google Scholar]
  38. Thomas GH, Freckleton RP. MOTMOT: models of trait macroevolution on trees. Methods in Ecology and Evolution. 2012;3:145–151. [Google Scholar]
  39. Thomas GH, Freckleton RP, Székely T. Comparative analyses of the influence of developmental mode on phenotypic diversification rates in shorebirds. Proceedings Biological sciences / The Royal Society. 2006;273:1619–24. doi: 10.1098/rspb.2006.3488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Wilman H, Belmaker J, Simpson J, de la Rosa C, Rivadeneira MM, Jetz W. EltonTraits 1.0: Species-level foraging attributes of the world’s birds and mammals. Ecology. 2014;95:2027–2027. [Google Scholar]
  41. Zanne AE, Tank DC, Cornwell WK, Eastman JM, Smith SA, FitzJohn RG, McGlinn DJ, O’Meara BC, Moles AT, Reich PB, Royer DL, et al. Three keys to the radiation of angiosperms into freezing environments. Nature. 2014;506:89–92. doi: 10.1038/nature12872. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials

RESOURCES