Skip to main content
PLOS Genetics logoLink to PLOS Genetics
. 2025 Sep 23;21(9):e1011871. doi: 10.1371/journal.pgen.1011871

A method for identifying local adaptation in structured populations

Isabela do O 1,*, Oscar E Gaggiotti 2, Pierre de Villemereuil 3,4,#, Jerome Goudet 1,5,*,#
Editor: Michael Edge6
PMCID: PMC12479014  PMID: 40986628

Abstract

Species occupy diverse, heterogeneous environments, which expose populations to spatially varied selective pressures. Populations in different environments can diverge due to local adaptation. However, neutral evolution can also drive population divergence. Thus, testing for local adaptation requires a neutral baseline for population differentiation. The classical QST-FST comparison was developed for this purpose. Yet, QST-FST frequently fails to account for the complexities of population structure because the theory underlying this comparison assumes that all subpopulations are equally related, resulting in inflated false positive rates in metapopulations that deviate from the island model. To address this limitation we use estimates of between- and within-population relatedness to model population structure. Using those relatedness matrices, we infer the between- and within-population ancestral additive genetic variances under a mixed-effects model. Under neutrality, these inferred variances are expected to be equal. We propose here a test to detect selection based on the comparison of these two estimates of the ancestral variance and we compare its performance with earlier solutions. We find our method is well calibrated across various population structures and has high power to detect adaptive divergence.

Author summary

Populations of the same species often face different environmental pressures, driving them to adapt locally. However, even in the absence of adaptation, subpopulations can diverge due to random genetic drift and limited migration. Distinguishing between adaptive evolution and random divergence is a central challenge in evolutionary biology. Traditional methods, such as QSTFST comparison, assume equal relatedness among subpopulations — a simplification that rarely holds in complex real-world scenarios, leading to flawed conclusions. To overcome this limitation, we developed a novel method that incorporates genetic relatedness among subpopulations, leveraging quantitative genetic theory to estimate ancestral additive genetic variances. Our approach provides a powerful tool for testing local adaptation, reliably distinguishing adaptive divergence from drift across a variety of population structures.

Introduction

Species are generally distributed in heterogeneous environments. Over time, subpopulations exposed to different environments diverge genetically and phenotypically due to genetic drift, natural selection, and limited gene flow. Environmentally divergent subpopulations are likely to be subjected to different selective pressures, promoting their adaptive divergence. At the same time, distant subpopulations are more isolated, leading to their neutral differentiation [1]. Thus, before concluding that the observed difference is adaptive, one must first reject the hypothesis of neutral divergence by establishing a theoretically justified neutral expectation [2].

FST, the fixation index, is a classic measure used to quantify genetic differentiation among populations. For neutral loci, the degree of differentiation between subpopulations measured by FST depends on the demographic parameters of the population, such as the effective population size and migration rates. When selection is at play, FST for the loci under selection will also depend on the strength and mode of selection. When selection acting on a locus is similar across environments (e.g. spatially-homogeneous global adaptation; cf. [3]) FST will be lower at this locus compared to neutral loci. When selection varies across subpopulations (e.g. local adaptation), then the selected locus will have a larger FST in comparison to neutral loci.

Many traits of adaptive significance, such as morphological [4], physiological [5] and fitness-related [6] phenotypes, are quantitative traits and result from the interaction of multiple genes [2,7,8]. In this context of polygenic adaptation, FST measured for neutral loci serves as the null expectation. For these traits, the standard procedure is to compare FST with its quantitative analogue, QST, which describes the proportion of additive genetic variance between subpopulations relative to the total additive genetic variance (Eq 1) [9]:

QST=VBVB+2VW, (1)

VB represents the between-population additive genetic variance, and VW is the within-population additive genetic variance. Since QST is based on additive genetic variances, its estimation rests on being able to use controlled breeding designs and environmental effects [10]. Under the assumption that the investigated trait is neutral, we expect QST to be equal, on average, to FST. If QST>FST, it suggests adaptive divergence, while QST<FST may imply spatially-homogeneous global adaptation (c.f. [3]; but note that [9,11] use ‘balancing selection’ when referring to this scenario).

However, QST estimates are subject to significant uncertainty and potential biases. First, ratio estimates, like QST, can be biased since the expected value of a ratio differs from the ratio of expectations [12]. Second, the number of subpopulations considered strongly affects the reliability of QST estimates, with fewer populations sampled increasing the variability and reducing the power of the statistic [13,14]. Because of the high variability of QSTFST comparisons [14] proposed that instead of relying on a direct comparison between the two statistics, one should find where QSTFST falls in the expected neutral distribution of QSTFST values. Building on this, [15] developed a simulation-based approach to test whether QST is consistent with neutral divergence by comparing the observed QSTFST to a simulated distribution of the neutral expectation. This expected neutral distribution is generated using parametric bootstrapping, drawing from a χ2 distribution following approximations by [16].

While the improvement of the QSTFST method proposed by [15] reduces the false positive rate (FPR), it still assumes an isotropic population structure. For example, when considering instead a stepping-stone model, the p-values do not follow the expected distribution under neutrality, resulting in a non-calibrated test [17,18]. As highlighted in [18], the main issue lies in how between-population variance (VB) is estimated in these models. Typically, VB is derived from a mixed-effects model where population-level random effects are treated as independent, assuming equal relatedness between all subpopulations. This isotropic assumption does not hold for most natural populations, which often have complex genealogical relationships and migration patterns, leading to the lack of calibration reported above.

To address this, [19] proposed a model based on between- and within-population coancestry to estimate additive genetic variance accounting for non-uniform migration and drift patterns. The coancestry between two populations is the average coancestry between all pairs of individuals one from each population, and the coancestry between pairs of individuals is the probability that randomly sampled alleles from these individuals are identical by descent (IBD). [19] method extends the animal model to the metapopulation level, using coancestry to reflect genealogical relationships between subpopulations. They assume that all studied populations trace back to a common ancestral population. By accounting for population structure, their method provides a more accurate estimation of neutral expectations. Nevertheless, [19] solution still relies on a metapopulation model with specific assumptions, the admixture F-model [20], which has been shown to suffer from issues in estimating coancestries [21]. Besides, it is still unknown whether the test Ovaskainen et al. [19] developed can be considered calibrated, given that it does not provide a null-hypothesis framework per se.

In this study, we introduce LogAV, a new method for testing the null hypothesis of neutral divergence by comparing the log-ratio of two estimates of the same ancestral additive genetic variance: one derived from a between-population effect and the other from a within-population effect. Under a neutral scenario of phenotypic evolution, these two estimates of the ancestral variance should be equal. This constitutes our null hypothesis in a two-tailed test, which would suggest local adaptation if the estimated ancestral variance is larger when derived from the between-population effects than when derived from the within-population effects. Conversely, it would suggest spatially-homogeneous global adaptation (cf. [3]) when the opposite is true.

We evaluate our method (LogAV) along with both [15]’s method and the Driftsel approach [22], which implements [19] ideas, by applying the methods on neutrally evolving simulated phenotypes. We also assessed LogAV’s results for traits evolving under divergent selection in different strengths of selection and population structures, considering a range of population structures, including highly non-isotropic configurations. We found LogAV well calibrated for all tested population structures and able to detect selection when present. Finally we show an application of LogAV to a real already published data set by [23] which had previously been analyzed using driftsel

Description of the method

LogAV is designed to test whether neutral processes alone can account for the observed divergence between related subpopulations of a metapopulation. We assume that all studied subpopulations originated from a single, panmictic, ancestral population, and through a process of divergence and restricted gene flow, their genetic diversity becomes spatially structured. As shown by [19], it is possible to write an inferential model to estimate the expected ancestral additive variance (V𝒜 - A definition of the relevant symbols used throuhout this manuscript can be found on Table 1) under neutrality. We define the ancestral additive genetic variance V𝒜 as the additive genetic variance in the quantitative trait that existed in the ancestral population. This model provides a baseline for understanding the contributions of neutral processes and selection to the current genetic makeup of each subpopulation. Our approach examines the historical processes that occurred as they diverged from their common ancestor, as well as the ongoing evolutionary process, including current migration patterns and inbreeding.

Table 1. Table describing relevant symbols, their dimensions, and definitions used throughout the method.

Symbol Dimensions Definition
V𝒜 Scalar Ancestral additive genetic variance.
VW Scalar Additive genetic variance within subpopulations.
VB Scalar Additive genetic variance between subpopulations.
V^𝒜,W Scalar Ancestral additive genetic variance estimated from within subpopulation effects.
V^𝒜,B Scalar Ancestral additive genetic variance estimated from between subpopulation effects.
ω Scalar Width of the fitness function peak inversely related to the strength of stabilizing selection
Θ nT×nT Individual-level metapopulation-wide coancestry matrix.
Θx nx×nx Block of coancestry values related to population x.
Θp r×r Population-level metapopulation-wide coancestry matrix.
Θ^p r×r Estimated Θp.
Θxp Scalar Element x of the diagonal of matrix Θp corresponding to coancestry within-population x.
M nT×nT Block-matrix of individual-level relatedness within subpopulations.
𝐌^ nT×nT Estimated M.
𝐌x nx×nx A block of matrix M describing relatedness of individuals from subpopulation x.
A i,j Scalar Allele sharing between individuals i and j.
A¯sP r×r Matrix of population-level mean allele-sharing of parental generation.
z Scalar Phenotypic value.
μ Scalar Grand meta-population average.
ap r×1 Average breeding values per subpopulation.
ai nT×1 Individual breeding value.
Log𝒜V Scalar Log-ratio of ancestral variance from between and within-population effect.
n T Scalar Total number of individuals.
r Scalar Number of subpopulations.
i,j Indices Indices for individuals.
x,y Indices Indices for subpopulations.

In quantitative genetics, an animal model is generally used to estimate additive genetic variance. This model partitions phenotypic variance into different genetic and environmental components. To quantify the covariance of genetic effects between individuals, this model depends on the use of an individual-level relatedness matrix. Extending this principle to structured populations requires quantifying relationships between subpopulations as well as between individuals within those subpopulations [19]. Demographic history and spatial metapopulation configuration influence the distribution of genetic variation, which affects additive genetic variance within and between subpopulations. Thus, any test of local adaptation requires that the neutral differentiation baseline used accounts for these effects.

LogAV estimates V𝒜 in two separate ways: using the between-population and the within-population coancestries. We then compare these two estimates of the additive ancestral variance. We used a method of moment estimator of coancestries [21] to measure (i) kinship within subpopulations and infer ancestral variance from within-population variance, and (ii) mean kinship between subpopulations and infer ancestral variance from between-populations variance. Since the mathematical framework is constructed to reference the same coalescence tree, it accurately represents the segregation of alleles associated with the same variance. Thus, in the absence of selection, both variances should be equal.

To illustrate the concepts above, let us first consider a scenario involving two subpopulations. In this simple case, we can use FST to characterize the population structure of our metapopulation to establish the expected relationship under neutrality between the current genetic variances and the ancestral genetic variance. Namely, the proportion of the total genetic diversity described by the diversity between subpopulations is 2FST, and the remaining proportion, 1 − FST, is the one within subpopulations. Thus, at the limit of low mutation rate, and under neutrality, we should expect that the between-population additive genetic variance (VB) and the within-population additive genetic variance (VW) are respectively (Eq 2):

VB=2V𝒜FST, (2)

and (Eq 3)

VW=V𝒜(1FST), (3)

as demonstrated by [24], and earlier by [25].

However, as noted by others before [20,26,27] FST is a global summary parameter that does not capture differences in demographic histories among subpopulations under complex population structures. These differences in demography could be for example variation in population size, different branching times between subpopulations, and variation in migration rates between pairs of subpopulations. To capture the complexity of more biologically relevant population structures, we use a matrix of coancestries Θp following [19,21].

Θp contains the expected coancestries between pairs of subpopulations relative to the ancestral population (Fig 1). The superscript “p” refers to it being a population-level parameter. Element Θi,jp describes the probability of IBD between a random pair of alleles, one coming from subpopulations i and the other from subpopulation j. Diagonal elements of Θp describe such probability for any pair of alleles from distinct individuals within a subpopulation. Θp accounts for the non-independence between subpopulations, shared evolutionary history, asymmetric migration rates, and differences in population sizes, addressing limitations of using the global summary parameter FST in unevenly structured metapopulations [21]. FST and Θp are related as FST is a linear function of the average of the diagonal elements of Θp (Eq 4):

Fig 1. Cartoon description of Θp.

Fig 1

Given r subpopulations, Θp will be a r×r matrix. In this figure Θp is represented in different shades of green describing the average level of coancestry between random alleles sampled from different pairs of subpopulations, with the diagonal being the average of random pairs of alleles from distinct individuals in the same subpopulation. Darker shades in the cells within the matrix represent higher coancestry between the pair of subpopulations being compared. Θp reflects the past and present demographic metapopulations’ histories and connectivity.

FST=(i=1rΘi,ip)rθB1θB, (4)

where θB=2r(r1)i=2rj=1(i1)Θi,jp is the average of the off-diagonal elements of Θp.

While Θp describes average co-ancestries across and within subpopulations, the M matrix Fig 2, [19] describes relatedness between pairs of individuals. As it is general practice, we assume no cross-population breeding was performed, and thus that M is a block diagonal matrix this assumption can be lifted and cross-population breeding accounted for in M, (see [19]), i.e. relatedness for pairs of individuals from different subpopulations are already defined with Θp, so they are set to zero in M. The blocks in M describe the relatedness between all pairs of individuals within the subpopulations, relative to the average relatedness within each subpopulation, and discounting relatedness already accounted for in Θp. More precisely, M contains, within each subpopulation block x, the relatedness for each pair of individuals in that subpopulation multiplied by 1Θx,xp.

Fig 2. Cartoon description of nT×nT matrix M.

Fig 2

Similar to Fig 1, darker shades describe higher coancestry, but here, the coancestry is measured for each pair of individuals within subpopulations. Without cross-population breeding in the experimental setup, M is a block diagonal matrix.

We develop a test where we estimate, in the same statistical model, the ancestral additive variances considering either (i) the variance in mean phenotype between subpopulations (V𝒜,B) or (ii) the additive genetic variance in phenotypes within subpopulations V𝒜,W. Under a model of neutral evolution, we expect the two estimates to be equal to the same, unique ancestral additive genetic variance. We thus test the null hypothesis (described in Eq 5 that the two variances are identical:

V𝒜,B=V𝒜,W. (5)

A scenario where V𝒜,B>V𝒜,W would be compatible with local adaptation, while a scenario where V𝒜,B<V𝒜,W would be compatible with spatially-homogeneous global adaptation.

The use of our method ideally requires phenotypic data of the offspring generation (which we will call F1) reared in a common environment (i.e. “common garden”), neutral genetic markers from the parental (P) generation, and neutral genetic markers or a pedigree for the F1 (we discuss alternatives to this ideal setting in S7 Text). The neutral genetic markers are used to estimate the relative mean coancestry between all pairs of subpopulations and the relatedness between all pairs of F1 individuals within subpopulations. Fig 3 illustrates at which time point Θp and M are estimated and the chosen breeding design.

Fig 3. Description of expected experimental design to estimate Θp, M, and z.

Fig 3

The second line of this illustrative table, the “Ancestral population” line, describes the assumed ancestral panmictic state all subpopulations derive from. The third line, “Parental generation” (P) describes the studied subpopulations. From these studied subpopulations, individuals are sampled and exposed to a breeding design. At this step Θp is estimated. On the fourth line, we show the breeding design in the common garden setting used in our simulations: all males, first row of circles, are crossed with all females, second row of circles, following a North Carolina II design (as described by [28]), and two offspring from each cross are phenotyped. We estimate M using genotypic (or pedigree information) and phenotypic data from the F1 generation.

Estimating coancestries using the method of moments

We use the allele-sharing method of moments developed by [21,27] to estimate the co-ancestry, using the minimum allele-sharing between subpopulations as a reference point. We use the method of moments to obtain Θp and M (M can also be obtained using pedigree information if available). The allele-sharing-based estimator can be used for species with any ploidy level and makes no assumptions about the mating system [21]. For a diploid bi-allelic locus l, allele-sharing Ai,jl between two individuals i and j takes a value 1 if they are homozygous for the same allele, 0 if they are homozygous for different alleles, and 1/2 if at least one of the two individuals is heterozygous. Over all loci, Ai,j=1Ll=1LAi,jl.

To estimate Θp, we measure allele-sharing among individuals from the parental generation. The estimate Θ^p is computed as shown in Eq 6:

Θ^p=A¯sPmin(A¯sP)1min(A¯sP), (6)

where A¯sP is a r×r matrix made of elements A¯s,(x,y)P, the average allele-sharing Ai,j over all distinct pairs of individuals, one from subpopulation x and the other from subpopulation y; and min(A¯sP) denotes the minimum off-diagonal entry of A¯sP [21]. This minimum entry is our best proxy for allele-sharing in the ancestral population, we therefore make the mean coancestry entries of Θ^p relative to the mean coancestry in the ancestral population see [21,29].

Next, we need to estimate M, whose elements are the relatedness among the phenotyped individuals (F1) within each subpopulation discounted for their shared ancestries. To obtain relatedness, with pedigrees within each subpopulation, we could simply use the additive relationship matrix, but we can also make use of genotypic information for these F1 individuals if it is available, which we assume here. For each subpopulation, we estimate kinship among all parents and F1 individuals as shown in Eq 7:

k^i,j=Ai,jA¯i,j1A¯i,j, (7)

where A¯i,j is the average allele-sharing among pairs of distinct individuals in the subpopulation [30]. We double the kinship to obtain relatedness. To obtain 𝐌^x, the block of the 𝐌^ matrix corresponding to subpopulation x, and to account for the shared ancestry of individuals in subpopulation x relative to the ancestral subpopulation, we multiply the relatedness estimates of subpopulation x by the scalar (1Θ^xp), where Θ^xp is the xth element of the diagonal of Θ^p. Finally we store all the 𝐌^x in a block diagonal matrix (see Fig 2).

Linear mixed model

We fit a mixed-effects model with the phenotypic trait z as the response variable, accounting for population and individual effects as random-effect components:

zμ+𝐚i+𝐚p+ϵ, (8)

where μ is the overall phenotypic mean, 𝐚p is the population-level additive genetic effect, 𝐚i is the individual-level additive genetic effect and ϵ is the residual error. We assume the population-level and the individual-level additive genetic effects follow a multivariate normal distribution with their respective variance-covariance matrices depending on the above-defined co-ancestries and the between- (V𝒜,B) and within-population (V𝒜,W) ancestral additive genetic variances, whose distributions are described by Eq 9 and Eq 10:

𝐚p𝒩(0,2ΘpV𝒜,B) (9)
𝐚i𝒩(0,𝐌V𝒜,W) (10)

Note that assuming different ancestral additive genetic variances for 𝐚i and 𝐚p is another notable departure from [19]. We implement this model using a Bayesian framework, yielding notably posterior distributions for the ancestral additive genetic variances (V^𝒜,B,V^𝒜,W). We used the R package brms [31], based on the Hamiltonian Monte Carlo algorithm from the STAN library [32], with default priors for the fixed and random effects.

Hypothesis testing

Verification and comparison

We evaluated our method through simulations conducted in QuantiNemo2 [33], modeling metapopulations evolving under neutral and selective conditions (detailed settings for the simulation are described in S5 Text). Fig 4 illustrates the population structures used for the neutrally evolving metapopulations, along with an example of a coancestry matrix following the expected pattern under neutrality. The simulations were designed to investigate population structures deviating from the standard Island Model, while also varying the number of subpopulations. We chose particularly the 1-dimensional stepping stones to give continuity to the results described in [18] where the authors show the issue with the QSTFST based test on a 20 subpopulation 1D stepping stones. For Stepping Stones we show results for two migration rates, leading to FST of 0.2 and 0.37. The “139” (read: one, three, nine) population structure was inspired by the structure described by [22] and serves as an example of a hierarchical structure. In this structure one population diverges into three subpopulations which each diverges into three new subpopulations, and the data is sampled from the nine subpopulations at the end of the simulation.

Fig 4. Description of studied neutrally evolving population structures along with an example of the Θp matrix for neutral markers.

Fig 4

All simulations started from an initial large gene pool that splits instantaneously into 8 subpopulations for the island model and 20 for the stepping-stone model.

In all simulated scenarios, we sampled five males and five females from each parental subpopulation. A common garden breeding design was implemented, wherein the five females from each subpopulation were mated with the five males, producing two offspring per pair. This yielded 25 families of two offspring, thus 50 offspring per subpopulation.

Phenotypes were controlled by 100 bi-allelic loci with identical and fully additive effects across all scenarios, while 2000 neutral markers were simulated for genetic analysis. In the selection scenarios, we used stabilizing selection with a different optimum between two halves of the subpopulations, we included three cases of selection for island and for stepping stones model. We were interested in testing which selection intensities were necessary for selection to be identified by LogAV, thus, the selection scenarios differ in the value taken by the parameter ω2 (see Supporting infromation S5 Text, Eq S3, which describes the stabilizing fitness function). Larger ω2 describes wider fitness peak, thus a weaker selection, and we studied three cases: ω=10, 22, 50. These specific values describe a foreign fitness of respectively, 0.6, 0.9, 0.96. In the case of the stepping stones structure, subpopulations sharing the same environment were grouped. The specific parameters for each scenario are outlined in Table 2 and S1 Table.

Table 2. Parameters used in the different simulated scenarios.

Population Structure N. subpop. Subpop. size Environment ω Optima Mig. Rate F ST
Island model 8 500 Neutral 0.001 0.13
Island model 8 500 Selective 10 -5 or 5 0.001 0.15
Island model 8 500 Selective 22 -5 or 5 0.001 0.15
Island model 8 500 Selective 50 -5 or 5 0.001 0.15
Stepping stones 20 1000 Neutral 0.001 0.37
Stepping stones 20 1000 Neutral 0.01 0.2
Stepping stones 20 1000 Selective 10 -5 or 5 0.0009 0.18
Stepping stones 20 1000 Selective 22 -5 or 5 0.0009 0.18
Stepping stones 20 1000 Selective 50 -5 or 5 0.0009 0.18
139 9 900 Neutral 0 0.2

To compare QSTFST [15] and Driftsel [22] to our method, we focused on the false positive rates (see below), thus we only run these methods over neutrally evolving metapopulations. We analyzed the same set of neutrally evolving metapopulations using the three different methods, QSTFST, Driftsel, and LogAV, and compared their results. By analyzing the methods over 500 replicates of each simulated neutral scenario we obtained distributions of p-values (in the case of QSTFST and LogAV) and S-values (in the case of Driftsel - further details in S4 Text, specifically Eq S1). For our method and Driftsel [22], we used the method of moments [21,27] to estimate coancestry. We made this choice due to results in [21] which show more accurate estimates of coancestry for the method of moments compared to the Admixture F-model (AFM) method proposed by [20].

Under neutrality, p-values are expected to follow a uniform distribution, which is also the case for our Bayesian p-value due to the asymptotical convergence of the posterior toward the sampling distribution, see [34,35], thus a threshold of 0.05 should correspond to a false positive rate (FPR) of 5%. The FPR is calculated as the proportion of p-values falling below 0.05. For Driftsel, under neutrality, S-values are expected to follow a normal distribution centered around 0.5 (see S4 Text). Following the criteria outlined by [22], significant S-values are defined as being below 0.2 or above 0.8. Although there is, strictly speaking, no theoretical expectation for what the variance of S should be under the null hypothesis, we chose to conform to the values advocated by [22] to measure a “realized” false positive rate when using their method. We performed a binomial test to compare the FPR obtained to the expected one.

Results

Under neutrality, p-values obtained with the log-ratio of ancestral variance method (LogAV) are uniformly distributed for all models of population structure, including those that diverge from the Island Model, and thus yield the expected proportion of significant tests (Fig 5). Conversely, while being correctly calibrated for the island model of population structure, the QST-FST method and Driftsel show poor behavior under population structures diverging from the Island Model structure.

Fig 5. Summary of the results of analyses conducted on neutrally evolving metapopulations under varying population structures.

Fig 5

The first column specifies the population structures examined. Subsequent columns present the outcomes for each method: QSTFST, Driftsel, and LogAV. For QSTFST and LogAV. The results are shown as distributions of p-values, while for Driftsel, the results are presented as distributions of S-values. False positive rates (FPR) significantly deviating from the 5% expectation are marked with asterisks.

As previously shown by [18], QSTFST tests have inflated FPR under Stepping Stone population structures. We find they are significantly different 80 times out of 500 (p-value = 2.2 × 10−10) when we expect 25 significant results. Similarly, Driftsel results for Stepping Stone (FST = 0.35) metapopulations deviate from the distribution expected under neutrality, with inflated empirical FPR (85 significant results out of 500, p-value < 2.2 × 10−16). Driftsel also exhibits inflated empirical FPR for the hierarchical structure 139 (44 significant results out of 500, p-value = 4.2×104). For Stepping stones with lower differentiation between demes, i.e. a metapopulation with average FST of 0.2, results are qualitatively similar for QSTFST. However, QSTFST tests have even higher FPR. We find significance for 132 replicates out of 500 (p-value < 2.2 × 10−16). As of Driftsel, we find lower FPR than the expected 5% in this lower FST setting, with 15 out of 500 (p-value  = 0.03). The results for neutrally evolving metapopulations are summarized in Fig 5.

For a further understanding of the differences between the results provided by the different methods we show a graphical comparison between the results obtained using QSTFST and LogAV in Fig S1. Additionally we present the QQ-plots of the theoretical expectation under neutrality for the distribution of statistics (either p-value or S-statistics) for all three methods in Fig S3 to S5.

We also evaluate the LogAV test ability to detect selection. Under the stepping-stone population structure, the test achieved a 100% success rate across all metapopulations and selection scenarios (Fig 6). By contrast, under the island model, we found that a large peak width of ω=50 produced results that closely resembled neutrality. This reduced power is consistent with the similarity in phenotypic distributions between the island model with ω=50 and the neutrally evolving population, as shown in Fig S2.

Fig 6. Distributions of log-ratios from the LogAV test under different selection intensities and population structures.

Fig 6

Each violin plot represents the distribution of log(V^𝒜,B/V^𝒜,W) across replicate simulations. We compare two types of metapopulation structure — Island Model (blue tones) and Stepping Stones (green tones) — evolving either neutrally or under selection with different strengths (ω=10, 22, 50). Higher log-ratios indicate stronger evidence for diversifying selection. The LogAV test reliably detects selection in all Stepping Stones scenarios and in the Island Model scenarios with selection strength corresponding to ω=10,22, while it did not detect selection more often than in the neutral simulations for Island Model simulations with weaker selection (ω=50), where phenotypic divergence is less distinct.

Applications

We applied LogAV to a previously published data set of nine-spine stickleback from four different populations raised in a common garden [23]. This study analyzed the variation of morphological and behavioral traits using Driftsel [22]. Here, we applied LogAV to the individual traits.

For morphological traits, the original study reports only results from a multivariate analysis, where they observed a signal of diversifying selection. With LogAV, we also find a signal of diversifying selection for four of the traits (Table 3; significant traits are Standard length, body depth, head length, pelvic girdle length).

Table 3. Results of univariate tests of local adaptation using the LogAV method.

For each trait, the table shows the mean log ratio and the associated P-value. Significant evidence for local adaptation (e.g., P-value <0.05) is highlighted.

Trait Mean Log Ratio P-value
Standard Length (SL) 4.3955 0.0000
Body Depth 4.8672 0.0000
Caudal Peduncle Length (cp.length) 1.8753 0.1210
Head Length 4.4902 0.0000
Aggression (n.attack) 3.7116 0.0488
Pelvic Girdle Length (pg.length) 3.2546 0.0047
Risk Taking 1 (t.fullout)a 0.6225 0.6512
Risk Taking 2 (feed)b 2.1483 0.1285
Time to Orient (t.orient) -1.3450 0.6328

Note. Traits were tested individually using the LogAV method. A significant result suggests evidence for local adaptation in the trait.

aTime in seconds until the fish fully emerged from a refuge.

bBinary response: whether the fish bit food within 300 seconds after a simulated attack.

For the behavioral traits, the univariate results we obtain with LogAV are similar to those reported with Driftsel in the original article, indicating that “aggressiveness” (number of attacks) is likely under selection, while the others (“risk taking 1” and “risk taking 2”) could not be shown to be under selection.

Discussion

In this study, we present LogAV, a novel method based on the log-ratio of ancestral variances to disentangle the effects of selection and neutral evolution on phenotypic differentiation in spatially structured populations. This method builds on the conceptual foundation established by [19], who highlighted the importance of incorporating between-population relatedness to estimate phenotypic differentiation. Through simulations, we demonstrate that LogAV is well-calibrated across all tested population structures. These simulations also showed that existing alternatives such as Driftsel and QSTFST, are poorly calibrated under neutral scenarios that depart from the island model. LogAV’s reliability comes from (i) using coancestry matrices computed via the method of moments [21,27], which accurately capture genealogical ties within and between subpopulations; and (ii) using a theoretically justified null hypothesis test framework by comparing two estimates of the ancestral additive genetic variance. This robustness makes it well-suited for real-world use.

Real-world metapopulations often deviate significantly from the assumptions of the island model. For instance, [36] documented a metapopulation of quolls (Dasyurus hallucatus) in northern Australia characterized by a hierarchical structure, comprising three main clusters with further subclusters nested within each. Similarly, the bat Miniopterus schreibersii demonstrates isolation-by-distance patterns, with additional fine-scale structuring observed within localities [37].

A good example of how structure can potentially impact studies of local adaptation is found in freshwater snails Galba truncatula. In western Switzerland, these snails were found in a hierarchical population structure. [38] found even physically close subpopulations, a few meters apart, showed large differentiation, and the differentiation increased as the distances between subpopulations grew. However, the geographic distance alone was not enough to describe the neutral genetic distance, with some far-away subpopulations being undifferentiated, while a small stream was sufficient to create genetic differentiation between physically close subpopulations. Using the same subpopulations, [39] found QST to be significantly larger (on average) than FST between temporary and permanent habitats, but significantly smaller among subpopulations within habitats. It is possible, however, that these results are due to not accounting for the complex population structure in this system.

Studies on local adaptation in a wide range of species have demonstrated that ecological factors strongly influence population structure. For instance, in Daphnia species, differentiation, and genetic diversity strongly depend on habitat size and type [4042]. Consequently, assuming the Island Model for these species may be an oversimplification, which can lead to high false positive rates in tests of local adaptation. Beyond ecology and life history traits, neutral genetic structure in metapopulations can also be shaped by historical events, such as the recolonization of previously isolated populations following glacial cycles or anthropogenic activities, as is documented for instance in the metapopulations of sticklebacks in western North America [43]. These examples underscore the complexity of natural metapopulation structures, where migration rates and genetic connectivity often vary between pairs of subpopulations. Analyzing phenotypic divergence in such systems requires methods capable of accounting for this structural complexity. Our results show that LogAV is robust to varying population structures and should be a reliable method for testing for local adaptation in the aforementioned species.

In addition to its good performance under neutrality, the preliminary tests of LogAV using traits under selection are promising. So far, we have observed that LogAV demonstrates high sensitivity to selective scenarios. These results were consistent throughout the two different population structures tested under selection. However, further and more thorough tests, especially considering different selection strengths and sample size, are needed to identify its limits.

LogAV should be tested in scenarios that explore the interplay of various parameters, such as migration rates between subpopulations and selection patterns. Tests should include varying both the types of selection patterns and the levels of relatedness, with, for example, closely related populations being exposed to similar selective pressures. It would also be important to assess the method’s performance under varying effective population sizes, particularly in scenarios where some populations have much smaller effective sizes than others. We expect LogAV to be robust to such variations, as the inclusion of population coancestries should capture these differences in effective sizes. Additionally, the impact of sampling effort should be examined: for instance, determining the minimum number of phenotyped individuals and the number of subpopulations. Another point that deserves investigation is how the chosen breeding design will influence the outcome of LogAV [44].

We also suggest further research to test different scenarios of ancestry. The efficacy of the proposed method depends on accurately estimating the ancestral variance from inferred relatedness, which may be impacted by assumptions of panmixia in the ancestral population. Deviations from this assumption, such as in cases of historical population substructure or admixture events, could bias variance estimates. Substructuring of the ancestral population can lead to incomplete lineage sorting and thus to non-uniform coalescent times, which could impact the estimation of the ancestral additive genetic variance. Future research should investigate to what extent the proposed method is affected by the lack of panmixia in the ancestral population.

It is important to mention that other methods have also considered the problem of population structure, and have provided potential solutions considering that little to no information on population of origin and ancestral populations is known. [45] proposed a method for detecting local adaptation using principal component analysis of the relatedness matrix. An advantage of LogAV is the more classic statistical test using an analogue of the p-value. [45]’s method depends on the user choosing the threshold of their tests, more specifically the cutoff for which PCs represent between- and within-population variance.

At broader evolutionary scales, other methods have considered similar principles [46,47] as creating neutral expectations from molecular data, and using variance comparisons. [46] proposed a phylogenetic-scale neutrality index by normalizing within and between taxa variance. At a species-level, [47] create a neutral expectation for gene expression evolution by also developing an index but considering sequence divergence and polymorphism in expression data. Still at this broader phylogenetic scale, [48] proposed a framework that bridges statistical genetics and phylogenetic comparative methods. Their work emphasizes how both fields are actually using the same general model, and the importance of considering relatedness when comparing trait associations, whether studying populations or species

Further, we recognize the relevance of conducting additional research to investigate the effect of genetic architecture on the performance of methods for detecting local adaptation. [49] conducted a study that partly addresses this issue by analyzing the impact of genetic architecture on different variations of the QSTFST. Specifically, they tested the impact of varying numbers of loci impacting the trait on the distribution of QST. Despite the theoretical expectation of QST not depending on the number of loci underlying the phenotype in which the statistic is measured, [49] observed that a higher number of loci leads to higher variance in the QST distribution. We have not explored the impact of number of loci, and it is also unclear how dominance and epistatic interactions might impact the results of our method.

Evidence suggests that the classical QSTFST would be uncalibrated when there is dominance. However, the error direction is not resolved. [50] and [51] show that dominance reduces the mean value of QST, meaning that dominance makes the QSTFST comparison more conservative for divergent selection. While [52] show that under different population structures, QST may exceed FST. Since the empirical effect of dominance on genetic variance needs further investigation as some studies show little influence (e.g. [53]) of dominance while others show large influence (e.g. [54]), it is unclear which effects dominance would have on LogAV. Thus, future research should investigate the extent to which our method could be affected by non-additive effects on the studied phenotype.

A noteworthy difference between our method and QSTFST is that it is very general regarding the breeding design used within the common garden experiment. In particular, the QSTFST improvements proposed by [15] can be difficult to extend to accommodate breeding designs other than half-sib families. Specifically, their approach is constrained to balanced datasets, wherein offspring are related as half-sibs through shared fathers. Although [55] suggested an extension to unbalanced datasets, further extensions will be restricted to specific cross or nested breeding designs. Similarly to [19], our method adopts a framework akin to the animal model, which creates flexibility for the breeding design. Further, and extending the flexibility of Driftsel [22], the within-population relatedness matrices can be obtained either from pedigree information or, alternatively, from genetic marker data in the F1 generation, enabling accurate estimation of relatedness [30] even when the exact crosses to generate the F1 are unknown, a situation often encountered for investigations in plants, where seeds are sampled from the field and grown in a common garden environment.

It has been repeatedly suggested in the literature that, by relying on the ‘phenotypic gambit’ [56], it is possible to alleviate the need for an experimental common garden environment, using direct phenotypic measurements in wild conditions. In such case, a ratio of between-population to the total phenotypic variance, named PST is computed and compared to the FST [57]. Our method will then suffer from the same issues as the QSTFST does, especially due to the confounding effect of phenotypic plasticity and genotype-by-environment interaction [10,11]. While there exist workarounds for simple occurrences of the first issue [58] and although the phenotypic gambit works to some extent [11], it remains a hardly generalizable simplification. Our method is explicitly based on the assumption that phenotypic traits were evaluated in a common garden settings.

Although the relatedness information used in Θp and M can be obtained from a wild population settings, the issue of phenotypic plasticity and, especially, genotype-by-environment interaction invalidates our null hypothesis assumption of estimates of the ancestral additive genetic variance, V^𝒜,B and V^𝒜,W. Our method could possibly be made more robust to such issues by exploring parametrized scenarios of genotype-by-environment interaction, following the spirit of [58], but this would bear the same limits as such approach, and is beyond the scope of this study.

Additionally, we recommend that Θp and M be calculated respectively from a founder population and from individuals measured in a common garden. While it may be possible to estimate both from the same set of F1 genotypes, doing so may lead to inflated values of Θp due to the added effects of drift across generations in the common garden environment. Potentially, reconstructing parental genotypes from the F1 genotype could be a potential solution [59], though we have not investigated this solution in this study.

In summary, our results validate the log-ratio of ancestral variance as a robust method for distinguishing adaptive divergence from neutral processes in structured populations. Its well-calibrated performance across diverse population structures demonstrates that it provides a reliable alternative to QSTFST and Driftsel. We did so by adopting a general framework based on well-established quantitative genetic and statistical principles. From quantitative genetic theory, we derive two distinct approaches to calculate ancestral additive genetic variances, V𝒜,W and V𝒜,B. Then we compare the two estimates of ancestral variance using a log-ratio approach that has been adopted by many hypothesis testing frameworks (e.g., likelihood ratio test) and has proven to be very robust in a wide range of applications requiring the comparison of variances.

Supporting information

S1 Text. Explanations for S1 Fig.

(PDF)

pgen.1011871.s001.pdf (64.3KB, pdf)
S2 Text. Estimation of P-values from a Bayesian posterior.

(PDF)

pgen.1011871.s002.pdf (93.6KB, pdf)
S3 Text. Breeding design description.

(PDF)

pgen.1011871.s003.pdf (29.6KB, pdf)
S4 Text. Properties of the S-statistic, and why it is normally distributed.

(PDF)

pgen.1011871.s004.pdf (95.6KB, pdf)
S5 Text. Computer simulation details.

(PDF)

pgen.1011871.s005.pdf (105.4KB, pdf)
S6 Text. Quantinemo initialization parameters.

(PDF)

pgen.1011871.s006.pdf (62.1KB, pdf)
S7 Text. Type of data necessary to estimate coancestries and relatedness.

(PDF)

pgen.1011871.s007.pdf (84KB, pdf)
S8 Text. Description for S3 Fig.

(PDF)

pgen.1011871.s008.pdf (54.7KB, pdf)
Fig S1. Distribution of p-values of the different methods under neutrality.

(PDF)

pgen.1011871.s009.pdf (150.9KB, pdf)
Fig S2. Distribution of trait values for the different scenarios.

(PDF)

pgen.1011871.s010.pdf (314.4KB, pdf)
Fig S3. Quantile plots for the QSTFST tests.

(PDF)

pgen.1011871.s011.pdf (142.6KB, pdf)
Fig S4. Quantile plots for Driftsel tests.

(PDF)

pgen.1011871.s012.pdf (117.1KB, pdf)
Fig S5. Quantile plots for the LogAV tests.

(PDF)

pgen.1011871.s013.pdf (121.6KB, pdf)
Table S1. Summary of simulation parameters for each scenario.

(PDF)

pgen.1011871.s014.pdf (32.4KB, pdf)

Acknowledgments

We thank Otso Ovaskainen and Juha Merila for fruitful discussions and suggestions. We also thank Thibault Latrille, Ehouarn Le Faou, and Michael Whitlock for their comments and suggestions on the manuscript.

Data Availability

All necessary scripts and data to reproduce this study can be found at https://github.com/isadoo/MethodsLocalAdaptation_LogAV.

Funding Statement

This study was funded by the Swiss Science National Foundation grant 31003A_179358, 310030_215709 to JG. The funders did not play any role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Lande R. Neutral theory of quantitative genetic variance in an island model with local extinction and colonization. Evolution. 1992;46(2):381–9. doi: 10.1111/j.1558-5646.1992.tb02046.x [DOI] [PubMed] [Google Scholar]
  • 2.Lynch M, Walsh B. Genetics and analysis of quantitative traits. Sunderland, MA: Sinauer; 1998. [Google Scholar]
  • 3.Yeaman S. Evolution of polygenic traits under global vs local adaptation. Genetics. 2022;220(1):iyab134. doi: 10.1093/genetics/iyab134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Dittberner H, Korte A, Mettler-Altmann T, Weber APM, Monroe G, de Meaux J. Natural variation in stomata size contributes to the local adaptation of water-use efficiency in Arabidopsis thaliana. Mol Ecol. 2018;27(20):4052–65. doi: 10.1111/mec.14838 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Côte J, Roussel J-M, Le Cam S, Guillaume F, Evanno G. Adaptive divergence in embryonic thermal plasticity among Atlantic salmon populations. J Evol Biol. 2016;29(8):1593–601. doi: 10.1111/jeb.12896 [DOI] [PubMed] [Google Scholar]
  • 6.de Miguel M, Rodríguez-Quilón I, Heuertz M, Hurel A, Grivet D, Jaramillo-Correa JP, et al. Polygenic adaptation and negative selection across traits, years and environments in a long-lived plant species (Pinus pinaster Ait., Pinaceae). Mol Ecol. 2022;31(7):2089–105. doi: 10.1111/mec.16367 [DOI] [PubMed] [Google Scholar]
  • 7.Fisher RA. The genetical theory of natural selection: A complete variorum edition. Oxford University Press; 1999. [Google Scholar]
  • 8.Pritchard JK, Di Rienzo A. Adaptation – Not by sweeps alone. Nat Rev Genet. 2010;11(10):665–7. doi: 10.1038/nrg2880 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Spitze K. Population structure in Daphnia obtusa: Quantitative genetic and allozymic variation. Genetics. 1993;135(2):367–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Pujol B, Wilson AJ, Ross RIC, Pannell JR. Are Q(ST)-F(ST) comparisons for natural populations meaningful?. Mol Ecol. 2008;17(22):4782–5. doi: 10.1111/j.1365-294X.2008.03958.x [DOI] [PubMed] [Google Scholar]
  • 11.Leinonen T, McCairns RJS, O’Hara RB, Merilä J. Q(ST)-F(ST) comparisons: Evolutionary and ecological insights from genomic heterogeneity. Nat Rev Genet. 2013;14(3):179–90. doi: 10.1038/nrg3395 [DOI] [PubMed] [Google Scholar]
  • 12.O’Hara RB, Merilä J. Bias and precision in QST estimates: Problems and some solutions. Genetics. 2005;171(3):1331–9. doi: 10.1534/genetics.105.044545 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Goudet J, Büchi L. The effects of dominance, regular inbreeding and sampling design on Q(ST), an estimator of population differentiation for quantitative traits. Genetics. 2006;172(2):1337–47. doi: 10.1534/genetics.105.050583 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Whitlock MC. Evolutionary inference from QST. Mol Ecol. 2008;17(8):1885–96. doi: 10.1111/j.1365-294X.2008.03712.x [DOI] [PubMed] [Google Scholar]
  • 15.Whitlock MC, Guillaume F. Testing for spatially divergent selection: Comparing Q ST to F ST. Genetics. 2009;183(3):1055–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lewontin RC, Krakauer J. Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms. Genetics. 1973;74(1):175–95. doi: 10.1093/genetics/74.1.175 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Koch EM. The effects of demography and genetics on the neutral distribution of quantitative traits. Genetics. 2019;211(4):1371–94. doi: 10.1534/genetics.118.301839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.de Villemereuil P, Gaggiotti OE, Goudet J. Common garden experiments to study local adaptation need to account for population structure. J Ecol. 2020;110(5):1005–9. doi: 10.1111/1365-2745.13528 [DOI] [Google Scholar]
  • 19.Ovaskainen O, Karhunen M, Zheng C, Arias JMC, Merilä J. A new method to uncover signatures of divergent and stabilizing selection in quantitative traits. Genetics. 2011;189(2):621–32. doi: 10.1534/genetics.111.129387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Karhunen M, Ovaskainen O. Estimating population-level coancestry coefficients by an admixture F model. Genetics. 2012;192(2):609–17. doi: 10.1534/genetics.112.140871 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Goudet J, Weir BS. An allele-sharing, moment-based estimator of global, population-specific and population-pair FST under a general model of population structure. PLoS Genet. 2023;19(11):e1010871. doi: 10.1371/journal.pgen.1010871 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Karhunen M, Merilä J, Leinonen T, Cano JM, Ovaskainen O. DRIFTSEL: An R package for detecting signals of natural selection in quantitative traits. Mol Ecol Resour. 2013;13(4):746–54. doi: 10.1111/1755-0998.12111 [DOI] [PubMed] [Google Scholar]
  • 23.Karhunen M, Ovaskainen O, Herczeg G, Merilä J. Bringing habitat information into statistical tests of local adaptation in quantitative traits: A case study of nine-spined sticklebacks. Evolution. 2014;68(2):559–68. doi: 10.1111/evo.12268 [DOI] [PubMed] [Google Scholar]
  • 24.Whitlock MC. Neutral additive genetic variance in a metapopulation. Genetics Research. 1999 ;74(3):215–221. [DOI] [PubMed]
  • 25.Wright S. Evolution and the genetics of populations. The University of Chicago Press; 1969. [Google Scholar]
  • 26.Weir BS, Hill WG. Estimating F-statistics. Ann Rev Genet. 2002;36(1):721–50. [DOI] [PubMed] [Google Scholar]
  • 27.Weir BS, Goudet J. A unified characterization of population structure and relatedness. Genetics. 2017;206(4):2085–103. doi: 10.1534/genetics.116.198424 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Suza W, Lamkey K. Quantitative genetics for plant breeding. Iowa State University Digital Press; 2023. 10.31274/isdup.2023.146 [DOI] [Google Scholar]
  • 29.Ochoa A, Storey JD. Estimating FST and kinship for arbitrary population structures. PLoS Genet. 2021;17(1):e1009241. doi: 10.1371/journal.pgen.1009241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Goudet J, Kay T, Weir BS. How to estimate kinship. Mol Ecol. 2018;27(20):4121–35. doi: 10.1111/mec.14833 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Bürkner P-C. Advanced Bayesian Multilevel Modeling with the R Package brms. R J. 2018;10(1):395. doi: 10.32614/rj-2018-017 [DOI] [Google Scholar]
  • 32.Hoffman MD, Gelman A. The no-u-turn sampler: Adaptively setting path lengths in Hamiltonian Monte Carlo. J Mach Learn Res. 2014;15:1593–623. [Google Scholar]
  • 33.Neuenschwander S, Michaud F, Goudet J. QuantiNemo 2: A Swiss knife to simulate complex demographic and genetic scenarios, forward and backward in time. Bioinformatics. 2019;35(5):886–8. doi: 10.1093/bioinformatics/bty737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian data analysis. 2nd ed. Boca Raton, Florida (US): Chapman & Hall/CRC Press; 2004. [Google Scholar]
  • 35.Shi H, Yin G. Reconnecting p-value and posterior probability under one-and two-sided tests. Am Stat. 2021;75(3):265–75. [Google Scholar]
  • 36.von Takach B, Ranjard L, Burridge CP, Cameron SF, Cremona T, Eldridge MDB, et al. Population genomics of a predatory mammal reveals patterns of decline and impacts of exposure to toxic toads. Mol Ecol. 2022;31(21):5468–86. doi: 10.1111/mec.16680 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Dufresnes C, Dutoit L, Brelsford A, Goldstein-Witsenburg F, Clément L, López-Baucells A, et al. Inferring genetic structure when there is little: Population genetics versus genomics of the threatened bat Miniopterus schreibersii across Europe. Sci Rep. 2023;13(1):1523. doi: 10.1038/s41598-023-27988-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Trouve S, Degen L, Goudet J. Ecological components and evolution of selfing in the freshwater snail Galba truncatula. J Evol Biol. 2005;18(2):358–70. [DOI] [PubMed] [Google Scholar]
  • 39.Chapuis E, Trouve S, Facon B, Degen L, Goudet J. High quantitative and no molecular differentiation of a freshwater snail (Galba truncatula) between temporary and permanent water habitats. Mol Ecol. 2007;16(16):3484–96. doi: 10.1111/j.1365-294X.2007.03386.x [DOI] [PubMed] [Google Scholar]
  • 40.Haag CR, Riek M, Hottinger JW, Pajunen VI, Ebert D. Founder events as determinants of within-island and among-island genetic structure of Daphnia metapopulations. Heredity (Edinb). 2006;96(2):150–8. doi: 10.1038/sj.hdy.6800774 [DOI] [PubMed] [Google Scholar]
  • 41.Vanoverbeke J, De Gelas K, De Meester L. Habitat size and the genetic structure of a cyclical parthenogen, Daphnia magna. Heredity (Edinb). 2007;98(6):419–26. doi: 10.1038/sj.hdy.6800958 [DOI] [PubMed] [Google Scholar]
  • 42.Walser B, Haag CR. Strong intraspecific variation in genetic diversity and genetic differentiation in Daphnia magna: The effects of population turnover and population size. Mol Ecol. 2012;21(4):851–61. doi: 10.1111/j.1365-294X.2011.05416.x [DOI] [PubMed] [Google Scholar]
  • 43.Catchen J, Bassham S, Wilson T, Currey M, O’Brien C, Yeates Q, et al. The population structure and recent colonization history of Oregon threespine stickleback determined using restriction-site associated DNA-sequencing. Mol Ecol. 2013;22(11):2864–83. doi: 10.1111/mec.12330 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Fraimout A, Guillaume F, Li Z, Sillanpää MJ, Rastas P, Merilä J. Dissecting the genetic architecture of quantitative traits using genome-wide identity-by-descent sharing. Mol Ecol. 2024;33(6):e17299. doi: 10.1111/mec.17299 [DOI] [PubMed] [Google Scholar]
  • 45.Josephs EB, Berg JJ, Ross-Ibarra J, Coop G. Detecting adaptive differentiation in structured populations with genomic data and common gardens. Genetics. 2019;211(3):989–1004. doi: 10.1534/genetics.118.301786 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Latrille T, Bastian M, Gaboriau T, Salamin N. Detecting diversifying selection for a trait from within and between-species genotypes and phenotypes. J Evol Biol. 2024;37(12):1538–50. doi: 10.1093/jeb/voae084 [DOI] [PubMed] [Google Scholar]
  • 47.Warnefors M, Eyre-Walker A. A selection index for gene expression evolution and its application to the divergence between humans and chimpanzees. PLoS One. 2012;7(4):e34935. doi: 10.1371/journal.pone.0034935 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Schraiber JG, Edge MD, Pennell M. Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations. PLoS Biol. 2024;22(10):e3002847. doi: 10.1371/journal.pbio.3002847 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Liu JJ, Edge MD. Error rates in Q_ST–F_ST comparisons depend on genetic architecture and estimation procedures. bioRxiv. 2024:2024–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Goudet J, Martin G. Under neutrality, Q(ST) < or = F(ST) when there is dominance in an island model. Genetics. 2007;176(2):1371–4. doi: 10.1534/genetics.106.067173 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Cubry P, Scotti I, Oddou-Muratorio S, Lefèvre F. Generalization of the QST framework in hierarchically structured populations: Impacts of inbreeding and dominance. Mol Ecol Resour. 2017;17(6):e76–83. doi: 10.1111/1755-0998.12693 [DOI] [PubMed] [Google Scholar]
  • 52.López-Fanjul C, Fernández A, Toro MA. The effect of dominance on the use of the QST-FST contrast to detect natural selection on quantitative traits. Genetics. 2007;176(1):725–7. doi: 10.1534/genetics.106.062901 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hill WG, Goddard ME, Visscher PM. Data and theory point to mainly additive genetic variance for complex traits. PLoS Genet. 2008;4(2):e1000008. doi: 10.1371/journal.pgen.1000008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Class B, Brommer JE. Can dominance genetic variance be ignored in evolutionary quantitative genetic analyses of wild populations?. Evolution. 2020;74(7):1540–50. doi: 10.1111/evo.14034 [DOI] [PubMed] [Google Scholar]
  • 55.Gilbert KJ, Whitlock MC. QST-FST comparisons with unbalanced half-sib designs. Mol Ecol Resour. 2015;15(2):262–7. doi: 10.1111/1755-0998.12303 [DOI] [PubMed] [Google Scholar]
  • 56.Grafen A. Natural selection, kin selection and group selection. In: Krebs JR, Davies NB, editors. Behavioural ecology: An evolutionary approach. Malden, MA: Wiley-Blackwell; 1984. [Google Scholar]
  • 57.Leinonen T, Cano JM, Mäkinen H, Merilä J. Contrasting patterns of body shape and neutral genetic divergence in marine and lake populations of threespine sticklebacks. J Evol Biol. 2006;19(6):1803–12. doi: 10.1111/j.1420-9101.2006.01182.x [DOI] [PubMed] [Google Scholar]
  • 58.Brommer JE. Whither Pst? The approximation of Qst by Pst in evolutionary and conservation biology. J Evol Biol. 2011;24(6):1160–8. doi: 10.1111/j.1420-9101.2011.02268.x [DOI] [PubMed] [Google Scholar]
  • 59.Jones OR, Wang J. COLONY: A program for parentage and sibship inference from multilocus genotype data. Mol Ecol Resour. 2010;10(3):551–5. doi: 10.1111/j.1755-0998.2009.02787.x [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Kelly A Dyer, Michael Edge

8 May 2025

PGENETICS-D-25-00294

A method for identifying local adaptation in structured populations

PLOS Genetics

Dear Dr. Goudet,

Thank you for submitting your manuscript to PLOS Genetics. After careful consideration, we feel that it has merit but does not fully meet PLOS Genetics's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript within 60 days Jul 07 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosgenetics@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pgenetics/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to any formatting updates and technical items listed in the 'Journal Requirements' section below.

* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Michael Edge

Guest Editor

PLOS Genetics

Kelly Dyer

Section Editor

PLOS Genetics

Aimée Dudley

Editor-in-Chief

PLOS Genetics

Anne Goriely

Editor-in-Chief

PLOS Genetics

Additional Editor Comments :

Three experts have now reviewed the manuscript, and I have read it as well. All readers are agreed that the proposed approach represents a worthwhile addition to the set of methods available for testing for divergent selection among related natural subpopulations, and all are positive about the fit of the paper with PLoS Genetics. At the same time, the reviewers all suggested additions to the manuscript that would make it clearer or more convincing. In particular, reviewers 1 and 2 ask for additional theoretical and methodological detail, and reviewers 1 and 3 suggest additional simulations that would strengthen the manuscript. Reviewers 2 and 3 both ask the authors to consider whether other schemes for estimating pieces of input to the model that might allow for wider application of the method. In a revised version, I would ask that you comment on these possibilities and/or expand on the necessity for the design proposed.

Journal Requirements:

1) We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type u2018LaTeX Source Fileu2019 and leave your .pdf version as the item type u2018Manuscriptu2019.

2) Your manuscript is missing the following sections: Verification and Comparison, and Applications. Please ensure that your article adheres to the standard Methods article layout and order of Abstract, Author Summary, Introduction, Description of the Method, Verification and Comparison, Applications, Discussion, Acknowledgements, References, and Supplementary Information. For details on what each section should contain, see our Methods article guidelines:

https://journals.plos.org/plosgenetics/s/submission-guidelines#loc-manuscript-organization.

3) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines: 

https://journals.plos.org/plosgenetics/s/figures

4) We notice that your supplementary Figures, Table, and information are included in the manuscript file. Please remove them and upload them with the file type 'Supporting Information'. Please ensure that each Supporting Information file has a legend listed in the manuscript after the references list.

Reviewers' comments:

Reviewer's Responses to Questions

Reviewer #1: Summary

The authors present a simple, robust method to perform QST-FST analyses under a variety of population and breeding structures. They combine and extend previous work on estimating kinship matrices and applying the animal model to study local adaptation. I think that the developed test quantity of the ratio of the between to within population estimates of the ancestral genetic variance is a good idea that will likely apply to a wide range of scenarios. I do not see any major issues with the paper. The structure of the paper is that of advocating a new methodological approach without providing software to apply the method or testing it on real data. I generally think this is fine, but I think that a bit more should be done to justify the approach and identify conditions under which it will fail.

Major comments:

The approach is described in terms of estimating the additive ancestral genetic variance and the provided derivation uses a model of population structure where subpopulations result from one ancestral, panmictic population. However, it is clear that the authors intend for the model to be applicable in scenarios where this is not true. All but one of the simulated models lack any ancestral population. It is therefore strange to see the discussion suggest investigating “to what extent the proposed method is affected by the lack of panmixia in the ancestral population.” The method was tested on three models without any ancestral population at all, and seems to perform quite well, but this is not really discussed or interpreted at all. The authors should do more to at least discuss this generality. In the absence of any software or empirical examples the main focus must be on the theoretical properties.

How should one interpret estimates of the ancestral genetic variance in situations like models 2 and 3? It seems like the ancestral genetic variance may be related to the point at which lineages have been sufficiently shuffled by migration to become exchangeable. Some sort of explanation along these lines would be helpful.

The authors should be more explicit about the variance decomposition leading to equation (8). I was thrown by the seeming possibility for negative values in equation (7) so additional rationale is needed there.

One simple robustness check that could be accomplished with the simulations already performed would be to misassign some individuals to the wrong populations. These may represent recent migrants or incorrect labeling. There is some natural worry with QST methods that populations are correctly defined in the first place. My guess would be that such mislabeling does not lead to false positives since it would inflate within population variance, which the extra between population relatedness would be ignored. This would be good to see explicitly.

Given the focus on the properties of the allele-sharing estimator of coancestry, it would be good to explicitly see whether using other estimators inflates the rate of false positives.

While no software or empirical examples are presented (presumably to be given by future work), it would still be good to know whether the method could be applied to the re-analysis of published data. Driftsel appears to have been applied in at least a handful of published works. Would re-analysis be straightforward?

Minor comments:

144-145: FST is not proportional to the average of the diagonal element of Theta. Doubling this average while maintaining between population relatedness would not double FST?

165: I think stabilizing selection on the phenotypic trait, rather than balancing selection, is meant here.

219: It is not sufficient to say that a Bayesian estimation framework is employed without specifying priors and how the model was fit. I don’t see this in the supplement but it should be there.

269: AFM was not previously defined.

Reviewer #2: In this paper, Isabela et al. developed a new method to estimate the genetic variance between- (Vb) and within-populations (Va) for Qst-Fst comparisons that takes within and between-population relatedness into account. They call this approach LogAV and show that its Type I error is better calibrated than other approaches (e.g. traditional Qst-Fst and DriftSel) for a range of population structures.

The approach makes sense and the authors convincingly show that the errors are well-calibrated. My main comments are related to the lack of sufficient details of the methods and simulations:

I found the details of the inference to be a bit sparse in the main text and supplement. I understand that a Bayesian framework was used to estimate Va and Vb but what were the priors? I assume uniform given their discussion on the equivalence of p-values and posterior probabilities but please make this explicit. What was the range of the prior distribution? Which algorithm was used to sample from the posterior? The code on github suggests MCMC but this and the specific packages used should be explained in more detail in the paper, especially since this is primarily a methods paper.

Please provide more details on the simulation parameters. 100 causal and 2000 neutral markers were used for the simulations. What were their frequencies? Were they drawn from some distribution? Is there a burn-in period before divergence? How exactly were the genetic values simulated to ensure heritability of 0.8 across all scenarios?

Was coancestry estimated from neutral markers only or were the causal variants included? This matters because the frequency distribution and Fst can differ between neutrally evolving markers and those under selection and it's important to evaluate how that would affect estimation of variance components with LogAV.

Fig. S2 is supposed to show 100% success of LogAV in detecting selection. The figure is difficult to interpret beyond showing that the p-values under the neutral scenario are uniformly distributed and under selection to be very low. A better, more informative visualization would be an ROC curve.

Do you strictly need a design where genotypes from the parents are available? My intuition is that it should be possible to get an estimate of theta^p from the offspring generation by methods such as REAP (Thornton et al.) and PC-RELATE (Conomos et al.) to estimate it.

Given that this is primarily a method paper, these details should be made more explicitly available so we can evaluate the performance of the method under different scenarios. It would also broaden the scope of the paper to describe the performance of the method under a wider range of scenarios considering different strengths of selection and genetic architectures. Without this, the paper comes off as more of a preliminary analysis of a promising method as opposed to a fully realized approach.

References:

Thornton, T., Tang, H., Hoffmann, T. J., Ochs-Balcom, H. M., Caan, B. J., & Risch, N. (2012). Estimating kinship in admixed populations. American Journal of Human Genetics, 91(1), 122–138. https://doi.org/10.1016/j.ajhg.2012.05.024

Conomos, M. P., Reiner, A. P., Weir, B. S., & Thornton, T. A. (2016). Model-free Estimation of Recent Genetic Relatedness. American Journal of Human Genetics, 98(1), 127–148. https://doi.org/10.1016/j.ajhg.2015.11.022

Reviewer #3: The manuscript by do O et al explores a new test for selection on quantitative traits by building a linear mixed model and testing for equivalent of the "ancestral population variance" and the within population variance. The test is conceptually similar to Qst-Fst tests, but, as the authors note, expected to be more robust because it can account for more complicated forms of population structure. It also produces something akin to a calibrated p-value distribution, which seems robust to population structure.

I think that the method overall sounds quite reasonable: the authors form two genetic relatedness matrices, one account for the relationship between populations and one accounting for the relationships within each population. They then cleverly use this to build a a statistical test that the within and between population variances are the same. They assess significance using a Bayesian p-value, which is not a concept I was familiar with.

I think that their simulation results, though somewhat sparse, suggest that this method controls false discoveries significantly better than alternative approaches, particularly when population structure deviates from an island model.

While the authors mostly focus on false positive rates, and do a thorough job exploring different scenarios, they only spend a short time (and one supplementary figure) on the power of the test, which they find to be 100% in the scenario they explored. I think that an easy way to make the manuscript have a little more meat would be for the authors to explore a bit more about the power of the method, for example by exploring power across a grid of omega or optimum shift values. It would also be interesting to compare power to the other methods---although given their elevated false positive rate, the authors would need to be somewhat careful about how they frame it.

Another thought that struck me during the reading of the manuscript is that the authors imagine a two step procedure for data collection: first, sampling parents in the field and using them to build the \Theta matrix, and then doing a common garden experiment using the offspring of those samples to measure phenotypes and to measure genotypes to build the M matrix. First of all, it seems like (and perhaps I'm overlooking some aspect of double dipping here) that the same F1 genotypes could be used to construct M and \Theta, which could potentially be useful in some settings?

Secondly, and maybe more interestingly, what if phenotypes and genotypes are measured exclusively from the parental generation? There are many systems in which a common garden is not feasible, but questions of this sort are still interesting. This framework, based entirely on genetic relatedness matrices, seems like it could in principle be applied using purely parental genotypes and phenotypes. Obviously a complication here is that environmental differences between the parental populations will confound the estimation of V_{A,B}. However, it would be interesting to see how strong of an effect would be necessary to really mess up the test. I'm not suggesting that the authors have to do this, as I think he manuscript is already very interesting, but it could be of interest to many practitioners---and anyway, I have a strong suspicion some might just go ahead and do it!

Finally I think that the authors test bears some similarity to a few tests proposed elsewhere in the literature that I think the authors should engage with, e.g. Warnefors and Eyre-Walker (2012 and Latrille et al (2024), which if I recall correctly test a very similar hypothesis about equal variances but motivate it from a McDonald-Kreitman-like perspective. To be clear I absolutely do not think the current manuscript is duplicative of the work in these papers, but they are similar enough that it's probably worth discussing.

I prefer to sign my reviews. My name is Joshua Schraiber.

References:

Warnefors, M., & Eyre-Walker, A. (2012). A selection index for gene expression evolution and its application to the divergence between humans and chimpanzees. PLoS One, 7(4), e34935.

Latrille, T., Bastian, M., Gaboriau, T., & Salamin, N. (2024). Detecting diversifying selection for a trait from within and between-species genotypes and phenotypes. Journal of evolutionary biology, 37(12), 1538-1550.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Evan M Koch

Reviewer #2: No

Reviewer #3: Yes: Joshua G Schraiber

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

Figure resubmission:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. If there are other versions of figure files still present in your submission file inventory at resubmission, please replace them with the PACE-processed versions.

Reproducibility:

To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Decision Letter 1

Kelly A Dyer, Michael Edge

27 Jul 2025

PGENETICS-D-25-00294R1

A method for identifying local adaptation in structured populations

PLOS Genetics

Dear Dr. Goudet,

Thank you for resubmitting your manuscript to PLOS Genetics. The reviewers from the first round all agreed to re-read the manuscript and have provided their updated assessments. All three are largely satisfied and positive about eventual publication. However, there are some minor issues that require further attention in a minor revision. In a revised version, please address Reviewer 1's requests in regard to ensuring that the available code base is complete, Reviewer 1's comments about figure clarity, and Reviewer 2's request for clarification.

Please submit your revised manuscript within 30 days Aug 26 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosgenetics@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pgenetics/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below.

* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Michael Edge

Guest Editor

PLOS Genetics

Kelly Dyer

Section Editor

PLOS Genetics

Aimée Dudley

Editor-in-Chief

PLOS Genetics

Anne Goriely

Editor-in-Chief

PLOS Genetics

Journal Requirements:

1) We have noticed that you have uploaded Supporting Information files, but you have not included a list of legends. Please add a full list of legends for your Supporting Information files after the references list.

Note: Supporting Information Legends should not be uploaded as a separate file in the online submission form. 

2) Your current Financial Disclosure states, "JG31003A_179358, 310030_215709Swiss Science National Foundationhttps://www.snf.ch/enThe funders did not play any role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript."

However, your funding information on the submission form indicates receiving no funds. Please modify your funding information.

Please ensure that the funders and grant numbers match between the Financial Disclosure field and the Funding Information tab in your submission form. Note that the funders must be provided in the same order in both places as well. 

If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

Reviewers' comments:

Reviewer's Responses to Questions

Reviewer #1: I thank the authors for their responses to my comments and for the changes made to clarify their analysis and improve the manuscript. While the authors did not investigate some of the robustness issues I brought up in my original review, the power simulations and empirical examples are probably more worthwhile additions. The only necessary additions are code and data to reproduce the empirical results. It would also be good to have power simulations of spatially-homogeneous global adaptation, but this is arguably not as interesting as local adaptation.

Code availability:

The GitHub link provides sufficient code and documentation for the generating and analysis of simulated data. I am very happy to see the empirical example where LogAV is applied to the Stickleback data. I would only add that the scripts and ideally data files used to produce Table 3 should also be added to the repository.

Population structure in simulations:

My original confusion with the models of population structure in simulations stemmed from the diagrams in figures 4 and 5 as well as the description in the first paragraph of the “Verification and Comparison” section. No lines are drawn to indicate an ancestral population. In particular, the standard island and stepping stone models of population structure are most often at equilibrium, where the structure persists indefinitely into the past. The authors have pointed out that because of how these simulations are initialized, and the fact that they are not run long enough to reach equilibrium, it is equivalent to having an ancestral population. I don’t expect it makes that much difference to the results, it would be good to explicitly state in the caption to Figure 4. I don’t see any reason not to and it would help clarify things a bit for readers who may naturally assume that such models are the equilibrium versions. I don’t think anything else is necessary here.

Selection simulations:

The addition of Figure 6 is an improvement over what was given in the previous version of the manuscript and the addition of simulations of weaker diversifying selection is welcome as well.

It would have been nice to have matching power simulations under the spatially-homogeneous global adaptation model, but I understand that studies using the method would be more interested in finding local adaptation.

Re-analysis of stickleback data:

I find the empirical example to be a great addition to the paper. This demonstrates that the method will work on real-world data even though the full software implementation is not yet available. I would only ask the authors to make these scripts available.

Reviewer #2: I am overall satisfied with the revisions made by the authors and have no further major revisions to suggest. I would recommend one minor clarification:

Regarding the possibility of estimating theta^p and M from the offspring, the authors give two different reasons for cautioning against this: (i) using all F1 genotypes will lead to inflated theta^p values (in response to my comment), and (ii) inflated theta^p values due to the added drift across generations in the common garden experiment (in the Methods section of the main text). Could they elaborate on their reasoning or provide an appropriate citation? If my intuition is correct, if we have allele frequencies from the sub-populations (e.g. from a reference set), we should be able to estimate the drift in the sub-populations, and therefore, theta^p fairly easily. Could the authors clarify?

Reviewer #3: The revision by do O et al addresses some of my main concerns. I continue to think the manuscript is a little bit terse, but I think the authors added some meat and improved it by adding some more detailed power analysis and by including an analysis of real data. Though the authors did not do any simulations under a model where phenotypes are only measured "in the wild", I think their discussion of the point is acceptable. Regarding the asks in my original review, I feel satisfied by the authors' response.

I prefer to sign my reviews. My name is Joshua Schraiber.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: No: An empirical example using stickleback trait and genetic data has been added but the data and code to generate these results are not present.

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Arslan Zaidi

Reviewer #3: Yes: Joshua Schraiber

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

Figure resubmission:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. If there are other versions of figure files still present in your submission file inventory at resubmission, please replace them with the PACE-processed versions.

Reproducibility:

To enhance the reproducibility of your results, we recommend that authors deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Decision Letter 2

Kelly A Dyer, Michael Edge

4 Sep 2025

Dear Dr Goudet,

We are pleased to inform you that your manuscript entitled "A method for identifying local adaptation in structured populations" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/ click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Michael Edge

Guest Editor

PLOS Genetics

Kelly Dyer

Section Editor

PLOS Genetics

Aimée Dudley

Editor-in-Chief

PLOS Genetics

Anne Goriely

Editor-in-Chief

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-25-00294R2

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Acceptance letter

Kelly A Dyer, Michael Edge

PGENETICS-D-25-00294R2

A method for identifying local adaptation in structured populations

Dear Dr Goudet,

We are pleased to inform you that your manuscript entitled "A method for identifying local adaptation in structured populations" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Zsofia Freund

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Explanations for S1 Fig.

    (PDF)

    pgen.1011871.s001.pdf (64.3KB, pdf)
    S2 Text. Estimation of P-values from a Bayesian posterior.

    (PDF)

    pgen.1011871.s002.pdf (93.6KB, pdf)
    S3 Text. Breeding design description.

    (PDF)

    pgen.1011871.s003.pdf (29.6KB, pdf)
    S4 Text. Properties of the S-statistic, and why it is normally distributed.

    (PDF)

    pgen.1011871.s004.pdf (95.6KB, pdf)
    S5 Text. Computer simulation details.

    (PDF)

    pgen.1011871.s005.pdf (105.4KB, pdf)
    S6 Text. Quantinemo initialization parameters.

    (PDF)

    pgen.1011871.s006.pdf (62.1KB, pdf)
    S7 Text. Type of data necessary to estimate coancestries and relatedness.

    (PDF)

    pgen.1011871.s007.pdf (84KB, pdf)
    S8 Text. Description for S3 Fig.

    (PDF)

    pgen.1011871.s008.pdf (54.7KB, pdf)
    Fig S1. Distribution of p-values of the different methods under neutrality.

    (PDF)

    pgen.1011871.s009.pdf (150.9KB, pdf)
    Fig S2. Distribution of trait values for the different scenarios.

    (PDF)

    pgen.1011871.s010.pdf (314.4KB, pdf)
    Fig S3. Quantile plots for the QSTFST tests.

    (PDF)

    pgen.1011871.s011.pdf (142.6KB, pdf)
    Fig S4. Quantile plots for Driftsel tests.

    (PDF)

    pgen.1011871.s012.pdf (117.1KB, pdf)
    Fig S5. Quantile plots for the LogAV tests.

    (PDF)

    pgen.1011871.s013.pdf (121.6KB, pdf)
    Table S1. Summary of simulation parameters for each scenario.

    (PDF)

    pgen.1011871.s014.pdf (32.4KB, pdf)
    Attachment

    Submitted filename: Response_to_reviewers_July8.pdf

    pgen.1011871.s015.pdf (163.8KB, pdf)
    Attachment

    Submitted filename: responses_to_reviewers.pdf

    pgen.1011871.s016.pdf (49.2KB, pdf)

    Data Availability Statement

    All necessary scripts and data to reproduce this study can be found at https://github.com/isadoo/MethodsLocalAdaptation_LogAV.


    Articles from PLOS Genetics are provided here courtesy of PLOS

    RESOURCES