Abstract
Convergent adaptation is the independent evolution of similar traits conferring a fitness advantage in two or more lineages. Cases of convergent adaptation inform our ideas about the ecological and molecular basis of adaptation. In judging the degree to which putative cases of convergent adaptation provide an independent replication of the process of adaptation, it is necessary to establish the degree to which the evolutionary change is unexpected under null models and to show that selection has repeatedly, independently driven these changes. Here, we discuss the issues that arise from these questions particularly for closely related populations, where gene flow and standing variation add additional layers of complexity. We outline a conceptual framework to guide intuition as to the extent to which evolutionary change represents the independent gain of information owing to selection and show that this is a measure of how surprised we should be by convergence. Additionally, we summarize the ways population and quantitative genetics and genomics may help us address questions related to convergent adaptation, as well as open new questions and avenues of research.
This article is part of the theme issue ‘Convergent evolution in the genomics era: new insights and directions’.
Keywords: convergent adaptation, population genomics, information theory, convergence, adaptation
1. Introduction
Convergence is the independent evolution of similar features in two or more lineages [1]. Examples of convergence have played a fundamental role in our understanding of adaptive evolution (see [1], for review). Classically, convergent evolution has been used as one line of evidence for the adaptive value of a trait. When convergence is the result of natural selection producing the same solution in response to repeatedly encountered biotic and abiotic pressures, we treat these as natural replicates of a similar process to understand the ecological and phenotypic bases of convergence [2] and the molecular basis of evolution [3–6].
The broad definition of convergence permits us to think of convergence occurring at multiple levels. At the broadest scale, convergence in function may occur in species facing similar ecological challenges, consistent with view that there are sometimes many phenotypic ways to achieve a similar function [7]. For example, carnivorous plants have evolved similar strategies to adapt to their low-nitrogen environments but have done so via many distinct morphological and physiological mechanisms across five orders of flowering plants [8]. Convergence may also occur on the level of a specific phenotype. Carnivorous pitcher plants have striking convergence for trap morphology [9]. When convergent phenotypes arise in highly diverged species, it has often been presumed that these changes are owing to divergent genetic mechanisms (see [5], for review). However, perhaps surprisingly, there is increasing evidence that some phenotypes often converge because of selection on similar molecular mechanisms, with selection occurring at the same gene or even the same mutational change [3,5,10]. Again within carnivorous plants, Fukushima [11] found convergent amino acid substitutions in the evolution of digestive enzymes across four species. Despite these distinctions, all levels can be encapsulated in the term convergence. Additionally, following Arendt & Reznick [12], the single term can be used to describe both parallelism and convergence, regardless of the phylogenetic scale, ancestral state, or whether the underlying genetic mechanisms are the same or different. In this opinion piece, we discuss how adaptive convergence at the level of phenotypes, genes and allele frequency change can be studied with population genomic data. In establishing a case of convergent adaptation, two main conditions must be satisfied. First, the character must be adaptive. That is, it enhances the fitness of the organisms that bear it, relative to the ancestral state [13], with many definitions additionally requiring adaptations to be the product of natural selection for their current role [14,15]. Second, the adaptive changes in the different populations must be independent from each other. Below, we expand upon these conditions and show how they might be quantified through measures of the independent work done by natural selection in different populations. Throughout this article, we focus on how population-level genomic data both forces us to grapple with the issues that arise in identifying convergent adaptation among closely related taxa and provides novel ways to approach these and related questions.
(a). Identifying adaptations
Many different lines of evidence from developmental biology to field studies must often be brought together to demonstrate that a character is an adaptation. Often, one of the first steps in assessing whether some set of convergent characters are adaptations is to establish that the changes observed are unexpected by chance (i.e. under simple null evolution models). To address this problem for convergence at macro-evolutionary scales, phylogenetic null models and methods have been developed both for discrete and continuous traits [16–21], as well as for testing the association of a character with some specific ecological or environmental variable [2,22]. Convergence changes that are not because of the vagaries of divergence or genetic drift are good candidates for adaptations. However, even if the convergence is unexpected, this does not necessarily mean that it is adaptive. For example, it could be owing to developmental constraint or releases from constraint in similar ecological environments. Therefore, it is important to provide evidence that the changes are associated with increased fitness in the relevant environments. One way of doing this is to provide evidence that directional selection repeatedly drove the convergent change. The field of population genetics has a long history of discerning phenotypic and allele frequency shifts owing to selection from those due to neutral processes. In this article, we review some of the ways these approaches can be used and extended to studying cases of convergence.
(b). Establishing independence
Showing that changes observed were truly independent is necessary to gain insight about adaptation from studies of convergence; otherwise, they may just represent a single replicate of the adaptive process. Establishing that changes were truly independent is difficult. Even with a well-resolved phylogeny, with no gene flow and nodes with sufficient temporal separation such that there is no incomplete lineage sorting (ILS), secondary loss must be distinguished between convergent evolution to show that a trait is not shared from a common ancestor [23]. Establishing independence for more closely related taxa can be even more challenging. Closely related populations may be more likely to share independently derived states that were absent or rare in their ancestor. This may occur not only through independent mutations recurrently becoming a substitution in different parts of the phylogeny, but from shared variation present in their ancestor and introgression. These latter modes are expected to be more prevalent between closely related taxa because of their higher likelihood of interbreeding and retention of ancestral polymorphism.
The sharing of traits or alleles incongruent with the population or species tree owing to ancestral variation (i.e. ILS shown in figure 1a) has been termed hemiplasy (to contrast it with homoplasy [23, 24]). This term has also been more generally applied to alleles shared by gene flow and with some authors seeing it as an alternative explanation to convergence [25–27]. While these approaches to assess discordance between gene trees and species trees that lead to spurious inference of convergence are important, we argue that the sharing of an allele owing to ILS or introgression does not invalidate the case for convergent adaptation. Rather, these questions are focused on identifying the level of independence. When convergence at the level of the same molecular substitution between distant species is observed, we can usually assume that this was owing to multiple, independent mutations (i.e. convergent mutations). However, the key question is whether selection has independently increased the allele to fixation multiple times (i.e. there has been convergent substitutions). Taxa sharing the same substitution owing to ILS or gene flow means there has not been independence at the level of the mutational change; however, the allele frequency change at the locus can still be independent, that is convergent, across populations. For example, following populations 1 and 3 in figure 1a, even if an allele was shared owing to ILS, selection may have repeatedly driven up the allele in each population separately.
In highlighting levels of independence, it is not our intention to add more vocabulary to a perhaps already complex area. Instead, we aim to make clear the questions we seek to address when studying convergence at various levels. We, in particular, are interested in dissecting these levels of independence in cases of convergence at the same gene [28], to gain insights about adaptation into how a particular variant or haplotype offers a selective benefit. For example, seeing different mutations is informative about the mutational target for adaptation. Yet, the repeated use of standing variation, especially if those alleles are very rare, still informs us about the mutational target size. An example of both instances exist in marine populations of threespine stickleback that have repeatedly, independently have adapted to freshwater environments via changes at the EDA and PITX1 genes. The fact that many independent mutations at PITX1 underlie adaptation [29,30] suggests that this state is relatively accessible to the population by mutation or that other possible mutations may be too constrained by pleiotropy. Conversely, the EDA haplotype, which had been standing and reused many times by adaptation in freshwater sticklebacks [31], is suggestive that populations cannot access a similar, relatively low pleiotropy state more rapidly by mutation. The frequency of these different modes of convergence speaks to the extent to which populations are mutation-limited and to the role of standing variation and gene flow in adaptation [32]. Thus, regardless of the precise terminology we use, it is important to keep clear statements about the level of independence and the role selection has played in these changes. In this article, we introduce a conceptual framework to quantify the degree to which allele frequency shifts have been independent. We then review current approaches in population and quantitative genetics and genomics and how those can be extended to include more explicit tests for independence.
2. Quantifying the independence of convergent adaptation, the accumulation of information, or when to be surprised
One conceptual way forward in quantifying convergent adaptation may be to assess the degree to which selection has independently driven phenotypic and allele frequency change. Quantifying the work done by selection among taxa can help to guide intuition as to when to be impressed by convergence. One reason why adaptive convergence among lineages that are phylogenetically distant is surprising is that these are clear examples of selection independently, repeatedly reaching similar states that would be unlikely under chance or drift alone. One measure of this is the extent to which selective deaths (or births) underlying an adaptation in two or more populations are independent. For example, if light coloured pelts are an adaptation to artic environments, we have high confidence in calling the light coloured pelts of arctic foxes and hares convergent adaptations because the individuals who died owing to predation pressure, that drove the adaptation via allele frequency change, were clearly different sets of individuals (being foxes and hares, respectively).
More formally, we can calculate the number of selective deaths needed to generate a given allele frequency change. In a population of size N, the selective deaths imposed under a deterministic, directional selection model where the fittest homozygote has selection coefficient s, move an additive allele from frequency x1 to x2 in t1 to t2 generations is N L where L is
2.1 |
as originally derived by Haldane [33] in his discussion of the substitution load. (We return to the issues with Haldane’s load argument below.) We will focus on fixed alleles such that
2.2 |
This measure of the work done in fixing an allele owing to selection does not depend on the strength of selection because it takes a longer time to fix a more weakly selected allele than a strongly selected allele so the same total number of selective deaths are incurred. Additionally, most of the work done by selection to increase fitness occurs when the allele is very rare in the population. When the allele becomes common in the population, the majority of individuals carry the allele so fewer individual deaths are owing to differences in the genotype at this locus. Under this measure of work done by selection, we should be much more impressed by selection that repeatedly moves an allele from 10−4 to 50% than 50% to 100%, despite being comparable frequency changes and taking roughly the same time under an additive model.
We can extend this idea to obtain a metric of the proportion of the work done by selection that is shared between populations that both have experienced allele frequency change at the same locus. Here, we apply this to two cases shown in figures 1b,c. We first study the case that a selected allele has increased in frequency in the ancestral population from x1A to x2A, and following a population split increased from x2A to fixation in both of the daughter populations (figure 1b). Here, the work done by selection in the ancestral population (LA, i.e. that shared between our populations), the work independently done in the two daughter populations (LD), and their ratio can be written as
2.3 |
2.4 |
2.5 |
Thus, if the allele selectively increased from a low to intermediate frequency in the ancestral population before the split (x1A ≪ x2A), much of the work done by selection is shared between the daughter populations. In the absence of other information, it is parsimonious to assume that 100% of the selective deaths are shared between the two sister populations and that the allele fixed in the ancestral population (x2A = 1). This agrees with our intuition and ideas of phylogenetic parsimony that, in the absence of other evidence, all of the change occurred only once in the ancestral population. That is, we should not infer the repeated work of selection where it is unnecessary. This measure also emphasizes that we should be much more impressed by repeated adaptation from very rare standing variation than common variation. Similarly, in the ILS example shown in figure 1a, in the absence of other information it is parsimonious to assume that the allele fixed in populations 1 and 3 was relatively common in the ancestral population; so, its repeated fixation by drift was not unlikely. Only in the case where we have other information, for example that neutral ILS is very rare given the timespan involved, do we have some evidence that convergence is unusual.
As a final scenario, we consider the case when a selective allele increases in frequency in one population (i) owing to selection before being introduced into a second population (ii) by gene flow and fixing in the second population (figure 1c). For simplification, we assume a single pulse of admixture that introduces the allele into the second population at frequency p. The work done by selection in populations 1 and 2 are L1 = −2log (x1) and L2 = −2log (p), respectively, for a total load of LT = −2 (log (x1) + log (p)). Therefore, the proportion of selective deaths that are shared is
2.6 |
Because most of the selective deaths involved in fixing an allele occurs when the allele is rare, they are shared between populations unless the frequency that the allele is introduced into population 2 is small (i.e. p close to x1). For example, if the allele fixed from an initial frequency of 1 in 1000 in population 1 and admixture into population 2 initially pushed the allele to 10% frequency, a large portion, 75%, of the work done by selection is shared.
We can also relate this measure to the amount of information gained independently in different populations. As noted by Kimura [34], the load involved in fixing a single substitution (equation (2.2)) is twice the log of the probability of a neutral allele fixing from x1 and so is a measure of the information accumulated by selection as compared to a neutral model (in an information theoretic sense, [35,36]). We explain this connection in appendix Aa and how it follows from the fact that equation (2.2) can be seen as the average log-likelihood ratio of the probability of an allele fixing under the selective model compared to the neutral model. This information gain under a model of selection compared to neutrality is a measure of the unlikeliness of the state of the system encoded by selection as compared to a neutral model. Extending this argument in appendix Aa, we show that the decomposition of the shared and independent work done by selection through selective deaths, e.g. equations (2.3)–(2.6), corresponds to the shared and independent gain of information owing to selection. Specifically, we can express the additional work done by selection in a second population in terms of how improbable the allele frequency change was in a second population under a model of genetic drift, given the change already achieved in the first or ancestral population. We also show how this measure of convergence, as the independent work performed by selection, can also be expressed as the statistical ‘surprise’, under a neutral model, as observing what has been achieved in a second population conditional on what happened in a first or ancestral population. While we have considered just two populations here, we can extend this framework to larger numbers of populations. However, the main insight, that we want to see repeated large allele frequency changes from initially rare alleles, is clear from just the two population cases.
This framework may be useful in aiding our intuition about the degrees to which selective events have been independent and so can be considered to be convergent. At the macro-evolutionary scale, with no gene flow or ILS, convergent substitutions in different taxa can be thought of as completely independent allele frequency changes. Thus, seeing a specific, selection-driven substitution twice offers double the information gain by selection. It seems natural to then think of simply summing up the information gained over multiple substitutions across loci. However, one issue with this approach is that we cannot simply sum the work done by selection (i.e. selective deaths) across multiple loci, unless the loci contribute multiplicatively to fitness. The same information content can be accumulated for far fewer deaths under negative fitness epistasis (e.g. most efficiently owing to truncation selection). This argument forms the basis of the rejection of Haldane’s substitution load argument, and Kimura’s gain of information extension, as a limit on the rate of evolution [37–39]. Combining information over loci additively, that is assuming multiplicative epistasis, seems a fine first approximation, but this limitation should be borne in mind. However, pushing beyond that limitation the information gained through selection is a more general concept of a statistical measure of the work done by selection in pushing evolution towards highly improbable states. A number of authors have explored the idea of statistical information as a measure of work done by selection and more broadly the role of selection in decreasing the entropy of states including in the presence of different forms of epistasis [40–44]. These ideas extend naturally to quantitative traits [40,41,45] as well as substitution models from molecular evolution [41]. The null models that we judge our gain of information owing to selection against can also include mutation and mutational biases. Therefore, convergent adaptation, at the genetic or phenotypic level, can potentially be conceptualized in this way as the independent gain of information owing to selection.
3. Using population genetics to identify convergent adaptation
Population and quantitative genetics and genomics can provide evidence for convergent adaptation both by helping to establish the degree to which evolutionary changes among populations are independent and by providing evidence that, historically, individuals with a given allele had enhanced fitness relative to the ancestral state. In this section, we review how single locus and quantitative trait tests, and haplotypic or linked variation data, can be used to test the conditions of convergence.
(a). Single locus and quantitative trait tests
In evolutionary studies of characters across species, comparative methods [2,46] make use of the phylogeny to judge whether the sharing of a character among taxa is expected under some null model (e.g. independent evolution down a phylogeny) and whether characters are correlated with some ecological variable beyond null expectations (see [22], for a recent review). For example, in phylogenetic mixed models, the phenotypes of a set of species is regressed on some environmental variable as a fixed effect and divergence along the phylogeny is incorporated as a random effect, with the covariance structure of the random effect being specified by the matrix of shared branch lengths [47,48]. Comparative methods provide a series of tools for evaluating convergence for discrete and continuous characters and use convergence to gain independent information to test evolutionary hypotheses.
There is an equivalent body of work in population genetics which involves shorter evolutionary time scales. In answering these questions on short time scales, one equivalent to the phylogeny is the relatedness or kinship matrix [49,50]. There are different ways of defining this matrix, but broadly the elements of this matrix specify the amount of sharing of alleles between two individuals or populations, above that expected in a reference population (often some hypothetical ancestral population or the total sample), owing to shared coancestry and genetic drift [51]. This is conceptually similar to how a species phylogeny specifies the shared branch lengths for pairs of species back to their common ancestors and hence the degree of shared ancestry relative to the most recent common ancestor of all the samples. However, using the relatedness matrix is a somewhat different statement from the importance of species phylogeny in judging trait similarity because introgression and ILS increase the chance that alleles and traits can be discordant with a species phylogeny [25–27]. This issue is well accommodated for by using the relatedness matrix which naturally accounts for allele sharing that is discordant with the species tree. An additional benefit is that models based on the relatedness matrix can be used in situations where population relationships are not well described by a simple branching phylogeny, for example, isolation by distance, which is frequently encountered when dealing with populations separated by short timespans.
Tests of selection involving an empirical relatedness matrix have become more common with the increase in data from large numbers of closely related populations. For example, the association between allele frequencies at a locus and an ecological variable can be assessed by using the relatedness matrix to model the null expectations of populations that vary and covary in their deviations owing to genetic drift [52–54]. A similar body of work tests for non-neutral population differentiation while accounting for population structure by using the relatedness matrix or the eigen-vectors (principal components) of this matrix [55–58]. By incorporating the relatedness matrix as a null model into population-genetic tests, shared population history and gene flow is naturally accommodated for in assessing environmental correlations or non-neutral allele frequency divergence. While these tests use the independent evidence of allele frequencies across populations to test for deviations away from a null model, they do not generally test explicitly for adaptive convergence. This is because they are not usually used to take the extra step of determining whether similar patterns of non-neutral allele change in distinct subsets of the species range are, in fact, independent or can be explained by selection-driven allele frequency change in one region followed by gene flow and drift spreading the change to other regions. However, researchers do already informally test for convergence of allele frequency change across species ranges, for instance in studying parallel selective clines on multiple continents or at smaller geographical scales. (See, for example, the work on Drosophila genomic clines of [59–61].) One possibility to formalize a test of convergence is to condition on the observed allele frequency change in one subset of the species range, using the relatedness matrix, while testing for non-neutral allele frequency change elsewhere. Additionally, frameworks that use admixture graphs, parameterized versions of the relatedness matrix [62], to determine whether multiple instances of selection on distinct branches are being developed to examine allele frequency change across populations [63]. Therefore, we are now in position to move towards formally testing for non-neutral convergence in allele frequency changes among populations.
In studying phenotypes, we can also address similar questions across closely related populations. Quantitative geneticists have long asked whether common-garden phenotypes are too variable across populations compared to the allele frequency differentiation (often using QST – FST comparisons; see [64,65] for reviews). These tests were originally restricted to relatively simple models of population structure. Recently, there has been a move to using kinship matrices to test for non-neutral divergence in phenotypes or polygenic scores across populations, allowing shared neutral trait divergence owing to shared population history and gene flow to be more fully accounted for [66–70]. These tests are predicated on the idea that the additive genetic phenotypic variation among populations is the sum of allele frequency changes at many loci. Therefore, its neutral properties are well captured by the relatedness matrix in a conceptually similar way to how Brownian-motion models capture divergence owing to phenotypic effects of many small substitutions on a phylogeny. These relatedness-based models can be used to ask whether there is support for phenotypes covarying with environments more than expected under models of genetic drift [66,67,69] and to assess non-neutral divergence along different axes of population structure. These approaches have been used to quantify the degree to which non-neutral genetic phenotypic change among populations is independent versus a consequence of shared history. For example, assessing how unexpected the phenotypes or polygenic scores of one set of populations are, conditional on another distinct subset [66,67]. These ideas have also been extended into explicit tests of non-neutral phenotypic or polygenic score change on admixture graphs [71] that could be used to detect repeated changes among populations.
In conclusion, methods to assess non-neutral, repeated evolutionary change across many populations are rapidly being developed. These developments mirror many of aspects of phylogenetic comparative methods, but extend them to situations where population history is not well represented by a single tree. In moving forward, we need to more explicitly test for convergence and incorporate more intuitive measures of when this convergence is surprising.
(b). Linked variation
Beyond single-locus tests or tests at the level of the phenotype, knowledge of the population genetic variation at a set of linked sites helps address questions related to convergence that may not be possible otherwise. As discussed above, convergent evolution at the level of a gene or allele is most surprising when selection repeatedly and independently pulls alleles up from rare frequency. By comparing haplotype patterns across populations, we can hope to build evidence towards demonstrating this as well as establishing independence to identify convergence.
One common signal leveraged in population genomic scans for selection is the hitchhiking effect. The rapid increase of a rare, beneficial allele gives little time for mutation and recombination to introduce genetic variation on the haplotypic background of the allele, resulting in a decrease in allelic diversity in this region [72,73]. Thus, a strong signal of a selective sweep at a locus is evidence that selection has led to a gain of information in the population. With multiple populations exposed to some novel selection pressure, linked variation and haplotype patterns not just within but among populations can be informative. Strong allele frequency differentiation among populations can be used as additional evidence of selection which can be linked to the specific selection pressure driving the change. However, seeing the signal of a selective sweep across several populations does not ensure that selection was independent. For example, there may have been recent selection for the trait once in the ancestor of the present-day populations (i.e. the scenario in figure 1b where the sweep went to fixation fully in the ancestor or the allele frequency change from x2A to fixation was owing to drift). Additionally, there may have been selection in a single population and sufficient gene flow and genetic drift to increase the frequency of the selected haplotype with no additional selection in the recipient populations (i.e. figure 1c where now the increase in the recipient population was not owing to selection in population 2 but only migration pressure and drift).
Population genomic data helps dissect whether sweeps across multiple populations are indeed true examples of convergent adaptation. If, for example, populations adapt from different or independent mutations within the same gene or different genes that arose and swept, little overlap is expected in the haplotypes among these populations (figure 2). This builds support for the case that adaptation was probably independent and that information gain owing to selection was compounded across populations. Convergent adaptation owing to selection on the same mutation, i.e. independent adaptive allele frequency change on the same variant (independent substitutions but not independent mutations), may be also distinguishable from cases where most of the change is owing to shared selection. In [28], we developed an approach to distinguish between the ways convergent adaptation can arise using neutral allele frequency data from sites linked to beneficial loci. Specifically, we derived coancestry coefficients (terms proportional to the relatedness and kinship coefficients) both within and between populations experiencing the same selective pressure based on coalescent probabilities. We then used these terms to calculate the composite likelihoods of observing a set of neutral allele frequencies given a specific scenario of selection and selected allele sharing. Previously, we aimed to distinguish among convergent adaptation owing to convergent mutations, independent selection on shared ancestral variation, and independent selection on an introgressed variant, as well as a neutral model with no selection. In appendix Ab, we derive similar coancestry coefficients for the scenario where there has been a complete sweep in the ancestral population to exemplify how this approach can be extended to model scenarios that are not truly convergent adaptation in hopes to distinguish between these signals as well. Other authors have explored models and developed methods for sweeps from standing variation [74–76], sweeps from selected alleles spread by migration or introgression [77–80], and ancestral sweeps [81]. We hope that our approach, which combines insights from many of these models into a single framework, can aid investigators in identifying, and distinguishing among models of, convergent adaptation. Additionally, since our approach does not assume an underlying population tree, this may be particularly useful for closely related populations that may have short branching times and recent or ongoing gene flow.
Now, to build intuition, we briefly summarize the types of patterns we expect to observe under various scenarios that result in an adaptive allele being fixed in two populations, as illustrated in figure 2b. We focus on four cases: (i) independent selection on independent mutations for a beneficial allele at the same locus, (ii) independent selection on an allele that was standing in the ancestor of the selected populations and independently in the daughter populations for a moderate amount of time, (iii) introgression followed by independent selection, and (iv) a complete sweep in the ancestral population of the two adapted populations prior to their splitting. The first three cases fall under our definition of convergent adaptation while the last would not because of its lack of independence or replication. For each scenario, we focus on three signals: the haplotype patterns in our two adapted populations (figure 2c), the gene tree at the beneficial allele for the two adapted populations plus an outgroup (figure 2c), and the coancestry coefficients that our approach uses, which are another statement about the haplotypic similarity within and between populations, as a function of recombination distance from the locus experiencing selection (figure 2d). The latter two signals shown were generated from simulations as outlined briefly in appendix Ab.
The difference between the independent mutations scenario and the others is perhaps the most striking. Here, the beneficial allele arose on different haplotypes in the two adapted populations and, after fixation, we observe high levels of similarity within the two populations, but none between. This sharing within a population that experienced selection is owing to the fact that variants linking the beneficial allele also increased in frequency. As we move further away from the selected locus, recombination is more likely to occur and this association breaks down. This is shown in the coancestry coefficients as a function of recombination distance from the selected site. In this case, there is an increase in coancestry within a selected population but no increase in coancestry (beyond neutral, shared population history) between the populations. We can also see this in the tree generated at the selected site. Here, all lineages within a population coalesce rapidly, while between them, the lineages do find a common ancestor faster than on the time-scale of typical, neutral lineages.
The other three scenarios have a key similarity. Now, there is some haplotypic similarity in both adapted populations because the beneficial allele has a single, shared origin. Still, there are features that differentiate these scenarios that may be useful to distinguish them. In the case that selection was on a variant that was standing in the ancestor of the adapted populations, there is a region of a shared, core haplotype between populations and, as a result, increased coancestry between the populations. However, we observe a faster decay in coancestry with increasing recombination distance between the populations than within a population. This is owing to the fact that while the allele was standing independently in both populations prior to the onset of selection, recombination broke down the original haplotype on which the selected allele arose. Seeing a strong sweep within populations, but only a partially shared haplotype among the populations, is evidence that selection has independently dragged up the same variant from low frequency. By contrast, the final two scenarios have higher amounts of sharing both within and between the adapted populations. In the introgression case, we expect little to no differentiation in the haplotypes between the populations if migration occurs during the sweep. The gene tree depicts that all lineages with the beneficial allele coalesce rapidly at the selected site, regardless of which population they were sampled from. In the coancestry coefficients, there is now a slow decay in the within-population coancestries and a similar decay between populations. The final scenario, which does not represent convergent adaptation but rather a single, ancestral sweep, looks very similar to introgression because there is a lot of sharing in haplotypes within and between populations. The key difference is, if selection occurred sufficiently far in the past, we will observe new mutations independently arising in the adapted populations (indicated by the pink bars on the haplotypes). Specifically, in the tree, we still observe all lineages at the selected site coalesce at the same time in the past, but now there are branches leading to our present-day lineages on which new mutations have occurred. Thus, to demonstrate that the beneficial allele swept in multiple populations, the between-population coalescent times should be much more recent than those in the rest of the genome. Observing very recent coalescence (i.e. short branch lengths) between and among populations in the swept region would allow us to rule out that the sweep is shared ancestrally or spread by neutral migration pressure. Such sweep patterns, that are independent among multiple populations, can provide evidence of adaptive convergence and the independent gain of information owing to selection.
4. Discussion
We emphasize the importance in carefully defining the questions we are trying to address in studying convergent adaptation. Convergent evolution has a wide range of overlapping definitions; it is obviously fine if our discussion does not line up with your definition (indeed one of the authors has varied in his definitions, [82,83]). However, we assert that when there has been convergent substitutions at a locus among taxa, it is of interest that selection favoured their increase in frequency multiple times, no matter their origin.
There is a growing body of theory on when we should expect various modes of convergent adaptation among distantly related populations (owing to independent mutations [84,85]) and among closely related populations (owing to independent mutations, standing variation, and the spread of alleles by migration [82,83,86–89]). Dissecting whether the same or different mutations gave rise to the adaptive alleles will help us gain insights into questions related to mutational target size or epistatic constraints, but these questions can be somewhat orthogonal to more basic questions about adaptation. As we learn more about repeated adaptation across populations, the need to differentiate convergent mutations from convergent allele frequency change will increase. For example, human populations have repeatedly adapted to different environments through changes in skin pigmentation. This convergent adaptation on the level of the phenotype comes from a mixture of selection on old standing variation, both derived and ancestral variants, and recent mutations [90–92]. In the case of repeated evolution probably involving highly polygenic traits (such as human height and the convergent evolution of small body size; [93,94]), populations may be convergently adapting via very similar shared sets of alleles. This may reduce the independent information we can hope to glean about the genetics of height, but establishing that the phenotypic change occurred convergently is still important.
We demonstrated a potential way to quantify the impressiveness of various instances of convergent adaptation. One fair question is whether expressing these ideas in terms of information gained by selection and surprise add something new to the discussion. We ourselves are still a little unsure. After all, log-likelihood ratios are regularly computed as a means of summarizing support for various selection models over the null model. However, usually the log-likelihood is viewed as a test statistic that should be compared to some null distribution, to see if the null model can be rejected. Indeed, we often tell ourselves that we should not be too invested in the value of log-likelihood ratio as it is influenced by our sample size, etc. However, in calculating log-likelihoods of a particular allele being fixed in different populations, the likelihood ratio is a statement about the outcome of an evolutionary process and its replications across populations. Therefore, thinking about the log-likelihood of outcomes across populations as representations of the information gained by selection, at least under some models, focuses us on the view that natural selection is a force that drives the origination of the improbable, and in the case of convergent adaptation, its repeated origination. More work is needed to demonstrate that this is indeed useful and how it could be practically applied, but the idea itself may have merit.
In this article, we focus on how population-level genomic data, specifically from closely related taxa, forces us to recognize new challenges in identifying convergent adaptation while also enabling us to address these issues in new ways. Additionally, the sharing of characters among closely related taxa brings its own set of opportunities. Repeated adaptation among recently diverged populations potentially allows us to use population genomics, genetic crosses, and association studies to map genotype to phenotype, as well as the option to perform experimental manipulations in replicated populations across relatively uniform genetic backgrounds. Population genetic tools can be useful in helping to assess the two criteria necessary to have convergent adaptation. There are many existing approaches, that often nicely parallel those from phylogenetics, to test that changes we observe are unusual, relative to some null model. More work is needed to fully flesh out the potential of these methods, but these tests can be incorporated into existing frameworks allowing the field to move forward with more explicit tests for convergent adaptation. We are excited by what new data and tools can help us learn about convergence. Still, it is important to emphasize that cases which meet these two criteria may help in identifying loci and traits that may be interesting to investigate further, but they might not be convergent adaptations on the phenotypic level. For example, two loci may have swept the same haplotype to fixation, we cannot be certain that the haplotypes, while sharing some alleles, do not also harbour different alleles conferring different phenotypic effects on fitness. Therefore, the hard work of linking the genotype and phenotype to fitness will remain an obstacle in describing convergent adaptation. However, the growing potential for identifying new cases of convergence in closely related populations offers a rich seam for these efforts.
Acknowledgements
We wish to thank Nick Barton, Jeremy Berg, Doc Edge, Emily Josephs, Matt Osmond, and members of the Coop laboratory for helpful discussions and feedback. We would also like to apologize to authors whose work we could not cite on this broad topic owing to limited space.
Appendix A
(a) Information
Kimura [34] argued that Haldane’s [33] substitution load for a single selected allele is also related to the accumulation of genetic information. In this appendix, we discuss ideas about convergent allele frequency change in terms of information theory.
The information gain of the probability distribution p( ) over q( ) is defined as
A 1 |
This is the log-likelihood of x averaged over the probability distribution p( ). It measures how much more concentrated p( ) is on particular states in x compared to q( ). It is also called the Kullback–Leibler divergence, or relative entropy of the first model to the second model.
We can write the gain of information under our selection model compared to a neutral model as
A 2 |
i.e. the log-likelihood ratio under our selected compared to our neutral model averaged over outcomes of our selected model.
To see Kimura’s [34] argument, imagine an allele starting at frequency x1 in the population. If it were neutral, it would fix in the population with probability x1. If it is selected, it will fix with probability 1 (assuming x1 ≫ 1/(2Nes), where Ne is the effective population size). Then, the gain of information under the selection model compared to the neutral model is
A 3 |
as originally outlined by Kimura [34].
We can consider the gain of information in moving from frequency x1 to x2A, as in our population split case (figure 1b). Under genetic drift, the probability that a neutral allele starting from frequency x1A ≤ x2A reaches frequency x2A before loss is (x1/x2). An advantageous, additive allele achieves this change deterministically (again assuming (x1A ≫ (1/2Nes)). Therefore, we can write the gain of information of moving from frequency x1A to x2A under our selection model to our neutral model as
A 4 |
This shows that our selective deaths in achieving this frequency change (equation (2.3)) in the ancestral population may also be thought of as a selective gain in information. (Note in expressing things this way there is an ambiguity in that we state that the frequency at the moment the populations split is x2A. Therefore, an alternative formulation of the neutral probability may be the transitory stationary distribution at x2A of the neutral diffusion starting from frequency x1A. A different approach that would give us equation (A 4) is to think about the difference in the gain of information owing to selection in the system when going from frequency x1 → 1 compared to going from x2 → 1.)
Thus, we can decompose the total information gain under the selection model of going from frequency x1A to x2A and then from x2A to fixation into LD + LA. Note that in doing this we have not specified the time it took selection to achieve this gain. If we have knowledge of the timing, this could be incorporated by using the allele-frequency transition probabilities for the neutral and selection model in equation (A 2). (Note that incorporating the timing would lead to the information gain to depend on the strength of selection.) Using transition probabilities would also allow us to incorporate genetic drift into the selection model, which would allow the amount of work done by selection to be averaged over likely values of the allele frequency change owing to the combined actions of genetic drift and selection. Such considerations would need more work if this were to be useful in a practical setting.
Similarly, the total information accumulated in fixing the allele in populations 1 and 2 in our admixture example (figure 1c) is
A 5 |
Therefore, one measure of adaptive convergence may be the additional information gained by selection independently in different populations. In the context of information theory, equations (2.5) and (2.6) are called the coefficient of constraint which is a measure of how well we can predict the information in population 2 from that given to us by population 1.
Surprise! We can also formulate our results in terms of gain of information as the ‘surprise’ at our allelic state across populations. In information theory, the negative log probability of an outcome under a particular model is called the surprise (or the self information). The lower the probability of a particular outcome under a model, the greater the surprise when we see that outcome. Another good feature, following from that of log probabilities, is that surprise is additive so you are doubly surprised if you see the same rare event twice independently.
As everything is done under a effective deterministic selection model (i.e. starting from a frequency ≫ 1/(2Nes)), all of our statements about information gain only include the log probabilities under the neutral model. Therefore, they are the surprise at the outcome under the neutral model. Our measure of the work done by neutral probabilities can be decomposed into a product of conditional neutral, probabilities, e.g. equation (A 5) and so a sum of log probabilities. This has the nice feature that we can ask how surprised we are at the convergence in a second population given a change in the first population.
(b) Coancestry coefficients
Knowledge of variation linked to a selected site may enable us to address questions of convergence that would not be possible if we only observed the state of the selected populations. If we observe sister populations sharing a selective event, we can ask if the selection was independent or if the selective event occurred in their ancestor. If the selected event is shared and the sweep is recent, we might expect the regions around the selected site to be more similar than if the sweeps happened independently and are truly convergent. We can formalize this in terms of coancestry between populations.
We define coancestry coefficient fij as the probability that a pair of alleles sampled from populations i and i coalesce before the ancestral population of all sampled populations (see [28], for more details). Thinking about the populations in figure 1b, we can define the coancestry owing to both neutral processes and selection for populations 1 and 2 as where f12 alone specifies the coancestry between the populations owing to drift and admixture. If the selective events occurred independently from a new mutation of the beneficial allele in both populations, the coancestry between them at loci near the selected allele is simply what we expect under neutrality, f12.
The coancestry will increase if selection is on the same ancestral standing variant present in the ancestor of the populations, and is a function of the frequency of the standing variant, g (i.e. x2A in figure 1b) and the amount of time, t, once populations 1 and 2 have split from their ancestor before selection started. Assuming the standing variant was previously neutral or was maintained in the population at some low frequency by balancing selection,
A 6 |
where y2 represents the probability of both linked lineages failing to recombine off the beneficial allele. This can be approximated as where ts is the duration of the sweep phase. This increase in coancestry, that decays with distance from the selected site, is owing to the fact the region around the beneficial allele looks more similar if the variant is shared. However, as the amount of time the beneficial allele is standing independently in the sister populations before selection occurs increases, we expect this similarity to decrease as recombination is occurring independently in the populations. See [28] for full derivation.
If the selective event is shared completely (i.e. the sweep occurs in the ancestor of populations 1 and 2 before they split), there is a further increase in coancestry. In this case, the fraction of shared deaths is 1:
A 7 |
Even if the selective event is partially shared such that selection occurs in the ancestral population (for time t1) and continues independently in the daughter populations (for time t2), this takes the same form as (A 8) if t1 + t2 = ts:
A 8 |
Therefore, it may not be possible to distinguish cases where the fraction of shared death is greater than zero such that the selective event is shared completely or partially. However, it may be possible to detect if truly convergent, i.e. there is no overlap in selected deaths.
Simulation details. We performed coalescent simulations under the four scenarios outlined above using mssel, a modified version of ms [95] that allows for the incorporation of selection at single site and stochastically generated allele frequency trajectories. See [28] for more details. These simulations were used to generate both coancestry coefficients averaged over 100 simulations and the gene trees. For all, we specified a 5% reduction in fitness for heterozygotes not carrying the beneficial allele and assumed a model of additivity. For the ancestral sweep model, we simulated a sweep that finished 0.1 coalescent generations in the past. For the standing variant model, we specified that the variant was standing at 1% frequency for 0.075 coalescent generations after the populations split. Additionally, for the migration model, we allowed for a fraction of 1 × 10−4 migrants per generation over the duration of the sweep from the source population into the recipient population.
Data accessibility
This article has no additional data.
Competing interests
We declare we have no competing interests.
Funding
This work was supported by the National Science Foundation Graduate Research Fellowship awarded to K.M.L. (grant no. 1148897) and by grants from the National Science Foundation (no. 1353380) and the National Institute of General Medical Sciences of the National Institutes of Health (grant no. NIH R01 GM108779) awarded to G.C.
References
- 1.Losos JB. 2011. Convergence, adaptation, and constraint. Evolution 65, 1827–1840. ( 10.1111/evo.2011.65.issue-7) [DOI] [PubMed] [Google Scholar]
- 2.Harvey PH, Pagel MD. 1991. The comparative method in evolutionary biology, p. 239 Oxford, UK: Oxford University Press. [Google Scholar]
- 3.Martin A, Orgogozo V. 2013. The loci of repeated evolution: a catalog of genetic hotspots of phenotypic variation. Evolution 67, 1235–1250. [DOI] [PubMed] [Google Scholar]
- 4.Rosenblum EB, Parent CE, Brandt EE. 2014. The molecular basis of phenotypic convergence. Annu. Rev. Ecol. Evol. Syst. 45, 203–226. ( 10.1146/annurev-ecolsys-120213-091851) [DOI] [Google Scholar]
- 5.Stern DL. 2013. The genetic causes of convergent evolution. Nat. Rev. Genet. 14, 751 ( 10.1038/nrg3483) [DOI] [PubMed] [Google Scholar]
- 6.Storz JF. 2016. Causes of molecular convergence and parallelism in protein evolution. Nat. Rev. Genet. 17, 239 ( 10.1038/nrg.2016.11) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wainwright PC. 2007. Functional versus morphological diversity in macroevolution. Annu. Rev. Ecol. Evol. Syst. 38, 381–401. ( 10.1146/annurev.ecolsys.38.091206.095706) [DOI] [Google Scholar]
- 8.Givnish TJ. 2015. New evidence on the origin of carnivorous plants. Proc. Natl Acad. Sci. USA 112, 10–11. ( 10.1073/pnas.1422278112) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Thorogood CJ, Bauer U, Hiscock SJ. 2018. Convergent and divergent evolution in carnivorous pitcher plant traps. New Phytol. 217, 1035–1041. ( 10.1111/nph.14879) [DOI] [PubMed] [Google Scholar]
- 10.Christin P-A, Weinreich DM, Besnard G. 2010. Causes and evolutionary significance of genetic convergence. Trends Genet. 26, 400–405. ( 10.1016/j.tig.2010.06.005) [DOI] [PubMed] [Google Scholar]
- 11.Fukushima K. 2017. Genome of the pitcher plant cephalotus reveals genetic changes associated with carnivory. Nat. Ecol. Evol. 1, 59 ( 10.1038/s41559-016-0059) [DOI] [PubMed] [Google Scholar]
- 12.Arendt J, Reznick D. 2008. Convergence and parallelism reconsidered: what have we learned about the genetics of adaptation? Trends Ecol. Evol. 23, 26–32. ( 10.1016/j.tree.2007.09.011) [DOI] [PubMed] [Google Scholar]
- 13.Futuyma D. 2013. Evolution. Sunderland, MA: Sinauer. [Google Scholar]
- 14.Baum DA, Larson A. 1991. Adaptation reviewed: a phylogenetic methodology for studying character macroevolution. Syst. Biol. 40, 1–18. ( 10.1093/sysbio/40.1.1) [DOI] [Google Scholar]
- 15.Gould SJ, Vrba ES. 1982. Exaptation–a missing term in the science of form. Paleobiology 8, 4–15. ( 10.1017/S0094837300004310) [DOI] [Google Scholar]
- 16.Castoe TA, de Koning AJ, Kim H-M, Gu W, Noonan BP, Naylor G, Jiang ZJ, Parkinson CL, Pollock DD. 2009. Evidence for an ancient adaptive episode of convergent molecular evolution. Proc. Natl Acad. Sci. USA 106, 8986–8991. ( 10.1073/pnas.0900233106) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mahler DL, Ingram T, Revell LJ, Losos JB. 2013. Exceptional convergence on the macroevolutionary landscape in island lizard radiations. Science 341, 292–295. ( 10.1126/science.1232392) [DOI] [PubMed] [Google Scholar]
- 18.Stayton CT. 2008. Is convergence surprising? an examination of the frequency of convergence in simulated datasets. J. Theor. Biol. 252, 1–14. ( 10.1016/j.jtbi.2008.01.008) [DOI] [PubMed] [Google Scholar]
- 19.Thomas GW, Hahn MW. 2015. Determining the null model for detecting adaptive convergence from genomic data: a case study using echolocating mammals. Mol. Biol. Evol. 32, 1232–1236. ( 10.1093/molbev/msv013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Yeaman S, Gerstein AC, Hodgins KA, Whitlock MC. 2018. Quantifying how constraints limit the diversity of viable routes to adaptation. PLoS Genet. 14, 1–25. ( 10.1371/journal.pgen.1007717) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Zhang J, Kumar S. 1997. Detection of convergent and parallel evolution at the amino acid sequence level. Mol. Biol. Evol. 14, 527–536. ( 10.1093/oxfordjournals.molbev.a025789) [DOI] [PubMed] [Google Scholar]
- 22.Harmon LJ. 2018 Phylogenetic comparative methods: learning from trees. Self published under a CC-BY-4.0 license. See https://lukejharmon.github.io/pcm/ .
- 23.Wake DB, Wake MH, Specht CD. 2011. Homoplasy: from detecting pattern to determining process and mechanism of evolution. Science 331, 1032–1035. ( 10.1126/science.1188545) [DOI] [PubMed] [Google Scholar]
- 24.Avise JC, Robinson TJ. 2008. Hemiplasy: a new term in the lexicon of phylogenetics. Syst. Biol. 57, 503–507. ( 10.1080/10635150802164587) [DOI] [PubMed] [Google Scholar]
- 25.Guerrero RF, Hahn MW. 2018. Quantifying the risk of hemiplasy in phylogenetic inference. Proc. Natl Acad. Sci. USA 115, 12 787–12 792. ( 10.1073/pnas.1811268115) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hahn MW, Nakhleh L. 2016. Irrational exuberance for resolved species trees. Evolution 70, 7–17. ( 10.1111/evo.2016.70.issue-1) [DOI] [PubMed] [Google Scholar]
- 27.Mendes FK, Fuentes-González JA, Schraiber JG, Hahn MW. 2018. A multispecies coalescent model for quantitative traits. eLife 7, 1–24. ( 10.7554/eLife.36482) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lee KM, Coop G. 2017. Distinguishing among modes of convergent adaptation using population genomic data. Genetics 207, 1591–1619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Chan YF. et al. 2010. Adaptive evolution of pelvic reduction in sticklebacks by recurrent deletion of a Pitx1 enhancer. Science 327, 302–305. ( 10.1126/science.1182213) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Shapiro MD, Marks ME, Peichel CL, Blackman BK, Nereng KS, Jónsson B, Schluter D, Kingsley DM. 2004. Genetic and developmental basis of evolutionary pelvic reduction in three-spine sticklebacks. Nature 428, 717 ( 10.1038/nature02415) [DOI] [PubMed] [Google Scholar]
- 31.Colosimo PF, Hosemann KE, Balabhadra S, Villarreal G, Dickson M, Grimwood J, Schmutz J, Myers RM, Schluter D, Kingsley DM. 2005. Widespread parallel evolution in sticklebacks by repeated fixation of ectodysplasin alleles. Science 307, 1928–1933. ( 10.1126/science.1107239) [DOI] [PubMed] [Google Scholar]
- 32.Welch JJ, Jiggins CD. 2014. Standing and flowing: the complex origins of adaptive variation. Mol. Ecol. 23, 3935–3937. ( 10.1111/mec.12859) [DOI] [PubMed] [Google Scholar]
- 33.Haldane JBS. 1957. The cost of natural selection. J. Genet. 55, 511 ( 10.1007/BF02984069) [DOI] [Google Scholar]
- 34.Kimura M. 1961. Natural selection as the process of accumulating genetic information in adaptive evolution. Genet. Res. 2, 127–140. ( 10.1017/S0016672300000616) [DOI] [Google Scholar]
- 35.Cover TM, Thomas JA. 2012. Elements of information theory. New York, NY: John Wiley & Sons. [Google Scholar]
- 36.Shannon CE. 1948. A mathematical theory of communication. Bell Syst. Tech. J. 27, 3–55. ( 10.1002/bltj.1948.27.issue-3) [DOI] [Google Scholar]
- 37.Felsenstein J. 1971. On the biological significance of the cost of gene substitution. Am. Nat. 105, 1–11. ( 10.1086/282698) [DOI] [Google Scholar]
- 38.Maynard-Smith J. 1968. ‘Haldane’s dilemma’ and the rate of evolution. Nature 219, 1114–1116. ( 10.1038/2191114a0) [DOI] [PubMed] [Google Scholar]
- 39.Sved J. 1968. Possible rates of gene substitution in evolution. Am. Nat. 102, 283–293. ( 10.1086/282542) [DOI] [Google Scholar]
- 40.Barton N. 2017. How does epistasis influence the response to selection? Heredity 118, 96–109. ( 10.1038/hdy.2016.109) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Iwasa Y. 1988. Free fitness that always increases in evolution. J. Theor. Biol. 135, 265–281. ( 10.1016/S0022-5193(88)80243-1) [DOI] [PubMed] [Google Scholar]
- 42.Mustonen V, Lässig M. 2010. Fitness flux and ubiquity of adaptive evolution. Proc. Natl Acad. Sci. USA 107, 4248–4253. ( 10.1073/pnas.0907953107) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Peck JR, Waxman D. 2010. Is life impossible? Information, sex, and the origin of complex organisms. Evol.: Int. J. Org. Evol. 64, 3300–3309. ( 10.1111/evo.2010.64.issue-11) [DOI] [PubMed] [Google Scholar]
- 44.Watkins C. 2002. The channel capacity of evolution: ultimate limits on the amount of information maintainable in the genome. In Proceedings of the Third International Conference on Bioinformatics of Genome Regulation and Structure, Novisibirsk, Russia, vol. 2, pp. 58–60. [Google Scholar]
- 45.Frank SA. 2012. Natural selection. v. how to read the fundamental equations of evolutionary change in terms of information theory. J. Evol. Biol. 25, 2377–2396. ( 10.1111/jeb.12010) [DOI] [PubMed] [Google Scholar]
- 46.Felsenstein J. 1985. Phylogenies and the comparative method. Am. Nat. 125, 1–15. ( 10.1086/284325) [DOI] [Google Scholar]
- 47.Housworth EA, Martins EP, Lynch M. 2004. The phylogenetic mixed model. Am. Nat. 163, 84–96. ( 10.1086/380570) [DOI] [PubMed] [Google Scholar]
- 48.Lynch M. 1991. Methods for the analysis of comparative data in evolutionary biology. Evolution 45, 1065–1080. ( 10.1111/evo.1991.45.issue-5) [DOI] [PubMed] [Google Scholar]
- 49.Hadfield J, Nakagawa S. 2010. General quantitative genetic methods for comparative biology: phylogenies, taxonomies and multi-trait models for continuous and categorical characters. J. Evol. Biol. 23, 494–508. ( 10.1111/jeb.2010.23.issue-3) [DOI] [PubMed] [Google Scholar]
- 50.Stone GN, Nee S, Felsenstein J. 2011. Controlling for non-independence in comparative analysis of patterns across populations within species. Phil. Trans. R. Soc. B 366, 1410–1424. ( 10.1098/rstb.2010.0311) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Goudet J, Kay T, Weir BS. 2018. How to estimate kinship. Mol. Ecol. 27, 4121–4135. ( 10.1111/mec.2018.27.issue-20) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Coop G, Witonsky D, Di Rienzo A, Pritchard JK. 2010. Using environmental correlations to identify loci underlying local adaptation. Genetics 185, 1411–1423. ( 10.1534/genetics.110.114819) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Frichot E, Schoville SD, Bouchard G, François O. 2013. Testing for associations between loci and environmental gradients using latent factor mixed models. Mol. Biol. Evol. 30, 1687–1699. ( 10.1093/molbev/mst063) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Hancock AM, Witonsky DB, Gordon AS, Eshel G, Pritchard JK, Coop G, Di Rienzo A. 2008. Adaptations to climate in candidate genes for common metabolic disorders. PLoS Genet. 4, e32 ( 10.1371/journal.pgen.0040032) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Bonhomme M, Chevalet C, Servin B, Boitard S, Abdallah JM, Blott S, San Cristobal M. 2010. Detecting selection in population trees: the Lewontin and Krakauer test extended. Genetics 186, 241–262. ( 10.1534/genetics.110.117275) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Duforet-Frebourg N, Luu K, Laval G, Bazin E, Blum MG. 2015. Detecting genomic signatures of natural selection with principal component analysis: application to the 1000 genomes data. Mol. Biol. Evol. 33, 1082–1093. ( 10.1093/molbev/msv334) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Galinsky KJ, Bhatia G, Loh P-R, Georgiev S, Mukherjee S, Patterson NJ, Price AL. 2016. Fast principal-component analysis reveals convergent evolution of ADH1B in Europe and East Asia. Am. J. Human Genet. 98, 456–472. ( 10.1016/j.ajhg.2015.12.022) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Günther T, Coop G. 2013. Robust identification of local adaptation from allele frequencies. Genetics 195, 205–220. ( 10.1534/genetics.113.152462) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Adrion JR, Hahn MW, Cooper BS. 2015. Revisiting classic clines in Drosophila melanogaster in the age of genomics. Trends Genet. 31, 434–444. ( 10.1016/j.tig.2015.05.006) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Machado HE, Bergland AO, O’Brien KR, Behrman EL, Schmidt PS, Petrov DA. 2016. Comparative population genomics of latitudinal variation in Drosophila simulans and Drosophila melanogaster. Mol. Ecol. 25, 723–740. ( 10.1111/mec.13446) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Sedghifar A, Saelao P, Begun DJ. 2016. Genomic patterns of geographic differentiation in Drosophila simulans. Genetics 202, 1229–1240. ( 10.1534/genetics.115.185496) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Pickrell JK, Pritchard JK. 2012. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 8, e1002967 ( 10.1371/journal.pgen.1002967) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Refoyo-Martínez A, da Fonseca RR, Halldórsdóttir K, Árnason E, Mailund T, Racimo F. 2018. Identifying loci under positive selection in complex population histories. bioRxiv, 453092 ( 10.1101/453092) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Leinonen T, McCairns RS, O’hara RB, Merilä J. 2013. QST–FST comparisons: evolutionary and ecological insights from genomic heterogeneity. Nat. Rev. Genet. 14, 179 ( 10.1038/nrg3395) [DOI] [PubMed] [Google Scholar]
- 65.Whitlock MC. 2008. Evolutionary inference from QST. Mol. Ecol. 17, 1885–1896. ( 10.1111/mec.2008.17.issue-8) [DOI] [PubMed] [Google Scholar]
- 66.Berg J, Zhang X, Coop G. 2017. Polygenic adaptation has impacted multiple anthropometric traits. bioRxiv, 167551 ( 10.1101/167551) [DOI] [Google Scholar]
- 67.Berg JJ, Coop G. 2014. A population genetic signal of polygenic adaptation. PLoS Genet. 10, e1004412 ( 10.1371/journal.pgen.1004412) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Josephs EB, Berg JJ, Ross-Ibarra J, Coop G. 2019. Detecting adaptive differentiation in structured populations with genomic data and common gardens. Genetics 211, 989–1004. ( 10.1101/368506) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Karhunen M, Merilä J, Leinonen T, Cano J, Ovaskainen O. 2013. DRIFTSEL: an R package for detecting signals of natural selection in quantitative traits. Mol. Ecol. Resour. 13, 746–754. ( 10.1111/men.2013.13.issue-4) [DOI] [PubMed] [Google Scholar]
- 70.Ovaskainen O, Karhunen M, Zheng C, Arias JMC, Merilä J. 2011. A new method to uncover signatures of divergent and stabilizing selection in quantitative traits. Genetics 189, 621–632. ( 10.1534/genetics.111.129387) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Racimo F, Berg JJ, Pickrell JK. 2018. Detecting polygenic adaptation in admixture graphs. Genetics 208, 1565–1584. ( 10.1534/genetics.117.300489) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Kaplan NL, Hudson RR, Langley CH. 1989. The ‘hitchhiking effect’ revisited. Genetics 123, 887–899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Maynard Smith J, Haigh J. 1974. The hitch-hiking effect of a favourable gene. Genet. Res. 23, 23–35. ( 10.1017/S0016672300014634) [DOI] [PubMed] [Google Scholar]
- 74.Berg JJ, Coop G. 2015. A coalescent model for a sweep of a unique standing variant. Genetics 201, 707–725. ( 10.1534/genetics.115.178962) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Peter BM, Huerta-Sanchez E, Nielsen R. 2012. Distinguishing between selective sweeps from standing variation and from a de novo mutation. PLoS Genet. 8, 1–14. ( 10.1371/journal.pgen.1003011) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Roesti M, Gavrilets S, Hendry AP, Salzburger W, Berner D. 2014. The genomic signature of parallel adaptation from shared genetic variation. Mol. Ecol. 23, 3944–3956. ( 10.1111/mec.12720) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Bierne N. 2010. The distinctive footprints of local hitchhiking in a varied environment and global hitchhiking in a subdivided population. Evolution 64, 3254–3272. ( 10.1111/evo.2010.64.issue-11) [DOI] [PubMed] [Google Scholar]
- 78.Kim Y, Maruki T. 2011. Hitchhiking effect of a beneficial mutation spreading in a subdivided population. Genetics 189, 213–226. ( 10.1534/genetics.111.130203) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Santiago E, Caballero A. 2005. Variation after a selective sweep in a subdivided population. Genetics 169, 475–483. ( 10.1534/genetics.104.032813) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Slatkin M, Wiehe T. 1998. Genetic hitch-hiking in a subdivided population. Genet. Res. 71, 155–160. ( 10.1017/S001667239800319X) [DOI] [PubMed] [Google Scholar]
- 81.Racimo F. 2016. Testing for ancient selection using cross-population allele frequency differentiation. Genetics 202, 733–750. ( 10.1534/genetics.115.178095) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Ralph PL, Coop G. 2010. Parallel adaptation: one or many waves of advance of an advantageous allele? Genetics 186, 647–668. ( 10.1534/genetics.110.119594) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Ralph PL, Coop G. 2015b. The role of standing variation in geographic convergent adaptation. Am. Nat. 186, S5–S23. ( 10.1086/682948) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Chevin L-M, Martin G, Lenormand T. 2010. Fisher’s model and the genomics of adaptation: restricted pleiotropy, heterogenous mutation, and parallel evolution. Evol.: Int. J. Org. Evol. 64, 3213–3231. ( 10.1111/evo.2010.64.issue-11) [DOI] [PubMed] [Google Scholar]
- 85.Orr HA. 2005. The probability of parallel evolution. Evolution 59, 216–220. ( 10.1111/evo.2005.59.issue-1) [DOI] [PubMed] [Google Scholar]
- 86.Gordo I, Campos PR. 2006. Adaptive evolution in a spatially structured asexual population. Genetica 127, 217–229. ( 10.1007/s10709-005-4012-9) [DOI] [PubMed] [Google Scholar]
- 87.MacPherson A, Nuismer S. 2017. The probability of parallel genetic evolution from standing genetic variation. J. Evol. Biol. 30, 326–337. ( 10.1111/jeb.13006) [DOI] [PubMed] [Google Scholar]
- 88.Paulose J, Hermisson J, Hallatschek O. 2019. Spatial soft sweeps: patterns of adaptation in populations with long-range dispersal. PLoS Genetics 15, e1007936 ( 10.1101/299453) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Ralph PL, Coop G. 2015a. Convergent evolution during local adaptation to patchy landscapes. PLoS Genet. 11, e1005630 ( 10.1371/journal.pgen.1005630) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Crawford NG. et al. 2017. Loci associated with skin pigmentation identified in African populations. Science 358, eaa8433 ( 10.1126/science.aan8433) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Edwards M, Bigham A, Tan J, Li S, Gozdzik A, Ross K, Jin L, Parra EJ. 2010. Association of the OCA2 polymorphism his615arg with melanin content in east Asian populations: further evidence of convergent evolution of skin pigmentation. PLoS Genet. 6, 1–8. ( 10.1371/journal.pgen.1000867) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Norton HL. et al. 2007. Genetic evidence for the convergent evolution of light skin in Europeans and east Asians. Mol. Biol. Evol. 24, 710–722. ( 10.1093/molbev/msl203) [DOI] [PubMed] [Google Scholar]
- 93.Perry GH, Dominy NJ. 2009. Evolution of the human pygmy phenotype. Trends Ecol. Evol. 24, 218–225. ( 10.1016/j.tree.2008.11.008) [DOI] [PubMed] [Google Scholar]
- 94.Tucci S. et al. 2018. Evolutionary history and adaptation of a human pygmy population of Flores Island, Indonesia. Science 361, 511–516. ( 10.1126/science.aar8486) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Hudson RR. 2002. Generating samples under a Wright–Fisher neutral model of genetic variation. Bioinformatics 18, 337–338. ( 10.1093/bioinformatics/18.2.337) [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
This article has no additional data.