Abstract
Given the many small-effect loci uncovered by genome-wide association studies (GWAS), polygenic scores have become central to genomic medicine, and have found application in diverse settings including evolutionary studies of adaptation. Despite their promise, polygenic scores have been found to suffer from limited portability across human populations. This at first seems in conflict with the observation that most common genetic variation is shared among populations. We investigate one potential cause of this discrepancy: stabilizing selection on complex traits. Counterintuitively, while stabilizing selection constrains phenotypic evolution, it accelerates the loss and fixation of alleles underlying trait variation within populations (GWAS loci). Thus even when populations share an optimum phenotype, stabilizing selection erodes the variance contributed by their shared GWAS loci, such that predictions from GWAS in one population explain less of the phenotypic variation in another. We develop theory to quantify how stabilizing selection is expected to reduce the prediction accuracy of polygenic scores in populations not represented in GWAS samples. In addition, we find that polygenic scores can substantially overstate average genetic differences of phenotypes among populations. We emphasize stabilizing selection around a common optimum as a useful null model to connect patterns of allele frequency and polygenic score differentiation.
This article is part of the theme issue ‘Celebrating 50 years since Lewontin's apportionment of human diversity’.
Keywords: polygenic scores, stabilizing selection, Lewontin 1972, portability, genome-wide association studies
1. Introduction
Lewontin’s foundational early work found that most common genetic variation is shared among human populations [1]. Thus, it strongly refuted the view that genetic variation is partitioned among mostly invariable populations and has become a classic work discrediting discrete human races. Lewontin’s finding is reflected by low estimates of FST among populations [2], i.e. only a small proportion (approx. 10%) of the total allelic variance is attributable to differences in frequency between populations (a finding that has been replicated for many different marker types genome-wide; [2–7]). While some loci that are highly differentiated among human populations have been uncovered (e.g. underlying loci in skin pigmentation and infectious disease immunity; see [8] for a review), there are relatively few such strongly selected loci [9,10].
Lewontin’s results also have implications for our a priori expectations of the partitioning of phenotypic variation within and among human populations. For phenotypes evolving neutrally in diploids, we expect the proportion of additive genetic variance attributable to among-population differences to be ≈2 FST when all individuals are measured in a common set of environments [11–16]. Thus in humans, we expect only approximately 18% of additive genetic variance to be due to among-population differences when measured in a common set of environments. Compared to this null, phenotypes subject to divergent selection among populations are expected to be over-dispersed, while phenotypes subject to stabilizing selection with the same selective optimum are expected to be less differentiated among populations. To distinguish among potential contributors to genetic differences among populations, researchers often turn to settings like common gardens in an effort to eliminate among-population environmental variation. However, it is not feasible to measure human phenotypes in a common environment (a major drawback of studies that investigated population-level phenotypic variation; [17,18]). Thus for the majority of complex traits, we do not know the role of genetics, let alone natural selection, in explaining phenotypic differences among human populations.
These questions have received renewed interest in human genetics due to genome-wide association studies (GWAS), which have found common variation associated with many phenotypes within populations. GWAS have revealed that most phenotypes are highly polygenic within populations [19–21] and confirmed that much of the genetic variance is additive (reviewed in [22]). These observations have motivated phenotype prediction through additive genetic values, the additive contribution of polymorphisms to phenotypic differences among individuals. One common approach to predict an individual’s additive genetic value is based on a polygenic score, the sum across trait-associated loci of genotypes weighted by their estimated effects. Predictions based on polygenic scores are being explored in a number of clinical settings and more generally as a tool for understanding the genetic basis of disease and phenotypic variability. However, the generalizability of polygenic scores across populations is concerning because GWAS samples are strongly biased towards European populations and studies of other populations are much smaller [23–26]. There is wide agreement that these portability issues must be addressed so that the future clinical use of polygenic scores does not further compound inequalities in healthcare [23,24,26].
Currently, for most complex traits, polygenic scores poorly predict additive genetic values, and therefore phenotypes. This issue stands even in samples closely related to the GWAS population, because polygenic scores aggregate across many loci with slightly mis-estimated effects. These problems increase as we move to populations that are genetically and environmentally more distant from the GWAS populations. For one, the associated loci are usually not the causal loci underlying trait variation; instead they tag the effects of linked causal sites. The interpretation of the effect size of an associated variant can be tricky because of: (i) linkage disequilibrium (LD), whereby it absorbs the effects at correlated causal sites [27–31]; (ii) population stratification, whereby it absorbs the effects of covarying environments [32–36]; and (iii) gene-by-environment (GxE) or gene-by-gene (GxG) interactions, whereby its estimate is averaged over the interacting environmental contexts or genetic backgrounds in the sample [28,37–47]. As all of these factors can and will vary across populations, the effect sizes of alleles will differ among them and so polygenic scores will have lower prediction accuracy, i.e. imperfect portability, across populations.
Even with perfectly estimated effects at the causal loci with significant trait associations, the prediction accuracy of polygenic scores will be limited because GWAS only identify loci with common alleles that contribute enough variance to exceed some significance threshold determined by the sample size. Therefore, an allele that is rare in the GWAS sample but common elsewhere will not be discovered. This would lead to a greater reduction in the phenotypic variance accounted for, or prediction accuracy, in populations not represented in the GWAS sample (hereafter ‘unrepresented populations’; [30,40,44,48–52]). Indeed, many variants contributing to trait variation in European GWAS samples are not at a high enough frequency to be detected in other populations, suggesting that different sets of polymorphisms contribute to the trait variance in different populations [53,54]. Genetic differentiation likely contributes to the reduction in the prediction accuracy of polygenic scores in unrepresented populations, as groups with increasing genetic distance from GWAS samples experience a greater loss in prediction accuracy [55–58].
The factors that reduce the utility of polygenic scores for individual-level prediction may also complicate the interpretation of average polygenic score differences across populations. Such issues arise in studies of adaptation that use polygenic scores to assess the contribution of selection to the genetic basis of phenotypic differentiation among human populations. An early application of this approach found polygenic signals of selection on height within Europe [59,60], where polygenic scores were over-differentiated among populations compared to the neutral prediction based on FST. Importantly, the null distribution of this test of neutrality at the level of the phenotype does not rely on accurate polygenic scores and so changes in LD, GxG and GxE should not cause false signals. However, the results are very sensitive to slight biases in estimated effect sizes due to population structure, and indeed the signal of polygenic selection on height turned out to be almost entirely due to stratification [61–63]. More generally, the imperfect portability of polygenic scores and the fact that the mean environmental contribution to phenotypes can vary greatly between populations raise concerns about over-interpreting differences in mean polygenic scores as genetic differences in the average phenotype among populations [60,64–67].
The consequence of genetic differentiation on the low generalizability of GWAS results for phenotype prediction may appear to contradict Lewontin’s observation of minor allele frequency (MAF) differences between populations. However, allele frequency differentiation for complex traits under selection can occur at a rate faster than drift. This would lead to noisy estimates of additive genetic values, and so reduced prediction accuracy of polygenic scores, for unrepresented populations. In order to understand the impact of this turnover on portability, we need models of allele frequency differentiation that are informed by plausible forms of natural selection on complex traits.
Stabilizing selection with a constant fitness optimum is a sensible null model for the evolution of complex traits. Indeed, studies of its influence on allelic dynamics have set a conceptual foundation for interpreting and designing GWAS within populations [68]. Under stabilizing selection, intermediate trait values have the highest fitness, with decreasing fitness with distance from that optimum (figure 1a). Many quantitative traits have been shown to experience stabilizing selection in humans (e.g. [69]) as well as across many other species (e.g. [70–72]). Stabilizing selection also contributes to the lack of variation in morphological traits within and between closely related species and the morphological constancy of traits in the fossil record [73–77].
Figure 1.
Stabilizing selection to a constant phenotypic optimum constrains the evolution of mean phenotypes through underdominant selection at the locus-level. (a) Gaussian fitness function for phenotypic stabilizing selection (teal) and resulting density of simulated phenotypes in the population (coral). Lines correspond to the intermediate strength of stabilizing selection that we simulated (w = 5, where w determines the width of the fitness peak; see §2). Note how stabilizing selection keeps the genetic variance of the population (VP) small compared to the width of the selection peak (w). (b) Change in mean phenotype from the ancestor (generation 0) under stabilizing selection (orange) and neutral phenotypic evolution (pink). Each line corresponds to a single simulation; here 10 simulations are shown for each strength of selection. (c) Deterministic allele frequency trajectories, assuming an infinite population size, based on the underdominant model, at a locus that contributes to the variance of a trait under stabilizing selection (here w = 5). The trajectory differs according to effect size and starting frequency. Note the symmetry for starting frequencies that are the same distance from their closest boundaries.
Here, we show how stabilizing selection on complex traits reduces portability and increases the chance of false signals of directional polygenic adaptation, despite low overall genetic differentiation and no genetically based trait differentiation among populations. Under parameter ranges estimated from empirical studies, we combine simulations of stabilizing selection on complex traits with analytical models of its effect on genetic differentiation to investigate how stabilizing selection drives these results. We focus only on scenarios in which the trait optimum is shared among populations, leading to levels of trait differentiation among populations much smaller than neutrality. In our baseline scenario, stabilizing selection occurs on a single additive trait, where alleles have the same effect in different populations (reflecting no GxG or GxE interactions and no pleiotropy; we relax these assumptions later).
We study differentiation between a pair of populations, either an ancestral and descendant population or a pair of contemporary populations, in which the results of a GWAS in one of those populations is used to make trait predictions in the other. In doing so we provide a polygenic score perspective on earlier investigations into the relationship between population structure, stabilizing selection and quantitative trait variation (e.g. [78–83]). We show how these factors reduce the prediction accuracy of polygenic scores and can readily lead to patterns of polygenic score differentiation rife with the potential for misinterpretation.
2. Model background
An individual’s additive genetic value Gi is the sum of the additive effects of all alleles they carry,
2.1 |
where al represents the additive effect of an allele relative to another at locus l and gil is the number of copies of that allele carried by the individual. All of these loci denoted by l are polymorphic within a specified population from which the individual was drawn. Note that the additive genetic value does not represent an absolute measure of an individual’s phenotype, and instead represents the additive contribution of the polymorphisms they carry to their deviation from their population’s mean.
Under a constant selective environment, stabilizing selection keeps the population mean phenotype close to the optimum and decreases the phenotypic variance in the population because individuals on both tails of the distribution have lower fitness (figure 1a,b; [12,84,85]). To understand the process by which stabilizing selection reduces the phenotypic variance, we focus on the additive genetic variance
2.2 |
in which the first component refers to the additive genic variance (Va) and the second component accounts for the contribution of linkage disequilibrium (LD) among loci that contribute to the variance. In the short term, stabilizing selection reduces the phenotypic variance by generating negative LD between like-effect alleles, thereby limiting extreme phenotypes (known as the Bulmer effect; [86]). The additive genetic variance quickly reaches an equilibrium reflecting a balance between selection producing negative LD, and recombination and chromosome segregation breaking up that generated LD [87].
The long-term genetic response to stabilizing selection is driven by a reduction in the additive genic variance, in particular the variance in genotypes at a locus (Var(gl); the expected heterozygosity). Yet if the genetic basis of trait variation was truly infinitesimal, i.e. made up of loci of infinitely small effect, only genetic drift would erode variation in the long term. However, while the loci discovered by GWAS contribute very small effects, these effects are not infinitesimally small and so they can be directly acted on by selection [68,88]. Selection at the individual loci underlying trait variation is in many cases well approximated by underdominant selection, in which the more common allele fixes and the minor allele is lost (figure 2a; [84,89]). Owing to underdominant selection at the locus level, stabilizing selection removes polymorphisms at a faster rate than neutrality, with selection coefficients proportional to their squared effect sizes (see [68,88,90] for recent applications of such models to understand GWAS variation within populations). Meanwhile, under moderate strengths of stabilizing selection and a constant environment, the population mean phenotype stays very close to the optimum through rapid, small fluctuations at many individual loci.
Figure 2.
The effect of stabilizing selection, distribution of effect sizes and GWAS ascertainment scheme on the portability of a GWAS to another population. (a) Reduction in heterozygosity at loci that contributed to the variance 1500 generations ago in a population with Ne = 10 000. Each open point represents the midpoint of the effect size bin of width 0.01, within which we averaged heterozygosity from 200 simulations. Each filled point represents our analytical predictions for that midpoint. (b) Reduction over time in the total variance contributed by polymorphisms in an ancestral population. Lighter lines show results from 200 simulations and darker lines show analytical predictions. The axis at the top shows the expected neutral FST between the ancestral and descendant population for the time scales of divergence shown on the bottom axis. The inset on the left shows the distribution from which mutation effect sizes are drawn. For the results shown we used a normal distribution with standard deviation of 0.1. The inset on the right shows the equilibrium density of variance contributed from each effect size. As the width of the fitness peak increases, and thus stabilizing selection weakens, mutations with large effects can drift to higher frequencies and contribute a greater proportion of the trait’s variance. (c) The reduction over time in the total variance contributed by polymorphisms in an ancestral population for three different mutation effect size distributions, holding the strength of phenotypic stabilizing selection constant (w = 5). The inset shows the three mutation effect size distributions: two normal (green and blue) and a mixture of normals to produce a heavy-tailed distribution (yellow). The blue lines correspond to the same distribution used to produce sub-figure (b), thus replicating the orange lines in sub-figure (b). Lighter lines show results from simulations (100 each for yellow and green) and darker lines show analytical predictions. Predictions in (b) and (c) are based on the diffusion approximation; see equations (A 17) and (A 18).
We note that the full analysis of models of stabilizing selection is challenging because changes in allele frequencies and covariances must be tracked over many loci (e.g. [87,91]). Indeed, when we simulated physically linked loci with low recombination, we found that selection on an allele was weaker than we predicted due to the persistence of selection-generated LD between alleles with opposing effects (the Bulmer effect; electronic supplementary material, figure S1). To understand the long-term turnover in the genetic basis of trait variation, from here on we focus on the additive genic variance, Va, under the assumption that loci contributing to the variance are physically unlinked.
We begin by thinking of an ancestral population at equilibrium and the loss of ancestral phenotypic variance over time. When the population is at mutation-drift-selection equilibrium, stabilizing selection and genetic drift remove variation such that as time goes on less and less of the variance in the descendant population is contributed by ancestral polymorphisms. Instead, the variance contributed by new mutations that are private to the descendant population will increase, replenishing what was removed by drift and selection.
We assume a model of Gaussian stabilizing selection where the width of the fitness peak is determined by w (w2 is equivalent to VS in other models of stabilizing selection, e.g. [92]), which will approximate any symmetric quadratic stabilizing selection model when the mean trait is close to its optimum. We also assume thousands of unlinked loci contributing to trait variation and explore dynamics resulting from three different mutation effect size distributions (figure 2c); see appendix A, §(a) for simulation details.
We can predict the reduction in additive genic variance contributed by ancestral variants (anc) in the descendant population (desc) after t generations for alleles with effect size a using a common approximation for the per-generation loss (e.g. [93])
2.3a |
2.3b |
2.3c |
where N is the population size, VP is the total population trait variance and VP ≪ w2, S = 2Na2/w2 is the population-scaled selection coefficient of the allele, and FST ≈ t/2N for neutral polymorphisms. On the right-hand side of equation (2.3a), the first bracketed term is the per-generation reduction due to drift and the second term is the reduction due to stabilizing selection. Looking at the exponent in equation (2.3c), we see that the decay of the variance (heterozygosity) contributed by alleles depends on FST, but will be increased for alleles whose effect sizes are large enough such that their population-scaled selection is appreciable (S > 1). Equations (2.3) offer good intuition; however, in the remainder of the main text we show results from a diffusion approximation that we developed that is better for our purposes (extending from [68], see appendix A, §(c)).
The reduction in additive genic variance contributed by a particular polymorphism with effect size a is the same as the reduction in heterozygosity at that site. Selection causes a stronger reduction when the fitness peak is narrower (i.e. when w is smaller) and when the allele’s effect is larger (figure 2a; see electronic supplementary material, figure S2 for the approximation in equation (2.3a)). We can average the reduction in heterozygosity across all sites, weighting by the distribution of effect sizes and genic variance contributed by a given effect size, to predict the total remaining variance and thus total reduction in Va over time (figure 2b; equation (A 17)). In the example shown in figure 2, this total reduction tends to be weaker than what we see for the largest effect polymorphisms, because most sites that contribute to the variance are of small effect. This is because (i) under our mutational distribution most alleles have small effects, and (ii) under stabilizing selection, the equilibrium distribution of observed effects is narrower than the distribution of mutation effects.
The form of the decay in ancestral variance strongly depends on the distribution of mutation effects. Consider a case in which a higher proportion of introduced mutations are strongly selected due to their large effects (S > 10). Some of these alleles are still capable of drifting to intermediate frequencies, and so a higher proportion of the ancestral variance will be contributed by larger effect polymorphisms [68]. Therefore, stabilizing selection will cause a steeper reduction in the ancestral variance (figure 2c). Moreover, if most of the other mutations are nearly neutral, the early and steep decline in ancestral variance will be followed by a decline more consistent with neutrality (yellow lines in figure 2c; see [68] for a discussion of selection regimes). The extent to which ancestral variance is depleted determines the amount of shared additive genetic variance between diverging populations and thus the portability of polygenic scores.
3. Accuracy of polygenic score predictions
To understand the effect of stabilizing selection and drift on the prediction accuracy of polygenic scores in isolation from other sources of bias, we make the simplifying assumption that GWAS identify associations between polymorphisms and trait variation only at causal loci. For a GWAS within a population to identify a locus as being associated with the trait, the locus has to be polymorphic in that population and its phenotypic association has to achieve some level of statistical significance (i.e. contribute above some level of variance to the trait). For those causal loci with significant associations, we also assume that their effects are estimated perfectly and for the moment that these true effects do not vary within the sample, the population the sample was drawn from, or across populations (i.e. populations experience the same set of environments). We also assume that their effects are strictly additive. At the trait-associated loci, one can sum the additive effects of all alleles that an individual carries at a predefined set of markers to form a polygenic score.
We are interested in the reduction in prediction accuracy for a population not represented in the GWAS sample, relative to the prediction accuracy of those represented populations. To explore this we consider the genetic differentiation between a pair of populations, A and B, in which the GWAS sample is drawn from population A but not population B. When the effects of alleles do not vary between populations, we can quantify the reduction in phenotype prediction accuracy (r2) of polygenic scores, compared to using additive genetic values, constructed for any population as
3.1 |
where Si is an individual’s polygenic score and Gi is their additive genetic value. Since an individual’s polygenic score is only part of their additive genetic value when a GWAS estimates true effects, this reduction can be understood as the proportion of the total additive genetic variance, or proportion of the heritability, explained by GWAS-significant sites (appendix A, §(b)). The ratio of in population B to population A quantifies the reduction in prediction accuracy due to a lack of GWAS representation for population B. If these populations experience the same selective environment, we expect the same Var(G) for each population, so this reduction due to a lack of representation would simply be the ratio of the variances explained by polygenic scores.
Note that our definition of the prediction accuracy in population B is the squared correlation of the deviation of an individual’s polygenic score with its deviation in phenotype, where both of these deviations are with respect to the mean phenotype in population B. This definition matches typical polygenic score practices where predictions are statements about the departure of an individual from their ancestry group’s mean genetic value, rather than a prediction of their departure from the mean genetic value of the GWAS population. However, this definition of prediction accuracy does not include discrepancies from the evolution of mean polygenic scores and phenotypes among populations, meaning it does not account for a systematic shift in polygenic scores between populations. We turn to this point in §4.
(a) . Ascertainment of all causal loci in the GWAS sample
We begin with the simplified case in which all causal loci that are polymorphic in population A have been identified, such that polygenic scores equal the additive genetic values in population A. As polygenic scores from ancient DNA are being used to investigate the phenotypic diversity of ancient human populations [94], we first consider the prediction accuracy in a population ancestral to population A of a polygenic score constructed using variants found in population A. The reduction in polygenic score prediction accuracy for this ancestral population is approximately the reduction in variance contributed by that ancestral population to the present (figure 2b,c; electronic supplementary material, figure S4; see appendix A, §c(ii) for an explanation). While ancient individuals were likely not drawn from populations directly ancestral to present-day populations, we should observe the same general patterns with genetic differentiation between the ancient and present-day population (see also [95] for approximations of this decline).
For the remainder of this article, we consider A and B to be contemporary populations; we assume for simplicity that they split from a common ancestral population without subsequent gene flow. In figure 3a, we use simulations and analytical predictions to show how the prediction accuracy in population B decreases with increasing time since its common ancestor with population A. The span of neutral genetic differentiation was chosen to reflect a scale along which various human populations could fall. We see slightly weaker reductions in prediction accuracy with time when the variance of the mutation effect size distribution is quartered and stronger reductions when the mutation effect size distribution has a heavy tail (figure 3b; electronic supplementary material, figure S5). All variance-contributing loci were ascertained in population A, so the only reason the full genic variance in population B was not captured is because private polymorphisms contribute to the phenotypic variance in each population. At the time of divergence between the pair of populations, they entirely share their genetic basis of trait variation. Then as stabilizing selection and drift remove polymorphisms (at equilibrium), new mutations replenish them at different sites in each population, leading to the same total variance but different genetic bases of that variance (assuming a very large mutational target). Without gene flow between the pair of populations, the polymorphisms that arose since their common ancestor will remain private to each population and will thus not contribute to the variance in the other population. Therefore, the polymorphisms in population A that also contribute to the variance in population B must be ancestrally shared polymorphisms, specifically at those loci in which ancestral polymorphisms were not removed by drift or selection in either descendant population. Note that these shared polymorphisms on average contribute less variance in the descendant populations than in their ancestor. Thus we see a loss in the additive genetic variance shared between populations over time due to a reduction in both the number of shared ancestral polymorphisms and the variance they each contribute. In appendix A, §c(ii), we describe our analytical predictions for this process, which match well to simulations.
Figure 3.
Reduction of polygenic score prediction accuracy in unrepresented population B relative to contemporary, represented population A increases with the strength of stabilizing selection, density of large-effect mutations and power-based GWAS discovery schemes. (a) Reduction of prediction accuracy in population B, when all variance-contributing polymorphisms in population A were ascertained, with increasing time since the common ancestor of populations A and B. Paler lines show results from simulations and darker lines show results from analytical predictions. (b) Distribution of the reduction in prediction accuracy for population B relative to population A from simulations with w = 5 when different sets of variance-contributing sites in population A were ascertained. Results are shown for two mutation effect size distributions. The Gaussian distribution corresponds to the blue distribution and the heavy tail to the yellow distribution in the inset of figure 1c. The same Gaussian distribution was used to produce results in sub-figure (a).
(b) . Ascertainment of a subset of causal loci in the GWAS sample
To more realistically explore the impact of GWAS ascertainment, we explore power- and frequency-based variant discovery. In the main text, we focus on a case in which a GWAS uncovered the top 5% of variance-contributing polymorphisms in population A. Under our choice of effect size distribution and strengths of stabilizing selection, these polymorphisms explain just under 50% of the additive genic variance in population A (similar to that of height in Europeans; [96–98]). We also consider a case in which a GWAS uncovered all causal loci with an MAF that exceeds 1%. As expected, under any ascertainment scheme we observe a substantial drop in the variance explained in population B compared to the case in which all polymorphisms in population A are ascertained (figure 3b). If this reduction in variance for each population is the same, then the decline in prediction accuracy in population B relative to population A over time will be similar to the case when all polymorphisms in population A are ascertained. We find that this is approximately the case for a Gaussian mutation effect size distribution but not for the heavy-tail effect size distribution that we simulate (figure 3c). These differences arise from differences in the strength of selection on and variance contributed by ancestrally shared polymorphisms; see electronic supplementary material, §1.2 for details. We present results under GxE, pleiotropy, and directional selection in electronic supplementary material, §1.3.
4. Difference in polygenic means among populations
In the previous section, we described how stabilizing selection and ascertainment in the GWAS sample can reduce the prediction accuracy of individual genetic values, but what are the consequences for the mean polygenic score of populations? The mean polygenic score of the population is twice the sum of population allele frequencies weighted by effect sizes. If a trait is neutrally evolving, the loci contributing to its variation are just like other neutrally evolving loci, and so differences among populations in their mean polygenic score just reflect a weighted sum of neutral allele frequency differences. Naively, as trait-increasing alleles underlying a neutral trait are equally likely to drift up or down, one might think that over many loci we expect only a small mean difference between populations. However, the polygenic score is a sum rather than a mean, and so each locus we add into the score is like an additional step in the random walk that two populations take away from each other [99]. We expect the variance among populations, i.e. the average squared difference between population means and the global mean, to be 2VA FST [11,14]. We first explore the differentiation of mean polygenic scores under a constant optimum and constant environment, and then relax these assumptions.
(a) . Stabilizing selection to a constant optimum
Under a constant selective environment, stabilizing selection keeps population mean phenotypes close to their optimum in the face of genetic drift and mutation, such that the difference in mean phenotypes among populations with the same optimum should be minor relative to neutral expectations (even accounting for the lower genetic variance within populations under stabilizing selection; figure 4a). This reduction in the divergence of the mean phenotype between populations reflects the fact that if trait-increasing alleles accidentally drift up in frequency, thus pushing the population mean away from its selective optimum, trait-decreasing alleles are subject to directional selection in their favour (and vice versa).
Figure 4.
Polygenic scores can overestimate differences in additive genetic values between populations for traits under stabilizing selection. (a,b) Absolute standardized mean polygenic score differences between populations A and B, , either when (a) all polymorphisms in both populations were ascertained, or when (b) the top 5% of variance-contributing polymorphisms in population A were ascertained. This measure has the same interpretation as QX; it equals . (c) Partitioning of mean polygenic score differences between the ascertained and non-ascertained sets of polymorphisms. Ascertained polymorphisms are from the top 5% of variance-contributing sites in population A. Points represent results for a single simulation. Ellipses denote the 95% confidence interval. The dashed grey line is the line of exactly opposing effects.
To explore population differences in mean polygenic scores, we can use the polygenic score equivalent of QST/2FST [100,101], where QST measures the proportion of total variance in additive genetic values attributable to among-population differences. Under neutrality, for strictly additive traits we expect QST to equal 2FST estimated from neutral polymorphisms [14,15], and their ratio should be χ2-distributed with degrees of freedom equal to one fewer than the number of populations (an extension of the Lewontin–Krakauer test; [102]). The polygenic score analogue is QX [60], where additive genetic values are substituted by polygenic scores. For our pair of populations, our QX statistic is
4.1 |
where Z is the mean polygenic score in the population denoted in the subscript. When population mean phenotypes are over-dispersed relative to neutral expectations, QX will be larger than 1, potentially resulting in a statistically significant p-value under the null distribution. When population means are under-dispersed relative to neutral expectations, as we would expect for traits under stabilizing selection with the same optimum across populations, QX will be much smaller than 1, and the p-values under the null will be large.
Stabilizing selection to the same optimum tightly constrains the difference in mean additive genetic values between populations, i.e. the difference in mean polygenic scores using all of the variation, to be much lower than the difference for neutral traits, even after standardizing for the lower overall levels of variation (figure 4a). This leads to a distribution of QX that is skewed toward lower values than the neutral χ2-distribution (electronic supplementary material, figure S12). However, when we ascertain the top 5% of variance-contributing polymorphisms in population A, the level of standardized polygenic score differentiation under stabilizing selection becomes more similar to neutral levels (figure 4b), with a comparable level of false positive signals of adaptive differentiation as in the neutral case (electronic supplementary material, figure S12). While this result is more specific to our choice of mutation effect size distribution and strengths of stabilizing selection, we can generally conclude that for cases of stabilizing selection with limited ascertainment, estimates of standardized mean polygenic score differences will approach and perhaps exceed what we observe under neutrality. Note that under the neutral case, the distribution of standardized mean polygenic score differences stays about the same between ascertainment levels, whether ascertaining all polymorphisms from both populations or just the top 5% in population A. This is because, under neutrality, all alleles exhibit the same behaviour and do not evolve in coordination with one another. Thus stabilizing selection, combined with incomplete and asymmetric ascertainment, causes the inflation of estimated mean standardized population differences above that seen for the underlying genetic values.
Since under stabilizing selection to the same optimum we expect only small differences between populations in their mean genetic value, the disparity between the (true) differences in mean genetic values and (estimated) differences in mean polygenic scores represents the mean difference contributed by non-ascertained polymorphisms. Intuitively this occurs because if population B has a larger value of an ascertained polygenic score than population A, then the ascertained trait-increasing alleles have by chance drifted up in population B (compared to A) and this imbalance will have induced directional selection for the rest of the trait-increasing alleles to decrease their frequency to keep the population close to the optimum (under high polygenicity). In line with our expectations, we find that the mean polygenic score differences calculated from ascertained sites and mean polygenic score differences calculated from non-ascertained sites are close to opposite one another (figure 4b). The countervailing effect of the non-ascertained loci is noisier when stabilizing selection is weaker because with weaker selection, population means can drift further from their optimum. These results confirm that mean polygenic scores calculated from all polymorphisms (i.e. additive genetic values) should closely match between populations experiencing stronger and similar selection pressures and highlight how incomplete ascertainment can by chance generate misleading differences between them.
(b) . Adapting to a changing optimum or environment
The combination of changes in the stabilizing selection regime and incomplete ascertainment of causal polymorphisms can also generate misleading differences in mean polygenic scores and signals of differential selection among populations. When the optimal phenotype shifts from its ancestral value equally in both descendant populations, this directional selection generates responses in the true mean genetic values in each population that track each other extremely closely. However, because our polygenic score based on ascertained polymorphisms explains less of the variance in population B than in population A, we capture less of its response to selection and thus artificially generate a difference between populations in their mean polygenic scores (figure 5a). This shift in the mean polygenic score between populations, as well as a decrease in the total variance explained, can push the distribution of QX towards larger values than the neutral case (figure 5b). The chances of getting a p-value below a significance threshold of 0.05 are highest when stabilizing selection is strongest (w = 4; up to 30% chance depending on time since the optimum shift) but tend to be greater than neutrality for the strengths of selection we investigated (electronic supplementary material, figure S6). These signals of polygenic selection are consistent with the idea that QX informs us about directional selection on polygenic scores, as selection has driven an increase in the polygenic score of A relative to B. However, they also highlight the very incomplete picture that we obtain about selection and genetic differences in the phenotype.
Figure 5.
Polygenic scores can generate false signals of adaptive differentiation when populations adapt to the same optimum shift. (a) Change in mean phenotype or polygenic score over time since the optimum shifted by 2 standard deviations of the phenotype distribution. True phenotypes are different from additive genetic values in that they account for the contribution of substitutions. (b) Quantile–quantile plot of observed p-values against expected p-values under neutrality (uniform distribution). The dashed lines show equality; points that lie above these lines indicate that the observed distribution has a higher density of low p-values than the neutral distribution, and points that lie below them indicate the opposite.
5. Discussion
Our work builds on population genetics theory of stabilizing selection to investigate why polygenic scores perform poorly in populations not represented in GWAS samples (unrepresented populations) and how they can be misleading about average genetic differences among populations. Specifically, we provide a theoretical foundation to understand how stabilizing selection on complex traits impacts the population differentiation of polygenic scores without a difference in environments or trait optima. Negative selection has been invoked as an explanation for low portability, consistent with large effect alleles discovered by GWAS being rare in the population [30,54]. Here we explore these predictions under a model of stabilizing selection that has the advantage of offering clear-cut connections between an allele’s phenotypic and fitness effects. With this approach to modelling allele frequency dynamics, we obtained a detailed understanding of how a lack of population representation in GWAS can reduce the utility of polygenic scores in explaining phenotypic variation within and among populations. We discuss the dynamics contributing to low portability, considerations for those interested in explaining variation beyond the GWAS sample, and necessary discretion when using polygenic scores to understand group differences.
(a) . Dynamics contributing to low portability
While stabilizing selection around the same optimum among populations reduces phenotypic differentiation relative to neutrality, somewhat counterintuitively it increases genetic differentiation at trait-influencing loci compared to neutral polymorphisms. Genetic differentiation increases because stabilizing selection drives the turnover of polymorphisms that contribute to trait variation within populations; thus polymorphisms that are ancestrally shared between populations are eventually lost and replaced by private mutations. The consequences of this differentiation at large-effect QTLs and a few loci have been explored by various authors [82,83,103], and here we investigated its implications on GWAS results and downstream analyses. Over time, the sets of polymorphisms that contribute substantial variance in each population diverge, such that stabilizing selection reduces the additive genic variance in one population that can be explained by polymorphisms in another. Thus, polygenic scores constructed from polymorphisms ascertained in a GWAS will have reduced prediction accuracy in unrepresented populations compared to represented ones. The prediction accuracy for unrepresented populations decreases with increasing strengths of stabilizing selection and time since the common ancestor with represented populations. Occasional fluctuations in the fitness optima would negligibly influence our portability results, because such directional selection would cause very minor shifts in frequency over the short timescales we consider [90].
The distribution of mutation effect sizes is critical to understanding the effects of stabilizing selection on the turnover of genetic variation among populations. With increasing weight towards larger mutation effects, stabilizing selection leads to a faster reduction in the shared additive genetic variance, and thus polygenic score portability. In addition, the decline in prediction accuracy for unrepresented populations relative to represented ones will depend on frequency- and power-based ascertainment schemes; since polymorphisms shared between populations will more likely be of small effect and common, the discovery of polymorphisms based on MAF will lead to a weaker decline than discovery based on variance contributed. Alternatively, if the trait was truly infinitesimal, stabilizing selection would not contribute to the loss of heterozygosity, such that the sharing of genetic variance would be well predicted by genetic drift, or neutral FST. While the loci mapped by GWAS are often of small effect, and the traits highly polygenic, we know that their effects are not infinitesimal, as loci must make a reasonable contribution to the variance to be discovered by a GWAS (see [68] for a detailed population genetic model).
While we do not consider migration between represented and unrepresented populations here, the general decline in portability with increasing genetic differentiation between groups should hold, though the exact prediction will differ. Pleiotropy and GxE complicate the predictions of stabilizing selection for portability, which we discuss in electronic supplementary material, §1.3. Differences in the environment among populations can lead to changes in the effect sizes of alleles (GxE), which reduces the portability because (i) effect sizes will be only partially correlated between populations and (ii) stabilizing selection will purge more of the ancestral variation shared between populations since GxE increases the overall trait variance when the interacting environment changes. Pleiotropy, in our implementation, leads to the opposite effect. Holding the average strength of selection on all alleles constant, when an allele can independently affect multiple traits under stabilizing selection, the correlation between its effect size on the trait of interest and its selection coefficient weakens. Thus stabilizing selection purges less of the shared variation and causes a weaker reduction in portability.
(b) . Considerations for applications of GWAS results across groups
Like much of the recent work on portability, our work emphasizes the reduction in prediction accuracy for populations not represented in GWAS [29,31,48,104,105]. A natural conclusion is that polygenic predictions that work well across populations will require GWAS across a range of diverse ancestries, in line with other calls to reduce Euro-centrism in GWAS [23,24,26]. In addition to changes in allele frequency and linkage disequilibrium, the relationship of polygenic scores to phenotypes will vary across populations due to variation in genetic effects, assortative mating, and differences in GxG and GxE. Indeed the prediction accuracy of polygenic scores for some traits was recently shown to be quite variable across different groups within an ancestry, suggesting that GxE and environmental variation are quite prevalent for some traits [45]. In addition, associated variants on the same haplotype were found to have differing effects in European–Americans and admixed African–Americans, suggesting that genetic or environmental interactions modify additive effect sizes across groups [47]. Thus, we caution that a better understanding of the portability of polygenic scores across populations also requires a stronger understanding of the causes of variation in prediction accuracy within populations.
With the rise of ancient DNA sequencing, GWAS from contemporary populations have also been used to construct polygenic scores for ancient individuals (reviewed in [94]). These scores have been used to provide a window into the phenotypic diversity of past populations [106–108] and to disentangle genetic and environmental contributors to temporal phenotypic variation (e.g. at the Neolithic transition; [109–111]). Such studies are most convincing when there are relevant phenotypic measurements on at least some ancient individuals and polygenic prediction accuracies can be judged. However, investigators will often not have this luxury, leaving unclear the insight these approaches can provide. Some studies using ancient DNA have identified reduced rates of disease alleles in the past compared to present-day populations. While we have focused here on quantitative traits, rather than disease traits, we caution that purifying selection against risk alleles will lead modern day populations to systematically underrepresent the diversity of disease alleles in the past [107,112–114].
(c) . Misinterpretations of group differences based on polygenic scores
When using polygenic scores constructed from the ascertained set of polymorphisms, we increase the possibility of generating misleading signals of differentiation between populations. Stabilizing selection to a constant optimum alone does not generate more false signals of directional selection than what we expect under the neutral evolution of complex traits. However, the standardized difference between populations will be systematically over-estimated with ascertained polygenic scores. We see this result because with stabilizing selection, the unascertained portion of the variance tends to act exactly counter to the trend seen in the ascertained portion. Misleading signals of adaptive polygenic differentiation can also be generated when the stabilizing selection regime shifts in the same way in each population (such that there is still minor phenotypic differentiation between them) as the ascertained polymorphisms only capture a shift in the ascertainment population. This issue arises because stabilizing selection lowers the proportion of the additive genic variance explained in the unrepresented population and so we capture a lower proportion of that population’s response to directional selection. Thus this issue of missing the parallel adaptive response across ancestries can be expected in many situations with imperfect portability. Such signals of polygenic adaptation can be useful as QX is correctly detecting that directional selection has acted on the genetic variation along the branch leading to the represented population, but the signal is very open to the misinterpretation that the unrepresented population has not also responded to the same selection pressures.
Many traits have likely experienced a mixture of stabilizing selection and bursts of directional selection across human history. Even if the populations share the same phenotypic optimum, if an environmental change systematically shifts one population away from this optimum, there would be directional polygenic adaptation to move that population back towards the optimum, resulting in a difference in polygenic scores but no difference in the mean phenotypes between populations [67]. Therefore, under current ascertainment schemes and pervasive stabilizing selection, the difference in mean polygenic scores among populations provides very unreliable information about the potential role of selection in generating phenotypic differences among populations.
A polygenic score is a prediction of how an individual’s phenotype is expected to deviate away from the sample mean given their genotype at some (large) number of polymorphisms. Sometimes they are quoted as absolute values, but that is always based on an empirical phenotypic mean. The mean polygenic score in a population cannot inform us of the mean phenotype in a population, even if it was constructed from all polymorphisms and averaged across all individuals. This is because these scores are based on polymorphic genotypes alone whereas the absolute phenotype of an individual represents the end product of the entire genome and environment played out through a vast number of developmental processes. Yet it is easy to fall into the trap of believing that a difference in polygenic scores between groups is a strong statement about the difference in the mean phenotype of those groups, which also differ in a myriad of environmental and cultural factors (a related set of issues are present in epidemology; [115]). Thus while the field of human genetics is increasing its power to predict phenotypic variance among individuals within groups, it remains a poor guide to the causes of phenotypic variance among groups with greater environmental and genetic differentiation [116,117].
Acknowledgements
We thank Molly Przeworski, Guy Sella, Michael Turelli, Vince Buffalo, Carl Veller, Sohini Ramachandran, Doc Edge, Noah Rosenberg, Yuval Simons and two anonymous reviewers for their valuable comments on earlier drafts of our manuscript. We also thank the members of the Coop lab for helpful discussions. This work used the Extreme Science and Engineering Discovery Environment (XSEDE) computing resource PSC Bridges-2 through allocation TG-MCB200074 to S.Y. and TG-BIO210159 to G.C.
Appendix A
(a) Simulation details
In SLiM version 3.6 [119], we simulated stabilizing selection on additive traits and divergence between a pair of populations with the same optimal phenotype (set at zero). Each simulation consisted of a constant population size of 10 000 diploid individuals and genomes comprising 4000 1bp quantitative trait loci and 1000 1bp neutral loci, with free recombination between loci. Mutations arose at rate 2 × 10−6 per base pair and for quantitative trait loci their effect size was randomly assigned from a normal distribution or a mixture of normal distributions. We used a Gaussian fitness function setting the fitness of an individual to
A 1 |
where r = ||r|| is the phenotypic distance from the optimum of an individual with a vector of phenotypes r and w quantifies the width of the fitness peak. The form of Gaussian selection is consistent with any form of quadratic selection if the population mean is near the optimum (see §Aa(ii)). We simulated quantitative traits under neutrality (w = ∞) and under varying widths of the fitness peak, w, in which larger values of w indicate weaker stabilizing selection. We chose strengths of stabilizing selection and an effect size distribution that correspond to observations in human populations (see §§Aa(i) and (ii) for details). We calculated an individual’s fitness in each generation based on equation (A 1), where their phenotype was the sum of the effects of alleles they carried at both variable and fixed sites.
We burned in each simulation for 60 000 (6N) generations before splitting the population into two for 3500 generations. We recorded mutation effect sizes and frequencies in each population every 100 generations for the first 500 generations, and then every 500 generations. To reduce computation times, groups of 10 simulation replicates shared the first 40 000 generations of their burn-in. The remaining 20 000 generations provided more than enough time for a complete turnover in the genetic basis of trait variation, such that simulations within a group were effectively independent. We simulated 100 or 200 replicates of each parameter combination.
(i) Choice of parameter values for strength of selection on the trait
We simulated under four different strengths of stabilizing selection (w), chosen to range from no selection (neutral trait) to the highest strength of stabilizing selection estimated from the UK BioBank [69]. These estimates were determined from regressions of relative fitness (based on estimates of lifetime reproductive success) on squared standard normal phenotypes, r2, where half of the coefficient of the quadratic term is the quadratic selection gradient, γ, in which negative values imply stabilizing selection, and the intercept is the mean fitness in the population, . The authors report estimates of γ from their regressions, and so to get strengths of stabilizing selection (w) under our slightly different fitness model, we make the following approximations to fitness. At equilibrium, the distribution of phenotypes is closely centred around the optimal phenotype relative to the width of the fitness gradient, and therefore we assume that mean fitness . Additionally assuming that the mean phenotype in the population is the optimal phenotype, we can then approximate that the quadratic fitness function is equivalent to ours when γ = −1/w2, because γ/2r2 + 1 ≈ exp (γr2/2). Based on incremental γ-values of 0, −0.02, −0.04 and −0.06 (the maximum estimated), we use the following values of the strength of stabilizing selection on the trait: w = ∞ (neutral trait), w = 7, w = 5 and w = 4.
(ii) Choice of effect size distribution
Effect sizes of new mutations were drawn from a normal distribution or mixture of normal distributions with mean zero. Provided that the strengths of selection we chose correspond to selection on standard normal phenotypes, we used a standard deviation that would result in a genetic variance close to 1 at equilibrium. However, we note that the size of the mutational target provides a simple inflation factor to the variance and cancels out when calculating the reduction in prediction accuracy; the mutation effect size distribution that we use is more important for determining the distribution of selection coefficients. See equation (A 13) for the solution to the equilibrium genic variance. Since the strength of selection helps determine the equilibrium genic variance, we used the weakest strength of selection to determine the variance of this effect size distribution (w = 7). For a Gaussian distribution of effect sizes, this leads to a solution for the standard deviation (σa) of 0.1. This distribution generates mutations that are mostly nearly neutral or weakly selected. To explore how different selection regimes lead to different declines in prediction accuracy, we also simulated mutation effect size distributions that involve a higher density of nearly neutral mutations or a higher density of strongly selected mutations. We used a Gaussian distribution with σa = 0.05 for the former and a mixture of Gaussians for the latter, which we call the ‘heavy tail’ distribution. The heavy tail distribution generates mutations from a Gaussian distribution with σa = 0.05 that has mean 0 with probability 0.65 and mean ±0.25 with probability 0.175. We chose these specifications for the heavy tail distribution in order to match the median absolute deviation of selection coefficients for the standard Gaussian with σa = 0.1.
(iii) Extension: directional selection
We extended our baseline stabilizing selection scenario for directional selection by imposing a positive optimum shift of either one or two standard deviations of the phenotypic distribution. The optimum shifted in the same way 2500 generations after the pair of populations diverged. We started directional selection simulations from the states of the baseline scenario that were recorded at the time of the optimum shift. To calculate the extent of the optimum shift in each simulation, we averaged the phenotypic standard deviation of each population in that simulation. Simulation details and results for other extensions can be found in electronic supplementary material, §1.3.
(b) Relative prediction accuracy of polygenic scores
The prediction accuracy of a polygenic score is the proportion of the phenotypic variance explained by that polygenic score (the squared correlation R2). We write out the correlation between an individual’s polygenic score constructed from unlinked polymorphisms discovered by GWAS, Si, and the true additive genetic value, Gi, in a particular target population. We assume that a GWAS discovers associations only at causal loci, and that at these loci with significant associations, it estimates their true effects. There are Ls loci with significant associations in the study and a remaining Lr that contribute to trait variation in a particular target population. Therefore, an individual’s predicted and true genetic values are
A 2 |
in which their true genetic value is a sum of their predicted genetic value and genetic value contributed by the remainder of sites contributing to trait variation. Because the predicted genetic value is a portion of the true genetic value, the covariance between them is the variance in predicted genetic values
A 3 |
Thus, the correlation between the predicted and true genetic values is
A 4 |
meaning that the correlation between the predicted and true additive genetic values is the square root of the proportion of the trait variance explained by polymorphisms ascertained by the GWAS, or that the variance in true genetic values explained by polygenic scores is the proportion of the genetic variance explained by the polymorphisms used to construct the polygenic scores. This value also represents the reduction in the prediction accuracy of polygenic scores compared to additive genetic values, when predicting true additive phenotypes (with contributions from both genetics and environment). We denote the additive environmental contribution to the phenotype in an individual as Ei. The prediction accuracy of an individual’s full additive phenotype (Gi + Ei) when using polygenic scores is
A 5 |
and when using additive genetic values is
A 6 |
such that the reduction of prediction accuracy when using polygenic scores instead of additive genetic values is
A 7 |
Thus represents the prediction accuracy of polygenic scores for additive genetic values, as well as the reduction in prediction accuracy for full additive phenotypes due to incomplete ascertainment.
(i) Modification for gene-by-environment interactions
While we assume that the effects of a causal allele are perfectly estimated in the GWAS sample, these effects may differ in populations that experience different environments (GxE). Thus for populations not represented in the GWAS, the correlation between polygenic scores and true additive genetic values will be lower than without GxE. Imagine that we use the set of effect sizes from GWAS population A (al,A) to construct polygenic scores for population B where the true effect sizes (al,B) differ. The covariance of the polygenic scores with true additive genetic values is then
A 8 |
Therefore, the correlation between polygenic scores and additive genetic values is
A 9 |
For a neutrally evolving trait, effect sizes and heterozygosities are independent and so
A 10 |
such that the neutral drop in prediction accuracy due to GxE is simply the decrease from 1 of the correlation of effect sizes between A and B.
(c) Modelling details
(i) Background
Assuming Gaussian stabilizing selection and that the population mean stays close to its optimum value, selection at an individual locus is well described by a model of underdominance where the per-generation change in an allele’s frequency x can be described by its mean and variance
A 11 |
where s = a2/w2 is the selection coefficient, with a being the additive effect of the allele with frequency x relative to the alternate allele, and N is the diploid population size. This approximation further assumes that the phenotypic variance is much smaller than the width of the fitness peak under stabilizing selection (σ2 ≪ w).
Assuming an infinite sites model, the mean time that a mutation spends in a certain frequency interval before fixation or loss (at a single site) multiplied by its population mutation rate is equivalent to the expected number of alleles segregating at those frequencies at a single time point [120]. The mutation rate of an allele with effect a is , where U is the per-generation mutation rate for the entire mutational target and is the proportion of mutations with effect a (with density f(a|σa), where σa is the standard deviation of the mutation effect size distribution). The mean time τ spent in frequency interval (x, x + dx) for a new mutation with effect a can be solved using the Green’s function (see eqn A 18 of [68]), such that
A 12 |
where erf is the error function and . Thus, we can solve for the expected total additive genic variance at equilibrium by summing over the heterozygosities contributed by all sites
A 13 |
(ii) Results
In our models, we focus on divergence without gene flow between a pair of populations, in which case they only share polymorphisms that arose in their common ancestor and were not lost in either population. To predict the loss of ancestral heterozygosity and probability that the descendant populations share polymorphisms, and thus describe the proportion of additive genic variance in one population explained by polymorphisms identified by a GWAS in the other, we consider the frequency trajectory of ancestrally segregating variants. To fully calculate these quantities, we would need the diffusion transition density with underdominant selection to calculate the distribution of the frequency in the present given the ancestral frequency, but we can build simple approximations of the combined effects of selection and drift. If we assume that the frequency of our allele x is close to 0 such that 1 − x ≈ 1, then we can approximate its expected per-generation change (equation (A 11)) as
A 14 |
such that its deterministic frequency trajectory follows the logistic function from starting frequency x0,
A 15 |
To consider the effects of genetic drift on the frequency trajectory, we assume that xt is normally distributed around its deterministic frequency trajectory as follows:
A 16 |
Reduction in variance due to drift and selection. We can predict the additive genic variance contributed by all ancestral polymorphisms (anc) to the descendent (desc) after t generations by averaging over their effect sizes, starting frequencies and trajectories,
A 17 |
The total reduction in the additive genetic variance contributed by ancestral polymorphisms after time t is thus given by E(Va[anc,desc](t, w, σa))/E(Va(w, σa)).
To obtain the reduction in heterozygosity used in figure 2a, we use the ratio of expected heterozygosities for the midpoints of the effect size bins evaluated. The equations we use have a similar structure to those where we predict the total variance explained by ancestral polymorphisms at time 0 and t, except that we do not integrate over all effect sizes and remove a2 to calculate the heterozygosity instead of variance
A 18 |
where Ht refers to the heterozygosity at time t.
Variance in ancestor explained by polymorphisms in descendant. To understand the ancestral variance attributable to present-day polymorphisms, we need to think of the properties of present-day alleles backward in time. For alleles with effect sizes large enough for selection to act strongly, selection (usually) constrains them from reaching appreciable frequencies and selection acts approximately like additive selection against the allele (eqn (A 14)). Conditional on the present-day frequency, the distribution of a deleterious, additive allele’s trajectory backward in time is the same as the process forward in time, such that g(xt|x0, a, w) = g(x0|xt, a, w) [121]. Therefore, we can write the expected additive genic variance in the ancestral population explained by variation segregating in the descendant population, in which they have diverged for t generations, as
A 19 |
which is identical to the additive genic variance contributed by all ancestral polymorphisms to the descendent (with just a flip in the subscripts; see equation (A 17)). Thus, the decay in the additive genic variance segregating in an ancestral population to the present day is the same as loss in prediction accuracy when using present-day polymorphisms to predict phenotypes in the ancestral population.
Variance in population B explained by polymorphisms in population A. When the population in which we make predictions is contemporary with the population represented in the GWAS, we again consider the variance explained by all polymorphisms that contribute to the variance in the prediction population. In our model of population divergence, these sites that contribute to the variance in the prediction population are sites that were segregating in the common ancestor that remain segregating in both the GWAS and prediction population. We first account for the variance contributed by ancestrally segregating sites in the descendant population in which we make predictions (B), as explained in equation (A 17)), and then for the possibility that each site is segregating, and thus ascertained, in the population from which the GWAS sample was drawn (A),
A 20 |
where x0 is the frequency of an allele in the common ancestor of populations A and B, xt,A and xt,B are its frequencies after time t in each of those descendant populations, respectively, and the probability that the ancestral variant is segregating in population A for ascertainment is given by
A 21 |
Data accessibility
Code used to generate simulations and process output can be found at https://github.com/SivanYair/SLiMsims_StabilizingSelection. The data are provided in electronic supplementary material [118].
Authors' contributions
S.Y.: conceptualization, formal analysis, investigation, methodology, software, visualization, writing—original draft, writing—review and editing; G.C.: conceptualization, formal analysis, funding acquisition, investigation, methodology, supervision, writing—original draft, writing—review and editing.
Both authors gave final approval for publication and agreed to be held accountable for the work performed therein.
Competing interests
We declare we have no competing interests.
Funding
Funding was provided by the National Institute of General Medical Sciences of the National Institutes of Health (NIH R01 GM108779 and R35 GM136290, awarded to G.C.).
References
- 1.Lewontin RC. 1972. The apportionment of human diversity. In Evolutionary biology (eds T Dobzhansky, MK Hecht, WC Steere), ch. 14, pp. 381–398. New York, NY: Appleton-Century-Crofts.
- 2.Li JZ, et al. 2008. Worldwide human relationships inferred from genome-wide patterns of variation. Science 319, 1100-1104. ( 10.1126/science.1153717) [DOI] [PubMed] [Google Scholar]
- 3.Barbujani G, Magagni A, Minch E, Cavalli-Sforza LL. 1997. An apportionment of human DNA diversity. Proc. Natl Acad. Sci. USA 94, 4516-4519. ( 10.1073/pnas.94.9.4516) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Jorde LB, Watkins WS, Bamshad MJ, Dixon ME, Ricker CE, Seielstad MT, Batzer MA. 2000. The distribution of human genetic diversity: a comparison of mitochondrial, autosomal, and Y-chromosome data. Am. J. Hum. Genet. 66, 979-988. ( 10.1086/302825) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK, Zhivotovsky LA, Feldman MW. 2002. Genetic structure of human populations. Science 298, 2381-2385. ( 10.1126/science.1078311) [DOI] [PubMed] [Google Scholar]
- 6.Conrad DF, Jakobsson M, Coop G, Wen X, Wall JD, Rosenberg NA, Pritchard JK. 2006. A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nat. Genet. 38, 1251-1260. ( 10.1038/ng1911) [DOI] [PubMed] [Google Scholar]
- 7.Bergström A, et al. 2020. Insights into human genetic variation and population history from 929 diverse genomes. Science 367, eaay5012. ( 10.1126/science.aay5012) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Fan S, Hansen MEB, Lo Y, Tishkoff SA. 2016. Going global by adapting local: a review of recent human adaptation. Science 354, 54-59. ( 10.1126/science.aaf5098) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Coop G, Pickrell JK, Novembre J, Kudaravalli S, Li J, Myers RM, Cavalli-sforza LL, Feldman MW, Pritchard JK. 2009. The role of geography in human adaptation. PLoS Genet. 5, e1000500. ( 10.1371/journal.pgen.1000500) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hernandez RD, Kelley JL, Elyashiv E, Melton SC, Auton A, McVean G, 1000 Genomes Project, Sella G, Przeworski M. 2011. Classic selective sweeps were rare in recent human evolution. Science 257, 920-924. ( 10.1109/4.563681) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wright S. 1951. The genetical structure of populations. Ann. Eugen. 15, 323-354. ( 10.1111/j.1469-1809.1949.tb02451.x) [DOI] [PubMed] [Google Scholar]
- 12.Lande R. 1976. Natural selection and random genetic drift in phenotypic evolution. Evolution 30, 314-334. ( 10.1111/j.1558-5646.1976.tb00911.x) [DOI] [PubMed] [Google Scholar]
- 13.Rogers AR, Harpending HC. 1983. Population structure and quantitative characters. Genetics 105, 985-1002. ( 10.1093/genetics/105.4.985) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lande R. 1992. Neutral theory of quantitative genetic variance in an Island model with local extinction and colonization. Evolution 46, 381-389. ( 10.1111/j.1558-5646.1992.tb02046.x) [DOI] [PubMed] [Google Scholar]
- 15.Whitlock MC. 1999. Neutral additive genetic variance in a metapopulation. Genet. Res. 74, 215-221. ( 10.1017/s0016672399004127) [DOI] [PubMed] [Google Scholar]
- 16.Edge MD, Rosenberg NA. 2015. A general model of the relationship between the apportionment of human genetic diversity and the apportionment of human phenotypic diversity. Hum. Biol. 87, 313-337. ( 10.13110/humanbiology.87.4.0313) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Relethford JH, Lees FC. 1982. The use of quantitative traits in the study of human population structure. Am. J. Phys. Anthropol. 25, 113-132. ( 10.1002/ajpa.1330250507) [DOI] [Google Scholar]
- 18.Chakraborty R. 1990. Quantitative traits in relation to population structure: why and how are they used and what do they imply? Hum. Biol. 62, 147-162. [PubMed] [Google Scholar]
- 19.Loh PR, et al. 2015. Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nat. Genet. 47, 1385-1392. ( 10.1038/ng.3431) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Shi H, Kichaev G, Pasaniuc B. 2016. Contrasting the genetic architecture of 30 complex traits from summary association data. Am. J. Hum. Genet. 99, 139-153. ( 10.1016/j.ajhg.2016.05.013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Boyle EA, Li YI, Pritchard JK. 2017. An expanded view of complex traits: from polygenic to omnigenic. Cell 169, 1177-1186. ( 10.1016/j.cell.2017.05.038) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hill WG, Goddard ME, Visscher PM. 2008. Data and theory point to mainly additive genetic variance for complex traits. PLoS Genet. 4, 1-10. ( 10.1371/journal.pgen.1000008) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Li YR, Keating BJ. 2014. Trans-ethnic genome-wide association studies: advantages and challenges of mapping in diverse populations. Genome Med. 6, 1-14. ( 10.1186/s13073-014-0091-5) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Popejoy AB, Fullerton SM. 2016. Genomics is failing on diversity. Nature 538, 161-164. ( 10.1038/538161a) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Macarthur J, et al. 2017. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucelic Acids Res. 45, 896-901. ( 10.1093/nar/gkw1133) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Martin AR, Kanai M, Kamatani Y, Okada Y, Neale BM, Daly MJ. 2019. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584-591. ( 10.1038/s41588-019-0379-x) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Vilhjálmsson BJ, et al. 2015. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet. 97, 576-592. ( 10.1016/j.ajhg.2015.09.001) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Horikoshi M, Sofer T, Mahajan A, Kitajima H, Franceschini N, McCarthy MI, Kidney Consortium, Td-genes Consortium, Morris AP. 2017. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum. Mol. Genet. 26, 3639-3650. ( 10.1093/hmg/ddx280) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lam M, Chen C-Y, Li Z, Martin AR, Bryois J. 2019. Comparative genetic architectures of schizophrenia in East Asian and European populations. Nat. Genet. 51, 1670-1678. ( 10.1038/s41588-019-0512-x) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Wang Y, Guo J, Ni G, Yang J, Visscher PM, Yengo L. 2020. Theoretical and empirical quantification of the accuracy of polygenic scores in ancestry divergent populations. Nat. Commun. 11, 1-9. ( 10.1038/s41467-020-17719-y) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Weissbrod O, et al. 2021. Leveraging fine-mapping and non-European training data to improve trans-ethnic polygenic risk scores. medRxiv. ( 10.1101/2021.01.19.21249483) [DOI]
- 32.Haworth S, et al. 2019. Apparent latent structure within the UK Biobank sample has implications for epidemiological analysis. Nat. Commun. 10, 333. ( 10.1038/s41467-018-08219-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kerminen S, et al. 2019. Geographic variation and bias in the polygenic scores of complex diseases and traits in Finland. Am. J. Hum. Genet. 104, 1169-1181. ( 10.1016/j.ajhg.2019.05.001) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Sakaue S, et al. 2020. A global atlas of genetic associations of 220 deep phenotypes. medRxiv, pp. 1–52. ( 10.1101/2020.10.23.20213652) [DOI]
- 35.Trochet H, Pelletier J, Tadros R, Hussin J. 2021. Comparison of polygenic risk scores for heart disease highlights obstacles to overcome for clinical use. bioRxiv. ( 10.1101/2020.08.09.243287) [DOI]
- 36.Isshiki M, Watanabe Y, Ohashi J. 2021. Geographic variation in the polygenic score of height in Japan. Hum. Genet. 140, 1097-1108. ( 10.1007/s00439-021-02281-4) [DOI] [PubMed] [Google Scholar]
- 37.Brown BC, Asian Genetic, Epidemiology Network, Type Diabetes, Ye CJ, Price AL, Zaitlen N. 2016. Transethnic genetic-correlation estimates from summary statistics. Am. J. Hum. Genet. 99, 76-88. ( 10.1016/j.ajhg.2016.05.001) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Coram MA, Fang H, Candille SI, Assimes TL, Tang H. 2017. Leveraging multi-ethnic evidence for risk assessment of quantitative traits in minority populations. Am. J. Hum. Genet. 101, 218-226. ( 10.1016/j.ajhg.2017.06.015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Adhikari K, et al. 2019. A GWAS in Latin Americans highlights the convergent evolution of lighter skin pigmentation in Eurasia. Nat. Commun. 10, 1-16. ( 10.1038/s41467-018-08147-0) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bentley AR, et al. 2019. Multi-ancestry genome-wide gene–smoking interaction study of 387, 272 individuals identifies new loci associated with serum lipids. Nat. Genet. 51, 636-648. ( 10.1038/s41588-019-0378-y) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Galinsky KJ, Reshef YA, Finucane HK, Loh PR, Zaitlen N, Patterson NJ, Brown BC, Price AL. 2019. Estimating cross-population genetic correlations of causal effect sizes. Genet. Epidemiol. 43, 180-188. ( 10.1002/gepi.22173) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Grinde KE, Thornton TA, Chan KHK, Reiner AP. 2019. Generalizing polygenic risk scores from Europeans to Hispanics / Latinos. Genet. Epidemiol. 43, 50-62. ( 10.1002/gepi.22166) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Veturi Y, Yi N, Huang W, Vazquez AI. 2019. Modeling heterogeneity in the genetic architecture of ethnically diverse groups using random effect interaction models. Genetics 211, 1395-1407. ( 10.1534/genetics.119.301909) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wojcik GL, et al. 2019. Genetic analyses of diverse populations improves discovery for complex traits. Nature 570, 514-518. ( 10.1038/s41586-019-1310-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Mostafavi H, Harpak A, Agarwal I, Conley D, Pritchard JK, Przeworski M. 2020. Variable prediction accuracy of polygenic scores within an ancestry group. eLife 9, e48376. ( 10.7554/eLife.48376) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Mathieson I. 2021. The omnigenic model and polygenic prediction of complex traits. Am. J. Hum. Genet. 108, 1-6. ( 10.1016/j.ajhg.2021.07.003) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Patel RA, et al. 2021. Effect sizes of causal variants for gene expression and complex traits differ between populations. bioRxiv. ( 10.1101/2021.12.06.471235) [DOI]
- 48.Martin AR, Gignoux CR, Walters RK, Wojcik GL, Neale BM, Gravel S, Daly MJ, Bustamante CD. 2017. Human demographic history impacts genetic risk prediction across diverse populations. Am. J. Hum. Genet. 100, 635-649. ( 10.1016/j.ajhg.2017.03.004) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Martin AR, et al. 2017. An unexpectedly complex architecture for skin pigmentation in Africans. Cell 171, 1340-1353. ( 10.1016/j.cell.2017.11.015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Curtis D. 2018. Polygenic risk score for schizophrenia is more strongly associated with ancestry than with schizophrenia. Psychiatr. Genet. 28, 85-89. ( 10.1097/YPG.0000000000000206) [DOI] [PubMed] [Google Scholar]
- 51.Kim MS, Patel KP, Teng AK, Berens AJ, Lachance J. 2018. How genetic disease risks can be misestimated across global populations. Genome Biol. 19, 179-193. ( 10.1101/195768) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Conti DV, et al. 2021. Trans-ancestry genome-wide association meta-analysis of prostate cancer identifies new susceptibility loci and informs genetic risk prediction. Nat. Genet. 53, 65-75. ( 10.1038/s41588-020-00748-0) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Liu JZ, et al. 2015. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat. Genet. 47, 979-986. ( 10.1038/ng.3359) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Durvasula A, Lohmueller KE. 2021. Negative selection on complex traits limits phenotype prediction accuracy between populations. Am. J. Hum. Genet. 108, 620-631. ( 10.1016/j.ajhg.2021.02.013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Scutari M, Mackay I, Balding D. 2016. Using genetic distance to infer the accuracy of genomic prediction. PLoS Genet. 12, 1-19. ( 10.1371/journal.pgen.1006288) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Bitarello BD, Mathieson I. 2020. Polygenic scores for height in admixed populations. G3: Genes, Genomes, Genetics 10, 4027-4036. ( 10.1534/g3.120.401658) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Cavazos TB, Witte JS. 2021. Inclusion of variants discovered from diverse populations improves polygenic risk score transferability. Hum. Genet. Genomics Adv. 2, 100017. ( 10.1016/j.xhgg.2020.100017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Privé F, Aschard H, Carmi S, Folkersen L, Hoggart C, O’Reilly PF, Vilhjàlmsson BJ. 2022. Portability of 245 polygenic scores when derived from the UK biobank and applied to 9 ancestry groups from the same cohort. Am. J. Hum. Genet. 109, 12-23. ( 10.1016/j.ajhg.2021.11.008) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Turchin MC, Chiang CWK, Palmer CD, Sankararaman S, Reich D, Hirschhorn JN. 2012. Evidence of widespread selection on standing variation in Europe at height-associated SNPs. Nat. Genet. 44, 1015-1019. ( 10.1038/ng.2368) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Berg JJ, Coop G. 2014. A population genetic signal of polygenic adaptation. PLoS Genet. 10, e1004412. ( 10.1371/journal.pgen.1004412) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Berg JJ, et al. 2019. Reduced signal for polygenic adaptation of height in UK Biobank. eLife 8, e39725. ( 10.7554/eLife.39725) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Sohail M, et al. 2019. Polygenic adaptation on height is overestimated due to uncorrected stratification in genome-wide association studies. eLife 8, e39702. ( 10.7554/eLife.39702) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Refoyo-Martínez A, Liu S, Jørgensen AM, Jin X, Albrechtsen A, Martin AR, Racimo F. 2020. How robust are cross-population signatures of polygenic adaptation in humans? eLife 2019;8:e39702. ( 10.7554/eLife.39702) [DOI]
- 64.Novembre J, Barton NH. 2018. Tread lightly interpreting polygenic tests of selection. Genetics 208, 1351-1355. ( 10.1534/genetics.118.300786) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Coop G. 2019. Reading tea leaves? Polygenic scores and differences in traits among groups. arXiv 1909.00892. ( 10.48550/arXiv.1909.00892) [DOI]
- 66.Rosenberg NA, Edge MD, Pritchard JK, Feldman MW. 2019. Interpreting polygenic scores, polygenic adaptation, and human phenotypic differences. Evol. Med. Public Health 2019, 26-34. ( 10.1093/emph/eoy036) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Harpak A, Przeworski M. 2021. The evolution of group differences in changing environments. PLoS Biol. 19, e3001072. ( 10.1371/journal.pbio.3001072) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Simons YB, Bullaughey K, Hudson RR, Sella G. 2018. A population genetic interpretation of GWAS findings for human quantitative traits. PLoS Biol. 16, e2002985. ( 10.1371/journal.pbio.2002985) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Sanjak JS, Sidorenko J, Robinson MR, Thornton KR, Visscher PM. 2018. Evidence of directional and stabilizing selection in contemporary humans. Proc. Natl Acad. Sci. USA 115, 151-156. ( 10.1073/pnas.1806837115) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Bumpus HC. 1899. The Elimination of the Unfit as Illustrated by the Introduced Sparrow, Passer Domesticus: (a Fourth Contribution to the Study of Variation). Biological lectures delivered at the Marine Biological Laboratory of Wood’s Hole. Gin.
- 71.Kingsolver JG, Hoekstra HE, Hoekstra JM, Berrigan D, Vignieri SN, Hill CE, Hoang A, Gibert P, Beerli P. 2001. The strength of phenotypic selection in natural populations. Am. Nat. 157, 245-261. ( 10.1086/319193) [DOI] [PubMed] [Google Scholar]
- 72.de Villemereuil P. 2020. Fluctuating optimum and temporally variable selection on breeding date in birds and mammals. Proc. Natl Acad. Sci. USA 117, 31 969-31 978. ( 10.1073/pnas.2009003117) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Gingerich PD. 1983. Rates of evolution: effects of time and temporal scaling. Science 222, 159-161. ( 10.1126/science.222.4620.159) [DOI] [PubMed] [Google Scholar]
- 74.Weber KE. 1990. Selection on wing allometry in Drosophila melanogaster. Genetics 126, 975-989. ( 10.1093/genetics/126.4.975) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Barton NH, Keightley PD. 2002. Understanding quantitative genetic variation. Nat. Rev. Genet. 3, 11-21. ( 10.1038/nrg700) [DOI] [PubMed] [Google Scholar]
- 76.Hill WG, Kirkpatrick M. 2010. What animal breeding has taught us about evolution. Ann. Rev. Ecol. Evol. Syst. 41, 1-19. ( 10.1146/annurev-ecolsys-102209-144728) [DOI] [Google Scholar]
- 77.Houle D, Bolstad GH, Van Der Linde K, Hansen TF. 2017. Mutation predicts 40 million years of fly wing evolution. Nature 548, 447-450. ( 10.1038/nature23473) [DOI] [PubMed] [Google Scholar]
- 78.Cohan FM. 1984. Can uniform selection retard random genetic divergence between isolated conspecific populations? Evolution 38, 495-504. ( 10.1111/j.1558-5646.1984.tb00315.x) [DOI] [PubMed] [Google Scholar]
- 79.Narain P, Chakraborty R. 1987. Genetic differentiation of quantitative characters between populations or species II: optimal selection in infinite populations. Heredity 59, 199-212. ( 10.1038/hdy.1987.114) [DOI] [PubMed] [Google Scholar]
- 80.Lande R. 1991. Isolation by distance in a quantitative trait. Genetics 128, 443-452. ( 10.1093/genetics/128.2.443) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Goldstein DB, Holsinger KE. 1992. Maintenance of polygenic variation in spatially structured populations: roles for local mating and genetic redundancy. Evolution 46, 412-429. ( 10.1111/j.1558-5646.1992.tb02048.x) [DOI] [PubMed] [Google Scholar]
- 82.Latta RG. 1998. Differentiation of allelic frequencies at quantitative trait loci affecting locally adaptive traits. Am. Nat. 151, 283-292. ( 10.1086/286119) [DOI] [PubMed] [Google Scholar]
- 83.Le Corre V, Kremer A. 2003. Genetic variability at neutral markers, quantitative trait loci and trait in a subdivided population under selection. Genetics 1219, 1205-1219. ( 10.1093/genetics/164.3.1205) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Wright S. 1935. Evolution in populations in approximate equilibrium. J. Genet. 30, 257-266. ( 10.1007/BF02982240) [DOI] [Google Scholar]
- 85.Lande R, Arnold SJ. 1983. The measurement of selection on correlated characters. Evolution 37, 1210-1226. ( 10.2307/2408842) [DOI] [PubMed] [Google Scholar]
- 86.Bulmer MG. 1971. The effect of selection on genetic variability. Am. Nat. 105, 201-211. ( 10.1086/282718) [DOI] [Google Scholar]
- 87.Turelli M, Barton NH. 1994. Genetic and statistical analyses of strong selection on polygenic traits: what, me normal? Genetics 138, 913-941. ( 10.1093/genetics/138.3.913) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Sella G, Barton NH. 2019. Thinking about the evolution of complex traits in the era of genome-wide association studies. Annu. Rev. Genomics Hum. Genet. 20, 461-493. ( 10.1146/annurev-genom-083115-022316) [DOI] [PubMed] [Google Scholar]
- 89.Robertson A. 1956. The effect of selection against extreme deviants based on deviation or on homozygosis. J. Genet. 54, 236-248. ( 10.1007/BF02982779) [DOI] [Google Scholar]
- 90.Hayward LK, Sella G. 2021. Polygenic adaptation after a sudden change in environment. bioRxiv. ( 10.1101/792952) [DOI]
- 91.Turelli M, Barton NH. 1990. Dynamics of polygenic characters under selection. Theor. Popul. Biol. 38, 1-57. ( 10.1016/0040-5809(90)90002-D) [DOI] [Google Scholar]
- 92.Turelli M. 1984. Heritable genetic variation via mutation-selection balance: Lerch’s zeta meets the abdominal bristle. Theor. Popul. Biol. 25, 138-193. ( 10.1016/0040-5809(84)90017-0) [DOI] [PubMed] [Google Scholar]
- 93.Keightley PD, Hill WG. 1988. Quantitative genetic variability maintained by mutation-stabilizing selection balance in finite populations. Genet. Res. 52, 33-43. ( 10.1017/S0016672300027282) [DOI] [PubMed] [Google Scholar]
- 94.Irving-Pease EK, Muktupavela R, Dannemann M, Racimo F. 2021. Quantitative Human Paleogenetics: what can ancient DNA tell us about complex trait evolution? Front. Genet. 12, 703541. ( 10.3389/fgene.2021.703541) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Carlson MO, Rice DP, Berg JJ, Steinrücken M. 2021. Polygenic score accuracy in ancient samples: quantifying the effects of allelic turnover. bioRxiv. ( 10.1101/2021.09.21.461259) [DOI] [PMC free article] [PubMed]
- 96.Yang J, et al. 2010. Common SNPs explain a large proportion of the heritability for human height. Nat. Genet. 42, 565-569. ( 10.1038/ng.608) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Wood AR, et al. 2014. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 46, 1173-1186. ( 10.1038/ng.3097) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Yang J, et al. 2015. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat. Genet. 47, 1114-1120. ( 10.1038/ng.3390) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Chakraborty R, Nei M. 1982. Genetic differentiation of quantitative characters between populations or species I: mutation and random genetic drift. Genet. Res. 39, 303-314. ( 10.1038/hdy.1987.114) [DOI] [Google Scholar]
- 100.Prout T, Barker JSF. 1993. F statistics in Drosophila buzzatii: selection, population size and inbreeding. Genetics 134, 369-375. ( 10.1093/genetics/134.1.369) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Spitze K. 1993. Population structure in Daphnia obtusa: quantitative genetic and allozymic variation. Genetics 135, 367-374. ( 10.1093/genetics/135.2.367) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Lewontin RC, Krakauer J. 1973. Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms. Genetics 74, 175-195. ( 10.1093/genetics/74.1.175) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Kremer A, Le Corre V. 2012. Decoupling of differentiation between traits and their underlying genes in response to divergent selection. Heredity 108, 375-385. ( 10.1038/hdy.2011.81) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Márquez-Luna C. 2017. Multiethnic polygenic risk scores improve risk prediction in diverse populations. Genet. Epidemiol. 41, 811-823. ( 10.1002/gepi.22083) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Duncan L, Shen H, Gelaye B, Meijsen J, Ressler K, Feldman M, Peterson R, Domingue B. 2019. Analysis of polygenic risk score usage and performance in diverse human populations. Nat. Commun. 10, 3328. ( 10.1038/s41467-019-11112-0) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Mathieson I, et al. 2015. Genome-wide patterns of selection in 230 ancient Eurasians. Nature 528, 499-503. ( 10.1038/nature16152) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Berens AJ, Cooper TL, Lachance J. 2017. The genomic health of ancient hominins. Hum. Biol. 89, 7-19. ( 10.13110/humanbiology.89.1.01) [DOI] [PubMed] [Google Scholar]
- 108.Martiniano R, et al. 2017. The population genomics of archaeological transition in west Iberia: investigation of ancient substructure using imputation and haplotype-based methods. PLoS Genet. 13, 1-24. ( 10.1101/134254) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Cox SL, Ruff CB, Maier RM, Mathieson I. 2019. Genetic contributions to variation in human stature in prehistoric Europe. Proc. Natl Acad. Sci. USA 116, 201910606. ( 10.1073/pnas.1910606116) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Cox SL, et al. 2022. Predicting skeletal stature using ancient DNA. Am. J. Biol. Anthropol. 177, 162-174. ( 10.1002/ajpa.24426) [DOI] [Google Scholar]
- 111.Marciniak S, et al. 2021. An integrative skeletal and paleogenomic analysis of prehistoric stature variation suggests relatively reduced health for early European farmers. bioRxiv 2021.03.31.437881. ( 10.1101/2021.03.31.437881). [DOI]
- 112.Aris-Brosou S. 2019. Direct evidence of an increasing mutational load in humans. Mol. Biol. Evol. 36, 2823-2829. ( 10.1093/molbev/msz192) [DOI] [PubMed] [Google Scholar]
- 113.Esteller-Cucala P, Maceda I, Børglum AD, Demontis D, Faraone SV, Cormand B, Lao O. 2020. Genomic analysis of the natural history of attention-deficit/hyperactivity disorder using Neanderthal and ancient Homo sapiens samples. Sci. Rep. 10, 8622. ( 10.1038/s41598-020-65322-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Simonti CN, Lachance J. 2021. Ancient DNA reveals that few GWAS loci have been strongly selected during recent human history. bioRxiv. 2021.04.13.439742. ( 10.1101/2021.04.13.439742) [DOI]
- 115.Rose G. 2001. Sick individuals and sick populations. Int. J. Epidemiol. 30, 427-432. ( 10.1093/ije/30.3.427) [DOI] [PubMed] [Google Scholar]
- 116.Lewontin RC. 1974. The analysis of variance and the analysis of causes. Am. J. Hum. Genet. 26, 400-411. ( 10.1093/ije/dyl062) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Feldman MW, Lewontin RC. 1975. The heritability hang-up. Science 190, 1163-1168. ( 10.1126/science.1198102) [DOI] [PubMed] [Google Scholar]
- 118.Yair S, Coop G. 2022. Population differentiation of polygenic score predictions under stabilizing selection. Figshare. [DOI] [PMC free article] [PubMed]
- 119.Haller BC, Messer PW. 2019. SLiM 3: forward genetic simulations beyond the Wright–Fisher model. Mol. Biol. Evol. 36, 632-637. ( 10.1093/molbev/msy228) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Sawyer SA, Hartl DL. 1992. Population genetics of polymorphism and divergence. Genetics 132, 1161-1176. ( 10.1534/genetics.107.073361) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Maruyama T, Kimura M. 1975. Moments for sum of an arbitrary function of gene frequency along a stochastic path of gene frequency change. Proc. Natl Acad. Sci. USA 72, 1602-1604. ( 10.1073/pnas.72.4.1602) [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Citations
- Yair S, Coop G. 2022. Population differentiation of polygenic score predictions under stabilizing selection. Figshare. [DOI] [PMC free article] [PubMed]
Data Availability Statement
Code used to generate simulations and process output can be found at https://github.com/SivanYair/SLiMsims_StabilizingSelection. The data are provided in electronic supplementary material [118].