Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2024 Apr 11;22(4):e3002511. doi: 10.1371/journal.pbio.3002511

Interpreting population- and family-based genome-wide association studies in the presence of confounding

Carl Veller 1,*, Graham M Coop 2,*
Editor: Priya Moorjani3
PMCID: PMC11008796  PMID: 38603516

Abstract

A central aim of genome-wide association studies (GWASs) is to estimate direct genetic effects: the causal effects on an individual’s phenotype of the alleles that they carry. However, estimates of direct effects can be subject to genetic and environmental confounding and can also absorb the “indirect” genetic effects of relatives’ genotypes. Recently, an important development in controlling for these confounds has been the use of within-family GWASs, which, because of the randomness of mendelian segregation within pedigrees, are often interpreted as producing unbiased estimates of direct effects. Here, we present a general theoretical analysis of the influence of confounding in standard population-based and within-family GWASs. We show that, contrary to common interpretation, family-based estimates of direct effects can be biased by genetic confounding. In humans, such biases will often be small per-locus, but can be compounded when effect-size estimates are used in polygenic scores (PGSs). We illustrate the influence of genetic confounding on population- and family-based estimates of direct effects using models of assortative mating, population stratification, and stabilizing selection on GWAS traits. We further show how family-based estimates of indirect genetic effects, based on comparisons of parentally transmitted and untransmitted alleles, can suffer substantial genetic confounding. We conclude that, while family-based studies have placed GWAS estimation on a more rigorous footing, they carry subtle issues of interpretation that arise from confounding.


GWASs aim to estimate direct effects of genotype on an individual’s phenotype, but this can be subject to genetic and environmental confounds and "indirect" genetic effects of relatives’ genotypes. This study presents a theoretical analysis of the influence of confounders in population-based and within-family GWASs, showing that, while family-based studies are more rigorous, they still carry subtle issues that arise from confounding.

1 Introduction

Genome-wide association studies (GWASs) have identified thousands of genetic variants that are associated with a wide variety of traits in humans. In the standard “population-based” approach, the GWAS is conducted on a set of “unrelated” individuals. The associations that are detected can arise when a variant causally affects the trait or when it is in tight physical linkage with causal variants nearby.

Central to the aims of GWASs is the estimation of variants’ effect sizes on traits of interest. These effect-size estimates are important for identifying and prioritizing variants and implicated genes for functional followup, and may be used to form statistical predictors of trait values or to understand the causal or mechanistic role of genetic variation in traits. Understanding sources of error and bias in GWAS effect-size estimates is therefore crucial.

The interpretation of GWAS effect-size estimates is complicated by 4 broad factors [1,2]. First, the causal pathways from an allele to phenotypic variation need not reside in the individuals who enrolled in the GWAS, but can also reflect causal effects on the individual’s environment of the genotypes of their siblings, parents, other ancestors, and neighbors (indirect genetic effects or dynastic effects) [3]. Second, a phenotypic association can result from correlations between the allele and environmental causes of trait variation (environmental confounding) [4]. Third, a phenotypic association can be generated at a locus if it is genetically correlated with causal loci outside of its immediate genomic region (genetic confounding) [1]. Fourth, an allele’s effect on a trait might depend on the environment and the allele’s genetic background (gene–environment and gene–gene interactions, or G×E and G×G) [57].

Since our primary interest here will be genetic confounding, we briefly describe some potential sources of the long-range allelic associations that drive it: population structure, assortative mating, and selection on the GWAS trait.

Population structure leads to genetic correlations across the genome when allele frequencies differ across populations or geographic regions: sampled individuals from particular populations are likely to carry, across their genomes, alleles that are common in those populations, which induces correlations among these alleles, potentially across large genomic distances. Such genetic correlations persist even after the populations mix, as alleles that were more common in a particular source population retain their association until uncoupled by recombination.

Assortative mating brings alleles with the same directional effect on a trait (or on multiple traits, in the case of cross-trait assortative mating) together in mates, and therefore, bundles these alleles in offspring and subsequent generations. This bundling manifests as positive genetic correlations among alleles with the same directional effect [8,9], which can confound effect-size estimates in a GWAS on the trait.

Finally, natural selection on a GWAS trait can result in genetic correlations by favoring certain combinations of trait-increasing and trait-decreasing alleles. A form of selection that is expected to be common for many traits of interest is stabilizing selection, which penalizes deviations from an optimal trait value. By favoring compensating combinations of trait-increasing and trait-decreasing alleles, stabilizing selection generates negative correlations among alleles with the same directional effect [10,11], and therefore can confound effect-size estimates in a GWAS performed on the trait under selection or on a genetically correlated trait.

The potential for dynastic, environmental, and genetic confounds to bias GWAS effect-size estimates has long been recognized [4,12], and so a major focus of the literature has been to develop methods to control for these confounds [13,14]. Standard approaches include using estimates of genetic relatedness as covariates in GWAS regressions [15,16] or downstream analyses such as LD-Score regression [1719]. Such methods aim to control for both environmental and genetic confounding, but do so imperfectly (e.g., [20,21]). Further, it is often unclear what features of genetic stratification are being addressed [1,2]: assortative mating in particular may not be well accounted for by these methods [22]. Additionally, dynastic confounding will be particularly difficult to control for using population-level covariates.

One promising way forward is to estimate allelic effects within families, either by comparing the separate associations of parentally transmitted and untransmitted alleles with trait values in the offspring [2327], or by associating differences in siblings’ trait values with differences in the alleles they inherited from their parents [2830]. The idea is that, by controlling for parental genotypes, within-family association studies control for both environmental stratification and indirect/dynastic effects, while mendelian segregation randomizes alleles across genetic backgrounds. In principle, this allows the “direct genetic effect” of an allele—the causal effect of an allele carried by an individual on their trait value—to be estimated. Recognizing that a variant detected in a GWAS will usually not itself be causal for the trait variation but instead will only be correlated with true causal variants, the direct effect of a genotyped variant is usually interpreted as reflecting the direct causal effects of nearby loci that are genetically correlated with the focal locus [2]—but not the effects of more distant loci that might also be correlated with the focal genotyped locus (e.g., because of population structure or assortative mating).

Consistent with both the presence of substantial confounds in some population-based GWASs and the mitigation of these confounds in within-family GWASs, family-based estimates of direct effect sizes and aggregate quantities based on these estimates (e.g., SNP-based heritabilities) are substantially smaller than population GWAS estimates for a number of traits, most notably social and behavioral traits [3034]. Likewise, estimates of genetic correlations between traits are sometimes substantially reduced when calculated using direct effect estimates from within-family GWASs (e.g., [33]). While some of these findings could reflect the contribution of indirect genetic effects to population GWASs, it is also likely that, at least for some traits, standard controls for population stratification in population GWASs have been insufficient [20,21,3437].

Our aim in this paper is to study a general model of confounding in GWASs, to generate clear intuition for its influence on estimates of effect sizes in both population- and family-based designs. A number of the issues that we analyze have previously been raised, particularly in the context of population-based GWASs (e.g., [1,2,38,39]); here, we analyze them in a common framework that allows for comparison of multiple sources of confounding in both population and family-based GWASs. There is a large literature on GWASs in nonhuman organisms (e.g., [4043]). However, although the results and intuition that we derive here apply equally well to human and nonhuman GWASs, we shall interpret them primarily from the perspective of human GWASs, in which the inability to experimentally randomize environments, together with the small effects that investigators hope to detect, makes confounding a particular concern.

To focus on confounding, we assume no G×E and G×G interactions (the effects of G×E and G×G on estimates produced by population- and family-based designs are studied in ref. [44]). We derive expressions for estimators of direct effects in both population and within-family GWASs, as functions of the true direct and indirect effects at a locus and the genetic confounds induced by other loci. In doing so, we find that family-based estimates of direct effects are in fact susceptible to genetic confounding, contrary to standard interpretation. Reassuringly, in many of the models we consider, the resulting biases are likely to be small in humans. We also address a related case: family-based GWAS designs that consider transmitted and untransmitted parental alleles and in which the indirect (or “dynastic”) effect of an allele is estimated from its association with the offspring’s phenotype when carried by the parent but not transmitted to the offspring. We show that this estimator of indirect effects can be substantially biased by genetic and environmental confounds, in a similar way to population estimates of direct effects. Next, we consider various sources of genetic confounding—assortative mating, population structure, and stabilizing selection on GWAS traits—and how they influence estimates of direct effects in both population and within-family GWASs.

We then turn to sibling indirect effects, which are known to bias estimates of direct effects in sibling-based GWASs [2,34]. We characterize this bias in a simple model and contrast it to the bias caused by sibling indirect effects in a population GWAS.

2 Effect-size estimates in association study designs

Our primary focus will be on how genetic confounding can bias the estimation of direct genetic effects. These genetic confounds are due to associations between a genotyped variant at a GWAS locus and causal variants at other loci. As we will see, 2 kinds of association must be distinguished: cis-linkage disequilibrium (cis-LD) and trans-linkage disequilibrium (trans-LD). Genetic variants A and B are in positive cis-LD if, when an individual inherits A from a given parent, the individual is disproportionately likely to inherit B from that parent (Fig 1A). A and B are in positive trans-LD if, when an individual inherits A from one parent, the individual is disproportionately likely to inherit B from the other parent (Fig 1B). These covariances have also been called gametic and non-gametic LD, respectively (e.g., [45]). To quantify the degrees of cis-LD and trans-LD, we denote by Dij and D˜ij the allelic covariances between focal variants at loci i and j in cis and in trans, and we denote by rij and r˜ij the analogous allelic correlation coefficients. For some of our results, it will be important to distinguish the LD present in the sample on which the association study is performed and the LD present among the parents of the sample.

Fig 1. The influence of cis- and trans-LD on effect-size estimates in population-based and within-family association studies.

Fig 1

(A) The focal allele at an association study locus (solid red square) is in positive cis-LD with trait-increasing alleles at other loci (solid black squares) if it is disproportionately likely to be found alongside them on an individual’s maternally or paternally inherited genome. (B) The focal allele at the study locus is in positive trans-LD with trait-increasing alleles at other loci if it is disproportionately likely to be found across from them on the maternally and paternally inherited genomes. (C,D) In a population association study, both positive cis- and trans-LD between the focal allele at the study locus and trait-increasing alleles anywhere else in the genome—either on the same chromosome as the study locus or on different chromosomes—generate a spuriously high effect-size estimate at the study locus. (E,F) In a sibling association study, a trait-increasing allele causes a spuriously increased effect-size estimate at the study locus if the parent is a coupling double heterozygote for the focal and trait-increasing alleles, having inherited them from the same parent (E), but a spuriously decreased estimate if the parent is a repulsion double heterozygote, having inherited them from different parents (F). These biases arise only if the trait-affecting locus is on the same chromosome as the focal study locus. The net bias depends on the relative frequencies of coupling and repulsion double heterozygotes in the parents, which depends on the difference in the degrees of cis- and trans-LD.

Consider a trait Y influenced by genetic variants at a set of polymorphic loci L, each of which segregates for 2 alleles. For ease of interpretation, and without loss of generality, we designate the “focal” allele at locus lL to be the allele that directly increases the trait value, and we denote by pl the frequency of this allele. Allelic effects are assumed to be additive within and across loci, such that the trait value of an individual can be written

Y=Y*+lLglαlddirecteffects+lL(glm+glf)αliindirecteffects+ϵ. (1)

Here, gl, glm, and glf are the numbers (0, 1, or 2) of focal alleles carried at locus l by the individual, their mother, and their father, respectively, αld>0 is the direct effect of the focal allele at l, and αli is its indirect effect via the maternal and paternal genotypes. (For simplicity, we assume that indirect effects via the maternal and paternal genotypes are equal; this assumption is relaxed in S1 Text Section S1.) ϵ is the environmental noise, with E[ϵ]=0, and Y* is the expected trait value of the offspring of parents who carry only trait-decreasing alleles.

2.1 Population-based association studies

The variants at a genotyped locus will usually not themselves have causal effects on the trait, but will instead be in cis-LD with—and thus “tag”—causal variants at nearby loci. Thus, we typically think of the association at a focal genotyped locus as reflecting the direct contributions of a relatively small number of tightly linked loci, Llocal, found within tens or perhaps hundreds of kb from the focal locus [46]. For concreteness, we assume some fixed Llocal in our analyses. Informally, we intend “local” to capture only those loci that are within a sufficiently short recombination distance of the focal locus that the timescale of recombination between these loci is longer than that by which other processes, such as population structure, assortative mating, and selection, generate LD between more distant loci. We recognize that this definition is somewhat arbitrary, that in practice researchers seldom have a predefined number of “local” SNPs in mind, and that others might prefer alternative definitions (such as the entire chromosome of the focal locus) or to dispense with the idea of “local” altogether. However, we believe our definition is close to what researchers implicitly have in mind when they think of a genotyped locus as “tagging” causal loci. Moreover, many of the processes that we study below generate long-range LD not just within chromosomes but also across different chromosomes; to extend the definition of “tagging” to the entire chromosome of the focal locus, but not to the entire genome, is then also an arbitrary choice.

Under the additive model, the standard interpretation is that a population association study performed at a focal genotyped locus λ provides an estimate of the quantity

αλ=1pλ(1pλ)lLlocalDλlαld, (2)

where pλ is the frequency of the focal allele at λ, and Dλl is the degree of cis-LD between the focal allele at λ and a causal allele at a nearby locus lLlocal. It is reasonable to think of this quantity as the “direct effect” tagged by the focal variant at the genotyped locus λ: in the absence of confounding, it can be interpreted as the average phenotypic effect of randomly choosing a non-focal allele in the population and swapping it for a focal allele, where in this hypothetical swap, the causal alleles near the locus are included (though see ref. [44] for how G×E and G×G interactions can complicate such causal interpretations).

We shall interpret systematic deviations of effect-size estimates from Eq (2) as “biased.” This is because our focus is on the estimation of direct effects at individual loci rather than on other applications such as trait prediction. For the same reason, and following standard terminology in statistics, we refer to the causes of these biases as “confounds,” since they are a form of omitted variable bias in the marginal regression at the study locus λ. We recognize that these definitions of “bias” and “confounding” might be less appropriate in some other settings.

Effect-size estimation in a population GWAS is complicated by the presence of environmental and genetic stratification. Under the model in Eq (1), if we perform a standard population association study at locus λ, the estimated effect of the focal allele on the trait Y is

α^λpop=2Vλ(lLlocalDλlαld+lL\LlocalDλlαld+lLD˜λlαldgeneticconfounds,direct+lL[Dλl+D˜λl+2D˜λl]αligeneticconfounds,indirect+12Cov(gλ,ϵ)environmentalconfound), (3)

where, of the cis- and trans-LD terms, Dλl and D˜λl are defined in the GWAS sample while Dλl and D˜λl are defined in their parents (S1 Text Section S1.2). Vλ is the genotypic variance at λ, equal to 2pλ(1pλ)(1+Fλ), where Fλ is Wright’s coefficient of inbreeding at λ.

The environmental confound is Cov(gλ,ϵ)/Vλ; all non-local cis- and trans-LD terms in the study sample (Dλl and D˜λl,lLlocal) are direct genetic confounds (Fig 1C and 1D), and all cis- and trans-LD terms among parents of sampled individuals (Dλl and D˜λl), together with all trans-LD terms in the study sample (D˜λl), are indirect genetic confounds.

The direct genetic confounds arise because an allele carried by an offspring at λ is correlated with the alleles that they carry at other loci lL (via Dλl and D˜λl) that directly affect the trait value. The indirect genetic confounds arise because an allele carried by the offspring at λ—say, the maternal allele—is correlated with alleles carried by the offspring’s mother at other loci (Dλl and D˜λl) and alleles carried by their father (as reflected by the trans-LD in the offspring, D˜λl). These alleles in the parents can indirectly affect the offspring’s trait value.

Thus, as is now well appreciated, population-based GWASs potentially suffer from many types of confounds [1,2]. In practice, they can be reduced by including principal components—which capture genome-wide relatedness among GWAS participants—as regressors in a GWAS, or by using relatedness matrices in mixed models [15,16]. However, it is often unclear exactly what these methods control for in a given application [1,2], they can be sensitive to sample size [47], and they have been shown to be inadequate in important cases (e.g., [20,21]). When principal components (or other controls) fail to account fully for stratification, then Eq (3) can be interpreted as a decomposition of the remaining, uncontrolled-for confounding in the GWAS. (By the Frisch–Waugh–Lovell theorem ([48], pg. 36), Eq (3) is the estimate one obtains by first regressing the focal-locus genotype on the PCs, collecting the residuals from this regressions—which can be thought of as focal-locus genotype values stripped of whatever signal the PCs captured—and regressing the trait value on these residuals.)

2.2 Within-family association studies

The 2 within-family association study designs that we consider are parent–offspring GWASs and sibling GWASs. Other designs have been proposed to control for genetic and environmental confounding in the estimation of aggregate quantities such as heritability (e.g., [49]), but our primary focus is on the estimation of single-marker effect sizes. We do later turn to the interpretation of polygenic score (PGS) regressions within families.

Estimates of direct genetic effects

Parent–offspring studies can be used to estimate trait associations for parentally transmitted and untransmitted variants at a locus λ, α^λ(T), and α^λ(U), by regressing the trait value Y jointly on the transmitted and untransmitted genotypes, gλT and gλU [27] (note that here and throughout, we use “transmitted genotype” and “untransmitted genotype” as shorthand for the genotypes constructed from transmitted and untransmitted alleles, respectively). The aim is often to estimate the direct effect of a variant, α^λd, as the difference between these 2 regression coefficients:

α^λd,TU=α^λ(T)α^λ(U). (4)

A second aim is to treat α^λ(U) as an estimate of the indirect, or family, effect of the variant. We return to this second aim later.

In S1 Text Section S1.4, we show that, in the absence of interactions between parental and offspring genotypes, the estimate of the direct effect of a variant at locus λ in a parent–offspring study is

α^λd,TU=α^λ(T)α^λ(U)=2HλlL(12cλl)(DλlD˜λl)αld (5)
2Hλ(lLlocalDλlαld+lL\Llocal(12cλl)(DλlD˜λl)αldgeneticconfounds,direct), (6)

where Hλ is the fraction of parents who are heterozygous at locus λ and cλl is the sex-averaged recombination fraction between λ and l. The cis- and trans-LD terms Dλl and D˜λl are measured in the parents.

Similarly, an estimate of the direct effect can be obtained from pairs of siblings by regressing the differences in their phenotypes on the differences in their genotypes at the focal locus λ. In the presence of genetic confounds, this procedure yields the same estimate as Eq (6), in expectation (S1 Text Section S1.3):

α^λd,sib2Hλ(lLlocalDλlαld+lL\Llocal(12cλl)(DλlD˜λl)αldgeneticconfounds,direct). (7)

An assumption in sibling GWASs is that an offspring’s phenotype is not influenced by the genotypes of their siblings—i.e., that there are no sibling indirect genetic effects. We consider violations of this assumption later.

Both of the family-based estimates above implicitly make the further assumption that the association study is conducted before any selection has occurred in the offspring generation, so that the offspring genotypes reflect mendelian transmission from parents. This includes both natural selection and selection in which families/offspring are included in the sample (i.e., ascertainment).

In Eqs (6) and (7), there is no environmental confound, because family-based GWASs successfully randomize the environments of family members with respect to within-family genetic transmission; i.e., variation in offsprings’ genotypes around their parental or sibship means is randomly assigned with respect to environment.

The derivations above further show that, while population association studies are biased by sums of trans- and cis-LD between the focal locus and all causal loci (Eq (3)), within-family association studies are instead biased by differences between trans- and cis-LD, and moreover, that the biases in within-family studies are driven only by LD between the focal locus and causal loci on the same chromosome (cλl<1/2). To provide an intuition for this result, we focus our discussion on a sibling association study performed at λ; the intuition is identical for the analogous parent–offspring study.

Because the difference between 2 siblings in their maternally inherited genotypes is uncorrelated with the difference in their paternally inherited genotypes, we may consider maternal and paternal transmissions separately in studying how a locus lL can confound effect-size estimation at λ in a sibling association study. We will phrase our discussion in terms of maternal transmission.

For effect-size estimation at λ to be genetically confounded by maternal transmission at a distant locus l, the mother must be heterozygous at both loci. For if she were homozygous at l, then maternal transmission at l could not contribute to any trait differences between her offspring, while if she were homozygous at λ, maternal transmission would not result in genetic variation among her offspring at λ with which trait variation could be associated. Therefore, we restrict our focus to mothers who are heterozygous at both λ and l, or “double heterozygotes.” Two kinds exist (Fig 1E and 1F): coupling double heterozygotes who carry the focal alleles at λ and l on the same haploid genome (“in cis”) and repulsion double heterozygotes who carry them on opposite haploid genomes (“in trans”).

We first consider the case where the recombination rate between λ and l is small (cλl≪1/2). In this case, if the mother is a coupling double heterozygote, then her offspring will tend to inherit either both or neither of the focal alleles at λ and l (Fig 1E). Therefore, if one sibling inherits the focal allele at λ and another does not, the first sibling will tend to inherit the focal (trait-increasing, as we have defined it) allele at l and the second sibling will not, so that the effect of locus l positively confounds the association between λ and the trait (Fig 1E). If the mother is instead a repulsion double heterozygote, then her offspring will tend to inherit either the focal allele at λ or the focal allele at l, but not both (Fig 1F). In this case, if one sibling inherits the focal allele at λ and another does not, the second sibling will tend to inherit the focal (trait-increasing) allele at l and the first sibling will not, so that the effect of locus l negatively confounds the association between λ and the trait (Fig 1F). When λ and l are linked, therefore, the way in which l genetically confounds the effect-size estimate at λ depends, positively or negatively, on whether the fraction of coupling double heterozygotes among parents is greater or smaller, respectively, than the fraction of repulsion double heterozygotes.

In contrast, if λ and l are unlinked (cλl = 1/2), then transmissions from coupling and repulsion double heterozygote parents are equal, and so l cannot confound estimates at λ (Fig 1E and 1F). Put differently, meiosis in double heterozygotes fully randomizes joint allelic transmissions at λ and l, with offspring equally likely to inherit any possible combination of alleles at the 2 loci.

Therefore, only linked loci l can confound a family-based association study at λ, and they do so in proportion to (i) how small the recombination rate between λ and l is; and (ii) the difference between the fractions of parents who are coupling and repulsion double heterozygotes at λ and l. Accordingly, if we write these fractions of parents as Hλlcoup and Hλlrep, then DλlD˜λl=(HλlcoupHλlrep)/2, and so Eq (7) (and Eq (6)) can be rewritten in terms of the relative frequencies of the 2 kinds of double-heterozygotes:

α^λd,sib2Hλ(lLlocalDλlαld+lL\Llocal(12cλl)(HλlcoupHλlrep)αld).

In a species with many chromosomes, such as humans, for a given locus, there will be many more unlinked loci than linked loci. Therefore, the set of loci that can confound a family-based association study at a given locus will be much smaller than the set of loci that can confound a population association study at the locus. It will often be the case, therefore, that biases in the estimation of direct genetic effects will be smaller in family-based studies than in population studies, a point that we explore below when we consider sources of genetic confounding.

Estimates of indirect genetic effects

We now return to the coefficient on the untransmitted genotype in the joint regression of trait on transmitted and untransmitted genotype in parent–offspring GWASs, α^λ(U), which has sometimes been treated as an estimate of the indirect effect α^λi. Assuming equal indirect effects via maternal and paternal genotypes (an assumption that we relax in S1 Text Section S1.4),

α^λi=α^λ(U)=1Hλ(1+3F)(2lL[Dλl[(1+3F)cλl2F]+D˜λl[(1+3F)cλl+1+F]+(1F)D˜λl]αld+2(1F)lL[Dλl+D˜λl+2D˜λl]αli+(1+F)Cov(gλU,ϵ)2FCov(gλ,ϵ)). (8)

This expression is easiest to interpret when F = 0, in which case

α^λi=2Hλ(lLlocalDλlαlilocalindirecteffect+lL\Llocal(Dλlcλl+D˜λl(1cλl)+D˜λl)αldgeneticconfounds,direct+lL\Llocal(Dλl+D˜λl+2D˜λl)αligeneticconfounds,indirect+12Cov(gλU,ϵ)environmentalconfound). (9)

(This simplification makes use of the fact that, by our definition of Llocal, setting F = 0 at locus λ implies that D˜λl=D˜λl=0 for lLlocal—see Eq S.29 in S1 Text Section S1.4.)

The direct genetic confound reflects associations of the untransmitted alleles at the focal locus with alleles that are transmitted to the offspring at causal loci lL and which directly affect the offspring’s trait value (via αld). These associations are due to covariances among alleles in each parental genome (Dλl and D˜λl) and across the parental genomes (reflected as trans-LD in the offspring, D˜λl). The indirect genetic confound reflects associations of the untransmitted alleles to alleles at other loci in the parents, which can indirectly affect the offspring trait value (via αli). Finally, unlike in family-based estimates of direct genetic effects (Eqs (6) and (7)), family-based estimates of indirect effects suffer from environmental confounding, in the same way that population GWASs do (Eq (3)).

Therefore, using the coefficient on the untransmitted genotype in the joint regression of phenotype on transmitted and untransmitted genotype as an estimate of the indirect effect is highly susceptible to environmental confounding as well as both direct and indirect genetic confounding, in a similar way to estimating the direct effect via a population-based association study [50]. Adjustments for assortative mating in particular have been included in some PGS-based analyses of indirect effects (e.g., [27,34]). However, it is not clear how robust these adjustments are in the presence of multiple forms of confounding.

2.3 Polygenic scores and their phenotypic associations

A current drawback to family-based GWASs is that sample sizes are often small, limiting power to estimate direct genetic effects. Because of this limitation, instead of estimating per-locus effect sizes in family designs, investigators often measure the within-family phenotypic association of a combined linear predictor, a PGS, constructed using effect-size estimates across many loci from a population GWAS. In the sibling-based version of this study design, the difference in siblings’ population-based PGSs is regressed on their difference in phenotypes (e.g., [30,31]). In parent–offspring designs, offsprings’ trait values are regressed on their population-based PGSs, controlling for the midparent PGSs (e.g., [35]), or, equivalently, offsprings’ trait values are regressed jointly on their population-based PGSs constructed separately for transmitted and untransmitted alleles, and the difference in the slopes in this joint regression is taken (e.g., [27]).

When such PGS regressions are used within families for the same phenotype as the population GWAS, a non-zero slope of the PGS is usually interpreted as reflecting the fact that the PGS—despite having been calculated from a population GWAS and therefore subject to many potential confounds—nevertheless does capture the direct genetic effects of alleles. When the PGS for one phenotype is regressed within families on the value of another phenotype, non-zero slopes are often interpreted as evidence that direct genetic effects on the 2 phenotypes are causally related, for example, through pleiotropic effects of the alleles involved.

Suppose that we have performed a population GWAS for trait 1, generating effect-size estimates α^λ at a set of genotyped loci λ∈Λ. To construct a PGS for trait 1, these effect-size estimates are used as weights in a linear sum across an individual’s genotype:

PGS1=λΛgλα^λpop. (10)

In a sibling-based study (the results and intuition below will be the same for a parent–offspring study), the difference between siblings’ trait-1 PGSs, ΔPGS1, is regressed on the difference in their values for trait 2, ΔY2 (note that trait 2 could be the same as trait 1). If L is the set of loci that causally underlie variation in trait 2, and βl are the true direct genetic effects of variants at these loci on trait 2, then the numerator of the slope in this regression can be written as follows:

Cov(ΔPGS1,ΔY2)=2λΛlL(12cλl)(DλlD˜λl)α^λpopβl (11)

(see S1 Text Section S2). Note that, while the population-based effect-size estimates α^λ depend on cis- and trans-LD, as detailed by Eq (3), the patterns of LD may differ from those in the family study (the DλlD˜λl term in Eq (11)) if the population- and family-based studies differ in relevant aspects of sample composition.

The intuition for Eq (11) is similar to that for the single-locus effect-size estimate in a sibling GWAS (Eq (7)). The numerator of the difference in slopes of transmitted and untransmitted PGSs in a parent–offspring design takes a similar form to Eq (11).

In the absence of confounding and under some simplifying assumptions, the sibling PGS covariance measures the contribution of each locus included in the PGS to the additive genic covariance between traits 1 and 2 that is tagged by the genotyped variants included in the PGS (see Eq S.38 in S1 Text Section S2). Under these assumptions, the sibling PGS slope therefore does provide a measure of the underlying pleiotropy between the traits.

Interpretation of the sibling PGS slopes is more complicated in the presence of genetic confounding (see Eq S.37 in S1 Text Section S2), which is absorbed into the effect-size estimates α^λpop (Eq (3)) so that the PGS applies a potentially strange set of weights to the genotyped loci it includes. (A related problem occurs when indirect genetic effects absorbed by the population-based PGS change the interpretation of within-family PGS slopes [51,52].) A non-zero sibling PGS slope still establishes that the trait-1 PGS loci are in systematic signed intra-chromosomal LD with loci that causally affect trait 2. However, it no longer necessarily implies that traits 1 and 2 are causally related via pleiotropy, for 2 reasons. To understand these reasons, suppose that the causal loci for traits 1 and 2 are distinct, i.e., that there is in fact no pleiotropy. First, a SNP included in the trait-1 PGS could tag local variants that causally affect trait 1 but which are also, via sources of confounding such as cross-trait assortative mating, in systematic long-range LD with variants on the same chromosome that causally affect trait 2. Such SNPs will be predictive of sibling differences in trait 2, even though they locally tag only trait-1 causal variants. Second, LD between variants on the same or distinct chromosomes that are causal for trait 1 and trait 2 will cause some SNPs that locally tag trait-2 causal variants to be significantly associated with trait 1 in a population GWAS, and therefore to be included in the trait-1 PGS. These SNPs, since they tag trait-2 causal variants, will be predictive of sibling differences in trait 2.

In summary, in the presence of confounding, non-zero sibling PGS slopes cannot be viewed as de facto evidence for causal relationships between traits.

3 Sources of genetic confounding in association studies

As we have seen, genetic confounding of association studies depends, in ways that vary across study designs, on levels of non-local cis- and trans-LD between the study locus and loci that influence the study trait. Below, we consider various processes that give rise to non-local cis- and trans-LD, and their likely impact on the different association study designs. We focus our attention on the potential for these sources of LD to confound measurement of several key metrics. First, the average deviation of the estimated effect size from its true value, E[α^λαλ]. This metric indicates if effect sizes are systematically overestimated or underestimated because of genetic confounding. Second, the average squared effect-size estimate, weighted by heterozygosity: E[2pλ(1pλ)α^λ2]. The quantity 2pλ(1pλ)α^λ2 is proportional to the χ2-statistic of the association test at λ and determines whether the detected association is statistically significant. Its expectation is related to important measures such as the genetic variance and SNP-based heritability, and to LD-score regression [18] (we explore the connection of our results to LD-score regression in S1 Text Section S4). It is also directly related to the variance of effect-size estimates, and therefore captures the additional noise that genetic confounding creates in effect-size estimation at a given locus. Third, if GWASs have been performed on more than one trait, the covariance across loci of the effect-size estimates for 2 traits may be of interest. This covariance is determined by the average heterozygosity-weighted product E[2pλ(1pλ)α^λβ^λ], where α^λ and β^λ are the effect-size estimates at locus λ for traits 1 and 2.

In what follows, for simplicity, we ignore indirect effects and assume that there is no environmental confounding (i.e., no correlation between genotypes and the environmental effects ϵ). For each of the sources of genetic confounding that we consider, we calculate the 3 measures listed above both analytically and in whole-genome simulations carried out in SLiM 4.0 [53]. In our simulations, we use 2 recombination maps: (i) for illustrative purposes, a simple hypothetical map where the genome lies along a single chromosome of length 1 Morgan; and (ii) the human linkage map generated by Kong and colleagues [54]. A more detailed description of the simulations can be found in the Methods, and code is available at https://doi.org/10.5281/zenodo.10520811 and https://github.com/cveller/confoundedGWAS.

3.1 Assortative mating

Assortative mating is the tendency for mating pairs to be correlated for particular traits—either the same trait (same-trait assortative mating) or distinct traits (cross-trait assortative mating). For example, humans are known to exhibit same-trait assortative mating for height and cross-trait assortative mating for educational attainment and height (among many other examples, reviewed in refs. [37,55]). Assortative mating generates both cis- and trans-LD: It generates positive trans-LD among trait-increasing alleles because genetic correlations between mates translate to genetic correlations between maternally and paternally inherited genomes, and it generates positive cis-LD among trait-increasing alleles because, over generations, recombination converts trans-LD into cis-LD [9]. (In some cases, assortative mating can generate cis-LD by mechanisms additional to recombination—see ref. [56].)

Constant-strength assortative mating

If the strength of assortative mating, measured by the phenotypic correlation among mates ρ, is constant over time and there are no other sources of genetic confounding such as population structure, then, for a given pair of loci l,l′∈L, the positive cis-LD Dll will initially be smaller than the positive trans-LD D˜ll, but will gradually grow towards an equilibrium value equal to the trans-LD (Dll*=D˜ll*); in this equilibrium, assortative mating generates new cis-LD at the same rate as old cis-LD is destroyed by recombination (S1 Text Section S3.1; [9]).

Therefore, in a population GWAS, effect-size estimates will initially be biased upwards because of positive trans-LD, and the magnitude of the bias will grow over time as positive cis-LD too is generated from this trans-LD (Eq (3); Fig 2). In contrast, in a family-based GWAS, effect-size estimates will initially be biased downwards because the positive trans-LD exceeds the positive cis-LD (Eqs (6) and (7); Fig 2). However, as the cis-LD grows over time towards the value of the trans-LD, the magnitude of the downward bias will shrink, and, in equilibrium, the family-based GWAS will not be confounded by assortative mating (Fig 2).

Fig 2. Assortative mating systematically biases effect-size estimation in population and within-family GWASs, although the bias in within-family GWASs is expected usually to be small.

Fig 2

Here, cross-trait assortative mating between traits 1 and 2 occurs among parents for the first 19 generations, after which mating is random. Assortative mating is sex-asymmetric, with strength ρ = 0.2. Distinct sets of loci underlie variation in trait 1 and 2, with effect sizes at causal loci normalized to 1. Plotted are average estimated effects of the focal alleles at loci causal for trait 1 in population and within-family GWASs on trait 2, for a hypothetical genome with 1 chromosome of length 1 Morgan (A) and for humans (B). Since the traits have distinct genetic bases, the true effects on trait 2 of the alleles at trait-1 loci are zero. The horizontal lines at 0.1 are a theoretical “first-order” approximation of the asymptotic bias in a population GWAS (S1 Text Section S3.1). Profiles are averages across 10,000 replicate simulation trials. Simulation details can be found in the Methods and the code can be found at https://doi.org/10.5281/zenodo.10520811.

Under certain simplifying assumptions, we can calculate the average bias that assortative mating induces in a population GWAS in equilibrium, in the absence of other sources of genetic confounding such as population structure (S1 Text Section S3.1). In the case of same-trait assortative mating, effect-size estimates are inflated by an average factor of approximately 1/(1−h2ρ), where ρ is the phenotypic correlation among mates and h2 is the trait heritability (for similar calculations, see refs. [22,57]). As an example, the strength of assortative mating for human height has been estimated as ρ~0.25 [58], which, together with a heritability of h2~0.8, implies that effect-size estimates in a population-based GWAS would be inflated by a factor of about 1/(1−h2ρ)~1.25, a 25% amplification.

In the case of cross-trait assortative mating, if assortative mating is directional/asymmetric with respect to sex—i.e., the correlation ρ is between female trait 1 and male trait 2—then assortative mating generates spurious associations between trait 1 and alleles that affect trait 2 (and vice versa). If the loci underlying the 2 traits are distinct, then, in equilibrium, the spurious effect-size estimate at non-causal loci is approximately h2ρ/2 times the effect at causal loci, assuming the traits to have the same heritabilities and genetic architectures (horizontal dashed line in Fig 2; see S1 Text Section S3.1 for relaxations of these assumptions). If cross-trait assortative mating is bidirectional/symmetric with respect to sex, then, in equilibrium, the average spurious effect-size estimate at non-causal loci is approximately h2ρ times the effect at causal loci. Upward biases in effect-size estimates at causal loci are also expected under cross-trait assortative mating, but these are second-order relative to the biases at non-causal loci (S1 Fig).

The systematic over- and underestimation of effect sizes that assortative mating induces in population and family-based GWASs, respectively, will also affect our second measure of interest, the heterozygosity-weighted average squared effect-size estimate E[2pλ(1pλ)α^λ2] (and therefore, also downstream quantities such as SNP heritabilities). In a population GWAS, the presence of trans-LD and the gradual creation of cis-LD under assortative mating will increase the biases in effect-size estimates over time (Fig 2), which will concomitantly increase the average value of α^λ2 (Fig 3; also see ref. [22]). Moreover, cross-trait assortative mating will generate signals of genetic correlations among traits even in the absence of any pleiotropic effects of underlying variants [37]. In a family-based GWAS, the temporary attenuation of effect-size estimates owing to a transient excess of trans-LD over cis-LD under assortative mating will lead to a similar attenuation in the average squared effect-size estimate (Fig 3), although, like the bias in effect-size estimates themselves, this attenuation is expected to be small in humans (Fig 3B).

Fig 3. The impact of assortative mating on the average squared effect-size estimate in population and within-family GWASs.

Fig 3

Same-trait assortative mating of strength ρ = 0.2 occurs among parents in generations 0–20; mating is random before and after this period. Under random mating, the average squared effect-size estimates exceed the true average squared effect size (yellow line) because random drift generates chance local LD with causal alleles that inflates the variance of effect-size estimation (e.g., [18]). The magnitude of this variance inflation depends on the GWAS design and sample size, and the effect of assortative mating and its cessation should be judged in reference to it. To guide the eye in this judgment, the faint horizontal lines in (A) and (B) show the average squared effect-size estimate in the last 20 generations of the initial burn-in period of random mating. The inset in (B) zooms in on the sibling GWAS profile, omitting the population GWAS profile for clarity. Profiles are averages across 5,000 replicate simulation trials. Simulation details can be found in the Methods and the code can be found at https://doi.org/10.5281/zenodo.10520811.

As shown by Border and colleagues [22,37], the effects of assortative mating on estimates of heritability and genetic correlations described above are not well controlled for by LD Score regression [17,18]. The LD score of a variant proxies the amount of local causal variation the SNP tags, but because assortative mating generates long-range signed LD among causal variants, it causes local causal variants to be in long-range signed LD with other causal variants throughout the genome. Therefore, the slope of the LD score regression absorbs the effects of assortative mating, causing its estimates of heritability and of the degree of pleiotropy to be inflated (S1 Text Section S4). We note that the definition of heritability itself, and its measurement using various study designs, can be complicated by processes such as assortative mating (e.g., [59]).

Historical assortative mating

If, at some point in time, assortative mating for traits ceases and mating becomes random with respect to those traits, the positive trans-LD that was present under assortative mating will immediately disappear, leaving only the positive cis-LD that had built up; this cis-LD will then be gradually eroded by recombination. If equilibrium had been attained under assortative mating, the cis-LD would have grown to match the per-generation trans-LD. Therefore, in the first generation after assortative mating ceases, the upward bias in population-GWAS effect-size estimates would halve as the trans-LD disappears (Eq (3)); the bias would then shrink gradually to zero as the cis-LD erodes (Fig 2). A similar pattern will be observed for the heterozygosity-weighted average value of α^λ2 in the population GWAS, which eventually returns to its equilibrium level under random mating (Fig 3).

In contrast, with the disappearance of the positive trans-LD but the persistence of positive cis-LD, the bias in family-based effect-size estimates will suddenly become positive once assortative mating ceases (having temporarily been negative under assortative mating before equilibrium was attained); this bias too will then gradually shrink to zero as recombination erodes the remaining cis-LD (Fig 2). Concomitantly, the average squared effect-size estimate in the family GWAS will suddenly increase when assortative mating ceases, after which it too will gradually return to its equilibrium value under random mating (Fig 3).

Assortative mating between traits with different genetic architectures

An important practical question is how genetic confounding affects the GWAS loci we prioritize for functional follow-up and for use in the construction of PGSs. SNPs are usually prioritized on the basis of their GWAS p-value, which is proportional to the estimated variance explained by a SNP, 2pλ(1pλ)α^λ2 (where pλ is the minor allele frequency). The results above assume, in the case of cross-trait assortative mating, that the traits involved have similar genetic architectures (distribution of pl and αl at causal loci, and the total number of causal loci). In that case, if there is no pleiotropy between the traits, then while SNPs that tag trait-1 causal loci are predictive of the value of trait 2 owing to LD between trait-1 and trait-2 causal loci, we nonetheless expect the SNPs that tag trait-2 causal loci to be better predictors of trait 2, such that GWAS investigators would primarily pick out SNPs tagging trait-2 causal loci for prioritization and use in PGSs.

However, analysis of human GWASs suggests that quantitative traits can have widely different genetic architectures, with, in particular, substantial differences in the effective numbers of causal loci involved and in the distribution of minor allele frequencies ([60] and references therein). If 2 traits with distinct genetic bases show cross-trait assortative mating, but trait 1 has a denser genetic architecture (fewer causal loci) than trait 2, then the genetic signal of assortative mating—systematic LD between trait-1 and trait-2 causal loci—will be more heavily loaded per-locus onto trait-1 loci than onto trait-2 loci. In a GWAS on trait 2, this will inflate the magnitude of spurious effect-size estimates at SNPs that tag trait-1 loci relative to effect-size estimates at SNPs that tag causal trait-2 loci. In S1 Text Section S3.1, we quantify this effect, showing that, in a population GWAS for trait 2, the average magnitude of spurious effect-size estimates at trait-1 loci is proportional to |L2|/|L1|, where |L1| and |L2| are the numbers of loci underlying variation in traits 1 and 2, respectively. Thus, when trait 1 has a denser genetic architecture than trait 2 (|L2|/|L1| is large), the magnitudes of effect-size estimates at non-causal trait-1 loci could substantially overlap with those at causal trait-2 loci (as illustrated in Fig 4), potentially causing part of the apparent, mappable genetic architecture of the trait-2 GWAS to actually tag trait-1 loci.

Fig 4. Cross-trait assortative mating for traits with different genetic architectures can generate large spurious effect-size estimates in population GWASs.

Fig 4

Shown, for a population GWAS on trait 2, are estimated distributions of the magnitude of effect-size estimates at loci causal for trait 2 (gray) and at loci causal for trait 1 (greens), under random mating (light green) and after 20 generations of cross-trait assortative mating (sex-asymmetric, of strength ρ = 0.2) for traits 1 and 2 (dark green). Although the true effects of trait-1 loci on trait 2 are zero in these simulations (no pleiotropy), there is sampling noise in effect-size estimation at trait-1 loci under random mating (light green line), so that the mean magnitude of effect-size estimates is shifted away from zero (light green dot; dashed line displays 95th percentile under random mating). In (A), the traits have equal heritability, but the number of loci contributing variation to trait 1 is 10-fold smaller than that for trait 2. Under assortative mating, the magnitudes of the spurious effect-size estimates at trait-1 loci shift significantly rightward (dark green line), coming to overlap substantially with the distribution of effect-size estimate magnitudes at causal trait-2 loci (gray line; the distribution for trait-2 loci does not appreciably differ under random and assortative mating). In (B), equal numbers of loci contribute variation to the 2 traits. In this case, the rightward shift of the distribution of effect-size estimate magnitudes at trait-1 loci is not as pronounced as in (A). Densities are estimated from pooled effect-size estimates from 1,000 replicate simulations. Simulation details in Methods and the code can be found at https://doi.org/10.5281/zenodo.10520811.

3.2 Population structure

When a population GWAS draws samples from individuals of dissimilar ancestries, differences in the distribution of causal genotypes, and potentially of environmental exposures, can confound the association study [1,4]. Correcting for confounds due to population structure has therefore been an important pursuit in the GWAS literature [14,23,61].

For concreteness, consider a simple model where 2 populations diverged recently, with no subsequent gene flow between them. Genetic drift—and possibly selection—in the 2 populations will have led to allele frequency differences between them at individual loci. If allele frequencies have diverged at both a genotyped study locus and at loci that causally influence the study trait, these frequency differences will manifest as linkage disequilibria between the study locus and the causal loci in a sample taken across both populations, even if the loci are not in LD within either population. Specifically, if the frequencies of the focal allele at a given locus k are pk(1) and pk(2) in populations 1 and 2, then the cis-LD between the focal alleles at the association study locus λ and a causal locus l is

Dλl(S)=14(pλ(1)pλ(2))(pl(1)pl(2)) (12)

in a sample that weights the 2 populations equally, with the superscript (S) denoting that this LD is due to stratification. The trans-LD takes exactly the same form: D˜λl(S)=Dλl(S). From Eq (3), locus l therefore confounds estimation of the direct effect at λ in a population GWAS, by an amount proportional to

2(Dλl(S)+D˜λl(S))αld=(pλ(1)pλ(2))(pl(1)pl(2))αld. (13)

These genetic confounds are in addition to environmental confounding that would arise if the environments of the 2 populations alter their average trait values by different amounts.

In contrast, estimates of direct effects obtained from within-family association studies are not genetically confounded, because cis- and trans-LD are equal (Eqs (6) and (7)). Another way of seeing this is to consider that, by controlling for family, within-family GWASs control for the population, and in the scenario considered, by construction, there are no within-population LDs to confound effect-size estimation.

Allele frequency divergence due to drift

How do the confounds introduced by population structure affect the first of our measures of interest, the average deviation of effect-size estimates from their true values? The answer depends on the source of allele frequency differences between the 2 populations. If the differences are due to neutral genetic drift, they will be independent of each other (assuming causal loci are sufficiently widely spaced) and independent of the direction and size of effects at individual loci. Therefore, the LD induced by these allele frequency differences will, on average, not bias effect-size estimates in a population GWAS:

E[(pλ(1)pλ(2))(pl(1)pl(2))αld]=E[pλ(1)pλ(2)]E[pl(1)pl(2)]E[αld]=0, (14)

since E[pk(1)pk(2)]=0 at any locus k.

However, the LD induced by population structure will inflate the average squared effect-size estimate, and by extension the variance of effect-size estimates (Fig 5). In S1 Text Section S3.2, we quantify this effect for the same simple case of 2 separate populations. We find that the average squared effect-size estimate in a population GWAS is an increasing function of the divergence between the 2 populations (as measured by FST), the number of loci contributing variation to the study trait, and the true average squared effect size per locus (see also refs. [38,62]).

Fig 5. The impact of population structure and admixture on the average squared effect-size estimate in population and within-family GWASs.

Fig 5

Here, 2 populations are isolated until generation 0, at which point they mix in equal proportions. Initial allele frequencies are chosen independently for the 2 populations, such that allele frequency differences between the populations resemble those that would accumulate over time via random drift. As in Fig 3, the equilibrium value of the mean squared effect-size estimate under random mating is greater than the true mean squared effect size, in both the population and within-family GWAS, owing to linkage disequilibria among causal alleles that arise due to drift. This explains why, in the insets, the blue (population) and red (within-family) profiles do not shrink all the way down to the yellow (true) line after admixture, when mating is random. Note too the difference in scale of the y-axes in the insets: the return to equilibrium is much more rapid under the human genetic map (B) than for a hypothetical genome of one chromosome of length 1 Morgan (A), since, with more recombination, the ancestry-based linkage disequilibria are broken down more rapidly. Profiles are averages across 10,000 replicate simulation trials. Simulation details can be found in the Methods and the code can be found at https://doi.org/10.5281/zenodo.10520811.

In contrast, because effect-size estimates from within-family GWASs are not confounded in this model of isolated populations, the average squared effect-size estimate will not differ substantially from its expectation in an unstructured population (Fig 5).

While we have focused on a simple model of 2 isolated populations, the result that within-family association studies are not confounded holds for other kinds of population structure as well. Specifically, we may be concerned that a population GWAS suffers from genetic confounding along some given axis of population stratification. However, the family-based estimates will be unbiased by confounding along such an axis if the maternal and paternal genotypes at each locus are exchangeable with respect to each other along this axis (S1 Text Section S3.2). This requirement will be met in expectation under many models of local genetic drift in discrete populations or along geographic gradients. However, as we will shortly argue, migration and admixture introduce further complications.

Allele frequency divergence due to selection or phenotype-biased migration

Selection and phenotype-biased migration can also generate allele frequency differences among populations (for a review of phenotype-biased migration, see ref. [63]). Unlike genetic drift, both of these forces can lead to systematic directional associations between effect sizes and changes in allele frequencies between populations. For example, if selection has favored alleles that increase the trait in population 1 but not in population 2, then

E[(pl(1)pl(2))αld]>0 (15)

as directional selection causes systematic changes in allele frequencies across the loci l underlying variation in the trait under selection (e.g., [64]). Importantly, this form of selection can occur even if the mean phenotype of the 2 populations does not change [65,66]. Similarly, phenotype-biased migration, where, say, individuals with a higher value of the phenotype tend to migrate from population 2 to population 1, can also create a positive association between effect sizes and allele frequency differences (Eq (15)).

Unlike the case of neutral genetic drift in the 2 populations, where the sign of the LD between 2 alleles is independent of their effect sizes, the effect-size-correlated associations driven by selection or phenotype-biased migration can add up across loci, and thus lead to substantial, systematic biases in estimates of allelic effect sizes. This systematic genetic confounding would also substantially inflate the average squared effect-size estimate and thus measures of the genetic variance tagged by SNPs.

In addition, these systematic sources of genetic confounding can generate genetic correlations between traits with no overlap in their sets of causal loci—i.e., with no pleiotropic relationship. This will occur if 2 traits have both experienced selection or biased migration along the same axis. To take a concrete example, if people tend to migrate to cities in part based on traits 1 and 2, then these traits will become genetically correlated. If this axis is explicitly included as a covariate in the GWAS, then its influence on estimates of heritability and genetic correlations will be removed. However, its influence will not be removed by inclusion of genetic principal components or the relatedness matrix, if this axis (here, city versus non-city) is not a major determinant of genome-wide relatedness at non-causal loci [1]. Nor will LD score regression control for this influence, as the selection- or migration-driven differentiation of a variant along the axis will be correlated with the extent to which it tags long-range causal variants involved in either trait. This effect on LD score regression is similar to that discussed above for assortative mating [22,37]. Thus, like assortative mating, selection and phenotype-biased migration along unaccounted-for axes of population stratification can generate genetic correlations between traits. These selection- and migration-driven correlations should not necessarily be viewed as spurious, since genetic correlations should include those that arise from systematic long-range LD, but they complicate the interpretation of population-level genetic correlations as evidence for pleiotropy.

Again, these issues largely vanish in family-based studies, although phenotype-biased migration can cause transient differences in cis- and trans-LD that lead to biases in family-based estimates of direct effects (Eqs (6) and (7)).

3.3 Admixture

When populations that have previously been separated come into contact, alleles from the same ancestral population remain associated with each other in the admixed population until they are dissociated by recombination. If allele frequencies had diverged between the ancestral populations, this “ancestry disequilibrium” can translate to cis-LD between loci affecting a trait [67], potentially confounding GWASs performed in the admixed population. More generally, long-range LD will be an issue when there is genetic stratification and ongoing migration between somewhat genetically distinct groups.

For concreteness, we again consider a simple model where 2 populations have been separated for some time, allowing allele frequencies to diverge between them. The populations then come into contact and admix in the proportions A and 1−A. We assume that mating is random with respect to ancestry in the admixed population.

Suppose that, just before admixture, the frequencies of the focal allele at a given locus k were pk(1) and pk(2) in the 2 populations. Then, the initial degree of cis-LD between loci λ and l in the admixed population is given by Eq (12), weighted by the proportions in which the populations admix:

Dλl,0(A)=A(1A)(pλ(1)pλ(2))(pl(1)pl(2)); (16)

see, e.g., ref. [68]. This cis-LD subsequently decays at a rate cλl per generation, so that, t generations after admixture,

Dλl,t(A)=Dλl,0(A)(1cλl)t=A(1A)(pλ(1)pλ(2))(pl(1)pl(2))(1cλl)t. (17)

Because we assume that mating is random in the admixed population, the trans-LD is zero in every generation after admixture: D˜λl,t(A)=0. Note that the decay of cis-LD in an admixed population will be slowed if individuals mate assortatively by ancestry, because the trans-LD generated by assortative mating is continually converted by recombination to new cis-LD (as in our assortative mating model above; see ref. [69] for more discussion of this point in the context of population admixture).

Allele frequency divergence due to drift

How do these patterns of LD affect a population GWAS? If allele frequency differences between populations arose from neutral drift, they will be independent of effect sizes at causal loci and across loci, and therefore will not contribute, on average, a systematic directional bias to effect-size estimates. However, they will inflate the average squared effect-size estimate, by a smaller amount than for a population GWAS performed when the populations were still separated (because of the elimination of trans-LD under random mating in the admixed population). Moreover, this amount will decline in the generations after admixture as the remaining cis-LD is eroded by recombination (Eq (17); Fig 5). We quantify these effects in S1 Text Section S3.3 (see also refs. [38,68,70,71]).

Although within-family GWASs were not genetically confounded when the populations were separate (because cis- and trans-LDs were equal, as discussed above), they become genetically confounded in the admixed population, as all trans-LD is eliminated by random mating in the admixed population, leaving an excess of cis-LD relative to trans-LD that biases effect-size estimates (Eqs (6) and (7)). As in the case of the population GWAS, these biases will be zero on average if allele frequency differences between the ancestral populations were due to drift. However, after admixture, they will still inflate the average squared effect-size estimate (and thus the variance of effect-size estimates), which will thereafter decline in subsequent generations as the cis-LD is gradually broken down by recombination (Eq (17); Fig 5).

In comparing the average squared effect-size estimate in a population and a family-based GWAS, we observe that the value in the population GWAS rapidly declines to approximately the same level as the value in the within-family GWAS, despite the former having started at a much higher level in the initial admixed population (Fig 5). The explanation is that LD between unlinked loci confounds effect-size estimation in the population GWAS but not the within-family GWAS, such that (i) the average squared effect-size estimate from the population GWAS is initially much higher than that from a within-family GWAS, because it is inflated by LD between many more pairs of loci; and (ii) the average squared effect-size estimate from the population GWAS declines more rapidly, because LD between unlinked loci is broken down more rapidly than LD between linked loci.

Allele frequency divergence due to selection or phenotype-biased migration

In addition to drift, and as discussed above, selection and phenotype-biased migration can generate systematic, signed (effect-size correlated) LD, which would lead to systematic cis-LD in the descendent admixed population. These would lead to larger inflations of genetic variance and genetic correlations than would be expected had allele frequency divergence between the ancestral populations been due to drift alone, and would complicate interpretations of genetic correlations as being due to pleiotropy. Moreoever, if the admixed population is more than a few generations old such that LD between unlinked loci but not linked loci has largely been broken down, then population- and family-based estimates of these quantities might be similar.

Spurious genetic correlations due to confounding in population-based PGSs

Factors other than selection and phenotype-biased migration can also generate non-pleiotropic genetic correlation signals in family-based studies of admixed populations. In fact, the use of confounded population GWAS effect sizes can be sufficient. As an example of the confounding of genetic correlations in admixed populations due to a confounded GWAS for one trait, consider the GIANT-GWAS height PGS. Owing to confounding within Europe [20,21], the height PGS showed large differences between Northern Europeans and sets of individuals sampled in other locations, such as the African 1,000 genomes samples [72]. This confounding generated a spurious, systematic correlation between height effect sizes and allele frequency differences across populations, with height-increasing alleles that are more common among Northern Europeans being assigned larger effects [20]. As a result, in a PGS constructed from these effect-size estimates, larger PGS values are predictive of greater North European ancestry. Now imagine a sibling-based study performed in a sample with recently admixed “European” and “non-European” ancestry—African Americans, for example. An individual with a larger value than their sibling for the GIANT height PGS will, on average, carry more “European” ancestry. In African Americans, there will also be a systematic association of lighter skin pigmentation with recent “European” ancestry, and selection on skin pigmentation will have driven a signed difference in allele frequencies between European and West African ancestors. Putting these observations together, the GIANT height PGS, being predictive of the degree of European ancestry, may well be predictive of skin pigmentation differences between African American sibling pairs (Eq (11)), leading to the naive and incorrect conclusion that height and skin color are causally linked. In reality, this result would reflect the fact that alleles predicted to increase height and alleles that affect skin color are in systematic effect-signed admixture LD, as in Eq (16), as a consequence of stratification-biased effect-size estimates from the GIANT European GWAS.

3.4 Stabilizing selection

Stabilizing selection—selection against deviations from an optimal phenotypic value—is thought to be common [73], and has recently been argued to be consistent with the genetic architectures of many human traits [60]. By disfavoring individuals with too many or too few trait-increasing alleles, stabilizing selection generates negative LD among alleles with the same directional effect on the trait [10]. Thus, stabilizing selection will attenuate GWAS effect-size estimates at genotyped loci that tag these causal loci.

To quantify these biases, we consider the model of Bulmer [10,11], in which a large number of loci contribute to variation in a trait under stabilizing selection, with the population having adapted such that the mean trait value is equal to the optimum. Under this model, stabilizing selection rapidly reduces variance in the trait by generating, within each generation, negative cis- and trans-LD among trait-increasing alleles; this negative LD is then partially transmitted (because of recombination) to the next generation in the form of cis-LD.

If we make the simplifying assumption that all loci have equal effect sizes, then the equilibrium reduction in trait variance in a given generation, −d* (where d*<0), measured before the action of selection in that generation, can be calculated as a function of the genic variance Vg, the environmental noise VE, the strength of stabilizing selection VS/VP (scaled according to the phenotypic variance VP), and the harmonic mean recombination rate, c¯h, among loci underlying variation in the trait ([11]; S1 Text Section S3.4). Selection within the same generation will reduce the trait variance further, to a degree that can also be calculated as a function of Vg, VE, VS/VP, and c¯h (S1 Text Section S3.4; [11]).

Under the assumption of equal effect sizes across loci, we calculate in S1 Text Section S3.4 the average per-locus attenuation bias in effect-size estimates induced by stabilizing selection, (αlα^l)/αl. In a population GWAS performed before selection has acted in the sampled individuals’ generation, the attenuation bias is approximately

αlα^lpopαl=d*Vg,

while in a population GWAS performed after selection has acted in the sampled individuals’ generation, the attenuation bias is approximately

αlα^lpopαl=d*(1+2c¯h)Vg;

that is, a factor 1+2c¯h greater than in the pre-selection population GWAS.

In a within-family GWAS, the average proportionate bias in effect-size estimation is approximately

αlα^lfamαl=d*(12c¯h)Vg,

irrespective of whether the GWAS is performed before or after selection has acted (or is ongoing) in the generation from which the sample is drawn. The attenuation bias in a family-based GWAS is therefore smaller than that in a pre-selection population GWAS by a factor of 12c¯h, and smaller than that in a post-selection population GWAS by a factor of (12c¯h)/(1+2c¯h).

Thus, the bias in effect-size estimation can be calculated given estimates of the phenotypic variance and heritability of the trait, the harmonic mean recombination rate, and the strength of stabilizing selection (S1 Text Section S3.4). In the Methods, making some simplifying assumptions about the genetic architecture of the trait in question, we calculate an approximate value c¯h0.464 for humans. Using this value, Fig 6 shows the average proportionate reduction in GWAS effect-size estimates for various strengths of stabilizing selection and heritabilities of the trait. The range of selection strengths was chosen to match that inferred for human traits by Sanjak and colleagues [74].

Fig 6. Stabilizing selection attenutates GWAS effect-size estimates.

Fig 6

The calculations displayed here assume that genetic variation in the trait is contributed by 1,000 loci of equal effect spaced evenly along the human genome. Stabilizing selection is stronger if the width of the selection function scaled by the phenotypic variance, VS/VP, is smaller. The placement of the point for human height assumes a heritability of 0.8 and a strength of stabilizing selection of VS/VP = 30, as estimated in ref. [74]. Details of these calculations can be found in S1 Text Section S3.4. Note the different scales of the y-axes in (A), (B), and (C).

Attenuation of effect-size estimates is larger if stabilizing selection is stronger or if the trait is more heritable. Taking height as an example, heritability is ~0.8, VP≈7cm2, and Sanjak and colleagues [74] estimate a sex-averaged strength of stabilizing selection of VS/VP≈30. From these values, we calculate that a pre-selection population GWAS would systematically underestimate effect sizes at loci that causally influence height by about 2.7% on average, in the absence of other sources of LD (Fig 6A), while a post-selection population GWAS would systematically underestimate effect sizes by about 5.2%, on average (Fig 6B). More generally, within the range of reasonable strengths of stabilizing selection inferred by Sanjak and colleagues [74], we calculate average attenutations of pre-selection population-based effect-size estimates of up to 5% for highly heritable traits (h2≈1) under strong stabilizing selection (VS/VP≈20), down to 0.25% for less heritable traits (h2≈0.4) under weak stabilizing selection (VS/VP≈170) (Fig 6A); the analogous range is 0.5% to 10% for post-selection population-based effect-size estimates (Fig 6B).

Given the estimate c¯h0.464, the proportionate bias that stabilizing selection induces in within-family GWASs is expected to be a fraction 12c¯h7.2% that in pre-selection population GWASs and (12c¯h)/(1+2c¯h)3.7% that in post-selection population GWASs. Thus, for height, a within-family GWAS would underestimate effect sizes by only about 0.2% on average (Fig 6C).

The quantitative importance of these biases will vary by application. In situations where the goal is gene discovery, for example, 5% to 10% reductions in effect-size estimates are unlikely to flip the statistical significance of variants with large effects on a trait. However, the attenuations in effect-size estimates caused by stabilizing selection are systematic across loci, and therefore could substantially affect aggregate quantities based on these estimates. For example, the range of average reductions in population effect-size estimates calculated above for human traits would translate to reductions in naive estimates of SNP-based heritabilities of between 0.5% and 20% (approximately 5.4% in the case of height using effect-size estimates from a pre-selection population GWAS; approximately 10% using estimates from a post-selection population GWAS). If effect sizes are estimated by within-family GWAS, on the other hand, the reductions in these SNP-based heritability estimates would be much smaller.

As a further example, by generating negative LD between alleles with the same directional effect on the trait, the impact of stabilizing selection opposes, and therefore masks, the genetic impact of assortative mating [75]. A practical consequence is that stabilizing selection will tend to attenuate estimates of the strength of assortative mating based on GWAS effect sizes, which often use cross-chromosome correlations of PGSs (e.g., [57,76]). In humans, the phenotypic correlation among mates for height has been measured at about approximately 0.25 [58]. In S1 Text Section S3.4, we calculate that estimates of this correlation based on cross-chromosome correlations in PGSs will be biased downwards by about 20% (to approximately 0.20) because of stabilizing selection on height, if selection has not yet acted in the measured generation, and by about 40% (to 0.15) if selection has acted in the measured generation. Were assortative mating weaker, or stabilizing selection stronger, the genetic impact of assortative mating would be masked to an even greater extent (S1 Text Section S3.4).

As in our analysis of assortative mating above, if stabilizing selection ceases in some generation, the negative LD that built up during the period of stabilizing selection will decay over subsequent generations, rapidly for pairs of loci on different chromosomes and more slowly for linked pairs of loci. Patterns of selection on human traits have changed over time—for example, the strength of stabilizing selection on birth weight has relaxed [77]. In general, therefore, patterns of confounding reflect a composite of contemporary and historic processes.

3.5 Sibling indirect effects

Indirect effects of siblings’ genotypes on each other’s phenotypes are known to be a potential source of bias in sibling-based GWASs [34,52], and can be measured and corrected for only if, in addition to sibling genotypes, parental genotypes are also available, either directly or via imputation [27,34]. To generate intuition for their impact on GWASs, we consider a simple model of indirect sibling effects in the absence of G×E interactions and other confounding effects, focusing on a single-locus model for simplicity. We suppose that the indirect effect of an individual’s phenotype on their sibling’s (same) phenotype is β, so that the phenotypes of 2 siblings i and j can be written as follows:

Yi=Y*+αgi+βYj+ϵi,
Yj=Y*+αgj+βYi+ϵj. (18)

Taking their difference and rearranging, we find that

ΔY=α1+βΔg+11+βΔϵ. (19)

Therefore, in the absence of genetic confounding and G×E interactions, a sibling-based association study would return an effect-size estimate of

α^sib=α1+β (20)

in expectation. Thus, if sibling indirect effects are synergistic (β>0), they lead to underestimation of the direct genetic effect at the locus. In contrast, if sibling indirect effects are antagonistic (β<0), they lead to overestimation of the direct genetic effect.

How would a population GWAS be affected by the same sibling indirect effects? Sibling i’s phenotype can be written as follows:

Yi=Y*+αgi+βYj+ϵi
=Y*+αgi+β(Y*+αgj+βYi+ϵj)+ϵi
Yi=11β2(Y**+αgi+αβgj+ϵi+βϵj), (21)

where Y** = (1+β)Y*. Therefore, if we were to randomly choose 1 sibling from each sibship and estimate the effect size at the locus using a population association study across families, we would obtain

α^pop=Cov(gi,Yi)Var(gi)=11β2(α+αβrgsibs), (22)

where rgsibs=Cov(gi,gj)/Var(gi) is the genotypic correlation between sibs at the locus. Sibling indirect effects alter the effect-size estimate in a population GWAS via 2 channels. The first is through the factor 1/(1−β2) in Eq (22), which reflects second-order feedbacks of an individual’s phenotype on itself, via the sibling (see also ref. [78]). Since 1/(1−β2)>1, these feedbacks act to exacerbate the effects of causal alleles. For example, if sibling indirect effects are antagonistic (β<0), then a sibling with a large trait value will tend to indirectly reduce the trait value of their sibling, which in turn will indirectly further increase the trait value of the focal individual. This channel therefore pushes population GWASs towards overestimating the magnitude of direct genetic effects.

The other channel by which sibling indirect effects can influence a population GWAS is driven by the genotypic correlation among siblings, and is easiest to understand if we assume that sibling indirect effects are weak (β2≪1). In this case, α^popα+αβrgsibs. Since the genotypic correlation rgsibs>0, this channel of sibling indirect effects has the opposite effect to the one it has on a sibling GWAS: if sibling indirect effects are synergistic (β>0), the population GWAS overestimates the direct genetic effect at the locus, while if sibling indirect effects are antagonistic (β<0), the population GWAS underestimates the direct genetic effect. The reason for this difference is that a sibling GWAS is based on siblings whose genotypes differ at the focal locus, and whose genotypic values are therefore anticorrelated. If sibling indirect effects are synergistic (β>0), they will tend to attenuate the phenotypic differences between such siblings, and therefore attenuate effect-size estimates. In contrast, because siblings’ genotypes are positively correlated across the entire population, synergistic sibling indirect effects (β>0) will tend to exacerbate phenotypic differences across families, leading a population GWAS to overestimate effect sizes.

4 Discussion

It has long been recognized that population GWASs in humans can be biased by environmental and genetic confounding [1,4]. Currently, population GWASs attempt to control for these confounds by focusing on sets of individuals that are genetically more similar and by controlling for population stratification. However, these controls are imperfect and are not always well defined. For example, controlling for genome-wide patterns of population stratification based on common alleles does not control for the genetic and environmental confounding of rare variants [79]. Work on genetic confounding has uncovered increasing evidence that assortative mating may be leading to large biases in estimates of direct genetic effects and to large genetic correlations for a number of traits [22,37,57]; moreover, it can often be unclear whether genetic signals of assortative mating are due to trait-based mate choice or some other more general form of genetic confounding (e.g., [80]). Additionally, while we have focused primarily on genetic confounding, for a number of traits there are also signals of residual environmental confounding in GWAS signals [31,32,35,81]. Thus, subtle and often interwoven forms of genetic and environmental confounding remain a major issue in many GWASs [34], compromising the interpretation of GWAS effect-size estimates and downstream quantities such as SNP heritabilities and genetic correlations.

Effect-size estimates from within-family GWASs are less affected by these various confounds. They are not subject to environmental confounding across families, because the environments of family members are effectively randomized with respect to within-family genetic transmission. As we have shown, family-based estimates should also suffer substantially less from genetic confounding, because genetic transmission at unlinked loci (but not linked loci) is randomized by independent assortment of chromosomes in meiosis. Nonetheless, family-based GWASs can suffer from residual genetic confounding as well as sibling indirect effects (in sibling-based designs), and causal interpretations of the estimates they produce are complicated by G×E and G×G interactions [44]; they also raise a number of further conceptual problems that we discuss below.

Sources of genetic confounding

Genetic confounding is caused by long-range LD between loci that affect the trait or traits under study. To illustrate the potential for genetic confounds to bias GWAS effect-size estimates, we have considered several sources of long-range LD. Some of these—assortative mating, selection on GWAS traits, and phenotype-biased migration—can cause systematic directional biases in GWAS effect-size estimates. Others, such as neutral population structure, cause random biases that influence the variance of effect-size estimates and related quantities. Assortative mating and neutral population structure have received considerable theoretical attention in the GWAS literature (e.g., [22,37,38,57]). Here, we have further outlined how both selection and phenotyped-biased migration can drive systematic genetic confounding that may not be well accounted for by current methods of controlling for stratification (see also ref. [81]).

We wish to emphasize stabilizing selection in particular as a potential source of systematic confounding in GWASs. Stabilizing selection has been well studied in the quantitative genetics literature but less so in the context of GWASs, despite its expected ubiquity. By selecting for compensating combinations of trait-increasing and trait-decreasing alleles, stabilizing selection generates negative LD between alleles with the same directional effect on the trait [10,11], and can therefore bias GWAS effect-size estimates downwards. While the potential for stabilizing selection to confound effect-size estimation has been noted (e.g., [66,75,82]), the resulting biases have not, to our knowledge, been quantified. Our calculations suggest that these downward biases could, for some human traits, be as large as 5% to 10% systematically across all causal loci in population GWASs. While biases of this magnitude are unlikely to compromise some goals of GWASs, such as gene discovery, they could be quantitatively problematic for other GWAS aims, such as estimation of SNP heritabilities and the strength of assortative mating. Moreover, while our results pertain to (a particular model of) stabilizing selection, many kinds of selection generate LD between genetically distant loci—in fact, only multiplicative selection among loci does not ([83], pgs. 50 and 177). Therefore, the general result that selection can generate genetic confounding will hold more broadly. Even in the absence of natural and sexual selection, the ascertainment of samples for a population- or family-based GWAS is a form of selection. When participation in a GWAS sample is based partly on a genetically influenced phenotype [84], this sample-selection bias will generate LD between loci underlying participation. In family studies, this participation bias can violate the assumption of random assignment of genotypes, and thus potentially undermine the interpretation of these studies’ results.

For a given genotyped locus in a GWAS, there is no bright line between local “tagged” LD and long-range confounding LD, and one reasonable objection to the approach taken here is that that we have used an arbitrary definition of the causal loci that are locally tagged by a genotyped locus (Llocal in Eq (2)). All of the sources of genetic confounding that we have considered generate LD among causal loci both within and across chromosomes. Under these models, the within-chromosome LD that is generated is, in a sense, a continuation of the LD generated across chromsomes (moving from a recombination rate = 0.5 to ≤ 0.5). Thus, while investigators may prefer some looser definition of “local” when thinking about genotyped GWAS loci as tag SNPs, to extend that definition to include all loci on the same chromosome as the SNP would, by reasonable interpretation, be to include confounding into the desired estimator.

The extent to which the absorption of genetic confounding in estimated effect sizes is a problem depends on the application. In the case of polygenic prediction, the absorption of environmental effects, indirect effects, and the effects of untyped loci throughout the genome can help to improve prediction accuracy, although this does come at a cost to interpretability (and potentially also to portability across contexts). For GWAS applications focused on understanding genetic causes and mechanisms, the biases in effect-size estimates and spurious signals of pleiotropy among traits generated by genetic confounding will be more problematic.

Indirect genetic effects

Family GWASs are often interpreted as providing the opportunity to ask to what extent parental genotypes (or other family genotypes) causally affect a child’s phenotype (“genetic nurture” [27]). Viewed in this way, the association between untransmitted parental alleles and the child’s phenotype would seem, at first, a natural estimate of indirect genetic effects.

In practice, however, if the population GWAS suffers from genetic and environmental confounds, then the estimated effects of untransmitted alleles will absorb that confounding in much the same way that estimates of direct genetic effects from a population GWAS do (Eqs (8) and (9)) [50]. For example, in the case of assortative mating, a given untransmitted allele is correlated with alleles that were transmitted both by this parent and by their mate, and these transmitted alleles can directly affect the offspring’s phenotype. Thus, while family-based estimates of direct genetic effects benefit from the randomization of meiosis and from controlling for the environment, family-based estimates of indirect genetic effects lack both of these features and should be interpreted with caution. Indeed, recent work using parental siblings to control for grandparental genotypes has shown that little of the estimated “indirect genetic effect” may be causally situated in parents [36]. With empirical estimates of indirect genetic effects potentially absorbing a broad set of confounds [34,85], and few current studies of indirect effects having designs that allow such confounding to be disentangled, it is premature—and potentially invalid—to interpret associations of untransmitted alleles causally in terms of indirect genetic effects [3]. Rather, they should be treated agnostically in terms of “non-direct” effects.

Direct genetic effects

Mendelian segregation provides a natural randomization experiment within families [86], and so crosses in experimental organisms and family designs have long been an indispensable tool to geneticists in exploring genetic effects and causation. Growing concerns about GWAS confounding and the increasing availability of genotyped family members have led to a return of family-based studies to the association study toolkit [2]. Family-based estimates of direct genetic effects are often interpreted as being unbiased and discussed in terms of the counterfactual effect of experimentally substituting one allele for another [34,87,88].

As we have shown, family-based GWASs are indeed less subject to confounding than population-based GWASs: in the presence of genetic and environmental confounding, the family-based estimate of the effect size at a given locus provides a much closer approximation to the true effects of tightly linked causal loci than a population-based estimate does. The family-based estimate is not biased by environmental variation across families and avoids the correlated effects of the many causal loci that lie on other chromosomes. Still, the family-based estimate does absorb the effects of non-local causal loci on the same chromosome, and so cannot truly be said to be free of genetic confounding. Rather than considering a single allele being substituted between individuals, a better experimental analogy for the effect-size estimate would be to say that we are measuring the mean effect of transmission of a large chunk of chromosome surrounding the focal locus, potentially carrying many causal loci.

In addition, while within-family GWASs offer these advantages, in other ways, they move us further away from the questions about the sources and causes of variation among unrelated individuals that motivate population GWASs in the first place. Indeed, the presence of confounding and of G×E/G×G interactions introduces a number of conceptual issues in moving from within-family GWAS to the interpretation of differences among individuals from different families [44,89,90]. For example, in the presence of genetic confounding, the estimated effect of a causal allele of interest will depend on a set of weights: its LD to many other causal alleles. Family-based approaches weight these LD terms differently to population-based approaches, which, we argue, can complicate the interpretation of these estimates. For example, when previously isolated populations admix, same-ancestry alleles will be held together in long genomic blocks until these are broken up by recombination, which will happen very quickly for alleles on different chromosomes but more slowly for alleles on the same chromosome. A few generations after admixture, therefore, cross-chromosome ancestry LD will largely have dissipated, but contiguous ancestry tracts will still span substantial portions of chromosome lengths. Since both population and within-family GWASs are similarly confounded by the same-chromosome LD, their mean squared effect sizes will be similar in this case (Fig 5). Bearing in mind that the LD resulting from admixture is not present in the source populations, it becomes unclear which weighting of ancestry LD is appropriate if we want to interpret the resulting effect-size estimates as direct effects. As this example illustrates, while family-based GWASs are a useful device for dealing with confounding, it is not always obvious how to interpret the quantities that they measure.

A number of additional complications arise when, to compensate for the small effect sizes of individual loci, researchers combine many SNPs into a PGS and study the effects of PGSs within families (or use them as instruments in mendelian randomization analyses). For one, SNPs are usually chosen for inclusion in the PGS on the basis of their statistical significance in a population GWAS. This approach prioritizes SNPs whose effect-size estimates are amplified (or even wholly generated) by confounding (for an example of how this leads to residual environmental confounding in applications of sibling-based effect-size estimates, see ref. [79]). Second, the weights given to SNPs that are included in the PGS absorb the effects of confounding, and this confounding is heterogeneous across SNPs. Thus, when we study the correlates of trait-A PGS differences between siblings in the presence of GWAS confounding, we are not observing the average phenotypic outcomes of varying the genetic component of trait A between siblings. Rather, we are varying a potentially strangely weighted set of genetic correlates of trait A.

An observation that a population-GWAS PGS is predictive of phenotypic differences among siblings demonstrates that the PGS SNPs tag nearby causal loci, but beyond that, interpretation is difficult. Notably, if there is cross-trait assortative mating for traits A and B, but no pleiotropic link between the traits, then some of the SNPs identified as significant in a GWAS on trait A may be tightly linked to loci that causally affect trait B but not trait A. If these loci are included in the trait-A PGS, then when we study the effect of variation in the trait-A PGS on sibling differences, we are accidentally absorbing some components of the variation in trait B across siblings. Thus, we might observe a correlation between the trait-A PGS and differences in trait B between siblings, and this correlation may be lower than is observed at the population level, without there existing any pleiotropic (or causal) link between A and B. These effects can be exacerbated if the 2 traits have different genetic architectures (Fig 4). Instead of using a set of SNPs and weights from a population GWAS, genetic correlations between traits due to pleiotropy could be estimated from the correlation of effect sizes estimated within families [33]. Given current sample size constraints in family-based studies, the confidence intervals on these estimates are large. Moreover, significant family-based correlations need not reflect pure pleiotropy, since, as we have shown, they are not completely free of genetic confounding due to intra-chromosomal LD.

Also complicating the interpretation of family-based effect-size estimates are various types of interactions. Indirect effects between siblings can bias family estimates of direct genetic effects (Eq (20); [2,34,52]) in ways that are conceptually different from the biases they introduce to population-based estimates (Eq (22)). These sibling effects can potentially be addressed with fuller family information (e.g., parental genotypes in addition to sibling genotypes [27,34]).

In summary, family-based studies are a clear step forward towards quantifying genetic effects, with large-scale family studies carrying the potential to resolve long-standing issues in human genetics. However, these designs come with their own sets of caveats, which will be important to understand and acknowledge as family-based genetic studies become a key tool in the exploration of causal effects across disparate fields of study.

5 Methods

All simulations were carried out in SLiM 4.0 [53]. Code is available at https://doi.org/10.5281/zenodo.10520811.

For the purpose of carrying out sibling association studies in our simulations, we assumed a simple, monogamous mating structure: each generation, each female, and each male is involved in a single mating pair, and each mating pair produces exactly 2 offspring (who are therefore full siblings). To maintain the precisely even sex ratio required by this scheme, we assumed that a quarter of mating pairs produce 2 daughters, a quarter produce 2 sons, and half produce a son and a daughter. Population sizes were chosen to ensure that these numbers of mating pairs were whole numbers, and mating pairs were permuted randomly each generation before assigning brood sex ratios (to ensure that no artifact was introduced by SLiM’s indexing of individuals).

Each generation, per-locus effect size estimates were calculated for both population-wide and sibling GWASs. The former were calculated as the regression of trait values on per-locus genotypes, while the latter were calculated as the regression of sibling differences in trait values on sibling differences in per-locus genotypes.

In all simulations, the total population size was N = 40,000.

Assortative mating

For our general cross-trait assortative mating setup, traits 1 and 2 are influenced by variation at sets of bi-allelic loci L1 and L2, respectively. The effect sizes of the reference allele at locus l on traits 1 and 2 are αl and βl, respectively. An individual’s PGS is then P1=lL1glαl for trait 1 and P2=lL2glβl for trait 2. In all the scenarios we simulated, traits had heritability 1, so that individuals’ trait values are the same as their PGSs.

Our aim is to simulate a scenario where assortative mating is based on females’ values for trait 1 and males’ values for trait 2, such that, across mating pairs, the correlation of the mother’s PGS for trait 1, P1m, and the father’s PGS for trait 2, P2f, is a constant value ρ (in all of our simulations, ρ = 0.2). To achieve this, we use an algorithm suggested by Zaitlen and colleagues [69]: At the outset, we choose an accuracy tolerance ε such that, if by some assignment of mates the correlation of their PGSs falls within ε of the target value ρ, we accept that assigment. Each generation in which assortative mating occurs, we rank females in order of their PGSs for trait 1, and males in order of their PGSs for trait 2. We then calculate the PGS correlation across mating pairs, ρ0, if females and males were matched according to this ranking. If this (maximal) correlation is smaller than the upper bound of our target window (ρ0<ρ+ε, which very seldom occurred in our simulations), then females and males mate precisely according to their PGS rankings and we move on to the next generation. If, instead, ρ0 exceeds ρ+ε, then we follow the following iterative procedure until we have found a mating structure under which the correlation of PGSs falls within ε of the target value ρ.

First, we choose initial “perturbation sizes” ξ0 and ξ1 = 2ξ0. Suppose that, in iteration k of the procedure, the perturbation size is ξk and the chosen mating structure leads to a correlation among mates of ρk. If |ρkρ|<ε, we accept the mating structure and move on to the next generation. Otherwise, we choose a new perturbation size ξk+1: (i) if ρk−1, ρk>ρ, then ξk+1 = 2ξk; (ii) if ρk−1>ρ>ρk or ρk−1<ρ<ρk, then ξk+1 = (ξk−1+ξk)/2; (iii) if ρk−1, ρk<ρ, then ξk+1 = ξk/2. Once we have chosen ξk+1, for each individual we perturb their PGS (trait 1 for females; trait 2 for males) by a value chosen from a normal distribution with mean 0 and standard deviation ξk+1, independently across individuals. We then rank females and males according to their perturbed PGSs, and calculate the correlation ρk+1 of their true PGSs if they mate according to this ranking. (Since, in our experience, there can be substantial variance in the ρk+1 values that result from this procedure, we repeat it 5 times and choose the mating structure that produces the value of ρk+1 closest to the target value ρ.) We then decide if another iteration—i.e., another perturbation size ξk+2—is required.

Fig 2. Cross-trait assortative mating for traits with the same genetic architecture

In the simulations displayed in Fig 2, ρ = 0.2, and traits 1 and 2 have identical but non-overlapping genetic architectures: L1 and L2 are distinct sets of 500 loci each, with αl = 1 and βl = 0 for lL1, and αl = 0 and βl = 1 for lL2. Loci in L1 and L2 alternate in an even spacing along the physical (bp) genome. Fig 2A shows results for the “single chromosome” case where the recombination fraction between adjacent loci is c = 1/999 in both sexes (such that the single-chromosome genome receives, on average, one crossover per transmission). Fig 2B shows results for the case where recombination fractions between loci are calculated from the human female and male linkage maps generated by Kong and colleagues [54]. In both cases, we assumed no crossover interference.

At each locus, the initial frequency of the reference allele was 1/2, with reference alleles assigned randomly across diploid individuals and independently across loci such that, in expectation, Hardy–Weinberg and linkage equilibrium initially prevail. The assortative mating algorithm above was run for 19 generations, with a target correlation ρ = 0.2, a tolerance parameter ε = ρ/100, and an initial perturbation size ξ0=4[max({{P1m},{P2f}})min({{P1m},{P2f}})]. Thereafter, assortative mating was switched off, with mating pairs (still monogamous) being chosen randomly.

Fig 3. Same-trait assortative mating

The algorithm we followed to ensure assortative mating of a given strength was the same as that for Fig 2 above, but here traits 1 and 2 are identical, and 1,000 loci underlie variation in the trait, and are evenly spread along the physical genome. The effect size of the reference allele at each locus was drawn from a normal distribution with mean 0 and standard deviation 1, independently across loci. The initial frequency of the reference allele at each locus was drawn, independently across loci, from a uniform distribution on [MAF, 1−MAF]; in our simulations, we chose a minimum minor allele frequency of MAF = 0.1. Since here we are interested in quantifying the mean squared effect size estimate, which is directionally affected by drift-based local LD that may not be present in our initial configuration, we allowed 150 generations of random mating before switching on assortative mating (only the final 20 generations of this random mating burn-in are displayed in Fig 3). Assortative mating occurred for 20 generations, after which random mating occurred for a further 20 generations.

Fig 4. Cross-trait assortative mating for traits with different architectures

For Fig 4A, we again followed a similar procedure to that for Fig 2 above, but now, while traits 1 and 2 have distinct genetic bases, the numbers of loci contributing variation to traits 1 and 2 are |L1| = 100 and |L2| = 1,000. Trait-1 loci are placed evenly along the physical genome, with trait-2 loci then evenly spaced among the trait-1 loci; we used the human linkage map for these simulations. At both trait-1 and trait-2 loci, the initial frequency of the focal allele was drawn from a uniform distribution on [MAF, 1−MAF], with MAF = 0.1. At trait-2 loci, true effect sizes were randomly drawn from a normal distribution with mean zero and standard deviation 1; at trait-1 loci, true effect sizes were randomly drawn from a normal distribution with mean zero and standard deviation 10, so that traits 1 and 2 have equal genic variances. After a burn-in of 150 generation of random mating, assortative mating was switched on. We performed a population GWAS at the end of the period of random mating and after 20 generations of assortative mating. These GWASs were performed across 1,000 replicate trials, with the effect size estimates then pooled across trials. From these, we estimated the densities of the absolute values of effect size estimates using Matlab’s kernel density estimator ksdensity, specifying that the support of the distributions be positive. For Fig 4B, as a control, we also carried out the same simulations with |L1| = |L2| = 1,000, drawing effect sizes from a standard normal distribution.

Fig 5. Population structure and admixture

We wished first to simulate a situation where 2 populations of size N/2 have been separated for a length of time such that the value of FST between them is some predefined level (in our case, a mean FST per locus of 0.1). To do so without having to run the full population dynamics of 2 allopatric populations for a prohibitively large number of generations, we simply assigned allele frequencies to achieve the desired level of FST. We assumed 1,000 loci spread evenly over the physical genome. At each locus l, we chose an “ancestral” frequency pla for the reference allele independently from a uniform distribution on [MAF, 1−MAF], with MAF = 0.2. We then perturbed this allele frequency in populations 1 and 2 by independent draws from a normal distribution with mean 0 and variance 2pla(1pla)FST; if a perturbed allele frequency fell below 0 or above 1, we set it to 0 or 1, respectively. The population dynamics described above, with monogamous mating pairs chosen randomly, were then run for 50 generations.

In generation 50, the 2 populations merge, forming an admixed population of size N. The same population dynamics, with monogamous mating pairs chosen randomly, were then run for a further 50 generations.

Fig 6. Stabilizing selection

To calculate the bias in GWAS effect size estimation caused by stabilizing selection, we must first calculate the harmonic mean recombination rate. We focus on humans and consider only the autosomal genome. The set of loci underlying variation in the trait is L, which we apportion among the 22 autosomes according to their physical (bp) lengths (as reported in GRCh38.p11 of the human reference genome; https://www.ncbi.nlm.nih.gov/grc/human/data?asm=GRCh38.p11). For each chromosome, we spread its allotment of loci evenly over its sex-averaged genetic (cM) length, using the male and female linkage maps produced by Kong and colleagues [54]. (We use genetic lengths instead of physical lengths because, were we to spread loci evenly over the physical lengths of the chromosomes, some pairs of adjacent loci on some chromosomes might have a sex-averaged recombination fraction of 0, in which case the harmonic mean recombination rate would be undefined.) For each pair of linked loci, the recombination rate between them was estimated separately from the male and female genetic distance between them using Kosambi’s map function [91]. Pairs of loci on separate chromosomes have a recombination fraction of 1/2. With the sex-averaged recombination fraction cll thus calculated for every pair of loci (l,l′), the harmonic mean recombination fraction was calculated as c¯h=(|L|2)/(l,l1cll), where (|L|2)=|L|(|L|1)/2 is the number of pairs of distinct loci in L.

Performing this calculation with |L| = 1,000 loci, we obtain an estimate of c¯h0.464 for human autosomes. Substituting this estimate into S1 Text Eqs (S.106), (S.107), and (S.108) then defines the curves plotted in Figs 6A, 6B and 6C, respectively.

Supporting information

S1 Text. Supplementary information.

(PDF)

pbio.3002511.s001.pdf (486.1KB, pdf)
S1 Fig. Cross-trait assortative mating influences effect-size estimates at loci that affect the study trait, although this influence is second-order relative to that on effect-size estimates at loci that do not affect the study trait but do affect the other trait involved in assortative mating (cf. Fig 2; note the scales of the y-axes).

Simulations are the same as in Fig 2; the code can be found at https://doi.org/10.5281/zenodo.10520811.

(PDF)

pbio.3002511.s002.pdf (433.5KB, pdf)

Acknowledgments

We thank Jeremy Berg, Doc Edge, Arbel Harpak, Gibran Hemani, Hanbin Lee, Molly Przeworski, Alex Young, and members of the Coop lab for helpful discussions and comments on earlier drafts.

Data Availability

The authors confirm that all data underlying the findings are fully available without restriction. Simulation code is available at https://doi.org/10.5281/zenodo.10520811 and https://github.com/cveller/confoundedGWAS.

Funding Statement

Funding was provided by the National Institutes of Health (NIH R35 GM136290 awarded to GC) and a Branco Weiss fellowship to CV. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Vilhjálmsson BJ, Nordborg M. The nature of confounding in genome-wide association studies. Nat Rev Genet. 2013;14(1):1–2. doi: 10.1038/nrg3382 [DOI] [PubMed] [Google Scholar]
  • 2.Young AI, Benonisdottir S, Przeworski M, Kong A. Deconstructing the sources of genotype-phenotype associations in humans. Science. 2019;365(6460):1396–1400. doi: 10.1126/science.aax3710 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Wolf JB, Brodie ED III, Cheverud JM, Moore AJ, Wade MJ. Evolutionary consequences of indirect genetic effects. Trends Ecol Evol. 1998;13(2):64–69. doi: 10.1016/s0169-5347(97)01233-0 [DOI] [PubMed] [Google Scholar]
  • 4.Lander ES, Schork NJ. Genetic dissection of complex traits. Science. 1994;265(5181):2037–2048. doi: 10.1126/science.8091226 [DOI] [PubMed] [Google Scholar]
  • 5.Freeman G. Statistical methods for the analysis of genotype-environment interactions. Heredity. 1973;31(3):339–354. doi: 10.1038/hdy.1973.90 [DOI] [PubMed] [Google Scholar]
  • 6.Marchini J, Donnelly P, Cardon LR. Genome-wide strategies for detecting multiple loci that influence complex diseases. Nat Genet. 2005;37(4):413–417. doi: 10.1038/ng1537 [DOI] [PubMed] [Google Scholar]
  • 7.Gauderman WJ, Mukherjee B, Aschard H, Hsu L, Lewinger JP, Patel CJ, et al. Update on the state of the science for analytical methods for gene-environment interactions. Am J Epidemiol. 2017;186(7):762–770. doi: 10.1093/aje/kwx228 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Wright S, Systems of mating. III. Assortative mating based on somatic resemblance. Genetics. 1921;6(2):144–161. doi: 10.1093/genetics/6.2.144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Crow JF, Felsenstein J. The effect of assortative mating on the genetic composition of a population. Eugen Q. 1968;15(2):85–97. doi: 10.1080/19485565.1968.9987760 [DOI] [PubMed] [Google Scholar]
  • 10.Bulmer MG. The effect of selection on genetic variability. Am Nat. 1971;105(943):201–211. [Google Scholar]
  • 11.Bulmer MG. Linkage disequilibrium and genetic variability. Genet Res. 1974;23(3):281–289. doi: 10.1017/s0016672300014920 [DOI] [PubMed] [Google Scholar]
  • 12.Ewens WJ, Spielman RS. The transmission/disequilibrium test: history, subdivision, and admixture. Am J Hum Genet. 1995;57(2):455–464. [PMC free article] [PubMed] [Google Scholar]
  • 13.Pritchard JK, Rosenberg NA. Use of unlinked genetic markers to detect population stratification in association studies. Am J Hum Genet. 1999;65(1):220–228. doi: 10.1086/302449 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Price AL, Zaitlen NA, Reich D, Patterson N. New approaches to population stratification in genome-wide association studies. Nat Rev Genet. 2010;11(7):459–463. doi: 10.1038/nrg2813 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38(8):904–909. doi: 10.1038/ng1847 [DOI] [PubMed] [Google Scholar]
  • 16.Yang J, Zaitlen NA, Goddard ME, Visscher PM, Price AL. Advantages and pitfalls in the application of mixed-model association methods. Nat Genet. 2014;46(2):100–106. doi: 10.1038/ng.2876 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bulik-Sullivan BK, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47(11):1236–1241. doi: 10.1038/ng.3406 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Patterson N, et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47(3):291–295. doi: 10.1038/ng.3211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bulik-Sullivan B. Relationship between LD score and Haseman-Elston regression. BioRxiv. 2015. doi: 10.1101/018283 [DOI] [Google Scholar]
  • 20.Berg JJ, Harpak A, Sinnott-Armstrong N, Joergensen AM, Mostafavi H, Field Y, et al. Reduced signal for polygenic adaptation of height in UK Biobank. Elife. 2019;8:e39725. doi: 10.7554/eLife.39725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Sohail M, Maier RM, Ganna A, Bloemendal A, Martin AR, Turchin MC, et al. Polygenic adaptation on height is overestimated due to uncorrected stratification in genome-wide association studies. Elife. 2019;8:e39702. doi: 10.7554/eLife.39702 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Border R, O’Rourke S, de Candia T, Goddard ME, Visscher PM, Yengo L, et al. Assortative mating biases marker-based heritability estimators. Nat Commun. 2022;13(1):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Spielman RS, McGinnis RE, Ewens WJ. Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am J Hum Genet. 1993;52(3):506–516. [PMC free article] [PubMed] [Google Scholar]
  • 24.Allison DB. Transmission-disequilibrium tests for quantitative traits. Am J Hum Genet. 1997;60(3):676–690. [PMC free article] [PubMed] [Google Scholar]
  • 25.Eaves LJ, Pourcain BS, Davey Smith G, York TP, Evans DM. Resolving the effects of maternal and offspring genotype on dyadic outcomes in genome wide complex trait analysis (“M-GCTA”). Behav Genet. 2014;44:445–455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Weiner DJ, Wigdor EM, Ripke S, Walters RK, Kosmicki JA, Grove J, et al. Polygenic transmission disequilibrium confirms that common and rare variation act additively to create risk for autism spectrum disorders. Nat Genet. 2017;49(7):978–985. doi: 10.1038/ng.3863 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kong A, Thorleifsson G, Frigge ML, Vilhjalmsson BJ, Young AI, Thorgeirsson TE, et al. The nature of nurture: Effects of parental genotypes. Science. 2018;359(6374):424–428. doi: 10.1126/science.aan6877 [DOI] [PubMed] [Google Scholar]
  • 28.Abecasis GR, Cardon LR, Cookson WOC. A general test of association for quantitative traits in nuclear families. Am J Hum Genet. 2000;66(1):279–292. doi: 10.1086/302698 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Visscher PM, Medland SE, Ferreira MAR, Morley KI, Zhu G, Cornes BK, et al. Assumption-free estimation of heritability from genome-wide identity-by-descent sharing between full siblings. PLoS Genet. 2006;2(3):e41. doi: 10.1371/journal.pgen.0020041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lee JJ, Wedow R, Okbay A, Kong E, Maghzian O, Zacher M, et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet. 2018;50(8):1112–1121. doi: 10.1038/s41588-018-0147-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Selzam S, Ritchie SJ, Pingault JB, Reynolds CA, O’Reilly PF, Plomin R. Comparing within-and between-family polygenic score prediction. Am J Hum Genet. 2019;105(2):351–363. doi: 10.1016/j.ajhg.2019.06.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mostafavi H, Harpak A, Agarwal I, Conley D, Pritchard JK, Przeworski M. Variable prediction accuracy of polygenic scores within an ancestry group. Elife. 2020;9:e48376. doi: 10.7554/eLife.48376 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Howe LJ, Nivard MG, Morris TT, Hansen AF, Rasheed H, Cho Y, et al. Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nat Genet. 2022;54(5):581–592. doi: 10.1038/s41588-022-01062-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Young AI, Nehzati SM, Benonisdottir S, Okbay A, Jayashankar H, Chanwook L, et al. Mendelian imputation of parental genotypes improves estimates of direct genetic effects. Nat Genet. 2022;54:897–905. doi: 10.1038/s41588-022-01085-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Okbay A, Wu Y, Wang N, Jayashankar H, Bennett M, Nehzati SM, et al. Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals. Nat Genet. 2022;54(4):437–449. doi: 10.1038/s41588-022-01016-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Nivard M, Belsky D, Harden KP, Baier T, Ystrom E, Lyngstad TH. Neither nature nor nurture: Using extended pedigree data to elucidate the origins of indirect genetic effects on offspring educational outcomes. PsyArXiv. 2022. doi: 10.31234/osf.io/bhpm5 [DOI] [Google Scholar]
  • 37.Border R, Athanasiadis G, Buil A, Schork A, Cai N, Young A, et al. Cross-trait assortative mating is widespread and inflates genetic correlation estimates. Science. 2022;378(6621):754–761. doi: 10.1126/science.abo2059 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Rosenberg NA, Nordborg M. A general population-genetic model for the production by population structure of spurious genotype–phenotype associations in discrete, admixed or spatially distributed populations. Genetics. 2006;173(3):1665–1678. doi: 10.1534/genetics.105.055335 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Platt A, Vilhjálmsson BJ, Nordborg M. Conditions under which genome-wide association studies will be positively misleading. Genetics. 2010;186(3):1045–1052. doi: 10.1534/genetics.110.121665 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Atwell S, Huang YS, Vilhjálmsson BJ, Willems G, Horton M, Li Y, et al. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature. 2010;465(7298):627–631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Hayes B, Goddard M. Genome-wide association and genomic selection in animal breeding. Genome. 2010;53(11):876–883. doi: 10.1139/G10-076 [DOI] [PubMed] [Google Scholar]
  • 42.Peiffer JA, Romay MC, Gore MA, Flint-Garcia SA, Zhang Z, Millard MJ, et al. The genetic architecture of maize height. Genetics. 2014;196(4):1337–1356. doi: 10.1534/genetics.113.159152 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Josephs EB, Stinchcombe JR, Wright SI. What can genome-wide association studies tell us about the evolutionary forces maintaining genetic variation for quantitative traits? New Phytol. 2017;214(1):21–33. doi: 10.1111/nph.14410 [DOI] [PubMed] [Google Scholar]
  • 44.Veller C, Przeworski M, Coop G. Causal interpretations of family GWAS in the presence of heterogeneous effects. bioRxiv. 2023. doi: 10.1101/2023.11.13.566950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Weir BS. Linkage disequilibrium and association mapping. Annu Rev Genomics Hum Genet. 2008;9(1):129–142. doi: 10.1146/annurev.genom.9.081307.164347 [DOI] [PubMed] [Google Scholar]
  • 46.Pritchard JK, Przeworski M. Linkage disequilibrium in humans: models and data. Am J Hum Genet. 2001;69(1):1–14. doi: 10.1086/321275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Patterson N, Price AL, Reich D. Population structure and eigenanalysis. PLoS Genet. 2006;2(12):e190. doi: 10.1371/journal.pgen.0020190 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Greene WH. Econometric Analysis. 8th ed. New York: Pearson; 2018. [Google Scholar]
  • 49.Young AI, Frigge ML, Gudbjartsson DF, Thorleifsson G, Bjornsdottir G, Sulem P, et al. Relatedness disequilibrium regression estimates heritability without environmental bias. Nat Genet. 2018;50(9):1304–1310. doi: 10.1038/s41588-018-0178-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Shen H, Feldman MW. Genetic nurturing, missing heritability, and causal analysis in genetic statistics. Proc Natl Acad Sci U S A. 2020;117(41):25646–25654. doi: 10.1073/pnas.2015869117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Trejo S, Domingue BW. Genetic nature or genetic nurture? Introducing social genetic parameters to quantify bias in polygenic score analyses. Biodemography Soc Biol. 2018;64(3–4):187–215. doi: 10.1080/19485565.2019.1681257 [DOI] [PubMed] [Google Scholar]
  • 52.Fletcher J, Wu Y, Li T, Lu Q. Interpreting polygenic score effects in sibling analysis. BioRxiv. 2021. doi: 10.1101/2021.07.16.452740 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Haller BC, Messer PW. SLiM 4: multispecies eco-evolutionary modeling. Am Nat. 2023;201(5):E127–E139. doi: 10.1086/723601 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Kong A, Thorleifsson G, Gudbjartsson DF, Masson G, Sigurdsson A, Jonasdottir A, et al. Fine-scale recombination rate differences between sexes, populations and individuals. Nature. 2010;467(7319):1099–1103. doi: 10.1038/nature09525 [DOI] [PubMed] [Google Scholar]
  • 55.Horwitz TB, Balbona JV, Paulich KN, Keller MC. Evidence of correlations between human partners based on systematic reviews and meta-analyses of 22 traits and UK Biobank analysis of 133 traits. Nat Hum Behav. 2023;7(9):1568–1583. doi: 10.1038/s41562-023-01672-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Veller C, Muralidhar P, Haig D. On the logic of Fisherian sexual selection. Evolution. 2020;74(7):1234–1245. doi: 10.1111/evo.13944 [DOI] [PubMed] [Google Scholar]
  • 57.Yengo L, Robinson MR, Keller MC, Kemper KE, Yang Y, Trzaskowski M, et al. Imprint of assortative mating on the human genome. Nat Hum Behav. 2018;2(12):948–954. doi: 10.1038/s41562-018-0476-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Stulp G, Simons MJP, Grasman S, Pollet TV. Assortative mating for human height: A meta-analysis. Am J Hum Biol. 2017;29(1):e22917. doi: 10.1002/ajhb.22917 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kemper KE, Yengo L, Zheng Z, Abdellaoui A, Keller MC, Goddard ME, et al. Phenotypic covariance across the entire spectrum of relatedness for 86 billion pairs of individuals. Nat Commun. 2021;12(1):1050. doi: 10.1038/s41467-021-21283-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Simons YB, Mostafavi H, Smith CJ, Pritchard JK, Sella G. Simple scaling laws control the genetic architectures of human complex traits. bioRxiv. 2022. doi: 10.1101/2022.10.04.509926 [DOI] [Google Scholar]
  • 61.Pritchard JK, Stephens M, Rosenberg NA, Donnelly P. Association mapping in structured populations. Am J Hum Genet. 2000;67(1):170–181. doi: 10.1086/302959 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Lee H, Lee MH. Disentangling linkage and population structure in association mapping. 2023. Available from: https://github.com/hanbin973/hanbin973.github.io/raw/master/_data/LeeAndLee2023a.pdf. [Google Scholar]
  • 63.Edelaar P, Bolnick DI. Non-random gene flow: an underappreciated force in evolution and ecology. Trends Ecol Evol. 2012;27(12):659–665. doi: 10.1016/j.tree.2012.07.009 [DOI] [PubMed] [Google Scholar]
  • 64.Hayward LK, Sella G. Polygenic adaptation after a sudden change in environment. Elife. 2022;11:e66697. doi: 10.7554/eLife.66697 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Harpak A, Przeworski M. The evolution of group differences in changing environments. PLoS Biol. 2021;19(1):e3001072. doi: 10.1371/journal.pbio.3001072 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Yair S, Coop G. Population differentiation of polygenic score predictions under stabilizing selection. Philos Trans R Soc B. 1852;2022(377):20200416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Nei M, Li WH. Linkage disequilibrium in subdivided populations. Genetics. 1973;75(1):213–219. doi: 10.1093/genetics/75.1.213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Pfaff CL, Parra EJ, Bonilla C, Hiester K, McKeigue PM, Kamboh MI, et al. Population structure in admixed populations: effect of admixture dynamics on the pattern of linkage disequilibrium. Am J Hum Genet. 2001;68(1):198–207. doi: 10.1086/316935 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Zaitlen N, Huntsman S, Hu D, Spear M, Eng C, Oh SS, et al. The effects of migration and assortative mating on admixture linkage disequilibrium. Genetics. 2017;205(1):375–383. doi: 10.1534/genetics.116.192138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Zaitlen N, Pasaniuc B, Sankararaman S, Bhatia G, Zhang J, Gusev A, et al. Leveraging population admixture to characterize the heritability of complex traits. Nat Genet. 2014;46(12):1356–1362. doi: 10.1038/ng.3139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Lee H, Lee MH. Theoretical Interpretation of Genetic Studies in Admixed Populations. 2023. Available from: https://github.com/hanbin973/hanbin973.github.io/raw/master/data/LeeAndLee2023b.pdf. [Google Scholar]
  • 72.Martin AR, Gignoux CR, Walters RK, Wojcik GL, Neale BM, Gravel S, et al. Human demographic history impacts genetic risk prediction across diverse populations. Am J Hum Genet. 2017;100(4):635–649. doi: 10.1016/j.ajhg.2017.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Sella G, Barton NH. Thinking about the evolution of complex traits in the era of genome-wide association studies. Annu Rev Genomics Hum Genet. 2019;20:461–493. doi: 10.1146/annurev-genom-083115-022316 [DOI] [PubMed] [Google Scholar]
  • 74.Sanjak JS, Sidorenko J, Robinson MR, Thornton KR, Visscher PM. Evidence of directional and stabilizing selection in contemporary humans. Proc Natl Acad Sci U S A. 2018;115(1):151–156. doi: 10.1073/pnas.1707227114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Brown BC, Price AL, Patsopoulos NA, Zaitlen N. Local joint testing improves power and identifies hidden heritability in association studies. Genetics. 2016;203(3):1105–1116. doi: 10.1534/genetics.116.188292 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Yamamoto K, Sonehara K, Namba S, Konuma T, Masuko H, Miyawaki S, et al. Genetic footprints of assortative mating in the Japanese population. Nat Hum Behav. 2023;7(1):65–73. doi: 10.1038/s41562-022-01438-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Ulizzi L, Terrenato L. Natural selection associated with birth weight V. The secular relaxation of the stabilizing component. Ann Hum Genet. 1987;51(3):205–210. [DOI] [PubMed] [Google Scholar]
  • 78.Carey G. Sibling imitation and contrast effects. Behav Genet. 1986;16:319–341. doi: 10.1007/BF01071314 [DOI] [PubMed] [Google Scholar]
  • 79.Zaidi AA, Mathieson I. Demographic history mediates the effect of stratification on polygenic scores. Elife. 2020;9:e61548. doi: 10.7554/eLife.61548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Haworth S, Mitchell R, Corbin L, Wade KH, Dudding T, Budu-Aggrey A, et al. Apparent latent structure within the UK Biobank sample has implications for epidemiological analysis. Nat Commun. 2019;10(1):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Abdellaoui A, Dolan CV, Verweij KJ, Nivard MG. Gene–environment correlations across geographic regions affect genome-wide association studies. Nat Genet. 2022;54(9):1345–1354. doi: 10.1038/s41588-022-01158-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Li A, Liu S, Bakshi A, Jiang L, Chen W, Zheng Z, et al. mBAT-combo: a more powerful test to detect gene-trait associations from GWAS data. Am J Hum Genet. 2023;110(1):30–43. doi: 10.1016/j.ajhg.2022.12.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Bürger R. The Mathematical Theory of Selection, Recombination, and Mutation. Chichester, UK: Wiley; 2000. [Google Scholar]
  • 84.Benonisdottir S, Kong A. Studying the genetics of participation using footprints left on the ascertained genotypes. Nat Genet. 2023;55(8):1413–1420. doi: 10.1038/s41588-023-01439-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Demange PA, Hottenga JJ, Abdellaoui A, Eilertsen EM, Malanchini M, Domingue BW, et al. Estimating effects of parents’ cognitive and non-cognitive skills on offspring education using polygenic scores. Nat Commun. 2022;13(1):4801. doi: 10.1038/s41467-022-32003-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Fisher RA. Statistical methods in genetics. Heredity. 1952;6(1):1–12. [Google Scholar]
  • 87.Morris TT, Davies NM, Hemani G, Davey Smith G. Population phenomena inflate genetic associations of complex social traits. Sci Adv. 2020;6(16):eaay0328. doi: 10.1126/sciadv.aay0328 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Brumpton B, Sanderson E, Heilbron K, Hartwig FP, Harrison S, Vie GÅ, et al. Avoiding dynastic, assortative mating, and population stratification biases in Mendelian randomization through within-family analyses. Nat Commun. 2020;11(1):3519. doi: 10.1038/s41467-020-17117-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Coop G, Przeworski M. Lottery, luck, or legacy. A review of “The Genetic Lottery: Why DNA matters for social equality”. Evolution. 2022;76(4):846–853.35225362 [Google Scholar]
  • 90.Coop G, Przeworski M. Luck, lottery, or legacy? The problem of confounding. A reply to Harden. Evolution. 2022;76(10):2464–2468. doi: 10.1111/evo.14588 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Crow JF. Mapping functions. Genetics. 1990;125(4):669–671. doi: 10.1093/genetics/125.4.669 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Roland G Roberts

31 Mar 2023

Dear Dr Veller,

Thank you for submitting your manuscript entitled "Interpreting population and family-based genome-wide association studies in the presence of confounding" for consideration as a Research Article by PLOS Biology.

Your manuscript has now been evaluated by the PLOS Biology editorial staff, as well as by an academic editor with relevant expertise, and I'm writing to let you know that we would like to send your submission out for external peer review.

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. After your manuscript has passed the checks it will be sent out for review. To provide the metadata for your submission, please Login to Editorial Manager (https://www.editorialmanager.com/pbiology) within two working days, i.e. by Apr 04 2023 11:59PM.

If your manuscript has been previously peer-reviewed at another journal, PLOS Biology is willing to work with those reviews in order to avoid re-starting the process. Submission of the previous reviews is entirely optional and our ability to use them effectively will depend on the willingness of the previous journal to confirm the content of the reports and share the reviewer identities. Please note that we reserve the right to invite additional reviewers if we consider that additional/independent reviewers are needed, although we aim to avoid this as far as possible. In our experience, working with previous reviews does save time.

If you would like us to consider previous reviewer reports, please edit your cover letter to let us know and include the name of the journal where the work was previously considered and the manuscript ID it was given. In addition, please upload a response to the reviews as a 'Prior Peer Review' file type, which should include the reports in full and a point-by-point reply detailing how you have or plan to address the reviewers' concerns.

During the process of completing your manuscript submission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF.

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Roli Roberts

Roland Roberts, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

Decision Letter 1

Roland G Roberts

12 Jun 2023

Dear Dr Veller,

Thank you for your patience while your manuscript "Interpreting population and family-based genome-wide association studies in the presence of confounding" was peer-reviewed at PLOS Biology. It has now been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by three independent reviewers.

In light of the reviews, which you will find at the end of this email, we would like to invite you to revise the work to thoroughly address the reviewers' reports.

You’ll see that reviewer #1 is positive, but raises two main issues; one is about the methodologies that you consider for family-based GWAS, and the other is your definition of bias/confounding; he suggests remedies for these. He also thinks you should shorten the manuscript and describe the simulations better. Reviewer #2 is very positive and only has a couple of points, one of which overlaps with rev #1 (the definition of bias). Reviewer #3 is also broadly positive, but thinks that you need to tie it in with underlying mechanistic processes (he also takes issue with the term “genetic confounding”). He has a number of other points that mostly relate to real-world examples and implications that might make the paper more accessible.

Given the extent of revision needed, we cannot make a decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is likely to be sent for further evaluation by all or a subset of the reviewers.

We expect to receive your revised manuscript within 3 months. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension.

At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may withdraw it.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point-by-point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Revised Article with Changes Highlighted" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Roli Roberts

Roland Roberts, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

------------------------------------

REVIEWERS' COMMENTS:

Reviewer #1:

[identifies himself as Alexander Young]

IMPORTANT: See also the comments on the attached PDF!

The authors present a manuscript that evaluates the influence of a variety of factors on both standard and family-based GWAS methods. The manuscript represents an important contribution to the literature on understanding what different GWAS designs are estimating, a topic of considerable importance since so much in human genetics relies upon the output of these studies. The results that I found most enlightening were those detailing how different population genetic processes can lead to forms of within-chromosome linkage disequilibrium (LD) that influence family-based and standard GWAS designs differently. Also important is the authors' warning about the interpretation of within-family analyses of polygenic predictors (PGS) derived from standard GWAS owing the influence of confounding in the original GWAS summary statistics.

While I found the manuscript made many valuable points, there are some issues with the theoretical results on family-based GWAS and on the definitions of 'bias' and 'confounding' the authors use. I outline the major issues here, with detailed comments given in an annotated PDF.

The first issue is the authors' derivation of results for family-based GWAS using transmitted and non-transmitted alleles. The authors define this method as proceeding by performing two separate univariate regressions on transmitted and untransmitted parental alleles, with direct effects estimated from the difference of these regression coefficients (A1.4). This is not the methodology used in family-based GWAS studies (e.g. Young et al. 2022) or in studies of indirect genetic effects (e.g. Kong et al. 2018). The method used in Young et al. 2022 is a regression of offspring phenotype jointly onto offspring genotype (or PGS) and combined parental genotype (or PGS). This is equivalent to a joint regression onto genotypes computed from the transmitted and non-transmitted parental alleles, with the direct effect computed by subtracting the coefficient on the non-transmitted genotype from the coefficient on the transmitted genotype, the method used by Kong et al. 2018. The equivalence between these two models is shown in Section 7.2 of the Supplementary Information of Okbay et al. 2022. While this is equivalent to performing separate regressions on transmitted and non-transmitted genotypes under random-mating, which means transmitted and non-transmitted genotypes (or PGS) are uncorrelated, this is not true in general, including for many of scenarios considered in the manuscript. I'm not sure if this makes much difference in some scenarios (e.g. analysis of a single variant affecting a polygenic phenotype affected by assortative mating) but could make a substantial difference under others (e.g. strong population structure, or a PGS that is strongly correlated between parents). However, most of the results in the main text use the sibling design, for which the author's derivation is correct as far as I can tell.

The second issue is how the authors define bias or confounding. This is a tricky issue in the case of GWAS designs since they are estimating marginal effects, which are inherently defined in terms of linkage disequilibrium with other variants. The authors define the true effect to be that of the local region around the focal SNP, i.e. the region with recombination fraction approximately 0. But this is not what GWAS is estimating even under random-mating with direct genetic effects only: it is estimating a linear combination of direct genetic effects of causal variants on the same chromosome, with weight in proportion to the LD between the focal variant and the causal variant. It would be better to define the true target parameter as such (as would be obtained from standard GWAS under random-mating with no indirect genetic effects) and then to examine bias in relation to that parameter.

A related issue is the authors examine the impact of various population genetic processes on a quantity they term the "heterozygosity weighted average squared effect size estimate". An issue with this quantity is that GWAS gives a biased estimate of the true value of this even under random-mating and direct genetic effects only (e.g. Figure 3). It therefore makes it hard to assess the impact of various population genetic processes on the bias introduced into this term, and it makes me doubt whether it is even a sensible thing to estimate with marginal GWAS effect estimates, whether from standard or family-based GWAS. It would both be clearer and increase the impact of this paper on the field if the authors could measure bias in some quantity or method that is generally used and (at least thought to be) understood, such as SNP heritability as estimated by LDSC. LDSC attempts to 'undo' the fact that marginal GWAS effect estimates are the outcome of LD with variants that are physically close to the focal variant when estimating SNP heritability. Results demonstrating how various population genetic processes impact LDSC estimates of SNP heritability (which should be approximately unbiased under a model of random mating and direct genetic effects only) from standard and family-based GWAS would be far more interesting and interpretable than results on 'heterozygosity weighted average squared effect size estimate'. (I appreciate there is a complicating factor that the true heritability changes with these different population genetic processes, but that is something I'm sure the authors could find a sensible way of dealing with and communicating. True SNP heritability could, for example, be computed from a PGS computed from the true direct effects of the causal SNPs.)

I also think the authors are stretching the definition of 'confounding' when they claim GxE and GxG interactions lead to confounding in family-based GWAS estimates. Under interaction models, all marginal effects are some sort of average effect, with the average taken over some distribution of the interacting factors. Such marginal effects are not typically thought of as 'confounded' by interactions in statistical modeling. The authors point that family-based and standard GWAS are estimating different average effects under certain interaction models is valid, but I don't think this can be construed as 'confounding' in family-based GWAS.

The authors should also include a description of the simulations in the manuscript. I followed the link to the git repository, and did not find it immediately enlightening.

Finally, the main text is rather long. While I thought the authors generally did a good job of making a technical topic more comprehensible, the writing could be tightened up significantly.

Alexander Young.

Reviewer #2:

This is a fantastic study by Veller & Coop that lays out how population structure impacts estimates of SNP effects and other derived quantities obtained across various experimental designs (population and within-family designs).

I only have minor comments.

My first comment relates to the word "bias", which seems overly used in this manuscript (x47 times). I will illustrate my point by simply talking about heritability. The heritability of a trait has a different meaning (and expectation) depending on whether the population undergoes random mating or not. Therefore, assuming that the random mating heritability is the true heritability and the other one biased does not seem to be the appropriate language. Even more so, given that a randomly mating population is a mere abstraction that would probably never exist in nature. The paper has a lot of this fallacious dichotomy. This is a minor comment so the only action that I request here is for the authors to emphasize that the notion of bias is subjective and will depend on what the application is.

My second comment regards the effectiveness of PCA to correct stratification. The authors echo what was reported before that PCA adjustment may not fully remove confounding. However, this is only half true as the power of PCA depends on sample size. Interestingly, Price (2006) clearly made that point and gave a rule of thumb to appreciate when PCA starts detecting (sub)structures. However, that argument tends to get lost in the literature. My suggestion is to emphasize that the power of PC depends on sample size or, equivalently, to add that PCA is inefficient in (meta-analyses of) small samples.

"Transmitted genotype" - Only alleles are transmitted. Please fix.

Reviewer #3:

[identifies himself as Gibran Hemani]

Veller and Coop provide a very detailed and careful theoretical analysis of subtle biasing structures in different GWAS designs, which I personally found to be valuable and educational. The authors use their analysis to propose caution over family-based methods for genetic association analysis, however a lack of contextualising with empirically known parameters of biasing mechanisms made this feel difficult to easily accept or action.

Main comments

Mechanistic processes. I did struggle with explanations such as "Genetic confounding is caused by long-range LD between loci that affect the trait or traits under study" which are used throughout - it equates an unobservable state of the data with an observable state of the data, however it would be better to equate a mechanistic process (e.g. stabilising selection, cross-trait assortative mating etc) with an observable state of the data (long range LD). Perhaps avoid the term genetic confounding. It appears that there are cross-trait mechanisms that introduce long-range LD, and within-trait mechanisms that drive issues at the causal locus. So start with the mechanisms, list what types of LD they can generate, and summarise the extent of bias likely arising from those mechanisms.

Empirical support. Many papers have focused on showing bias in population-based estimates. However few examine bias in family-based estimates, though increasing numbers of family-based estimates are appearing. Can you pinpoint specific examples of family-based estimates which are particularly liable to be incorrect due to some of these biasing mechanisms?

Sensitivity analysis. A useful outcome from this would be to allow researchers to examine the sensitivity of an estimate by being able to plug in parameters into some of the equations to explore how much e.g. assortative mating would be required to give rise to the observed results from their study.

Polygenicity. The simulations used quite low polygenicity (100 or 1000 causal variants). The more socially driven traits that are liable to many of the biasing mechanisms are likely much more polygenic. How sensitive are some of the conclusions to greater polygenicity.

Tagging SNPs vs other forms of LD. I think it's quite well understood that sentinel variants detected by genome wide scans are not necessarily the causal variants and that they may slightly underestimate the true causal effect of the locus. The other forms of LD bias are much less well known in my opinion. In the exposition it would be anchoring to have a clear distinction between these two ideas at the outset.

Minor comments

line 31 - it's more accurate to describe assortative mating as a collider/selection bias rather than confounding.

line 52 - could say that indirect/dynastic effects are particularly difficult to control using population-level covariates

line 52 - 'there is no bright line separating dynastic, environmental and genetic confounding' - The mechanism of dynastic effects is quite specific, and forms a special case of both environmental confounding. I think it would serve the reader better to clarify these distinctions rather than imply that they can't be clarified.

line 118 - perhaps italicise "other parent"

Trans-LD - can you provide a more clear mechanistic explanation about how this manifests? It seems like cross-trait assortative mating is the main driver? In any case I would suggest renaming some of these terms.

Line 249 - is it just sampling error that would drive differences in coupling and repulsion heterozygote frequencies or some other mechanism? If sampling error, does it contribute towards asymptotic bias? If there's a mechanism that drives differences then it would be more direct to re-factor this equation in terms of parameters for that mechanistic process.

Line 333 - Determining if a slope is non-zero depends on hypothesis testing which has a power component. There hasn't been any discussion of the magnitude of the biasing terms or their standard errors, i.e. for what sample size of sibling pairs, and for what level of cross trait assortative mating, polygenicity etc would be required in order for a sibling pair PGS slope to be determined to be non-zero?

Line 394 - the h^2 rho term is a bit unclear when there are two traits in cross-trait AM, each potentially having different h^2

Figure 3 - I think squared effects are being used to proxy h2-like metrics, however there is an issue with bias in h2 due to changes in phenotypic variance due to AM. Kemper et al 2021 discuss how there isn't an obvious 'true' value for h2

Attachment

Submitted filename: PBIOLOGY-D-23-00751_R1.pdf

pbio.3002511.s003.pdf (4.3MB, pdf)

Decision Letter 2

Roland G Roberts

12 Jan 2024

Dear Dr Veller,

Thank you for your patience while we considered your revised manuscript "Interpreting population- and family-based genome-wide association studies in the presence of confounding" for publication as a Research Article at PLOS Biology. This revised version of your manuscript has been evaluated by the PLOS Biology editors, the Academic Editor, and one of the original reviewers.

Based on the review, we are likely to accept this manuscript for publication, provided you satisfactorily address the remaining points raised by the reviewer. Please also make sure to address the following data and other policy-related requests:

IMPORTANT:

a) Please attend to the remaining comments from reviewer #1.

b) This is a minor point, but when reading your Title, several of us stumbled over the use of the gerund "confounding" at the end. It feels like an adjective with a missing noun, rather than a noun in its own right. Would "confounds" or "confounders" or some such be easier for a casual reader to parse?

c) Many thanks for providing the simulation code in Github. However, because Github depositions can be readily changed or deleted, please make a permanent DOI’d copy (e.g. in Zenodo) and provide this URL (see below).

d) Please cite the location of the simulation code clearly in all relevant main and supplementary Figure legends (Figs 2, 3, 4, 5, S1, I think), e.g. “The simulation code used to generate this Figure can be found in https://doi.org/10.5281/zenodo.XXXXX”

e) I note that you mention two of the reviewers (Young, Hemani) in the Acknowledgements. While we appreciate the sentiment, this is against PLOS policy, so please could you remove this?

As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript.

We expect to receive your revised manuscript within two weeks.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

- a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list

- a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable, if not applicable please do not delete your existing 'Response to Reviewers' file.)

- a track-changes file indicating any changes that you have made to the manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*Press*

Should you, your institution's press office or the journal office choose to press release your paper, please ensure you have opted out of Early Article Posting on the submission form. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please do not hesitate to contact me should you have any questions.

Sincerely,

Roli Roberts

Roland Roberts, PhD

Senior Editor,

rroberts@plos.org,

PLOS Biology

------------------------------------------------------------------------

CODE POLICY

Per journal policy, as the code that you have generated is important to support the conclusions of your manuscript, we require that you make it available without restrictions upon publication. Please ensure that the code is sufficiently well documented and reusable, and that your Data Statement in the Editorial Manager submission system accurately describes where your code can be found.

------------------------------------------------------------------------

DATA NOT SHOWN?

- Please note that per journal policy, we do not allow the mention of "data not shown", "personal communication", "manuscript in preparation" or other references to data that is not publicly available or contained within this manuscript. Please either remove mention of these data or provide figures presenting the results and the data underlying the figure(s).

------------------------------------------------------------------------

REVIEWER'S COMMENTS:

Reviewer #1:

[identifies himself as Alexander Young]

The authors have done a good job at justifying their choice of the heterozygosity weighted squared effect size as their metric of potential bias. While I personally would still be interested to see simulation results applying LDSC to the different scenarios examined in this manuscript, that is perhaps beyond the scope of what is already a comprehensive manuscript and which now includes a theoretical treatment of potential LDSC biases within the authors' theoretical framework.

The authors have corrected their model for trio-based GWAS estimation. However, I was looking at their equations for the non-transmitted coefficient, and I'm not sure that it is totally correct. Looking at Equation 9, the 'genetic confounding direct' component appears to imply that this coefficient can pickup some of the direct effect of the focal variant. If I set \\lambda=l, so that c_{\\lambda,l}=0, this equation implies a contribution from a term equal to (\\tilde{D}'_{l,l}+\\tilde{D}_{l,l})\\alpha_l^d, i.e. the sum of trans-LD in the offspring and parent generations at locus l times the direct genetic effect of the locus. I do not see how the non-trasnmitted coefficient can pick up part of the direct effect of the focal variant in a joint regression of phenotype onto genotype and parental genotype, since the direct effect of the focal locus will be fully captured by the offspring genotype at that locus, even when there is trans-LD within the locus. Can the authors comment on this?

(There is a simpler route to deriving the trio-GWAS regression coefficients that avoids having to deal with the complexities of the full joint regression calculations. One can reformulate the regression in terms of the offspring deviation genotype — g_l-(g_l^m+g_l^f)/2 — and the sum of the parents' genotypes. Since these terms have zero covariance even under non-random mating, one can obtain the coefficients for the univariate regressions of phenotype onto these two terms and then take the sum and rearrange in order to obtain the results for the joint regression of phenotype onto offspring and parental genotype. I take this approach in some lectures I gave available here: https://www.youtube.com/watch?v=Imc_gJyhahQ&ab_channel=SocialScienceGeneticsAssociationConsortium)

Also, on lines 1456-1458, the authors invoke the Frisch-Waugh-Lovell theorem to justify performing a regression on the offspring deviation genotype in order to derive the 'direct genetic effect' estimate. But they state that they only need to residualise the offspring genotype for the midparent genotype, whereas the FWL theorem also requires that the phenotype be residualised with respect to the midparent genotype.

A couple of minor things:

line 1553: should this be a '+' not an '='?

Line 2350: 'teh' -> 'the'

Decision Letter 3

Roland G Roberts

19 Jan 2024

Dear Dr Veller,

Thank you for the submission of your revised Research Article "Interpreting population- and family-based genome-wide association studies in the presence of confounding" for publication in PLOS Biology. On behalf of my colleagues and the Academic Editor, Priya Moorjani, I'm pleased to say that we can in principle accept your manuscript for publication, provided you address any remaining formatting and reporting issues. These will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes.

Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS: We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. 

Sincerely,

Roli Roberts

Roland G Roberts, PhD, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Supplementary information.

    (PDF)

    pbio.3002511.s001.pdf (486.1KB, pdf)
    S1 Fig. Cross-trait assortative mating influences effect-size estimates at loci that affect the study trait, although this influence is second-order relative to that on effect-size estimates at loci that do not affect the study trait but do affect the other trait involved in assortative mating (cf. Fig 2; note the scales of the y-axes).

    Simulations are the same as in Fig 2; the code can be found at https://doi.org/10.5281/zenodo.10520811.

    (PDF)

    pbio.3002511.s002.pdf (433.5KB, pdf)
    Attachment

    Submitted filename: PBIOLOGY-D-23-00751_R1.pdf

    pbio.3002511.s003.pdf (4.3MB, pdf)
    Attachment

    Submitted filename: Veller_Responses.pdf

    pbio.3002511.s004.pdf (91.7KB, pdf)
    Attachment

    Submitted filename: Veller_Responses.pdf

    pbio.3002511.s005.pdf (73.7KB, pdf)

    Data Availability Statement

    The authors confirm that all data underlying the findings are fully available without restriction. Simulation code is available at https://doi.org/10.5281/zenodo.10520811 and https://github.com/cveller/confoundedGWAS.


    Articles from PLOS Biology are provided here courtesy of PLOS

    RESOURCES