Power loss due to testing association between covariate adjusted traits and genetic variants

Pranav Yajnik; Michael Boehnke

doi:10.1002/gepi.22325

. Author manuscript; available in PMC: 2021 Sep 1.

Published in final edited form as: Genet Epidemiol. 2020 Jun 8;44(6):579–588. doi: 10.1002/gepi.22325

Power loss due to testing association between covariate adjusted traits and genetic variants

Pranav Yajnik ¹, Michael Boehnke ¹

PMCID: PMC7610149 NIHMSID: NIHMS1614707 PMID: 32511788

Abstract

Multiple linear regression is commonly used to test for association between genetic variants and continuous traits and estimate genetic effect sizes. Confounding variables are controlled for by including them as additional covariates. An alternative technique that is increasingly used is to regress out covariates from the raw trait and then perform regression analysis with only the genetic variants included as predictors. In the case of single-variant analysis, this adjusted trait regression (ATR) technique is known to be less powerful than the traditional technique when the genetic variant is correlated with the covariates We extend previous results for single-variant tests by deriving exact relationships between the single-variant score, Wald, likelihood-ratio, and F-test statistics and their ATR analogs. We also derive the asymptotic power of ATR analogs of the multiple-variant score and burden tests. We show that the maximum power loss of the ATR analog of the multiple-variant score test is completely characterized by the canonical correlations between the set of genetic variants and the set of covariates. Further, we show that for both single- and multiple-variant tests, the power loss for ATR analogs increases with increasing stringency of Type 1 error control ( $α$ ) and increasing correlation (or canonical correlations) between the genetic variant (or multiple variants) and covariates. We recommend using ATR only when maximum canonical correlation between variants and covariates is low, as is typically true.

Keywords: adjusted outcome, power loss, covariates, linear regression, genome-wide association study

INTRODUCTION

Multiple linear regression and the associated ordinary least-squares and F-test methodologies are effective and widely used approaches to test for association between genetic variants and quantitative traits and to estimate genetic effect sizes while controlling for the effects of other variables (covariates). Covariates may be included to account for confounding (e.g. due to population structure or assay batch effects), to reduce trait variability and consequently increase power, or to exclude associations that are driven primarily through the action of the variants on an intermediate trait.

Current genome-wide association studies (GWAS) typically assay hundreds of thousands to millions of genetic variants. Single-variant association tests are performed separately on each variant to test whether the variant is associated with the trait. Multi-variant, gene-, or region-based tests are performed to address the omnibus hypothesis that one or more in a set of variants are associated with the trait. Since the dependent variable and covariates are typically the same across all tests, some analysts use a two-stage approach for quantitative trait GWAS (Randall et al., 2013; UK10K Consortium, 2015; Tachmazidou et al., 2017; Kanai et al., 2018; Styrkarsdottir et al., 2019; Niarchou et al., 2020 are some examples of studies employing this methodology). In the first stage, an ‘adjusted’ trait is obtained as the residuals from the regression of the trait on covariates. In the second stage, association analyses are performed to test for association between the adjusted trait and each variant (or set of variants) without inclusion of other covariates. We term this strategy “adjusted-trait regression (without covariates)” (ATR).

Although ATR can be conceptualized as a two-stage method, we note that it bears no relation to the “two-stage least-squares” method used in structural equations modeling and estimation of causal effects using instrumental variables. We assume that the target of inference is the conditional association between the unadjusted trait and variants given the covariates rather than the association between the adjusted trait and variants unconditional on the covariates. Thus, we view ATR as a numerical technique to conveniently approximate the results that would have been obtained from analysis of the unadjusted trait (with covariates included). The strategy of analyzing a covariate-adjusted trait may be used for any statistical method that deals with linear models, including gene/region based tests like burden or SKAT (Lee et al., 2014) or methods for linear mixed-models.

We have not found any methods papers that recommend the use of ATR. Indeed, the research articles cited above make use of ATR without comment or justification. ATR results are not identical to results obtained from modeling the unadjusted trait along with covariates. Previous investigations of single-variant models showed that the ordinary least-squares ATR estimator of genetic effect is biased towards zero by a factor of 1 − R² (Demissie & Cupples, 2011; Xing et al., 2011; Che et al., 2012), where $R^{2}$ is the sample coefficient of determination obtained by regressing the genetic variant onto the covariates. These investigations used approximations and simulations to assess power and Type 1 error of the ATR-based tests assuming a Type 1 error rate of $α = 0.05$ and showed that ATR is typically less powerful than multiple linear regression when the sample correlation between a genetic variant and covariates is non-zero. More recently, Sofer et al. (2019) showed that the ATR-based single-variant score and multi-variant SKAT test statistics are numerically (deterministically) dominated by the corresponding test statistics obtained from analyzing the unadjusted trait with covariates leading to deflated p-values and loss of power.

We extend these previous results by deriving the exact relationship between ATR and multiple linear regression score, likelihood ratio, Wald, and F test-statistics for single-variant analysis. We use these relationships to derive (1) the exact finite sample distributions of the ATR test-statistics (hence, exact power and Type 1 error) under the assumption of independent and identically normally distributed errors and (2) the asymptotic relationship between the test-statistics for situations where the assumption is suspect. In addition, we derive the asymptotic distributions of ATR based analogs of two gene/region-based tests: the burden test and the (omnibus) score test, and show that these tests applied in the ATR framework may also suffer from loss of power compared to their multiple linear regression analogs. In particular, we show that the maximum possible power loss for gene-based ATR score tests depends on the maximum canonical correlation between the set of variants and the set of covariates, so that we expect power loss to be modest in typical GWAS with low to moderate population structure.

METHODS AND RESULTS

Definition of the ATR approach

We assume a model of the form:

Y_{i} = α + \sum_{j = 1}^{m} β_{j} g_{i j} + \sum_{l = 1}^{k} γ_{l} c_{i l} + ϵ_{i}

(M1)

Here $Y_{i}, 1 \leq i \leq n$ is the trait value for the $i^{t h}$ study participant, $g_{i j}$ the genotype (or genotype-imputation-based dosage) for the $j^{t h}$ variant for this study participant, $β_{j}$ the effect of the variant on the trait (conditional on the other m − 1 variants and covariates), $c_{i l}$ the value of the $l^{t h}$ covariate, $γ_{j}$ the (conditional) effect of the covariate, and $ϵ_{i}$ a random error. We assume the errors are independent and identically distributed across observations with $E (ϵ_{i}) = 0$ and $V a r (ϵ_{i}) = σ^{2}$ . For single-variant models, $m = 1$ and $β$ is the conditional effect of the variant on the trait given the covariates, but unconditional on any other variant.

The above model can be represented as $Y = G β + C γ + ϵ$ where $Y$ and $ϵ$ are $n \times 1$ vectors, $G$ is an $n \times m$ matrix, $β$ is a $m \times 1$ vector, $C$ is an $n \times (k + 1)$ matrix (including a column of ones for the intercept), and $γ = {(α, γ_{1}, \dots, γ_{k})}^{'}$ is a $(k + 1) \times 1$ vector. We have $V a r (Y| G, C) = V a r (ϵ) = σ^{2} I_{n}$ where $I_{n}$ is the n-dimensional identity matrix. We wish to test $H_{0} : β = 0$ . Further, we assume that the test statistic $T$ has the form $T = f (Y, G, C)$ . We note that the distribution of $T$ under the null may depend on $G$ and $C$ and on parameters that need to be estimated from the data. We assume that the (possibly estimated) parameter value $\hat{θ}$ required to define the distribution of $T$ under the null (for example, degrees of freedom for the F-statistic) also has the form $\hat{θ} = g (Y, G, C)$ .

Let $H_{C} = C {(C^{'} C)}^{- 1} C^{'}$ . Then $Y_{r} = Y - C \hat{γ} = (I_{n} - H_{C}) Y$ is the vector of residuals obtained by regressing $Y$ onto $C$ using ordinary least squares (with $\hat{γ} = {(C^{'} C)}^{- 1} C^{'} Y$ ). We define the ATR analog of $T$ to be $T_{A T R} = f (Y_{r}, G, J_{n})$ where $J_{n} = (1, \dots, 1)$ is the $n \times 1$ vector of ones denoting the intercept. Further, we assume that the parameter $θ$ for ATR is calculated as ${\hat{θ}}_{A T R} = g (Y_{r}, G, J_{n})$ . This definition of the ATR analog implies that inference based on $T_{A T R}$ can be performed by using existing software designed for inference with $T$ simply by replacing $Y$ and $C$ with $Y_{r}$ and $J_{n}$ . We note that if the parameter of the null distribution for a method depends on $Y$ and/or $C$ , we may have $\hat{θ} \neq {\hat{θ}}_{A T R}$ , and the ATR analog may reference a null distribution that differs from the one used by the unadjusted method to calculate p-values.

Ordinary least-squares estimation with ATR

The ordinary least-squares estimator of $β$ is given by $\hat{β} = {(G_{r}^{'} G_{r})}^{- 1} G_{r}^{'} Y_{r}$ where $G_{r} = (I_{n} - H_{C}) G$ is the matrix of residuals of variants regressed onto $C$ . This result is often referred to as the Frisch-Waugh-Lovell theorem (Frisch & Waugh, 1933; Lovell, 2008). In the appendix, we show that

{\hat{β}}_{A T R} = {(I}_{m} - R_{G C}^{2}) \hat{β}

where $R_{G C}^{2} = {[G^{'} (I_{n} - H_{1}) G]}^{- 1} G^{'} (H_{C} - H_{1}) G$ and $H_{1} = J_{n} J_{n}^{'} / n$ . Note that the eigenvalues of $R_{G C}^{2}$ are the sample canonical correlations between the set of genetic variants and the set of covariates. In particular, $R_{G C}^{2} = 0$ (the zero matrix) if and only if every genetic variant is uncorrelated with all covariates. Further, we have $E ({\hat{β}}_{A T R}) = (I_{m} - R_{G C}^{2}) β$ and, consequently, $E ({\hat{β}}_{A T R}) = 0$ if and only if none of the genetic variants are associated with the trait (conditional on covariates). Thus, any test that is valid for testing the omnibus hypothesis $H_{0} : E ({\hat{β}}_{A T R}) = 0$ is also valid for testing $H_{0} : β = 0$ .

In the case of single-variant analysis $(m = 1)$ , the above relationship simplifies to ${\hat{β}}_{A T R} = (1 - R^{2}) \hat{β}$ and we recover the result obtained previously (Demissie & Cupples, 2011; Xing et al., 2011; Che et al., 2012; Sofer et al., 2019). Thus, for single-variant analysis, the ATR ordinary least-squares estimator can only be biased towards the null. This is not true for individual elements of ${\hat{β}}_{A T R}$ when $m > 1$ . Indeed, $E {({\hat{β}}_{A T R})}_{j}$ is a linear combination of all the elements of the vector $β$ . In particular, $β_{j} = 0$ does not necessarily imply that $E {({\hat{β}}_{A T R})}_{j} = 0$ . Thus, a test that is valid for $H_{0} : E {({\hat{β}}_{A T R})}_{j} = 0$ is not necessarily valid for $H_{0} : β_{j} = 0$ (unless all remaining elements of $β$ are also $0$ ).

Single-variant association testing with ATR

Xing et al. (2011) showed that $W_{A T R} \leq W$ where $W$ is the Wald test statistic. Che et al. (2012) refined an approximation proposed by Demissie and Cupples (2011) for the F test statistic ( $F$ ) to $F_{A T R} = \frac{n - 2}{n - k - 2} (1 - \frac{R^{2}}{1 + R^{2} r^{2} (Y_{r}, G_{r}) - r^{2} (Y_{r}, G_{r})}) F$ where $r^{2} (Y_{r}, G_{r})$ is the sample squared correlation between $Y_{r}$ and $G_{r}$ and $F$ is the F statistic. Xing et al. (2011) and Che at al. (2012) used simulations to estimate power and Type 1 error rate for $α = 0.05$ .

We show that $S_{A T R} = (1 - R^{2}) S$ , where $S$ is the score test statistic for the above linear model when $m = 1$ . For linear models, the test statistics for the score, Wald, likelihood ratio, and F tests bear simple, deterministic relationships to each other (Vandaele 1981). Combining $S_{A T R} = (1 - R^{2}) S$ with these known relationships yields the following set of equalities:

F_{A T R} = \frac{n - 2}{n - k - 2} \times \frac{(1 - R^{2}) F}{1 + R^{2} F / (n - k - 2)}

W_{A T R} = \frac{(1 - R^{2}) W}{1 + R^{2} W / n}

{L R}_{A T R} = L R - n \log (1 + R^{2} [e^{L R / n} - 1])

where $L R$ denotes the likelihood ratio test statistic. We see that $S, W, and L R$ are always strictly greater than their ATR anologs if $R^{2} > 0$ and equal to them if $R^{2} = 0$ . P-values for the score, Wald, and likelihood ratio tests are standardly computed assuming the test statistics follow a chi-square distribution with $θ = s = 1$ degree of freedom ( $χ_{1}^{2}$ distribution). The ATR analogs of these methods also assume this same distribution and are less powerful than their counterparts if $R^{2} > 0$ .

In contrast, $F_{A T R} > F$ if $F < \frac{k}{R^{2}} - (n - 2)$ and the ATR analog of the F-test uses the F-distribution with $1$ and $n - 2$ degrees of freedom while the F-test assumes a distribution with $1$ and $n - k - 2$ degrees of freedom; in this case, ${\hat{θ}}_{A T R} \neq \hat{θ}$ since the denominator degrees of freedom depends on the number of covariates. Thus, the ATR analog of the F-test may be slightly anti-conservative if $R^{2} \approx 0$ and/or the number of covariates is large relative to the sample size. This is quite unlikely given the large sample sizes of current GWAS, the large values of the test statistic required to reject the null, and the fact that the expected value of the sample coefficient of determination increases with increasing number of predictors, even when the variant is independent of the predictors at the population level, in which case $E (R^{2}) \approx k / n$ for large samples.

For a fixed number of covariates, the score, Wald, likelihood ratio, and F test statistics asymptotically converge to the same random variable $T$ (almost surely) under the null and local alternatives ( $β = O (n^{1 / 2})$ i.e. when the effect size tends to zero asymptotically). Similarly, their ATR analogs each converge to $(1 - R^{2}) T$ . Asymptotically, each of the ATR test statistics follows a scaled $χ_{1}^{2}$ distribution whose scaling factor is less than or equal to one and are, thus, conservative when $R^{2} > 0$ . The exact finite sample distribution of the F statistic is known in the case where errors are normally distributed; the exact distributions of all the other test statistics can be derived easily given the above relationships.

For simplicity, we illustrate the conservative nature of ATR for single-variant tests under asymptotic conditions. Here, we have $P (T_{A T R} < α) = P (T < α / (1 - R^{2}))$ . The relationship between the p-values generated by the score test and its ATR analog is non-linear; the ATR test becomes more conservative as the p-value threshold for declaring significance ( $α$ ) becomes more stringent. Figure 1 shows power of the ATR test with $R^{2} = 0.05$ for $α$ values ranging from $10^{- 1}$ to $10^{- 10}$ where the effect size for each $α$ value is chosen to yield $80 %$ power for the score test. At the usual GWAS threshold of $α = 5 \times 10^{- 8}$ , the power of the ATR test is about 76%. Figure 2 shows how, for fixed $α = 5 \times 10^{- 8}$ , the ATR test becomes less powerful as $R^{2}$ increases (again, with effect size chosen to yield $80 %$ power for the score test).

Figure 1: — Power of ATR analog of single-variant score test when $R^{2} = 0.05$ with varying stringency of statistical significance $α$ displayed in the negative log ten scale. Effect sizes vary as a function of $α$ to yield 80% power for the score test.

Figure 2: — Power of ATR analog of single-variant score test with increasing $R^{2}$ for $α = 5 \times 10^{- 8}$ . The effect size was chosen to yield 80% power for the score test.

Burden tests with ATR

The relationships derived for the single-variant tests are directly applicable to burden tests. Burden tests typically assume the same multiple linear regression model presented in the previous section with $G$ replaced by $B = \sum_{i = 1}^{m} w_{i} G_{i} = G W$ where $G_{i}, \dots, G_{m}$ are $m$ genetic variants (columns of $G), w_{i}$ are weights (and $W = {(w_{1}, \dots, w_{s})}^{'}$ ), and $B$ is the (weighted) burden of alternate alleles (or genotype imputation-based dosages) from the $m$ variants. For burden tests, $R^{2}$ is the sample coefficient of determination obtained by regressing $B$ onto $C$ . Given $G$ and $C$ , the maximum possible value for $R^{2}$ is obtained when the weight vector $W$ is a scalar multiple of the eigenvector of $R_{G C}^{2}$ corresponding to the maximum eigenvalue and the maximum $R^{2}$ is equal to the maximum eigenvalue.

Classical omnibus tests with ATR

The omnibus null hypothesis that none of the $m$ variants are associated with trait (conditional on covariates) can be tested with the omnibus/multivariate score, Wald, likelihood ratio, and F tests. As before, these tests are asymptotically equivalent and we consider the score test as an exemplar. Unlike the single-variant case, no deterministic relationship exists between $S_{A T R}$ and $S$ when $m > 1$ (that is, $S_{A T R}$ can take multiple values for any given value of $S$ ). However, we show that

(1 - R_{m a x}^{2}) S \leq S_{A T R} \leq (1 - R_{m i n}^{2}) S

where $R_{m a x}^{2}$ and $R_{m i n}^{2}$ are the maximum and minimum canonical correlations between the variants and covariates. Recall that $S$ asymptotically follows a $χ_{m}^{2} (δ^{2})$ distribution with non-centrality parameter $δ^{2} = \frac{1}{σ^{2}} β^{'} G^{'} (I_{n} - H_{C}) G$ . Under the null, the distribution of $S$ depends only on the parameter $\hat{θ} = m$ . Asymptotically, $S_{A T R}$ follows the same distribution as the random variable $\sum_{i = 1}^{p} (1 - R_{i}^{2}) Z_{i}$ where $R_{1}^{2}, \dots, R_{p}^{2}$ are the distinct eigenvalues of $R_{G C}^{2}$ (in decreasing order so that $R_{1}^{2} = R_{m a x}^{2}$ and with $p$ possibly smaller than $m$ ) and the random variables $Z_{i}$ are mutually independent with $Z_{i} ~ χ_{ν_{i}}^{2} (λ_{i}^{2}), \sum_{i = 1}^{p} ν_{i} = m$ (see Appendix). Since $\hat{θ}$ is independent of $C$ , we have ${\hat{θ}}_{A T R} = \hat{θ}$ and p-values for $S_{A T R}$ are calculated assuming a central $χ_{s}^{2}$ distribution.

Note that the score test yields the same power for all effect size vectors $β$ such that $β^{'} G^{'} (I_{n} - H_{C}) β = c$ where $c \geq 0$ is a constant. Although the actual difference in power between $S$ and $S_{A T R}$ depends on the true value of $β$ , we show that, amongst all $β$ that yield the same power for the score test, the ATR analog achieves minimum power when $β$ is a scalar multiple of the eigenvector of $R_{G C}^{2}$ corresponding to the maximum eigenvalue (see Appendix). Here, $λ_{1}^{2} = δ^{2}$ and $λ_{i}^{2} = 0$ for $i = 2, \dots, p$ . Thus, the maximum possible power loss of the ATR analog of the score test (relative to the score test) is completely characterized by the set of canonical correlations between the variants and covariates.

Figure 3 shows, for fixed $α = 5 \times 10^{- 8}$ and $s = 10$ variants, the power of ATR analog across a range of $R_{m a x}^{2}$ with effect size chosen to yield $80 %$ power for the omnibus score test. We calculated tail probabilities for the distribution of $S_{A T R}$ using Davies’ method as implemented in the R package CompQuadForm (de Micheaux, P. L., & de Micheaux, M. P. L., 2017). We consider two situations. First, if the remaining canonical correlations are zero, the maximum possible power loss is slightly larger than that for the single-variant case for $m = 10$ and power loss increases as $m$ increases ( $m = 100$ shown in Figure 3). Second, if all canonical correlations are equal to $R_{m a x}^{2}, S_{A T R}$ follows the scaled chi-squared distribution $(1 - R_{m a x}^{2}) χ_{m}^{2} (δ^{2})$ , and the maximum possible power loss is equal to the minimum possible power loss; thus, for a given value of $R_{m a x}^{2}$ , this constitutes the worst-case scenario for ATR (Figure 3). Note that the maximum number of non-zero canonical correlations cannot exceed $m i n (m, k)$ . Thus, the second scenario is unlikely to occur in practice.

DISCUSSION

The ATR approach is often used in genetic association studies (Randall et al., 2013; UK10K Consortium, 2015; Tachmazidou et al., 2017; Kanai et al., 2018; Styrkarsdottir et al., 2019; Niarchou et al., 2020), and several papers have used simulation to assess its properties at modest significance thresholds (Demissie & Cupples, 2011; Xing et al., 2011; Che et al., 2012). However, to our knowledge no papers have presented analytic evaluations of ATR or considered significance thresholds appropriate for GWAS. The Frisch-Waugh-Lovell theorem (Frisch & Waugh, 1933; Lovell, 2008) demonstrates that when the target of inference is confined to a subset of predictors in the multiple linear regression model (e.g. genetic variants), OLS analysis can be achieved as a two-stage method by regressing the covariate adjusted trait onto the covariate adjusted variants. Thus, the ATR strategy of adjusting the trait but not the variants is formally justified in the context of multiple linear regression only when variants and covariates are uncorrelated.

It may seem that score-tests like those presented above or SKAT employ the same strategy as ATR. Indeed, for single-variant analyses the score-statistic for linear models ( $G^{'} Y_{r}$ ) is based on the adjusted trait and unadjusted variant. However, the score test-statistic (calculated by squaring the score-statistic and dividing by its estimated variance) does depend on the adjusted variants. Indeed, it can be shown that ATR over-estimates the variance of the score-statistic by a factor of ${(1 - R^{2})}^{- 1}$ due to using unadjusted variants in the variance calculation. Our derivations also show that single-variant OLS based inference can be fully recovered from the ATR based inference given the summary statistic $R^{2}$ for each variant. For multi-variant analyses, the entire $R_{G C}^{2}$ matrix is required.

For single-variant association tests, previous papers show by computer simulation that ATR is less powerful than the (theoretically justified) two-sided t and Wald tests when the variant is correlated with the covariates (Demissie & Cupples, 2011; Xing et al., 2011; Che et al., 2012; Sofer et al., 2019). We extend previous results by deriving the exact distribution of the ATR analogs for single-variant Wald, likelihood ratio, score, and F tests, and the asymptotic distributions for gene-based burden and score tests, and assessing size and power at significance levels appropriate for GWAS.

For single-variant tests, we show that the loss of power of the ATR method is completely characterized by the coefficient of determination ( $R^{2}$ ) obtained by regressing the variant onto the covariates, with the power loss increasing with increasing $R^{2}$ . Further, we show that loss of power increases as the p-value cutoff used to declare significance becomes more stringent. Characterizing power loss for the ATR analogs of gene-based tests is more complex. For gene-based score tests, the power loss depends on both the (true) strength of association between each variant and the outcome, and the correlation between each variant and the covariates. Power loss is greater when the subset of variants driving the association is also the subset that is driving the canonical correlation between variants and covariates. For the ATR analogs of the multiple linear regression omnibus test of association, we show that the maximum possible power loss is completely characterized by the canonical correlations between the variants and covariates with maximum power loss increasing with increasing values of any of the canonical correlations. When there is only a single non-zero canonical correlation, the maximum power loss is similar to the single-variant case.

At the significance threshold of $α = 5 \times 10^{- 8}$ typically used in GWAS, an $R^{2}$ of $0.1$ results in power decreasing from 80% (for the two-sided t test) to about 71% for the single-variant ATR test. Thus, we recommend that ATR based methods only be used when the $R^{2}$ for the majority of variants is expected be substantially less than 0.1. We re-iterate that sets of covariates not associated with the variant do not result in loss of power due to using ATR; in fact, they increase power if they explain some of the trait variance (Robinson & Jewell, 1991). Covariates that are associated with the trait but not genetic variants in a population based sample may be associated with genetic variants in studies that sample participants non-randomly (Munafo et al., 2018; Greenland et al., 1999); for example, two variables that both cause a disease but are independent in a population will be associated in a case-control sample (Monsees et al., 2009).

In GWAS, the most commonly included covariates that are likely to be correlated with a large number of variants are indicators of genetic ancestry (e.g. principal components). The distribution of correlation depends on the degree of population structure in the sample and the mean $R^{2}$ across variants is (approximately) the sample $F_{s t}$ . For intra-continental samples, typically $F_{s t} < 0.05$ but for inter-continental samples it can be $> 0.1$ [The 1000 Genomes Project Consortium, 2015]. As a further example, we calculated $R^{2}$ between ~750,000 genotyped variants and the first 2, 5, and 10 genetic principal components for ~409,000 participants with white-British ancestry in the UK Biobank (details of SNP QC and PCA generation in Bycroft et al., 2018) and found all $R^{2}$ values were < 0.05. In the analysis including the remaining 78,000 non-white participants (total sample size ~487,000), 6% of variants showed $R^{2} > 0.05$ and 2.5% showed $R^{2} > 0.10$ (the results were approximately similar with 2, 5, and 10 PCs).

Other commonly included covariates that may be correlated with variants are intermediate traits lying in between the gene and primary trait in the causal pathway, and indicators of sample processing or batch effects. For intermediate traits that are genetically complex, values of $R^{2}$ will typically be much smaller than 0.1. The situation with batch effects is less clear, especially for sequencing data which are sensitive to both sample processing and genotype calling methods. Finally, variants which are known to be associated with the trait may also be included as covariates, especially in fine mapping analyses or while searching for multiple independent signals within the same locus. Here, we recommend against using ATR based methods since there is potentially a large power loss for variants in even moderate linkage disequilibrium with the associated variant.

In multiple-variant tests such as burden and omnibus tests (like the F-test or SKAT), we note that least-squares effect size estimator for any particular variant may be biased either towards or away from the null for ATR. Thus, although ATR based tests are valid for the omnibus hypothesis that none of the variants are associated, an ATR based test for the conditional effect of a variant given the remaining variants may not be valid. This is of particular importance for post-hoc testing when the omnibus test is rejected and the analyst wishes to identify the subset of variants driving the association. We recommend against using ATR for such purposes.

When the distribution of the trait differs substantially from the normal distribution, ATR based methods are commonly used in conjunction with applying the inverse normal transform to the adjusted trait. Sofer et al. (2019) show that testing for association between the transformed adjusted trait and unadjusted variants may lead to increased Type 1 error and instead recommend using adjusted variants. McCaw et al. (2019) implement an omnibus test with this strategy.

Finally, we have assumed throughout that the multiple linear model (M1) is appropriate to answer the research question at hand and that $β$ truly measures the effect of interest. This necessitates including certain covariates (e.g. confounders), excluding others (e.g. colliders; see Greenland et al., 1999) and accounting for sample-selection effects (Munafo et al., 2018). For example, Aschard et al. (2015) show that simply adjusting for heritable covariates may lead to biased estimates of the direct (unmediated) effect of the variant on the trait and may lead to increased Type 1 error. We note that when OLS analysis of the full regression model results in increased Type 1 error, ATR will also be unable to fully control Type 1 error (although, the magnitude of Type 1 error will be lower with increasing $R^{2}$ ). Thus, ATR is invalid whenever OLS analysis of the full regression model is invalid.

In summary, we derive distributions of the ATR analogs of commonly used association test statistics. We show that ATR based methods are conservative when variants are correlated with covariates. We quantify the power loss and recommend that ATR based methods be used only when the squared correlation between variants and covariates can be confidently bounded to be substantially smaller than 0.1. We note that for commonly included covariates like age, gender and known or inferred ancestry, this is typically true and ATR based methods will likely result in negligible power loss. However, we reiterate that ATR is an ad-hoc methodology. Thus, we recommend that analysts carefully choose covariates based on a plausible causal model (accounting for sample-selection effects) and employ estimation/hypothesis-testing methods that are theoretically justified for those models.

Acknowledgments

Grant Number: NIH NHGRI HG009976

Appendix

All notation in the Appendix is as defined in the main text.

ATR estimator for $β$

The OLS estimator for $β$ is given by $\hat{β} = {[G^{'} (I_{n} - H_{C}) G]}^{- 1} G^{'} Y_{r}$ where $Y_{r} = (I_{n} - H_{C}) Y$ is the residual vector obtained from regressing $Y$ onto $C$ , and $H_{C} = C {(C^{'} C)}^{- 1} C^{'}$ . Note that $V a r (\hat{β}) = {σ^{2} [G^{'} (I_{n} - H_{C}) G]}^{- 1}$ . Since ATR simply replaces $Y a n d C$ with $Y_{r} a n d J_{n}$ , we have

{\hat{β}}_{A T R} = {[G^{'} (I_{n} - H_{1}) G]}^{- 1} G^{'} (I_{n} - H_{1}) Y_{r} = {[G^{'} (I_{n} - H_{1}) G]}^{- 1} G^{'} Y_{r} = {[G^{'} (I_{n} - H_{1}) G]}^{- 1} [G^{'} (I_{n} - H_{C}) G] \hat{β} = (I_{m} - {[G^{'} (I_{n} - H_{1}) G]}^{- 1} [G^{'} (H_{C} - H_{1}) G]) \hat{β} \overset{def}{=} (I_{m} - R_{G C}^{2}) \hat{β}

The second equality holds because $(I_{n} - H_{1}) Y_{r} = Y_{r} - \bar{Y_{r}}$ (where $\bar{Y_{r}}$ is the sample mean of $Y_{r}$ ) and $\bar{Y_{r}} = 0$ . The third equality holds because $G^{'} Y_{r} = {[G}^{'} (I_{n} - H_{C}) G] \hat{β}$ which follows from the expression for $\hat{β}$ . The fourth equality follows with straightforward algebra. Note that the eigenvalues of $R_{G C}^{2} \overset{def}{=} {[G^{'} (I_{n} - H_{1}) G]}^{- 1} [G^{'} (H_{C} - H_{1}) G]$ are the canonical correlations between $G$ and $C$ . Thus, when each variant is uncorrelated with all the covariates, all the eigenvalues of $R_{G C}^{2}$ are $0$ and ${\hat{β}}_{A T R} = \hat{β}$ .

When the model contains only one variant ( $m = 1$ ), we have ${[G^{'} (I - H_{1}) G]}^{- 1} [G^{'} (I - H_{C}) G] = 1 - R^{2}$ where $R^{2}$ is the coefficient of determination obtained by regressing the variant onto the covariates.

Relationship between the score test statistic and its ATR analog

The score test-statistic for testing $H_{0} : β = 0$ is given by

S = {\frac{1}{{\tilde{σ}}^{2}} \hat{β}}^{'} G^{'} (I_{n} - H_{C}) G \hat{β}

where ${\tilde{σ}}^{2} = \frac{1}{n} Y^{'} (I_{n} - H_{C}) Y = \frac{1}{n} Y_{r}^{'} Y_{r}$ is the maximum likelihood estimator (MLE) for $σ^{2}$ under the null (Vandaele 1981).

Note that ${\tilde{σ}}_{A T R}^{2} = \frac{1}{n} Y_{r} (I_{n} - H_{1}) Y_{r} = \frac{1}{n} Y_{r}^{'} Y_{r} = {\tilde{σ}}^{2}$ since $(I_{n} - H_{1}) Y_{r} = Y_{r} - {\bar{Y}}_{r} = Y_{r}$ . Thus, we have

S_{A T R} = \frac{1}{{\tilde{σ}}^{2}} {\hat{β}}_{A T R}^{'} G^{'} (I_{n} - H_{1}) G {\hat{β}}_{A T R} = \frac{1}{{\tilde{σ}}^{2}} {\hat{β}}^{'} [G^{'} (I_{n} - H_{C}) G] {[G^{'} (I_{n} - H_{1}) G]}^{- 1} [G^{'} (I_{n} - H_{C}) G] \hat{β} = \frac{1}{{\tilde{σ}}^{2}} {\hat{β}}^{'} [G^{'} (I_{n} - H_{C}) G] [I_{m} - R_{G C}^{2}] \hat{β} = S - \frac{1}{{\tilde{σ}}^{2}} {\hat{β}}^{'} [G^{'} (I_{n} - H_{C}) G] R_{G C}^{2} \hat{β}

Equivalently, we have

\frac{S_{A T R}}{S} = \frac{{\hat{β}}^{'} [G^{'} (I_{n} - H_{C}) G] [I_{m} - R_{G C}^{2}] \hat{β}}{{\hat{β}}^{'} G^{'} (I_{n} - H_{C}) G \hat{β}}

Recall that, for all vectors $x$ such that $x^{'} B x = c$ (for any constant $c > 0$ ) the generalized Rayleigh quotient $Q = \frac{x^{'} A x}{x^{'} B x}$ is bounded below and above by the minimum and maximum eigenvalues of $B^{- 1} A$ . Thus, we have

(1 - R_{m a x}^{2}) S \leq S_{A T R} \leq (1 - R_{m i n}^{2}) S

where $R_{m i n}^{2}$ and $R_{m a x}^{2}$ are the smallest and largest eigenvalues of $R_{G C}^{2}$ . The lower (upper) bound is attained when $\hat{β}$ is parallel to the eigenvector corresponding to maximum (minimum) eigenvalue of $R_{G C}^{2}$ . When each variant is orthogonal to each of the covariates we have $R_{m i n}^{2} = R_{m a x}^{2} = 0$ and $S_{A T R} = S$ .

When the model contains only one variant, the above relationship simplifies to the deterministic relationship $S_{A T R} = (1 - R^{2}) S$ (with $R^{2}$ as defined previously). For $m > 1$ , the relationship is not deterministic (that is, $S_{A T R}$ can take multiple values for any given value of $S$ ) unless all the variants are collinear. We can use the relationships between the score, Wald, likelihood-ratio, and F test statistics (Vandaele 1981) to derive exact expressions for the relationships between each of these tests and their ATR analogs for single variant models. We state these relationships in the main text (but omit the straightforward algebra).

Asymptotic distribution of $S_{A T R}$

Asymptotically, $S_{A T R}$ converges in distribution to the distribution of the quadratic form ${\hat{β}}^{'} A \hat{β}$ with $A = σ^{- 2} [G^{'} (I_{n} - H_{C}) G] {[G^{'} (I_{n} - H_{1}) G]}^{- 1} [G^{'} (I_{n} - H_{C}) G]$ . With suitable regularity conditions, asymptotically $\hat{β} ~ N (μ, V)$ with $V = σ^{2} {[G (I_{n} - H_{C}) G]}^{- 1}$ . Baldessari (1967) derived the distribution of quadratic forms in multivariate normal variables. Since $A$ is symmetric and $V$ positive definite, there exists an invertible matrix $M$ such that $M^{'} V^{- 1} M = I_{m}$ and $M^{'} A M = Λ$ with $Λ$ an $m \times m$ diagonal matrix. Thus, we have that $I_{m} - R_{G C}^{2} = V A = M Λ M^{- 1}$ ; that is, the columns of $M$ are the eigenvectors of $I_{m} - R_{G C}^{2}$ (and $R_{G C}^{2}$ ) and the $i^{t h}$ element of the diagonal of $Λ$ is $1 - l_{i}^{2}$ with $l_{i}^{2}$ the eigenvalue of $R_{G C}^{2}$ corresponding to the $i^{t h}$ column of $M$ . Let $R_{j}^{2}, 1 \leq j \leq p$ denote the $p \leq m$ distinct eigenvalues of $R_{G C}^{2}$ with $R_{1}^{2} > \cdot\cdot\cdot > R_{p}^{2}$ . Let $B_{j}$ be the $m \times m$ diagonal matrix which has elements $1$ where $Λ$ has elements $1 - R_{j}^{2}$ and $0$ otherwise. Then, from Baldessari (1967, Theorem 1) and some trivial algebra, $S_{A T R}$ follows the same distribution as $\sum_{j = 1}^{p} (1 - R_{j}^{2}) Z_{j}$ , where $Z_{j} ~ χ_{ν_{j}}^{2} (λ_{j}^{2})$ (that is, a non-central chi-squared distribution with $ν_{j}$ degrees of freedom and non-centrality parameter $λ_{j}^{2}), λ_{j}^{2} = {(M^{- 1} β)}^{'} B_{j} M^{- 1} β$ and $ν_{j}$ is the geometric multiplicity of $R_{j}^{2}$ .

Recall that, asymptotically, $S ~ χ_{m}^{2} (δ^{2})$ with $δ^{2} = β^{'} V^{- 1} β = {(M^{- 1} β)}^{'} M^{- 1} β$ . Thus, we have $\sum_{j = 1}^{p} λ_{j}^{2} = δ^{2}$ . When $β$ lies in the space spanned by the eigenvector(s) of $R_{G C}^{2}$ corresponding to the (distinct) eigenvalue $R_{k}^{2}, 1 \leq k \leq p$ , we have $λ_{k}^{2} = δ^{2}$ and $λ_{i}^{2} = 0, i \neq k$ . Consider the set $Δ$ of vectors $β$ that yield the same power for the score test (that is, all vectors $β$ for which $β^{'} V^{- 1} β = δ^{2}$ for a given $δ^{2}$ ). Unlike $S$ , the power of $S_{A T R}$ may differ when $β$ takes different values in this set. We use a result derived by Matthew and Nordstöm (1997) to find values in $Δ$ that lead to minimum power for $S_{A T R}$ :

Theorem 3 (Matthew and Nordstöm, 1997). Let $X_{i}$ and $Y_{i}$ be distributed, respectively, as $χ_{ν_{i}}^{2} (δ_{i}^{2})$ and $χ_{ν_{i}}^{2} (μ_{i}^{2})$ , $i = 1, \dots, n$ , with $X_{1}, \dots, X_{n}$ independent and $Y_{1}, \dots, Y_{n}$ independent. Then

\sum_{i = i}^{n} λ_{i} X_{i} \leq_{D} \sum_{i = 1}^{n} {λ_{i} Y}_{i}

holds for all nonnegative $λ_{i}$ ’s satisfying $λ_{1} \geq \dots \geq λ_{n}$ if and only if

\sum_{i = 1}^{k} δ_{i}^{2} \leq \sum_{i = 1}^{k} μ_{i}^{2} for all k = 1 \dots, n .

In the above theorem, $X \leq_{D} Y$ denotes that the random variable $Y$ stochastically dominates $X$ . From the above theorem and preceding details of the distribution of $S_{A T R}$ , it follows that distribution followed by $S_{A T R}$ when $β$ lies in the space spanned by the eigenvectors of $R_{G C}^{2}$ corresponding to the maximum eigenvalue $R_{1}^{2} = R_{m a x}^{2}$ is dominated by the distribution followed by $S_{A T R}$ when $β$ takes any other value in $Δ$ .

Footnotes

DATA AVAILABILITY STATEMENT

Data sharing not applicable – no new data generated.

CONFLICT OF INTEREST STATEMENT

The authors have no conflict of interest to declare.

REFERENCES

1000 Genomes Project Consortium. (2015). A global reference for human genetic variation. Nature, 526(7571), 68–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
Aschard H, Vilhjálmsson BJ, Joshi AD, Price AL, & Kraft P (2015). Adjusting for heritable covariates can bias effect estimates in genome-wide association studies. The American Journal of Human Genetics, 96(2), 329–339. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, ... & Cortes A (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203–209. [DOI] [PMC free article] [PubMed] [Google Scholar]
Che R, Motsinger-Reif AA, & Brown CC (2012). Loss of power in two-stage residual-outcome regression analysis in genetic association studies. Genetic epidemiology, 36(8), 890–894. [DOI] [PMC free article] [PubMed] [Google Scholar]
Demissie S, & Cupples LA (2011). Bias due to two-stage residual-outcome regression analysis in genetic association studies. Genetic epidemiology, 35(7), 592–596. [DOI] [PMC free article] [PubMed] [Google Scholar]
Frisch R, & Waugh FV (1933). Partial time regressions as compared with individual trends. Econometrica: Journal of the Econometric Society, 387–401. [Google Scholar]
Greenland S, Pearl J, & Robins JM (1999). Causal diagrams for epidemiologic research. Epidemiology, 37–48. [PubMed] [Google Scholar]
Kanai M, Akiyama M, Takahashi A, Matoba N, Momozawa Y, Ikeda M, ... & Kubo M (2018). Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nature genetics, 50(3), 390–400. [DOI] [PubMed] [Google Scholar]
Lee S, Abecasis GR, Boehnke M, & Lin X (2014). Rare-variant association analysis: study designs and statistical tests. The American Journal of Human Genetics, 95(1), 5–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lovell MC (2008). A simple proof of the FWL theorem. The Journal of Economic Education, 39(1), 88–91. [Google Scholar]
Mathew T, & Nordström K (1997). Inequalities for the probability content of a rotated ellipse and related stochastic domination results. The Annals of Applied Probability, 7(4), 1106–1117. [Google Scholar]
McCaw ZR, Lane JM, Saxena R, Redline S, & Lin X (2019). Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies. Biometrics. [DOI] [PMC free article] [PubMed] [Google Scholar]
de Micheaux PL, & de Micheaux MPL (2017). Package ‘CompQuadForm’. CRAN Repository. [Google Scholar]
Monsees GM, Tamimi RM, & Kraft P (2009). Genome-wide association scans for secondary traits using case-control samples. Genetic Epidemiology: The Official Publication of the International Genetic Epidemiology Society, 33(8), 717–728. [DOI] [PMC free article] [PubMed] [Google Scholar]
Munafò MR, Tilling K, Taylor AE, Evans DM, & Davey Smith G (2018). Collider scope: when selection bias can substantially influence observed associations. International journal of epidemiology, 47(1), 226–235. [DOI] [PMC free article] [PubMed] [Google Scholar]
Niarchou M, Byrne EM, Trzaskowski M, Sidorenko J, Kemper KE, McGrath JJ, ... & Wray NR (2020). Genome-wide association study of dietary intake in the UK biobank study and its associations with schizophrenia and other traits. Translational Psychiatry, 10(1), 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
Randall JC, Winkler TW, Kutalik Z, Berndt SI, Jackson AU, Monda KL, ... & Workalemahu T (2013). Sex-stratified genome-wide association studies including 270,000 individuals show sexual dimorphism in genetic loci for anthropometric traits. PLoS Genet, 9(6), e1003500. [DOI] [PMC free article] [PubMed] [Google Scholar]
Robinson LD, & Jewell NP (1991). Some surprising results about covariate adjustment in logistic regression models. International Statistical Review/Revue Internationale de Statistique, 227–240. [Google Scholar]
Sofer T, Zheng X, Gogarten SM, Laurie CA, Grinde K, Shaffer JR, ... & Lange L (2019). A fully adjusted two-stage procedure for rank-normalization in genetic association studies. Genetic epidemiology, 43(3), 263–275. [DOI] [PMC free article] [PubMed] [Google Scholar]
Styrkarsdottir U, Stefansson OA, Gunnarsdottir K, Thorleifsson G, Lund SH, Stefansdottir L, ... & Ivarsdottir EV (2019). GWAS of bone size yields twelve loci that also affect height, BMD, osteoarthritis or fractures. Nature communications, 10(1), 1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tachmazidou I, Süveges D, Min JL, Ritchie GR, Steinberg J, Walter K, ... & McCarthy S (2017). Whole-genome sequencing coupled to imputation discovers genetic signals for anthropometric traits. The American Journal of Human Genetics, 100(6), 865–884 [DOI] [PMC free article] [PubMed] [Google Scholar]
UK10K consortium. (2015). The UK10K project identifies rare variants in health and disease. Nature, 526(7571), 82–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vandaele W (1981). Wald, likelihood ratio, and Lagrange multiplier tests as an F test. Economics Letters, 8(4), 361–365. [Google Scholar]
Xing G, Lin CY, & Xing C (2011). A comparison of approaches to control for confounding factors by regression models. Human heredity, 72(3), 194–205. [DOI] [PubMed] [Google Scholar]

[R1] 1000 Genomes Project Consortium. (2015). A global reference for human genetic variation. Nature, 526(7571), 68–74. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] Aschard H, Vilhjálmsson BJ, Joshi AD, Price AL, & Kraft P (2015). Adjusting for heritable covariates can bias effect estimates in genome-wide association studies. The American Journal of Human Genetics, 96(2), 329–339. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, ... & Cortes A (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203–209. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Che R, Motsinger-Reif AA, & Brown CC (2012). Loss of power in two-stage residual-outcome regression analysis in genetic association studies. Genetic epidemiology, 36(8), 890–894. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Demissie S, & Cupples LA (2011). Bias due to two-stage residual-outcome regression analysis in genetic association studies. Genetic epidemiology, 35(7), 592–596. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] Frisch R, & Waugh FV (1933). Partial time regressions as compared with individual trends. Econometrica: Journal of the Econometric Society, 387–401. [Google Scholar]

[R7] Greenland S, Pearl J, & Robins JM (1999). Causal diagrams for epidemiologic research. Epidemiology, 37–48. [PubMed] [Google Scholar]

[R8] Kanai M, Akiyama M, Takahashi A, Matoba N, Momozawa Y, Ikeda M, ... & Kubo M (2018). Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nature genetics, 50(3), 390–400. [DOI] [PubMed] [Google Scholar]

[R9] Lee S, Abecasis GR, Boehnke M, & Lin X (2014). Rare-variant association analysis: study designs and statistical tests. The American Journal of Human Genetics, 95(1), 5–23. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] Lovell MC (2008). A simple proof of the FWL theorem. The Journal of Economic Education, 39(1), 88–91. [Google Scholar]

[R11] Mathew T, & Nordström K (1997). Inequalities for the probability content of a rotated ellipse and related stochastic domination results. The Annals of Applied Probability, 7(4), 1106–1117. [Google Scholar]

[R12] McCaw ZR, Lane JM, Saxena R, Redline S, & Lin X (2019). Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies. Biometrics. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] de Micheaux PL, & de Micheaux MPL (2017). Package ‘CompQuadForm’. CRAN Repository. [Google Scholar]

[R14] Monsees GM, Tamimi RM, & Kraft P (2009). Genome-wide association scans for secondary traits using case-control samples. Genetic Epidemiology: The Official Publication of the International Genetic Epidemiology Society, 33(8), 717–728. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Munafò MR, Tilling K, Taylor AE, Evans DM, & Davey Smith G (2018). Collider scope: when selection bias can substantially influence observed associations. International journal of epidemiology, 47(1), 226–235. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Niarchou M, Byrne EM, Trzaskowski M, Sidorenko J, Kemper KE, McGrath JJ, ... & Wray NR (2020). Genome-wide association study of dietary intake in the UK biobank study and its associations with schizophrenia and other traits. Translational Psychiatry, 10(1), 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Randall JC, Winkler TW, Kutalik Z, Berndt SI, Jackson AU, Monda KL, ... & Workalemahu T (2013). Sex-stratified genome-wide association studies including 270,000 individuals show sexual dimorphism in genetic loci for anthropometric traits. PLoS Genet, 9(6), e1003500. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Robinson LD, & Jewell NP (1991). Some surprising results about covariate adjustment in logistic regression models. International Statistical Review/Revue Internationale de Statistique, 227–240. [Google Scholar]

[R19] Sofer T, Zheng X, Gogarten SM, Laurie CA, Grinde K, Shaffer JR, ... & Lange L (2019). A fully adjusted two-stage procedure for rank-normalization in genetic association studies. Genetic epidemiology, 43(3), 263–275. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] Styrkarsdottir U, Stefansson OA, Gunnarsdottir K, Thorleifsson G, Lund SH, Stefansdottir L, ... & Ivarsdottir EV (2019). GWAS of bone size yields twelve loci that also affect height, BMD, osteoarthritis or fractures. Nature communications, 10(1), 1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] Tachmazidou I, Süveges D, Min JL, Ritchie GR, Steinberg J, Walter K, ... & McCarthy S (2017). Whole-genome sequencing coupled to imputation discovers genetic signals for anthropometric traits. The American Journal of Human Genetics, 100(6), 865–884 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] UK10K consortium. (2015). The UK10K project identifies rare variants in health and disease. Nature, 526(7571), 82–90. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] Vandaele W (1981). Wald, likelihood ratio, and Lagrange multiplier tests as an F test. Economics Letters, 8(4), 361–365. [Google Scholar]

[R24] Xing G, Lin CY, & Xing C (2011). A comparison of approaches to control for confounding factors by regression models. Human heredity, 72(3), 194–205. [DOI] [PubMed] [Google Scholar]

PERMALINK

Power loss due to testing association between covariate adjusted traits and genetic variants

Pranav Yajnik

Michael Boehnke

Abstract

INTRODUCTION