Abstract
Most human complex phenotypes result from multiple genetic and environmental factors and their interactions. Understanding the mechanisms by which genetic and environmental factors interact offers valuable insights into the genetic architecture of complex traits and holds great potential for advancing precision medicine. The emergence of large population biobanks has led to the development of numerous statistical methods aiming at identifying gene–environment interactions (G × E). In this review, we present state-of-the-art statistical methodologies for G × E analysis. We will survey a spectrum of approaches for single-variant G × E mapping, followed by various techniques for polygenic G × E analysis. We conclude this review with a discussion on the future directions and challenges in G × E research.
Keywords: gene–environment interaction (G × E), precision medicine, statistical genetics
This article is categorized under: Applications of Computational Statistics > Genomics/Proteomics/Genetics, Data: Types and Structure > Massive Data, Statistical Models > Linear Models
1 |. INTRODUCTION
Genetic variations, environmental factors, and their interactions jointly shape human complex phenotypes (Freeman, 1973). Understanding gene–environment interaction (G × E) can provide crucial mechanistic insights into genetic and environmental effect heterogeneity (Li et al., 2019). One recent example of G × E finding in humans is the interaction between BAP1 and asbestos exposure in mesothelioma (Carbone et al., 2020). In Cappadocia, Turkey, an asbestos-like carcinogenic fiber was found in the homes and air of villagers. However, this environmental exposure alone could not explain an epidemic in which 50% of the villagers died of mesothelioma, especially compared with only 4.6% of asbestos miners with at least 10 consecutive years of work (Carbone et al., 2007). Studies of the epidemic in Cappadocia led to the discovery that susceptibility to mesothelioma is passed down through generations in a Mendelian pattern and is caused by a G × E interaction (Carbone et al., 2020). Other notable human G × E examples include the interaction between FTO and physical activity on body mass index (Young et al., 2016), the interaction between SLC2A9 and biological sex on uric acid concentrations (Döring et al., 2008), and the interaction between the genetic risk of obesity and education on health-related outcomes (Barcellos et al., 2018). However, these are the few well-characterized G × E examples in humans. Way too often, the environment is ignored in human complex trait genetic studies. It also largely remains an open problem how such interactions should be modeled, quantified, and estimated in current study designs.
Early investigations into G × E primarily relied on a candidate gene approach (Caspi et al., 2003), which suffered from low replicability (Dick et al., 2015). The emergence of high-throughput genotype data enabled the implementation of genome-wide interaction studies (GWIS), allowing for comprehensive G × E scans across millions of single nucleotide polymorphisms (SNPs; Thomas, 2010). While GWIS improves the replicability and robustness of interaction findings, it poses challenges in terms of the burden of multiple testing, which substantially limits its statistical power (Aschard et al., 2012). To address this limitation, a two-step approach has been employed, involving an initial SNP filtering, followed by testing G × E exclusively on selected SNPs. However, it is now widely recognized that most human complex traits exhibit a highly polygenic nature (Boyle et al., 2017). Consequently, contemporary GWAS analyses have shifted focus away from individual SNPs, embracing methodologies that account for the polygenic nature of these traits (Bulik-Sullivan et al., 2015; Finucane et al., 2015; Miao et al., 2023). Similarly, G × E studies are undergoing a similar transition, placing greater emphasis on understanding how the polygenic basis of a trait varies across different environments (Arold et al., 2022; Barth et al., 2020; Bernabeu et al., 2021; Dahl et al., 2020; Martin et al., 2021; Robinson et al., 2017).
This article aims to comprehensively examine the cutting-edge statistical methods employed in G × E inference using molecular data. In the subsequent sections, we will begin by providing an overview of the fundamental concept of G × E. Then, we will review and compare statistical methodologies utilized for single-variant G × E studies as well as polygenic G × E inference. We will conclude this article by discussing the prospects and prevailing challenges of G × E analysis. Figure 1 presents a decision tree to guide the selection of an appropriate method for G × E inference. Table 1 offers the links to the methods and tools in this review.
FIGURE 1.

A decision tree for guiding the selection of appropriate methods for G × E inference. The decision node, depicted by the yellow block, serves as the starting point for determining the suitable approach. The blue blocks represent the specific methods to be employed based on the decision made at the decision node.
TABLE 1.
Software tools for G × E methods reviewed in this article.
| Category | Methods | Study design | Software |
|---|---|---|---|
| Single-variant G × E | Linear regression | Cohort study | https://www.cog-genomics.org/plink/2.0/assoc#glm |
| Logistic regression | Cohort and case–control study | https://www.cog-genomics.org/plink/2.0/assoc#glm | |
| Case-only approach | Cohort, case-only, and case–control study | https://bioconductor.org/packages/release/bioc/html/CGEN.html | |
| Empirical Bayes-type approach | Cohort and case–control study | https://bioconductor.org/packages/release/bioc/html/CGEN.html | |
| QUAIL | Cohort study | https://github.com/qlu-lab/QUAIL | |
| LEMMA | Cohort and case–control study | https://github.com/mkerin/LEMMA | |
| StructLMM | Cohort and case–control study | https://github.com/limix/struct-lmm | |
| GCI-GREML | Cohort and case–control study | https://bio.tools/mtg2 | |
| SPAGE | Cohort and case–control study | https://github.com/WenjianBI/SPAGE | |
| fastGWA-GE | Cohort and case–control study | https://yanglab.westlake.edu.cn/software/gcta/#fastGWA-GE | |
| Polygenic G × E | MRNM | Cohort and case–control study | https://bio.tools/mtg2 |
| G × EMM | Cohort and case–control study | https://github.com/andywdahl/gxemm | |
| PIGEON | GWIS summary statistics from cohort and case–control study | https://github.com/qlu-lab/PIGEON |
2 |. DEFINING G × E
G × E is commonly defined as the joint effect of genetics and the environment that deviates from their individual additive effects (Freeman, 1973). This concept is illustrated in Figure 2. Figure 2a illustrates a scenario where carrying an additional A allele produces a consistent change in the phenotype across different environments, suggesting the absence of G × E. Conversely, in Figure 2b, the presence of an additional A allele leads to varying changes in the phenotype across the environments, indicating the presence of G × E.
FIGURE 2.

An illustration of G × E. Each data point represents an individual, where the -axis denotes the genotype, and the -axis represents the phenotype value. Three different colors indicate three distinct environments. (a) In the absence of G × E, the genetic effect on the phenotype remains consistent across all three environments, as depicted by the fitted line parallel to each other. (b) In the presence of G × E, the genetic effect on the phenotype varies across the three environments. The largest effect is observed in the environment highlighted in yellow. This variation is illustrated by the fitted lines not being parallel to each other, indicating that the relationship between genotype and phenotype depends on the environment.
The estimation problem for G × E can be formulated within the following statistical framework. We consider a generalized linear model that incorporates the main effects of genetics and the environment, along with their interaction:
where subscript denotes the individual, is the link function, is the phenotype, is an environmental factor, is the additively coded genotype of a SNP with values 0, 1, or 2. Here, the interaction coefficient is the estimand of G × E analysis. Hypothesis testing conducted on this coefficient provides evidence regarding the presence of G × E. Additionally, the magnitude of quantifies the extent of G × E between SNP and environment on the phenotype .
2.1 |. Statistical methods to detect single-variant G × E
We first examine the statistical model utilized for single-variant G × E analysis, which aims to identify specific genetic variants that exhibit variability across subgroups defined by the environmental exposures. This model has been extensively employed in the GWIS design.
2.2 |. The basic model
For quantitative traits, a linear interaction model is typically used to model G × E, with the form
where the interaction coefficient measures the G × E effects. Software such as plink (Purcell et al., 2007) and Regenie (Mbatchou et al., 2021), can be used to implement computationally efficient GWIS, repeating the regression analysis for millions of SNPs in the genome.
For binary traits, several methods have been proposed to investigate G × E effects. One commonly used approach is implemented through logistic regression in case–control designs:
Testing the hypothesis in this model assesses the presence of SNP × E interaction. However, it is recognized that this method has limited statistical power (Piegorsch et al., 1994). An alternative approach known as the case-only method has been developed (Piegorsch et al., 1994). This method relies on the assumption of independence between and in the underlying population. Essentially, under this assumption, the interaction test statistic in logistic regression becomes equivalent to testing the association between and among disease cases. In addition, under G–E independence, case-only analysis is known to be more efficient for estimating G × E compared with case–control analysis which is a more robust alternative not relying on G–E independence (Gauderman et al., 2017). However, the case-only method has a notable limitation: when the independence assumption is violated, it can lead to an inflated type-I error (Albert et al., 2001). Besides, directly testing the G–E independence assumption is challenging due to constraints in statistical power. To strike a balance between robustness and statistical power, empirical Bayes-type (EB) approaches have been introduced (Mukherjee & Chatterjee, 2008). The EB approach computes a weighted average of the case–control and case-only estimators of the G × E, exploiting a trade-off between bias and efficiency. The weights are estimated from the data by a stochastic framework to account for uncertainty in the assumption of G–E independence. A main advantage of this approach is that users can bypass the need to test the G–E independence assumption. Following this work, a more general method for deriving EB estimates has been proposed for all parameters of the general logistic regression model (Chen et al., 2009). A Bayesian model averaging framework has also been introduced to weight case–control and case-only estimates using posterior probabilities (Li & Conti, 2009). In conclusion, while logistic regression remains the most widely used method for case–control studies, alternative methods have been developed to improve efficiency under a range of weak or strong assumptions.
In the case of genome-wide G × E scans, several methods have been proposed to address various challenges. SPAGE provides a computationally efficient approach to deal with unbalanced case–control ratios (Bi et al., 2019). This approach fits a genotype-independent logistic regression model only once in the genome-wide analysis to reduce computational cost and uses saddle point approximation to calibrate the test statistics for binary outcomes with unbalanced case–control ratios. In addition, fastGWA-GE is a linear mixed model-based method for efficient and accurate genome-wide G × E analysis (Zhong et al., 2023). It controls for relatedness either through pedigree information or through a sparse genetic relationship matrix, and the effects of population stratification are captured using SNP-derived principal components.
2.3 |. Two-step approaches and screening by vQTL
While the GWIS design enhances the replicability and robustness of G × E findings compared with candidate gene analysis, it also introduces a substantial burden of multiple testing, which significantly diminishes its statistical power (Aschard et al., 2012). Consequently, several “two-step” approaches have been proposed to improve the efficiency of G × E analysis while maintaining control over type I error. These approaches involve an initial step of SNP filtering, followed by testing for G × E using the selected SNPs at a modified significance level (Gauderman et al., 2017). Among the various two-step strategies, filtering by variance-associated genetic loci (vQTL) has recently been demonstrated to have higher effectiveness (Marderstein et al., 2021; Miao, Lin, et al., 2022).
A vQTL refers to a genetic locus that is associated with phenotypic variability. In the context of G × E analysis, we consider a linear model that incorporates the main effects of genetics and environments, as well as their interaction.
where is an environmental factor following a distribution with mean and variance (i.e., is the additively coded genotype of a SNP with values 0, 1, or 2, and is the error term following a distribution . We assume , and to be mutually independent. Under this model, the conditional variance of given the genotype is
This suggests that the conditional variance of may differ across genotype groups of if the genetic variant is involved in interactions. Thus, a genetic variant showing SNP × E interaction is associated with the variance of the phenotype, that is, a vQTL, and one can use vQTL to screen for candidate SNP × E effects (Miao, Lin, et al., 2022).
Various statistical methods have been proposed to detect vQTLs, including classic non-parametric methods (Wang et al., 2019), full parametric methods (Young et al., 2018), and two-stage methods (Marderstein et al., 2021). However, these methods have certain limitations, such as a lack of robustness to non-Gaussian phenotypes and potential confounding effects on both trait levels and trait variability. To address these limitations, a recent method called QUAIL, a quantile regression-based framework, has been introduced (Miao, Lin, et al., 2022). In QUAIL, if a SNP is a vQTL for trait , the conditional quantile function will exhibit different regression slopes for various quantile levels :
QUAIL defined a quantile-integrated effect.
which aggregates the differences between the regression coefficients of the upper and lower quantile levels (i.e., across all quantile levels. The choice of 0.5 in is to ensure that the quantile integral effect is equal to the difference of quantile-specific effect between all upper quantile levels and all lower quantile levels . This quantile-integrated effect represents the magnitude of the vQTL effect of a SNP. Testing the vQTL effect of a SNP is equivalent to testing the null hypothesis .
QUAIL estimates the quantile-integrated effect using various computational techniques to overcome the computational challenges associated with fitting standard quantile regression procedures that require iterative optimization for numerous quantile levels. It efficiently identifies vQTLs at the genome-wide scale, while overcoming the key limitations of traditional two-stage methods. Simulation studies have demonstrated that QUAIL exhibits greater power compared with other commonly used VQTL approaches when the phenotype is non-Gaussian distributed. In previous simulations, under a 0.4% phenotypic variance explained by G × E, QUAIL showed a 75% probability of rejecting the null hypothesis at a significance level of , while the best alternative methods had a 56% probability. When applied to UK Biobank data, QUAIL identified 11 novel vQTL for BMI (20% of the vQTL identified by all methods tested) compared with other methods (Miao, Lin, et al., 2022).
2.4 |. Modeling multiple environments
The aforementioned approaches are primarily designed for G × E analysis of G × E with a single environmental variable. An alternative approach involves jointly modeling multiple environmental variables, which can potentially enhance statistical power by capturing a combination of variables that serve as proxies for the true underlying driver of interaction. Several methods have been proposed to accommodate the modeling of multiple environments simultaneously, such as StructLMM (Moore et al., 2019) and LEMMA (Kerin & Marchini, 2020). StructLMM is a computationally efficient method that aims to identify genetic variants interacting with one or more environments. It achieves this by modeling the environmental similarity between individuals across multiple environments as a random effect and testing G × E via the variance components test. On the other hand, LEMMA is a Bayesian whole-genome approach that allows for the joint modeling of SNP marginal effects and SNP G × E interaction effects with an environmental score (ES). The ES is a linear combination of multiple environmental variables and can be employed in single-variant-by-ES G × E inference. LEMMA employs variational inference techniques to fit the model, ensuring computational tractability even with large-scale datasets such as large biobanks. By incorporating multiple environmental variables and employing advanced modeling techniques, these approaches provide alternative strategies to enhance the analysis of G × E interactions beyond single environmental variable scenarios.
2.5 |. Statistical method to detect polygenic G × E
Thus far, we primarily focused on interactions between a single genetic variant and the environment. However, it is now known that complex traits are influenced by numerous genetic variants scattered across the genome (Boyle et al., 2017). Therefore, contemporary G × E methods are increasingly emphasizing the investigation of how the polygenic architecture of a trait varies across different environments. In the following sections, we will delve into several G × E methods that have been developed to leverage polygenicity by considering the G × E effects for all SNPs across the genome.
2.6 |. Heritability and genetic correlation-based methods
This class of methods assesses whether the polygenic genetic basis of a trait varies across environments and typically examines the existence of any G × E (Bernabeu et al., 2021; Blokland et al., 2022; Martin et al., 2021). For demonstration, consider the trait under two different environments and . We can express the trait values as follows:
Here, and represent the trait values in environments and , and and denote the effect for SNP in environment and , respectively. and represent the environment-specific noise terms.
Two common approaches are used to test for polygenic G × E effects: differential heritability and imperfect genetic correlation between environments. Based on the above model, we can define the environment-stratified heritability, which represents the proportion of phenotypic variance explained by SNPs, as follows:
Testing for differential heritability is equivalent to testing the null hypothesis . In contrast, the genetic correlation between the environments, can by written as
Testing for imperfect genetic correlation is equivalent to testing the null hypothesis . To perform statistical inference on these parameters, two types of methods are commonly used: the GREML-based approach (Yang et al., 2010) and linkage disequilibrium (LD) score regression (Bulik-Sullivan et al., 2015). Both methods aim to estimate environment-stratified heritability and genetic correlation across environments. GREML provides a more efficient estimator compared with LD score regression but requires individual genotype and phenotype data as input. On the other hand, LD score regression, while less efficient than GREML, only requires environment-stratified GWAS summary statistics as input. Hypothesis testing based on these two approaches is often used to assess the existence of polygenic G × E.
Nevertheless, testing for differential heritability and imperfect genetic correlation may provide flawed results, if the goal is to examine the existence of G × E (Miao, Song, et al., 2022). For instance, heritability estimates may vary across discrete environments merely due to heteroscedasticity (i.e., differences in the residual variance components) in the absence of true G × E, leading to false positive G × E findings. Similarly, genetic correlation analysis between environments has its limitations. A perfect genetic correlation can be observed when the SNP additive effects are proportional between the environments, which is sometimes referred to as “amplification” in the G × E literature (Zhu et al., 2023). This leads to false negative results on testing G × E. Consequently, heritability and genetic correlation analyses are inadequate for proper estimation of polygenic G × E effects. Moreover, some of these approaches face technical challenges such as the inability to handle continuous environmental exposures.
2.7 |. Variance component tests
Another class of methods directly assesses polygenic G × E using variance component tests. The G × E variance, which quantifies the extent to which phenotypic variation in a population is attributable to the interaction between genetic and environmental factors, serves as the target for estimation in these methods. Several approaches have been proposed to estimate G × E variance. One such approach is GCI-GREML, an extension of the GREML approach that focuses on estimating the proportion of phenotypic variance explained by G × E effects (Robinson et al., 2017). However, GCI-GREML assumes a homogeneous residual variance across environments, which may introduce bias in the results (Ni et al., 2019). To address this limitation, the MRNM approach was proposed as an extension of GCI-GREML (Ni et al., 2019). MRNM overcomes this limitation by considering both the genotype–environment correlation (which captures the association between genetic factors and environmental factors) and the residual–environment interaction (which captures the variation in non-genetic factors across different levels of environmental factors). However, MRNM cannot be used to model binary traits and only allows for univariate environments (Dahl et al., 2020). To address these issues, G × EMM was developed (Dahl et al., 2020). G × EMM is a mixed model framework analogous to GREML and is designed to capture the aggregate polygenic contributions of G × E effects. It can be applied to binary traits and general environments, allowing for a broader range of applications compared with MRNM. G × EMM estimates the heritability specific to each discrete environment or environmental extreme in the case of continuous environments.
Recent work suggests that to accurately assess the presence or absence of polygenic G × E, researchers should focus on estimating the G × E variance component, instead of differential heritability and imperfect genetic correlation analysis (Miao, Song, et al., 2022). However, current methods in the literature still have value in certain scenarios. For example, one study highlighted that differential heritability between environments may affect the predictive accuracy of PGS in those environments, raising concerns about the universal applicability of PGS (Mostafavi et al., 2020). Imperfect genetic correlation analysis may be used to measure whether SNP effects are shared across environments (Bernabeu et al., 2021).
2.8 |. Empirical PGS × E analysis
Another method, often referred as empirical PGS × E analysis, has become increasingly popular in the field of G × E research (Biroli et al., 2022; Domingue et al., 2020; Li et al., 2019; Schmitz et al., 2022). This approach involves a two-step procedure that begins by summarizing the genetic predisposition of each individual into a polygenic score (PGS). Subsequently, it examines the interaction between the PGS and the environment using a regression-based framework. The “empirical” in the title indicates the fact that PGS used in this type of analysis is not the true PGS, but a noisy estimate derived from an external GWAS.
A true PGS is typically defined as a weighted sum of effect alleles for a collection of SNPs , where the are the true SNP effects for the SNP and indicates the number of copies of the effect alleles for the SNP (0, 1, or 2) for individual. Based on the true PGS, the data-generating model for PGS × E can be denoted as
where , and represents the main effects of the PGS , environment and the interaction between the PGS and environment, respectively. The estimand of PGS × E is .
In practice, one can only use scores estimated from GWAS (denoted by empirical PGS ) and perform empirical PGS × E analysis. For demonstration, we used the linear regression on quantitative traits as an example. The empirical PGS × E can be denoted as the following:
where is the estimation error in the PGS and . The empirical PGS × E requires no overlap between the GWAS used to construct PGS and the sample for PGS × E analysis. Under this requirement, the hypothesis testing for empirical PGS × E, denoted as is equivalent to the hypothesis testing for PGS × E, . However, this point estimation is substantially affected by the imprecision in PGS estimation due to limited sample sizes in GWAS. Neglecting the inherent uncertainty in the empirical PGS can lead to interaction estimates that are biased toward zero (i.e., . In cases where sample overlap exists between the GWAS and the PGS × E analysis, PGS will overfit the data, resulting in biased estimates of interaction and potential false discoveries in empirical PGS × E analysis (Miao, Song, et al., 2022).
The PGS × E approaches mentioned previously are based on linear regression for quantitative traits. For binary traits, the model for PGS × E analysis can be formulated by replacing the single variant in the logistic regression-based G × E analysis with the true PGS. Case-only methods have also been proposed for PGS × E analysis (Meisner et al., 2019). These methods exploit the G–E independence to enhance the power of logistic regression-based tests. In addition, a recent method simultaneously models G–E correlations and G × E using PGS in case–control studies (Wang et al., 2023). It uses a logistic-normal regression framework to quantify both disease risk and the PGS distribution in the population, and proposes joint inference using the retrospective likelihood of case–control data.
2.9 |. PIGEON
A key issue for all these aforementioned methods for polygenic G × E inference is, although all these approaches are referred to as G × E analysis in the literature, the relationship between these approaches is poorly understood. For example, it is unclear whether PGS × E and differential heritability analysis aim to estimate the same parameter. Consequently, a consistent language to describe the connections and distinctions among these methods is lacking. There is a pressing need for a comprehensive framework for quantifying polygenic G × E.
To bridge this gap, (Miao, Song, et al., 2022) proposed PIGEON, a unified statistical framework designed for estimating polygenic G × E. PIGEON is constructed based on a linear mixed model that captures both the additive effects and G × E effects for many SNPs.
Here, is the standardized phenotype with a mean of 0 and variance of 1 for the individual, is the standardized SNP, is the standardized environment, is the noise term, and quantifies the heteroskedasticity arising from the interaction between the residual variance and the environment (i.e., the variation in residual variance across different environments). Polygenic additive effects and interaction effects (i.e., and ) are modeled as random variables.
In the PIGEON framework, two main objectives are defined for G × E inference: G × E variance and covariant G × E. These two objectives aim to quantify the overall contribution of G × E to the phenotype and provide mechanistic insights into the interaction mechanisms, respectively. The G × E variance is a measure of the overall G × E contribution and is defined as the variance of the interaction effects, represented as , where is the total number of SNPs. Hypothesis testing on this quantity helps determine the evidence of any G × E. The magnitude of the G × E variance component quantifies the extent of G × E for the trait of interest. This is similar to the variance component tests we have introduced above. However, solely having a non-zero G × E variance does not provide detailed mechanistic insights into the interactions. To gain a deeper understanding of the polygenic G × E, PIGEON introduces the concept of covariant G × E, that is, . This measure captures the covariance between SNP additive effects and SNP × E interaction effects across the genome. By examining the correlation between the effects of SNPs on complex traits and their tendency to interact with the environment, the covariant G × E offers valuable insights into the underlying interaction mechanisms at a whole-genome level.
The two major objectives in the PIGEON framework serve as the foundation for quantifying evidence of polygenic G × E and interpreting the underlying mechanisms of these interactions. PIGEON also establishes the connections between existing G × E approaches, allowing us to understand their relationships, distinctions, and limitations. The objectives of differential heritability and imperfect genetic correlation are associated with the G × E variance which primarily focuses on providing evidence for the presence of polygenic G × E. On the other hand, PGS × E analysis is linked to the covariant G × E objective, which aims to interpret the mechanisms underlying polygenic G × E.
As mentioned in the section above, empirical PGS × E analysis is substantially affected by the imprecision in PGS estimation due to limited sample sizes in GWAS. Ignoring the uncertainty in empirical PGS leads to biases toward zero in the interaction coefficient estimates. In contrast, Importantly, PIGEON reveals a direct relationship between the coefficients of the oracle PGS × E analysis (which is based on the true PGS and described in “Empirical PGS × E analysis” section above) denoted as from and the covariant G × E represented by
where is the additive heritability of the outcome of interest. The oracle PGS × E represents an upper bound and infinite sample limit of empirical PGS × E, analogous to heritability being the upper bound of PGS predictive in the GWAS literature. Therefore, as a superior alternative to commonly used PGS × E analysis, estimating covariant G × E through variance component analysis provides a more reliable approach to quantifying and interpreting polygenic G × E effects.
PIGEON also introduces an estimation strategy called PIGEON LDSC which allows for the estimation of polygenic G × E effects using only GWIS and GWAS summary statistics as the input. It is unbiased, computationally efficient, and robust to sample overlap, heteroscedasticity, and gene–environment correlation. Two methods can be used to estimate the G × E variance component: PIGEON and G×Esum (Shin & Lee, 2021). Both methods utilizes the -scores for SNP × E interaction effects obtained from GWIS summary statistics. The PIGEON LDSC method calculates the expected value of the squared -score for each SNP × E interaction effect, denoted as . Regressing the squared -scores on the LD score provides the estimator of the G × E variance component accurately:
where denotes the GWIS sample size, is the G × E variance component, is the number of SNPs, is a correction factor to account for the environmental effect on -score approximation in GWIS, is the -score of environmental effect, is the LD score, and is the kurtosis of the environment.
To estimate covariant G × E, PIGEON LDSC only requires GWIS and GWAS summary statistics with arbitrary sample overlap. The expected value of the product of additive effect -scores and SNP × E effect -scores is
where is the covariant G × E, is the number of SNPs, is a correction factor described above, is the -score of environment effects, is the LD score, and represents the GWIS and GWAS sample size, is the number of overlapped samples between GWIS and GWAS analysis, is the skewness of the environment. The oracle PGS × E coefficient can be obtained by normalizing the covariant G × E by heritability.
With this approach, it has become possible to only use GWIS and GWAS summary statistics to perform G × E inference, especially PGS × E analysis. The most important feature is its robustness to sample overlap. In traditional G × E approaches, the presence of shared samples between GWAS and G × E cohorts renders PGS generation and subsequent PGS × E analysis impossible, due to the concern of inflated type-I error. With PIGEON, unbiased estimates for covariant G × E and oracle PGS × E can now be obtained, regardless of sample overlap. Another advantage of PIGEON is its ability to facilitate hypothesis-free scans for PGS × E. Unlike most studies that define PGS based on the same outcome used in G × E analysis, PIGEON offers a superior strategy. It allows researchers to perform GWIS in large samples through meta-analysis and then examine its genetic correlation with multiple published GWAS. This strategy eliminates the concerns regarding whether the GWAS and GWIS were conducted on the same or distinct samples, allowing for a comprehensive assessment of PGS × E effects across multiple PGS.
2.10 |. Gene–environment correlation in G × E research
The environments in many G × E applications can be partially endogenous, creating complex patterns of gene–environment correlations (rGE) that pose inferential challenges for identifying G × E (Jaffee & Price, 2007). In many applications of G × E, it is of great interest to ensure the exogeneity of E (Barcellos et al., 2018; Schmitz & Conley, 2017; Zhu et al., 2023). When G × E analysis is performed on observational data, some studies go to great lengths to use instrumental variables or other approaches (Barcellos et al., 2018; Schmitz & Conley, 2017). Some studies focus on the interaction between SNPs in autosomes and sex (Blokland et al., 2022; Zhu et al., 2023), while other studies ignore the potential correlation between genes and environment (Robinson et al., 2017). For single-variant G × E analysis, the rGE can lead to spurious G × E results (Dudbridge & Fletcher, 2014). Sensitivity analyses have been suggested to reduce the possibility of false-positive reports of interaction (Dudbridge & Fletcher, 2014). For polygenic G × E analysis, a much weaker condition than G–E independence, that is, zero correlation between additive SNP effects on environment and SNP × E effects on outcome, is sufficient to obtain unbiased estimates and well-controlled false-positive rates in polygenic G × E inference (Miao, Song, et al., 2022). If this weak condition is violated, a solution is also proposed to correct for biases introduced by rGE (Miao, Song, et al., 2022). In conclusion, it is important to recognize the impact of rGE on G × E inference and to select appropriate statistical methods and sensitivity analyses to reduce false-positive findings.
3 |. CONCLUSION
Despite the long-standing interest in G × E, our understanding of its contribution to human complex traits and diseases remains limited. The study of G × E interactions presents statistical challenges due to the high dimensionality of genetic information and environmental exposures, the need for large sample sizes to reliably detect G × E effects, and the scarcity of large datasets that combine genetic and environmental data. In this article, we aim to review the current state-of-the-art statistical methods on G × E inference. We discussed two main categories of methods designed for single-variant analysis and polygenic G × E analysis. These approaches represent a tradeoff between statistical power and resolution of findings. Single-variant analysis, while having lower power, offers higher resolution, making it valuable for identifying specific genetic variants involved in G × E interactions. In contrast, polygenic G × E analysis, with its higher power but lower resolution, aims to leverage genome-wide information to provide an understanding of the collective contribution and interpretation of G × E effects across all SNPs. These methods have laid the foundation for identifying and interpreting robust G × E interactions and hold significant potential for broad applications in many disciplines.
There are several important directions for future G × E methodological research. Firstly, the remarkable success of complex trait genetics, particularly through large-scale GWAS meta-analyses and the sharing of summary association statistics, has revolutionized how we approach genotype–phenotype associations. In contrast, G × E analysis still predominantly relies on small cohorts with individual-level genetic, exposure, and outcome data to date. To propel the field forward, it is crucial to meta-analyze and widely share GWIS summary statistics. Future developments in G × E methodology should prioritize approaches that leverage summary-level data, enabling more extensive collaboration and data integration efforts. Secondly, a lingering challenge is accurately quantifying the overall contribution of G × E across all relevant environments. The current G × E studies rely on single or several hypothesized environmental exposures. Understanding the cumulative impact of G × E across all possible environments remains an open question. Thirdly, unraveling the functional interpretation of G × E interactions presents another intriguing challenge. Identifying the underlying mechanisms and biological pathways through which genes and environmental factors interact to influence complex traits is essential for gaining deeper insights. Future research should strive to develop innovative methodologies that integrate genetic, environmental, and functional genomic data to elucidate the functional implications of G × E interactions. Fourth, the current gold-standard of PGS × E analysis involves performing regression within families, which effectively eliminates all bias from population stratification, environmental confounding, assortative mating, and other sources (Biroli et al., 2022). It is an open question whether the methods discussed might have biases for the reasons mentioned above when applied to population-level data, and if so, whether they can be used for within-family analyses. Fifth, it is important to recognize that the statistical methods discussed in this review all focus on G × E analyses using molecular genetic data. Before the advent of molecular genetic data made possible by advances in sequencing technologies, researchers used twin- and family-based analyses to study G × E (Dick, 2011). These methods relied on latent, unobserved indices of genetic influence to detect the presence of G × E. An example from twin studies is the analysis of monozygotic twins raised apart, which examines the effect of different environments on some traits with identical genotypes (Bergeman et al., 1988). Another example is differential heritability analysis, which compares twin-based heritability across different environments (Turkheimer et al., 2003). A valuable avenue for future research would be to compare the results and models from twin- and family-based studies with those from molecular analyses in the context of G × E to better understand the strengths and limitations of each study design.
ACKNOWLEDGMENTS
The authors gratefully acknowledge research support from National Institutes of Health (NIH) grant U01 HG012039 and support from the University of Wisconsin-Madison Office of the Chancellor and the Vice Chancellor for Research and Graduate Education with funding from the Wisconsin Alumni Research Foundation (WARF). We also acknowledge use of the facilities of the Center for Demography of Health and Aging at the University of Wisconsin-Madison, funded by NIA Center Grant P30 AG017266.
Footnotes
CONFLICT OF INTEREST STATEMENT
The authors declare no conflict of interests.
OPEN RESEARCH BADGES
This article has earned an Open Data badge for making publicly available the digitally-shareable data necessary to reproduce the reported results.
DATA AVAILABILITY STATEMENT
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
REFERENCES
- Albert PS, Ratnasinghe D, Tangrea J, & Wacholder S (2001). Limitations of the case-only Design for Identifying Gene-Environment Interactions. American Journal of Epidemiology, 154(8), 687–693. 10.1093/aje/154.8.687 [DOI] [PubMed] [Google Scholar]
- Arold BW, Hufe P, & Stoeckli M (2022). Genetic endowments, educational outcomes and the mediating influence of school investments. CESifo Working Paper No. 9841. [Google Scholar]
- Aschard H, Lutz S, Maus B, Duell EJ, Fingerlin TE, Chatterjee N, Kraft P, & van Steen K (2012). Challenges and opportunities in genome-wide environmental interaction (GWEI) studies. Human Genetics, 131(10), 1591–1613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barcellos SH, Carvalho LS, & Turley P (2018). Education can reduce health differences related to genetic risk of obesity. Proceedings of the National Academy of Sciences, 115(42), E9765–E9772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barth D, Papageorge NW, & Thom K (2020). Genetic endowments and wealth inequality. Journal of Political Economy, 128(4), 1474–1522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bergeman C, Plomin R, McClearn G, Pedersen NL, & Friberg LT (1988). Genotype€ nvironment interaction in personality development: Indentical twins reared apart. Psychology and Aging, 3(4), 399–406. [DOI] [PubMed] [Google Scholar]
- Bernabeu E, Canela-Xandri O, Rawlik K, Talenti A, Prendergast J, & Tenesa A (2021). Sex differences in genetic architecture in the UK Biobank. Nature Genetics, 53(9), 1283–1289. [DOI] [PubMed] [Google Scholar]
- Bi W, Zhao Z, Dey R, Fritsche LG, Mukherjee B, & Lee S (2019). A fast and accurate method for genome-wide scale phenome-wide G × E analysis and its application to UK biobank. The American Journal of Human Genetics, 105(6), 1182–1192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biroli P, Galama TJ, von Hinke S, van Kippersluis H, Rietveld CA, & Thom K (2022). The economics and econometrics of gene-environment interplay. arXiv Preprint. 10.48550/arXiv.2203.00729 [DOI] [Google Scholar]
- Blokland GAM, Grove J, Chen C-Y, Cotsapas C, Tobet S, Handa R, Schizophrenia Working Group of the Psychiatric Genomics Consortium, St Clair D, Lencz T, Mowry BJ, Periyasamy S, Cairns MJ, Tooney PA, Wu JQ, Kelly B, Kirov G, Sullivan PF, Corvin A, Riley BP, … Goldstein JM (2022). Sex-dependent shared and nonshared genetic architecture across mood and psychotic disorders. Biological Psychiatry, 91(1), 102–117. 10.1016/j.biopsych.2021.02.972 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boyle EA, Li YI, & Pritchard JK (2017). An expanded view of complex traits: From polygenic to Omnigenic. Cell, 169(7), 1177–1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bulik-Sullivan BK, Loh P-R, Finucane HK, Ripke S, Yang J, Patterson N, Daly MJ, Price AL, & Neale BM (2015). LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics, 47(3), 291–295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carbone M, Arron ST, Beutler B, Bononi A, Cavenee W, Cleaver JE, Croce CM, D’Andrea A, Foulkes WD, Gaudino G, Groden JL, Henske EP, Hickson ID, Hwang PM, Kolodner RD, Mak TW, Malkin D, Monnat RJ Jr., Novelli F, … Yang H (2020). Tumour predisposition and cancer syndromes as models to study gene–environment interactions. Nature Reviews Cancer, 20(9), 533–549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carbone M, Emri S, Dogan AU, Steele I, Tuncer M, Pass HI, & Baris YI (2007). A mesothelioma epidemic in Cappadocia: Scientific developments and unexpected social outcomes. Nature Reviews Cancer, 7(2), 147–154. [DOI] [PubMed] [Google Scholar]
- Caspi A, Sugden K, Moffitt TE, Taylor A, Craig IW, Harrington H, McClay J, Mill J, Martin J, Braithwaite A, & Poulton R (2003). Influence of life stress on depression: Moderation by a polymorphism in the 5-HTT gene. Science, 301(5631), 386–389. [DOI] [PubMed] [Google Scholar]
- Chen Y-H, Chatterjee N, & Carroll RJ (2009). Shrinkage estimators for robust and efficient inference in haplotype-based case-control studies. Journal of the American Statistical Association, 104(485), 220–233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dahl A, Nguyen K, Cai N, Gandal MJ, Flint J, & Zaitlen N (2020). A robust method uncovers significant context-specific heritability in diverse complex traits. The American Journal of Human Genetics, 106(1), 71–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dick DM (2011). Gene-environment interaction in psychological traits and disorders. Annual Review of Clinical Psychology, 7, 383–409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dick DM, Agrawal A, Keller MC, Adkins A, Aliev F, Monroe S, Hewitt JK, Kendler KS, & Sher KJ (2015). Candidate gene-environment interaction research: Reflections and recommendations. Perspectives on Psychological Science, 10(1), 37–59. 10.1177/1745691614556682 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Domingue BW, Trejo S, Armstrong-Carter E, & Tucker-Drob EM (2020). Interactions between polygenic scores and environments: Methodological and conceptual challenges. Sociological Science, 7, 465–486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Döring A, Gieger C, Mehta D, Gohlke H, Prokisch H, Coassin S, Fischer G, Henke K, Klopp N, Kronenberg F, Paulweber B, Pfeufer A, Rosskopf D, Völzke H, Illig T, Meitinger T, Wichmann HE, & Meisinger C (2008). SLC2A9 influences uric acid concentrations with pronounced sex-specific effects. Nature Genetics, 40(4), 430–436. [DOI] [PubMed] [Google Scholar]
- Dudbridge F, & Fletcher O (2014). Gene-environment dependence creates spurious gene-environment interaction. The American Journal of Human Genetics, 95(3), 301–307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh P-R, Anttila V, Xu H, Zang C, Farh K, Ripke S, Day FR, ReproGen Consortium; Schizophrenia Working Group of the Psychiatric Genomics Consortium; RACI Consortium, Purcell S, Stahl E, Lindstrom S, Perry JR, Okada Y, Raychaudhuri S, … Price AL (2015). Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature Genetics, 47(11), 1228–1235. 10.1038/ng.3404 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Freeman G (1973). Statistical methods for the analysis of genotype-environment interactions. Heredity, 31(3), 339–354. [DOI] [PubMed] [Google Scholar]
- Gauderman WJ, Mukherjee B, Aschard H, Hsu L, Lewinger JP, Patel CJ, Witte JS, Amos C, Tai CG, Conti D, Torgerson DG, Lee S, & Chatterjee N (2017). Update on the state of the science for analytical methods for gene-environment interactions. American Journal of Epidemiology, 186(7), 762–770. 10.1093/aje/kwx228 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jaffee SR, & Price TS (2007). Gene–environment correlations: A review of the evidence and implications for prevention of mental illness. Molecular Psychiatry, 12(5), 432–442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kerin M, & Marchini J (2020). Inferring gene-by-environment interactions with a Bayesian whole-genome regression model. The American Journal of Human Genetics, 107(4), 698–713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li D, & Conti DV (2009). Detecting gene-environment interactions using a combined case-only and case-control approach. American Journal of Epidemiology, 169(4), 497–504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J, Li X, Zhang S, & Snyder M (2019). Gene-environment interaction in the era of precision medicine. Cell, 177(1), 38–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marderstein AR, Davenport ER, Kulm S, van Hout CV, Elemento O, & Clark AG (2021). Leveraging phenotypic variability to identify genetic interactions in human phenotypes. The American Journal of Human Genetics, 108(1), 49–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin J, Khramtsova EA, Goleva SB, Blokland GA, Traglia M, Walters RK, Hübel C, Coleman JRI, Breen G, Børglum AD, Demontis D, Grove J, Werge T, Bralten J, Bulik CM, Lee PH, Mathews CA, Peterson RE, Winham SJ, … Davis LK (2021). Examining sex-differentiated genetic effects across neuropsychiatric and behavioral traits. Biological Psychiatry, 89(12), 1127–1137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mbatchou J, Barnard L, Backman J, Marcketta A, Kosmicki JA, Ziyatdinov A, Benner C, O’Dushlaine C, Barber M, Boutkov B, Habegger L, Ferreira M, Baras A, Reid J, Abecasis G, Maxwell E, & Marchini J (2021). Computationally efficient whole-genome regression for quantitative and binary traits. Nature Genetics, 53(7), 1097–1103. 10.1038/s41588-021-00870-7 [DOI] [PubMed] [Google Scholar]
- Meisner A, Kundu P, & Chatterjee N (2019). Case-only analysis of gene-environment interactions using polygenic risk scores. American Journal of Epidemiology, 188(11), 2013–2020. 10.1093/aje/kwz175 [DOI] [PubMed] [Google Scholar]
- Miao J, Guo H, Song G, Zhao Z, Hou L, & Lu Q (2023). Quantifying portable genetic effects and improving cross-ancestry genetic prediction with GWAS summary statistics. Nature Communications, 14(1), 832. 10.1038/s41467-023-36544-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miao J, Lin Y, Wu Y, Zheng B, Schmitz LL, Fletcher JM, & Lu Q (2022). A quantile integral linear model to quantify genetic effects on phenotypic variability. Proceedings of the National Academy of Sciences, 119(39), e2212959119. 10.1073/pnas.2212959119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miao J, Song G, Wu Y, Hu J, Wu Y, Basu S, Andrews JS, Schaumberg K, Fletcher JM, Schmitz LL, & Lu Q (2022). Reimagining gene-environment interaction analysis for human complex traits. bioRxiv Preprints. 10.1101/2022.12.11.519973 [DOI] [Google Scholar]
- Moore R, Casale FP, Jan Bonder M, Horta D, Franke L, Barroso I, & Stegle O (2019). A linear mixed-model approach to study multivariate gene–environment interactions. Nature Genetics, 51(1), 180–186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mostafavi H, Harpak A, Agarwal I, Conley D, Pritchard JK, & Przeworski M (2020). Variable prediction accuracy of polygenic scores within an ancestry group. eLife, 9, e48376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mukherjee B, & Chatterjee N (2008). Exploiting gene-environment independence for analysis of case-control studies: An empirical Bayes-type shrinkage estimator to trade-off between bias and efficiency. Biometrics, 64(3), 685–694. 10.1111/j.1541-0420.2007.00953.x [DOI] [PubMed] [Google Scholar]
- Ni G, van der Werf J, Zhou X, Hyppönen E, Wray NR, & Lee SH (2019). Genotype–covariate correlation and interaction disentangled by a whole-genome multivariate reaction norm model. Nature Communications, 10(1), 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Piegorsch WW, Weinberg CR, & Taylor JA (1994). Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. Statistics in Medicine, 13(2), 153–162. 10.1002/sim.4780130206 [DOI] [PubMed] [Google Scholar]
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, & Sham PC (2007). PLINK: A tool set for whole-genome association and population-based linkage analyses. The American Journal of Human Genetics, 81(3), 559–575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson MR, English G, Moser G, Lloyd-Jones LR, Triplett MA, Zhu Z, Nolte IM, van Vliet-Ostaptchouk JV, Snieder H, LifeLines Cohort Study, Esko T, Milani L, Mägi R, Metspalu A, Magnusson PKE, Pedersen NL, Ingelsson E, Johannesson M, Yang J, … Visscher PM (2017). Genotype–covariate interaction effects and the heritability of adult body mass index. Nature Genetics, 49(8), 1174–1181. [DOI] [PubMed] [Google Scholar]
- Schmitz LL, & Conley D (2017). The effect of Vietnam-era conscription and genetic potential for educational attainment on schooling outcomes. Economics of Education Review, 61, 85–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmitz LL, Zhao W, Ratliff SM, Goodwin J, Miao J, Lu Q, Guo X, Taylor KD, Ding J, Liu Y, Levine M, & Smith JA (2022). The socioeconomic gradient in epigenetic ageing clocks: Evidence from the multi-ethnic study of atherosclerosis and the health and retirement study. Epigenetics, 17(6), 589–611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shin J, & Lee SH (2021). GxEsum: A novel approach to estimate the phenotypic variance explained by genome-wide GxE interaction based on GWAS summary statistics for biobank-scale data. Genome Biology, 22(1), 1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomas D (2010). Gene–environment-wide association studies: Emerging approaches. Nature Reviews Genetics, 11(4), 259–272. 10.1038/nrg2764 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Turkheimer E, Haley A, Waldron M, d’Onofrio B, & Gottesman II (2003). Socioeconomic status modifies heritability of IQ in young children. Psychological Science, 14(6), 623–628. [DOI] [PubMed] [Google Scholar]
- Wang H, Zhang F, Zeng J, Wu Y, Kemper KE, Xue A, Zhang M, Powell JE, Goddard ME, Wray NR, Visscher PM, McRae AF, & Yang J (2019). Genotype-by-environment interactions inferred from genetic effects on phenotypic variability in the UK biobank. Science Advances, 5(8), eaaw3538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Z, Shi W, Carroll RJ, & Chatterjee N (2023). Joint modeling of gene-environment correlations and interactions using polygenic risk scores in case-control studies. bioRxiv Preprints. 10.1101/2023.02.14.528572 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR, Madden PA, Heath AC, Martin NG, Montgomery GW, Goddard ME, & Visscher PM (2010). Common SNPs explain a large proportion of the heritability for human height. Nature Genetics, 42(7), 565–569. 10.1038/ng.608 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Young AI, Wauthier F, & Donnelly P (2016). Multiple novel gene-by-environment interactions modify the effect of FTO variants on body mass index. Nature Communications, 7(1), 12724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Young AI, Wauthier FL, & Donnelly P (2018). Identifying loci affecting trait variability and detecting interactions in genome-wide association studies. Nature Genetics, 50(11), 1608–1614. [DOI] [PubMed] [Google Scholar]
- Zhong W, Chhibber A, Luo L, Mehrotra DV, & Shen J (2023). A fast and powerful linear mixed model approach for genotype-environment interaction tests in large-scale GWAS. Briefings in Bioinformatics, 24(1), bbac547. [DOI] [PubMed] [Google Scholar]
- Zhu C, Ming MJ, Cole JM, Edge MD, Kirkpatrick M, & Harpak A (2023). Amplification is the primary mode of gene-by-sex interaction in complex human traits. Cell Genomics, 3(5), 100297. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
