Summary
Understanding the complex interplay of genetic and environmental factors in disease etiology and the role of gene-environment interactions (GEIs) across human development stages is important. We review the state of GEI research, including challenges in measuring environmental factors and advantages of GEI analysis in understanding disease mechanisms. We discuss the evolution of GEI studies from candidate gene-environment studies to genome-wide interaction studies (GWISs) and the role of multi-omics in mediating GEI effects. We review advancements in GEI analysis methods and the importance of large-scale datasets. We also address the translation of GEI findings into precision environmental health (PEH), showcasing real-world applications in healthcare and disease prevention. Additionally, we highlight societal considerations in GEI research, including environmental justice, the return of results to participants, and data privacy. Overall, we underscore the significance of GEI for disease prediction and prevention and advocate for integrating the exposome into PEH omics studies.
Keywords: gene-environment interactions, precision environmental health, epidemiology, genetics, environmental exposures, epigenetics, environmental justice
Graphical abstract

Motsinger-Reif et al. review the state of research investigating interactions between genetics and environmental factors, highlighting the translation of findings into precision environmental health. They discuss the need for new detection techniques due to the use of large-scale datasets comprising data from multiple sources; outline challenges in the field related to measuring environmental factors, environmental justice, and data privacy; and deliberate on the advantages of gene-environment interactions for understanding disease mechanisms.
Introduction
Multiple, often interacting, genetic and environmental risk factors contribute to the development of diseases, with an estimated 50%–80% of overall disease etiology based on environmental factors.1 Gene-environment interactions (GEIs) vary based on a person’s genetic makeup and the nature of their environmental exposures. The involvement of GEI in disease etiology also varies across time and development stages, with significantly different effects on gene regulation over a lifetime, especially prenatally and during early childhood development.1 An increasing body of evidence demonstrates the significance of GEI, with a growing number of replicated interactions, including BRCA-1 associated protein-1 (BAP1) mutations and asbestos exposure for mesothelioma,2 chromodomain helicase DNA-binding protein 8 (CHD8) and pesticide exposure for autism spectrum disorder,3 fat mass and obesity-associated gene (FTO) and physical activity for obesity,4 and dopamine receptor D4 (DRD4) and parenting style for attention-deficit/hyperactivity disorder.5 In the context of precision health, several noteworthy GEIs that are relevant in various aspects of healthcare have been identified. These include the impact of gene polymorphisms in metabolic enzymes on health outcomes,6 the relationship between genetic factors and therapeutic response in drug efficacy,7 the influence of GEI on drug-drug interactions,8 and the association of GEI and drug toxicity.9
As reviewed in Baccarelli et al.,10 precision environmental health (PEH) employs genetic, epigenetic, environmental, and system-level data to elucidate the underlying environmental causes of disease, identify biomarkers of exposure, and develop new prevention and intervention strategies. Incorporating personalized information on genetic susceptibilities and environmental exposures enables improvements in the accuracy and efficacy of risk assessment, disease prevention, and interventions, with the ultimate goal of reducing the burden of environmentally induced diseases and health disparities. Figure 1 outlines the role of omic and exposure data in PEH. Advancements in PEH are anticipated to lead to more targeted and effective public health initiatives, healthcare strategies, and environmental policies that will foster healthier populations and environments.
Figure 1.
PEH
PEH uses available biological and exposure data to assess individual risk and tailor interventions. These data consist of omic data (blue box); data on internal exposures from diet, lifestyle factors, stress, and pharmacological agents; as well as data on prototypical external exposures from home, air, water, and social determinates of health (yellow boxes). Rigorous statistical machine-learning methods are used to winnow down the features that contribute to explaining disease likelihood (green boxes). This in turn enables differential treatment based on individual omic and exposure differences.
GEIs are foundational to PEH because they highlight the intricate interplay between genetic factors and environmental exposures in determining human health and disease risk. Individual reactions to environmental factors vary: what is harmless to one person may be detrimental to another due to genetic differences. Understanding GEI helps accurately predict individual susceptibilities, thus improving risk assessments. Unveiling the underlying biological mechanisms of diseases paves the way for health recommendations and effective interventions tailored to the individual.
Additionally, scrutinizing GEI engenders a holistic view of health determinants that does not oversimplify outcomes as the result of genetics or environmental exposures alone. Taking a comprehensive view aids in developing treatment approaches and prevention strategies, especially for those with genetic vulnerabilities to environmental hazards. This is a vital step in addressing health disparities as some populations have increased susceptibility due to the disproportionate number of exposures they experience and may have different susceptibilities due to genetics. Understanding GEI comprehensively ensures a more inclusive and personalized approach to environmental health.
In this manuscript, we review the current state of the field regarding GEI. We discuss the opportunities and obstacles involved in measuring environmental data; the advantages and challenges associated with interrogating interactions; and the interplay of the environment and genomics, transcriptomics, proteomics, lipidomics, metabolomics, and epigenetics. We outline approaches to GEI analysis and recent advancements, including emerging methods and sources to incorporate biological knowledge into analyses. We review known interactions and discuss how results are being translated into clinical practice. Further, we discuss the social impacts of GEI research regarding environmental justice, the return of results to cohort participants, and concerns surrounding data privacy in light of the increasing use and availability of geospatial data. Finally, we place the reviewed material into a PEH framework.
Gene-environment interactions
For this manuscript, we use a broad, inclusive definition of GEI and acknowledge that statistical definition of GEI is an area of active debate. A GEI occurs when exposure to an environmental factor affects the risk of developing a disease due to differences in an individual’s genetic makeup. Conversely, the effect of a genetic variant on a phenotype can also be influenced by exposures. The statistical definition determines the existence or absence of a GEI based on either an additive or multiplicative scale of measurement within linear modeling frameworks. There are several considerations when determining a suitable measurement scale for GEI, such as the primary goal of the inquiry (e.g., identifying causes of disease, predicting public health trends) and the proposed pathophysiologic model. Figure 2 demonstrates a general framework for GEI.
Figure 2.
Gene-environment contributions to trait variance
Given a genetic locus of interest (x axis) and an exposure of interest (exposed = blue), the ways in which they can be related to a trait of interest (y axis) are illustrated.
(A) Neither the genetic region of interest nor the exposure explain variance in the trait of interest.
(B) The genotype contributes to trait variance while the exposure does not.
(C) The exposure contributes to trait variance while the genotype does not.
(D) In the absence of exposure information, the genotype does not contribute to trait variance, but there is a gene-by-exposure interaction.
(E) Both the genotype and exposure contribute to trait variance as well as their interaction.
(F) Both the genotype and exposure contribute to trait variance but without a gene-environment interaction.
In traditional statistical frameworks, there is intricate discussion on GEI, with some statistical geneticists expressing uncertainty about the importance of these interactions. One issue is the seemingly minor impacts of GEI compared to the impacts of genetics or environmental exposures alone. Recent research has attempted to quantify interactions, disputing the idea that the impact of GEI on phenotypic variation is insignificant.11 Ottman discusses traditional statistical frameworks in detail.12
Understanding the significance of GEI requires a broad view that emphasizes the intricate nature of biological systems and considers the combined effects of genetic and environmental influences on trait expression and disease susceptibility, even in the absence of statistically significant GEI. Further, it is essential to recognize that the occurrence of GEI does not necessarily indicate molecular interactions inside cells. While it is commonly held that a GEI is significant only if it has a direct impact on molecular mechanisms, GEI may influence phenotypic or disease outcomes without direct molecular interactions. For instance, expression quantitative trait loci (eQTLs) have similar effects across many contexts, but phenotypic results may differ due to threshold responses that are more prominent in a particular environment.
In a recent review, Westerman and Sofer described various relationships between genetics, environmental exposures, and outcome variables that could yield statistically significant GEI.13 The authors discuss intracellular molecular interactions but also highlight several statistical arrangements of genetics, environmental exposures, and outcomes that can produce GEI. These arrangements depart from the often default assumption that GEI must involve a molecular interaction and are generally less intuitive and obvious.
An underlying cause of potentially spurious detectable GEI is a non-linear relationship between a subset of modeled variables (environmental exposure and outcome or a mediator and outcome). Due to this non-linear relationship, an environmental exposure and outcome may have a different slope for different genotypes, indicating a GEI. For example, there is a non-linear relationship between body mass index (BMI) and low-density lipoprotein (LDL) cholesterol, so it is possible that a putative GEI for LDL between genotype and BMI is a product of this non-linearity and not an actual GEI. Heterogeneous variance can also be an underlying cause of detectable GEI.14 If there is additional variance in an outcome across genotypes (e.g., a variance QTL [vQTL]) or bias in the measurement error of an outcome that increases variance across genotypes (e.g., skin pigmentation and pulse-oximeter measurements from wearable technology),15 the measured effect of a constant environmental exposure could change across genotypes, which would be measured as a GEI. These underlying causes range from true instances of GEI, such as non-linearity or vQTLs, to instances due to measurement error.13
Environmental exposures affect health and illness not just at the cellular level but also via behavioral and systemic consequences. Toxic exposures can affect cellular function, while lifestyle factors can modify phenotypes by affecting brain networks or hormone levels. A diverse range of environmental factors affect biological systems from the cellular to organism level and interact with genetics in intricate ways.
Exposure measurement
As the scale of genotyping rapidly increases, opportunities to collect high-dimensional environmental data are expanding. The exposome describes the totality of environmental exposures over the course of an individual’s life, including external (e.g., pollution, radiation, social determinants of health) and internal (e.g., microbiome, metabolism, oxidative stress) factors.16 The individual factors that constitute the exposome are measured in numerous ways.
Mass spectrometry (MS) assays quantify environmental exposures in biological samples such as blood, urine, and tissues.17 Untargeted MS represents a significant advancement in exposome research and provides a robust method for the high-throughput identification and quantification of a vast array of small molecules in biological and environmental samples.18 Untargeted MS can comprehensively screen for chemical exposures without prior knowledge of the analytes present. It can detect environmental pollutants (e.g., air and water contaminants, soil pollutants), dietary exposures (e.g., food additives and natural toxins), lifestyle factors (e.g., components of tobacco smoke and alcohol metabolites), endogenous metabolites indicative of internal biological processes, and microbial metabolites from the gut microbiota.19 Untargeted MS offers extensive chemical coverage and captures a broad range of organic compounds, metals, and metabolites, including volatile and semi-volatile organic compounds; non-volatile organic compounds such as lipids, proteins, peptides, and nucleotides; and even inorganic ions and metals under certain conditions.20 The technique’s high-throughput capability enables the detection and quantification of hundreds to several thousand unique analytes in a single run, depending on the sample matrix, technology employed, and analytical conditions. However, the sheer volume and complexity of data generated pose significant challenges in data analysis and compound identification, necessitating sophisticated bioinformatics tools and comprehensive reference databases for accurate analysis.21 The sensitivity and specificity of detection also vary among classes of chemicals and are influenced by factors such as a compound’s ionization efficiency and the presence of matrix effects.21
Geographic information systems (GISs) are used in epidemiology studies for environmental exposure assessment that integrates data from diverse sources.22 GIS linkages enable mapping that provides insights into the spatial and temporal distribution of exposures and facilitates the assessment of health risks associated with environmental factors. Analysis of air pollution is a primary application of GIS.23 Particulate matter (PM2.5 and PM10) concentrations can be modeled using satellite data, data from air-quality-monitoring stations, and land-use information. The distribution of gases such as nitrogen dioxide (NO2), sulfur dioxide (SO2), and ozone (O3) can be assessed using data on traffic, industrial activities, and meteorological conditions. Water-quality assessment is another prominent application of GIS in epidemiology studies.24 GIS can assess the risk of exposure to contaminants in drinking water supplies, including heavy metals, nitrates, and microbial pathogens, by linking residential addresses to water-source data. In the context of soil and agriculture, GIS linkages are used to estimate pesticide exposure in agricultural areas and exposure risks from contaminated land due to historical industrial use, proximity to waste disposal sites, and natural deposits of harmful substances. Additionally, GIS applications are used in studies estimating the impact of socioeconomic status; access to healthcare, food, and recreational facilities; and factors of the built environment such as access to green space25 and blue space, urban heat islands, and noise pollution on health outcomes. GISs can also be used to estimate environmental factors such as ultraviolet (UV) radiation exposure and the risk of extreme weather events such as floods and hurricanes.
Smart sensors in wearable devices that provide real-time data on individual exposure levels have transformed environmental exposure monitoring and assessment. Real-time data on air quality, noise levels, UV exposure and weather conditions, and physical activity can be paired with GIS data to reveal how environmental influences on human health differ within small geographic regions and across time.26 Real-time monitoring is essential for understanding the air quality individuals are exposed to during daily activities and identifying air-pollution-related health hazards.27 Combining GIS data with data from sensors in wearable devices can enhance contextual knowledge of exposures and how they vary across contexts. Geolocating sensor data can help mitigate environmental exposure hazards by correlating exposures to particular locations.28 Wearable sensors can monitor small and coarse particulate matter (PM2.5 and PM10), volatile organic compounds (VOCs), and hazardous gases including CO and NO2.29 These sensors can also monitor noise levels, an exposure with a rapidly evolving body of evidence.30 Data on environmental sound levels are used to determine exposure to potentially dangerous noise levels in urban areas and workplaces that could cause long-term hearing impairment or stress-related health concerns. Additionally, sensors in wearable devices can measure UV exposure, which is crucial for monitoring skin cancer risk and other sun-related health issues31 and regulating sun exposure. Sensors can also measure temperature and humidity, which affect comfort and health, particularly during harsh weather. Wearable sensors can detect steps, heart rate, and calories burned to measure lifestyle factors such as physical activity that interact with environmental exposures to affect health. Related technology can identify chemical exposures including heavy metals and organic contaminants.32 Due to the intricacy of detection, consumer wearables with these features are rare, although research and development is expanding availability.
Questionnaires administered to collect detailed data on lifestyle factors, environmental exposures, and social determinants of health are a pivotal source of information in environmental health research. While comprehensive data collection is instrumental in quantifying individual-level exposures and lifestyle factors, there are significant challenges, particularly regarding the standardization of questionnaires. The use of non-standard questionnaires introduces variability into the collected data, complicating comparisons and data synthesis across research efforts.33 Subjective responses and recall bias further exacerbate these challenges, potentially skewing the data.34 Moreover, cultural and linguistic differences can influence respondents’ understanding of questions, necessitating careful adaptation and validation across diverse populations to ensure accuracy and relevance.34 Data quality and completeness are additional concerns, with the reliability of collected information dependent on respondents’ willingness and ability to provide accurate answers.34,35 Environmental exposures can change over time, presenting additional difficulties with the use of questionnaires to assess historic and future exposures. To mitigate these challenges, there are ongoing efforts to develop standardized, validated questionnaires and implement advanced statistical techniques to adjust for bias and data variability.33,36 Further, integrating questionnaire data with objective exposure measurements such as biomonitoring results and sensor data is increasingly recognized as a valuable approach.37
Challenges with standardization of the exposome
Environmental exposures are responsible for 70%–90% of disease risk. While genetic studies commonly estimate heritability, defined as the proportion of phenotypic variation due to genetic variability, estimating the proportion of variability due to the exposome is challenging because of issues with direct exposure measurements. In contrast to genomics, where the genome is measured in a structured system with well-known estimates, and there are opportunities for data sharing through comprehensive databases with reference data, there is a lack of standardization and data sharing in exposome measurement. Accordingly, the proportion of disease risk due to exposome variability is underestimated. Table 1 details current estimates of the relative contribution of established exposures, underlining the importance of exposures in understanding overall disease etiology. While there is growing evidence in the GEI literature on specific diseases, an important open question is estimating the proportion of variation explained by GEI. Estimates of disease variation due to genetics and environmental factors can exceed 100% because they are often derived from different studies using different methods, leading to overlap. For example, twin studies may provide heritability estimates, while separate environmental studies offer risk estimates. When added together, the sum can erroneously exceed 100% due to this overlap and not accounting for GEI, which could either amplify or reduce the effects of each other. This highlights the complex interplay between genetics and the environment in disease causation.
Table 1.
Estimates of disease risk due to the exposome and heritability
| Disease | Environmental factors | Exposome contribution | Heritability estimate |
|---|---|---|---|
| CVD | air pollution (PM2.5), leading to atherosclerosis, hypertension, and myocardial infarction | over 80% of CVD risk is attributable to environmental and lifestyle factors38 | 40%–60%11 |
| T2D | dietary patterns, obesity, physical inactivity, exposure to chemicals (phthalates, bisphenol A) | 70%–90% of T2D predisposition is explained by environmental exposures, lifestyle factors, and genetic predisposition39 | 25%–75%40 |
| Asthma and respiratory diseases | air pollutants (NO2, O3), indoor allergens (dust mites, mold), tobacco smoke, occupational exposures | 60%–70% of asthma incidence is attributed to environmental factors, especially early-life exposures41 | 35%–95%42 |
| Cancer | tobacco smoking, dietary factors, alcohol consumption, sun exposure, environmental pollutants | 70%–90% of most cancers are attributed to environmental factors; lifestyle also plays a dominant role43 | varies widely, from low (∼15%) to high (>80%) depending on the type44 |
| Neurodegenerative diseases (e.g., Alzheimer’s disease) | physical inactivity, diet, exposure to air pollution, occupational exposures | up to 60% of neurodegenerative disease risk is explained by environmental factors45 | 25%–80%, depending on familial or sporadic cases46 |
CVD, cardiovascular disease; T2D, type 2 diabetes.
There are several challenges involved with standardizing exposure measures across cohorts and studies, primarily due to the heterogeneity of data collection methods used. The approaches used for individual studies vary, resulting in inconsistencies in the type, quality, and granularity of data. Non-standardized data complicate comparisons and meta-analyses. Moreover, due to the dynamic nature of the exposome, which changes across time and location, it is difficult to establish standardized measures that accurately capture the variability of exposures.
The complexity of the exposome adds another layer of challenge, as the vast array of exposures requires unique measurement techniques. These challenges are further exacerbated by the differing methodologies and units of measurement used in studies. Additionally, the lack of structured data and opportunities for data sharing makes it challenging to standardize measures and compare findings across studies. Table 2 summarizes key challenges for types of environmental exposure estimates.
Table 2.
Key challenges involved in environmental exposure estimates
| Methodology | Challenge | Description |
|---|---|---|
| GIS estimates of exposure | spatial resolution variability | the resolution of GIS data can vary significantly, affecting the accuracy of exposure estimates |
| temporal dynamics | Environmental factors can change over time, making it challenging to estimate exposures accurately at specific time points | |
| lack of personalization | GIS estimates often rely on area-level data, which may not accurately reflect individual exposures | |
| High-throughput MS exposomics technologies | standardization of analytical methods | standardized protocols and reference materials are needed to ensure comparability of results across laboratories and studies |
| data interpretation and integration | interpretation of MS data can be complex, requiring sophisticated bioinformatics tools for data integration | |
| sensitivity and specificity | variations in sensitivity and specificity across compounds can affect the accuracy of exposure assessments | |
| Survey assessments of exposure | subjectivity and recall bias | self-reported data are subject to biases, which can affect the accuracy of exposure assessments |
| questionnaire variability | non-standardized questionnaire design can lead to inconsistencies in collected data | |
| cultural and language differences | surveys developed in one cultural or linguistic context may not be directly applicable in another |
Advantages of GEI analysis
For complex traits influenced by both genotypes and environmental factors, GEI analyses have several advantages over genome-only models because they incorporate environmental exposures known to influence human health. GEI analyses can provide insights into disease mechanisms, identify sources of heterogeneity across individuals, and account for what some refer to as missing heritability, which is the proportion of heritability not explained by genetics alone.47 A growing number of well-established GEIs have been replicated across studies in various populations. Examples include associations between air pollution and cardiovascular disease, alcohol and liver disease, smoking and lung cancer, diet and obesity, and pesticide exposure and Parkinson’s disease.
GEI analyses can also help identify novel pathways that influence human health.48 Compared to genome-only approaches, a joint GEI model can explain significantly more phenotypic variance and substantially improve phenotypic prediction accuracy. Including environmental exposures in an analysis using the exposome framework allows additional phenotypic variance to be captured, including additive and non-additive effects such as GEI and environment by environment interactions (ExEs).49 In addition, accounting for the influence of environmental exposures on health phenotypes can boost power.50 Polygenic scores (PGSs), which provide a measure of an individual’s risk of a disease due to their genetics, may have low predictive power when applied to populations with different genetic ancestry or socioeconomic status or those that live in different environments.51 Accordingly, including the exposome in polygenic models can improve the portability of PGSs.52
Interplay of environmental epidemiology and genetics
Epidemiology provides a framework for investigating the distribution and determinants of health and disease in populations, and genetics provides insight into the underlying biological mechanisms that contribute to disease. Genetic analysis examining the proportion of variation in a trait due to genetic factors is an increasingly important tool for estimating disease heritability and has substantially influenced study design in epidemiology. Family studies help distinguish genetic and environmental risk, and genetics help explain population heterogeneity. Because genetic markers provide an objective measure of ancestry that is not influenced by environmental factors, including this type of data can improve study design by identifying and adjusting for population stratification resulting from selection bias. This selection bias can lead to spurious associations in traditional epidemiological studies.
Epidemiology plays a crucial role in the collection of adequately phenotyped samples for genetic studies. High-quality, standardized measures of disease outcomes and environmental exposures are also necessary to interpret genetic associations. Epidemiological studies provide a framework for investigating interactions between genetic and environmental factors in disease development. Additionally, epidemiological research is essential for elucidating the clinical and therapeutic implications of genetics and establishing guidelines and policies for the appropriate use of genetic information in clinical practice and public health. Finally, including genetics in epidemiology studies enhances the ability to identify environmental effects, especially those found in subgroups that are not as apparent or strong in the overall population. In some cases, genetic effects are evident only with stratification by exposure level, or exposure effects are detected only in combination with genetic effects.
Candidate genes
While the selection of candidate genes ultimately depends on the context of a study and what is already known, some classes of genes have been established as environmentally responsive. Many genes play a direct role in an individual’s response to external factors, encompassing detoxification, DNA repair, and maintenance of cellular homeostasis. Key gene classes involved in responses to external factors include cytochrome P450 enzymes (CYPs) for metabolizing drugs and detoxifying xenobiotics, DNA repair genes for correcting damage from agents such as UV light and chemicals, and metabolic genes other than CYPs that transform substances for easier excretion. Antioxidant genes combat oxidative stress from pollutants, and heat shock proteins (HSPs) protect against environmental stressors by ensuring proper protein folding. Xenobiotic receptors regulate the metabolism and excretion of foreign compounds, inflammatory response genes mediate reactions to environmental agents, and transporter genes facilitate the movement of substances across cell membranes, including efflux pumps that export toxins. Table 3 summarizes genes that are highly responsive to environmental factors and outlines their function in the context of common exposures.
Table 3.
Gene classes highly responsive to environmental factors and their function
| Class of Genes | Function |
|---|---|
| CYPs | metabolize drugs and detoxify xenobiotics; involved in the activation and deactivation of carcinogens, drugs, and pollutants |
| DNA repair genes | repair DNA damage caused by environmental agents such as UV light, radiation, and chemical mutagens; includes the nucleotide excision repair, base excision repair, DNA mismatch repair, and DNA double-strand break repair pathways |
| Metabolic genes | involved in converting substances into water-soluble compounds for excretion; includes the glutathione transferases and uridine diphosphate-glucuronosyltransferases enzymes |
| Antioxidant genes | code for enzymes that protect cells from oxidative stress by neutralizing reactive oxygen species; includes superoxide dismutase, catalase, and glutathione peroxidase |
| HSPs | upregulated in response to environmental stress to help fold proteins correctly and prevent aggregation of misfolded proteins |
| Xenobiotic receptors | sense the presence of xenobiotics and regulate the expression of metabolism and excretion-related enzymes and transporters; includes the aryl hydrocarbon receptor and pregnane X receptor |
| Inflammatory response genes | trigger and mediate inflammation in response to environmental agents; include genes coding for cytokines and chemokines |
| Transporter genes | encode for proteins involved in the transport of substances across cellular membranes, including efflux pumps such as P-glycoprotein |
A growing number of resources catalog known or predicted genes with evidence of GEI. The Comparative Toxicogenomics Database (CTD) is a publicly available, manually curated resource that outlines interactions between chemicals and genes, chemicals and diseases, and genes and diseases53 to elucidate a broad network of these components. GeneComps and ChemComps identify genes and chemicals with shared toxicogenomic profiles and provide enriched Gene Ontology terms and Venn diagram tools that identify common and distinct characteristics among chemicals, genes, and diseases, improving gene pathway data.54 CTD is integrated into several databases, including PubChem, PharmGKB, UniProt, T3DB, GAD, ChemID, and TOXNET.55 Additionally, microarray and proteomics datasets for various species can be accessed via the Chemical Effects in Biological Systems (CEBS) knowledgebase (http://cebs.niehs.nih.gov/cebs).56
Evolution from candidate GEI studies to genome-wide interaction studies
The availability of high-throughput genotype data and large-scale phenotype and exposome data from large consortia has motivated advances in GEI methodology from candidate gene-environment interaction (cGEI) studies57 to genome-wide GEI scans for millions of single nucleotide polymorphisms (SNPs), referred to as genome-wide environment interaction studies (GWEISs)58 or genome-wide interaction studies (GWISs). cGEI studies consider pre-identified genes based on prior scientific knowledge or hypotheses, along with selected exposures, to determine the joint effects of the genotype and environment with multiplicative or additive models. cGEI studies have found several interesting GEI associations,59 but this approach is limited by prior knowledge and low replicability.57,59 Additionally, cGEI studies do not allow for hypothesis-free scans of the genome to identify novel GEIs. Further, the proportion of phenotypic variation explained by cGEI is anticipated to be low.60
Genome-wide GEI scans do not require a priori hypotheses and are agnostic in nature and thus enable researchers to find novel main and interaction effects. GWISs have been used to identify GEIs with various environmental exposures such as toxic substances, diet, and smoking.61 While GWISs allow for hypothesis-free scans, they are often severely underpowered due to an extreme multiple testing burden. Further challenges for GWISs are related to exposure assessment, data harmonization in large consortia, reliance on electronic health records for phenotyping, multiple testing due to the large numbers of gene-exposure combinations tested, genetic heterogeneity, and the need for replication in other cohorts.62
Epigenetics
Epigenetics involves the study of non-sequence modifications to DNA and modifications to chromatin that affect gene-DNA accessibility and subsequent gene regulation. The epigenome comprises the DNA base changes, chromatin protein tails, and non-coding RNAs that regulate gene expression and mRNA alterations that influence protein production. Epigenetics bridges an individual’s genetic makeup and environmental exposures to provide insights into how GEIs contribute to disease development. Cell-cell communication and tissue and cell growth depend on epigenetic markers, and environmental exposures can cause disease by altering these epigenetic markers. DNA CpG methylation was the first and easiest epigenetic signature to assess. Investigations of other epigenomic modifications, such as hetero-chromatic histone modification, are limited by technology costs and the difficulty of large-scale measurements, in part due to the requirement for high-quality antibodies. There are a growing number of replicated epigenetic-environment interactions, including smoking and DNA methylation,63 maternal stress and methylation,64 and prenatal famine and histone modification.65
Epigenome-wide association studies (EWASs) are key to understanding GEI mechanisms, and commercial, high-density BeadChips66 are making EWASs more affordable. The availability of this technology, lower per-sample cost, and the proliferation of replicated results in numerous study populations are driving EWAS adoption. Despite these advances, EWASs remain too expensive for very large population studies.
There is growing interest in studying the accumulation of epigenetic changes, known as epigenetic drift, in individuals with sustained generational environmental stressors such as stress and insufficient or improper nutrition. This is paramount for understanding the consequences of racism, poverty, and other multigenerational disparities. For example, historic and ongoing experiences of discrimination, poverty, and toxic exposures can contribute to chronic stress and other environmental exposures that can affect gene expression and increase the risk of disease. The weathering hypothesis posits that cumulative stress-related exposures due to discrimination, lower socioeconomic status, and racism result in increased disease risk in marginalized groups.67 Additionally, the allosteric load theory, when considered in the context of GEI, highlights the sophisticated mechanisms through which living organisms adapt to their environments by modulating enzyme activity. This theory suggests that allosteric regulation, which occurs when molecules bind to enzymes at sites other than the active site to change their activity, plays a pivotal role in adjusting metabolic pathways in response to environmental changes.68
Because epigenomics alone is insufficient for a comprehensive understanding of disease pathology, several approaches have been used to examine the collective genome and epigenome. One approach expands the quantitative trait analysis used in transcriptomics (e.g., eQTL analyses) to examine methylation-eQTLs (methyl-eQTLs). Methyl-eQTL analyses identify CpGs associated with gene expression and are often used to follow up top hits/findings from EWASs and genome-wide association studies (GWASs). This approach can reveal insights into relevant biological pathways. More general pathway analysis approaches can also be used.69
Possible mechanisms of epigenetic inheritance include DNA methylation, heterochromatin, polycomb proteins, and non-coding RNAs.70 Aligning with the Developmental Origins of Health and Disease (DOHaD) theory, early life, prenatal, and pre-conception settings have been shown to affect long-term health and chronic illness risk. For example, research has connected low birth weight and maternal habits including smoking and malnutrition with adult cardiovascular disease and type 2 diabetes.71 DOHaD also emphasizes the vulnerability of fetuses and neonates to environmental contaminants due to poor defensive mechanisms.72 For example, infants born to malnourished mothers and those exposed to endocrine disruptors are more likely to develop metabolic illnesses,72 and smoking during pregnancy is linked to low birth weight and obesity for offspring.72 Additionally, events such as famines provide evidence for intergenerational inheritance with long-term impacts such as elevated risk of obesity, diabetes, and mental health disorders seen in the offspring of those who experienced famine.73 The DOHaD theory highlights early-life events’ impact on long-term health and has major implications for public health efforts to prevent chronic disease.
Multi-omics approaches for PEH
Multi-omics approaches, including genomics, transcriptomics, proteomics, epigenomics, lipidomics, and metabolomics, have been independently and jointly utilized to identify molecular features associated with environmental exposures that affect disease progression. Traditional omics-environment (OxE) interaction studies often analyze omics datasets independently. For example, transcriptomics has unveiled gene-expression signatures associated with exposure to various environmental chemicals that affect pathways involved in inflammation, oxidative stress, DNA damage, and apoptosis.74 MS-based proteomics has been employed to investigate the effects of environmental exposures on protein expression and post-translational modifications such as phosphorylation, acetylation, and ubiquitination.75 Additionally, MS has been used to examine the effects of biodiesel exhaust exposure on bioactive lipid mediators in human airways using lipidomic profiling methods.76 Further, a metabolomics study77 detected trichloroethylene metabolites in human plasma, linking these metabolites to changes in endogenous metabolites involved in immunosuppression, hepatotoxicity, and nephrotoxicity.
Integrative omics integrates and simultaneously analyzes multiple omics datasets for a comprehensive understanding of complex biological systems. This approach can unravel complex interactions between genetic variations, gene-expression patterns, protein profiles, metabolic changes, and epigenetic modifications in response to environmental factors. For example, a multi-level omics (proteomics, metabolomics, lipidomics, one-carbon metabolism, and neuroinflammation) analysis in Alzheimer’s disease pathology identified interacting pathway alterations, revealing dimensions of heterogeneity and potential novel disease mechanisms.78 Another study of Alzheimer’s disease79 integrated GWAS data, multi-omics data, drug-target networks, and protein-protein interactions to identify high-confidence Alzheimer’s disease-risk genes and candidate drugs for repurposing. Additionally, to explore the mechanisms of cognitive impairment in individuals with type 2 diabetes, researchers conducted a multi-omics analysis (transcriptome, plasma metabolome, and gut microbiota) in db/db mice with cognitive decline.80 Further, to understand the etiology of coronary artery disease through an integrative approach, a large-scale study81 combined GWAS variants, eQTLs, protein quantitative trait loci (pQTLs), and cell-type-specific gene-expression datasets. This framework enabled the identification of cell-type-specific OxE effects from bulk expression data, which is confounded by single-cell heterogeneity.
Advancements and challenges in studying OxE interactions using multi-omics
Advancements in single-cell omics technologies, such as single-cell RNA sequencing (scRNA-seq)82 and single-cell assay for transposase accessible chromatin sequencing (scATAC-seq),83 have enabled the identification of associations between traits and cell types from high-resolution profiling of individual cells. Single-cell integrative omics methods such as cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq)84 merge RNA and protein measurements, providing concurrent insights into transcriptomes and cell-surface proteins. However, integrative OxE analyses face significant challenges involving data harmonization, the integration of multiple types of environmental exposure data from diverse sources, the consideration of cell type-specific responses, how to capture dynamic interactions, and sample-size limitations. To address these challenges, promising tools are being developed to increase the precision of OxE analyses. For example, scQTLbase85 and scGWAS82 facilitate the identification of loci that play regulatory roles as well as associations between traits and cell types in single-cell integrative omics analyses.
Methods for GEI analysis
In traditional quantitative genetic studies, linear additive models offer a straightforward framework for testing GEI by incorporating both genetic and environmental variables, along with their interaction terms, into a regression equation. In these models, the dependent variable typically represents a phenotype of interest, such as disease status or a quantitative trait, and the independent variables include a genetic factor (e.g., genotype or allele count), an environmental exposure, and an interaction term that represents the product of the genetic and environmental variables.1,86 This approach is based on the premise that, if the interaction term significantly improves a model’s fit to the data, it suggests the presence of a GEI. This approach has been crucial in pinpointing essential genes that interact with the environment, but it does not show how the collective effects of genes are altered in response to environmental stimuli.87 With full-genome sequences, it is now possible to conduct GWEISs, which integrate GEI analyses into GWASs.59 This is anticipated to become a standard method for mapping complex traits,88 and methodological work is rapidly advancing. In addition, there is growing appreciation for an omnigenic hypothesis that posits that complex features or illnesses are influenced by all genes, which work together in a network inside an organism.89
Machine learning and artificial intelligence (AI) methods are being increasingly used in algorithms that include random forests, support vector machines, and neural networks. Deep-learning models excel in capturing complex, non-linear interactions, enhancing the understanding of gene-environment dynamics. Additionally, AI aids in constructing interaction networks and enables cross-validation and ensemble methods for robust model evaluation. Bayesian networks facilitate causal inference, and natural language processing can extract valuable information from scientific literature and clinical records, facilitating the identification of potential GEI. While it is beyond the scope of this review to cover all study designs and methods for detecting GEI, Table 4 summarizes current and emerging approaches and their advantages and disadvantages.
Table 4.
Study designs and methods for detecting GEI
| Approach | Description | Advantages and limitations |
|---|---|---|
| Candidate gene-environment analysis57,59,90 | pre-identified genes or variants and selected exposures are used to determine joint genotype and environmental effects on a multiplicative or additive scale | specific genes and exposures can be investigated; limited by prior knowledge and low replicability; does not allow for hypothesis-free scans |
| GWISs or GWEISs91 | genome-wide scans for GEI interactions enable the identification of novel main and interaction effects without a priori hypotheses | does not require a priori hypotheses; severely underpowered due to multiple testing burden |
| Variable selection92,93 | the number of features is reduced using methods such as lasso, ridge regression, and Bayesian variable selection | enhances power for GEI scans |
| Tree-based methods94 | tree-based methods are applied to interaction models | provides easily interpretable models; may not capture complex interactions |
| Combinatorial approaches94,95 | all possible variable combinations are searched for the best predictive model; multifactor dimensionality reduction is popular for detecting gene-gene interactions and GEI | non-parametric and model-free approach; computationally intensive and may suffer from the curse of dimensionality |
| Case-only design96 | detects GEI interactions under the G-E independence assumption | greater power than traditional case-control studies when G-E independence assumption holds; increased type I error if the G-E independence assumption is violated |
| Multi-stage analysis97 | SNPs are filtered in a multi-stage process before testing GEI interactions using SNPs selected based on marginal effects from GWAS results, gene-environment correlation tests, functional annotation, or external pathway knowledge | filters and prioritizes variants; may miss important interactions |
| Filtering and prioritization87,98,99 | biological knowledge from variant annotations, molecular phenotypes, pathway-based annotations, and associations from external databases, variants, and exposures are filtered and prioritized for GEI analyses; vQTLs enriched for GEI can be used for prioritizing genetic variants | incorporates biological knowledge into analysis; quality and completeness of knowledge base affect results |
| Set-based methods for rare variants100 | burden tests or variance component tests are used to group rare variants and test interactions between the set of rare variants and an exposure | increased statistical power and reduced multiple testing burden compared to single variant tests; loss of information for individual variants because variants are grouped |
| Polygenic score by environment (PGSxE) interactions101 | interactions between individual polygenic scores and environmental exposures are tested using methods that include polygenic gene-environment interactions (PIGEONs) | reduces multiple testing burden; accounts for the polygenic nature of complex traits; quality and portability of PGS can lead to biases and limited generalizability |
| Multi-exposure analysis92,102,103 | the effects of the genome and exposome are jointly modeled using linear mixed models such as StructLMM, GxEMMs, and IGE to investigate the effects of multiple exposures on complex traits | accounts for genome-exposome correlations and correlated exposome variables; computationally intensive; difficult to interpret and determine causality |
| Mixtures analysis104,105 | risk from complex mixtures such as air pollution, diet, or cigarette smoke is assessed by analyzing effects of mixture components and examining metabolic pathways for specific exposures | realistically models exposure mixtures; increased risk of confounding factors; population-specific exposure patterns can reduce the generalizability of results |
| Longitudinal analysis106,107,108 | gene x longitudinal-exposure interactions are modeled by incorporating cumulative and time-varying exposures from longitudinal studies into functional logistic regression modeling | increased power with repeated measurements time and resource intensive |
| Machine learning109 | GEI analyses can be conducted using advanced machine-learning models such as neural networks; evolutionary algorithms such as genetic programming neural network and grammatical evolution neural network can optimize the parameters of a neural network | models complex, non-linear relationships; requires large amounts of data; requires cross-validation to avoid overfitting and assess predictive ability of the model; tends to produce black-box models, which are challenging to interpret |
G-E, gene-environment.
Other advances in the field include set-based methods that use burden or variance component tests to group rare variants, which are SNPs with minor allele frequency less than 0.01, to test for interactions between a set of variants and an exposure.110 Other methods estimate and partition heritability by modeling the genome and exposome with linear mixed models. The integrative analysis of genomic and exposomic data (IGE)49 and gene-environment mixed models (GxEMMs),102 a mixed model for GEI, are whole-genome methods for estimating additive and interaction effect heritability and variance. Structure linear mixed models (StructLMMs)111 examine SNPs with multiple environmental variables to identify GEI-interacting SNPs. vQTLs enriched for GEI can be used to identify and prioritize genetic variants associated with phenotype variance.99 Additionally, PGSs are powerful tools for assessing interactions, particularly for highly polygenic traits.112 GEIs for rare diseases or genotypes can be explored using meta-analyses of exposure data and study results.
Methods development is also focused on assessing risk from complex mixtures of exposures such as air pollution, diet, or the carcinogen mixtures in cigarette smoke or pesticides.104 Effects of the components of a complex mixture can be dissected by examining metabolic pathways in individuals with different genetic backgrounds.59 For example, studies have identified the interaction between N-acetyltransferase 2 (NAT2) and heterocyclic amines found in red meat as a potential risk factor for colorectal cancer.113 Longitudinal measures of environmental exposure, such as those in Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE),106 can be leveraged to incorporate cumulative and time-varying exposures for complex disease-risk assessments.107
As reviewed in McAllister et al.,114 detecting GEI presents many statistical challenges. High dimensionality complicates analyses and requires multiple comparison adjustments, increasing the risk of type II error. Additionally, many studies lack power to detect interactions due to small effect sizes and insufficient sample sizes due to logistics and costs. However, the availability of data from mega-sized observational studies such as the UK Biobank115 and All of Us116 is helping address these limits. Further, heterogeneity in genetic and environmental factors across populations can obscure true interactions and hinder replicability. Model complexity is another issue because complex models are needed to accurately explore the many potential interactions and confounders.
Measurement errors and misclassifications can introduce bias into estimates and reduce power to detect true interactions, complicating GEI detection. Due to confounding and collinearity, disentangling the independent effects of correlated exposures is difficult and often inconclusive. Differences in interaction scale can also affect detection and interpretation. True interactions may also be missed due to multiple testing. Additionally, the difficulty of collecting high-quality data on genetic variants and environmental exposures can bias detected interactions. Finally, ethical and privacy concerns about genetic and environmental data can limit its use. Innovative methods and interdisciplinary collaboration are needed to overcome these challenges and gain insights into disease etiology and prevention. Although machine learning and AI are emerging methods for interaction detection, generalized linear models are still widely used.92
The lack of a standardized roadmap for quantifying the exposome and harmonizing environmental measurements across studies causes data collection and measurement protocol inconsistencies in GEI analyses. Global cooperation and funding agency enforcement are needed to solve these problems. Researchers are aware of these challenges, and efforts such as those from the Human Health Exposure Analysis Resource (HHEAR) network can help expand standardized exposure analysis in human health studies.117 hhearprogram.org, led by NIEHS, was recently issued to create the Center for Exposome Research Coordination to Accelerate Precision Environmental Health, which is tasked with helping resolve the challenges associated with exposomics studies.
Methodological advances are needed to address heterogeneity in exposure measurements, variable exposure distributions across populations, confounding factors, and population stratification when integrating exposure data from diverse studies for larger sample sizes. Additionally, methods to improve GEI detection in diverse populations are needed to ensure findings are broadly applicable. Finally, large population-based studies using geospatial estimates of environmental exposures are promising but require new methods for integrating data from diverse sources to explore complex relationships between health and the environment.
Translation for PEH
In the context of PEH, GEI studies aim to identify exposure-response relationships to predict disease risk and improve health. Analyzing exposure data across the course of life reveals inter-individual differences in susceptibility and informs the development of biomarkers of exposure/disease risk. Determining which genetic variations increase susceptibility to the toxic effects of environmental pollutants can help identify individuals at risk of adverse health outcomes and inform tailored interventions.10 GEI studies provide insights into how the complex interplay between genetic and environmental factors affects disease development and outcomes and elicits the underlying biological mechanisms of disease. We highlight early exemplars of how GEI can be translated into PEH practice. We then discuss what is needed to translate basic research from model organisms and new approach methods (NAMs) into PEH practice.118
Exemplars of PEH translation
GEI research has a direct impact on individual-level healthcare and preventive medicine in a growing number of real-world scenarios. Interactions have been found between genetic variation in CYP2D6 and pesticide exposure for Parkinson’s disease, NAT2 and smoking for bladder cancer, ALDH2∗2 and alcohol intake for esophageal cancer, and AS3MT and arsenic for skin lesions.119 As recently reviewed in Kabbani et al.,120 pivotal real-world pharmacogenomics applications showcase how knowledge of GEI can be incorporated into clinical practice, enhancing personalized medical treatment. For example, personalized warfarin dosing, guided by variations in genes that include VKORC1 and CYP2C9, optimizes dosage to balance efficacy and bleeding risk.121 Further, genetic testing for the HLA-B∗57:01 allele before prescribing abacavir for HIV has become standard practice to prevent reactions due to severe hypersensitivity.122 In another example, due to severe toxicity risks at standard doses in individuals with variants in the TPMT gene, dose adjustments can be made or alternative therapies suggested when prescribing thiopurine drugs such as azathioprine and mercaptopurine.123 Additionally, CYP2D6 gene variants help guide the selection of antidepressants and dosages to optimize treatment efficacy and minimize side effects.123 Identifying individuals with BRCA1/2 mutations and HER2 expression enables personalized cancer therapy with PARP inhibitors and trastuzumab, respectively, ensuring targeted and effective treatment strategies for patients.124
In addition to drug-response outcomes, there are a growing number of translational implementations for disease prevention in PEH. An example is GEI related to susceptibility to asbestos exposure. BAP1, a tumor suppressor gene, when mutated, increases susceptibility to several cancers, notably mesothelioma, in the context of asbestos exposure.125 These mutations not only predispose individuals to disease but may also modulate clinical outcomes, suggesting varied prognoses for affected individuals. This underscores the potential for targeted screening and prevention strategies, especially for those with known BAP1 mutations and a history of asbestos exposure. Investigating how these mutations interact with asbestos to promote carcinogenesis is vital for developing targeted therapies, as this helps with understanding the affected molecular pathways.126 Table 5 lists examples of GEI applications in PEH. Baccarelli et al.10 provide additional examples and context.
Table 5.
Exemplar GEI applications in PEH
| Scenario | Genetic and environmental factors | Impact |
|---|---|---|
| Personalized air-quality alerts | genetic susceptibilities to respiratory conditions and air quality | manage symptoms of respiratory conditions and prevent exacerbation by reducing exposure to pollutants |
| Lead exposure and genetic susceptibility | genetic variations and lead exposure | implement interventions to reduce the risk of cognitive and developmental delays associated with lead exposure in susceptible children |
| BAP1 and asbestos exposure | BAP1 mutations and asbestos exposure | implement targeted surveillance and early detection strategies for improved outcomes in individuals with an elevated risk of asbestos-related diseases |
| UV exposure and skin cancer risk | MC1R gene variants and UV radiation | implement skin cancer screening for early detection and treatment of skin cancer and provide personalized sun-protection advice to reduce the incidence of melanoma in susceptible individuals |
| Personalized nutrition plans | genetic variations and nutrient intake | manage chronic conditions through dietary modifications tailored to individual genetic makeup to reduce the risk of diet-related diseases such as type 2 diabetes and cardiovascular disease |
| Pesticide exposure and Parkinson’s disease | genetic susceptibilities and pesticide exposure | implement stringent protective measures and monitoring to manage and possibly prevent the onset of Parkinson’s disease in susceptible individuals |
Translation from model organisms and NAMs for PEH
Discovery and validation of GEI in human epidemiology studies are limited by low power compared to main-effect studies, the difficulty of harmonizing environmental risk factors across consortia, the complexity of measuring environmental exposures (especially mixtures) due to temporality, genetic and phenotypic heterogeneity, linkage disequilibrium patterns and genetic modifiers that can reduce power, and challenges associated with the location of most genetic variants (SNPs) in non-coding regions. GEI studies with population-based in vivo and in vitro experimental models have advanced the understanding of population variability. Especially for GEI studies that are unethical or intractable to conduct in humans (e.g., toxicity studies), testing and genetic analysis can be conducted in model organism population models or NAM experiments.127 Studies have demonstrated the utility of population-based human in vitro cells and population-based in vivo mouse models in hazard identification, exposure-response assessment, and mechanistic understanding.128 Hypotheses arising from these studies can then be tested using human genetic data, improving detection power and enabling the analysis of subpopulations.
Several population-based model organism resources have advantages compared to (and can complement) human GEI studies. Rodent models are heavily used in the field, and each has respective advantages and disadvantages. Among these are the genetically diverse Collaborative Cross,129 Diversity Outbred lines,130 Complementary CC-RIX mice,131 Mouse Hybrid Diversity Panel,132 Rat Hybrid Diversity Panel,133 and genetically heterogeneous rat stocks. These models help identify individual quantitative trait locus (QTL) and candidate genes involved in GEI interplay with complex human diseases due to the high sequence homology between mice and humans.
In addition to rodent resources, other population-based model organism systems have potential for GEI discovery, including Drosophila,134 Caenorhabditis elegans, and fish panels.135 An advantage of these alternative resources is broad exposome coverage due to their amenability to high-throughput data generation.127 Population-level studies utilizing complementary resources of cells and derived cell lines also enable in vitro population-level studies.136
For GEI studies, NAMs have historically been used during early discovery or in a screening context because of their amenability to high-throughput scaling. High-throughput screening (HTS) assays incorporating genetic data and environmental and chemical exposures can prioritize candidate exposures for in-depth hypothesis testing, including GEI analyses.137 Tox21138 and ToxCast139 enable HTS through in vitro assays produced by inserting chemical exposure-sensitive reporters into human cell lines. Using computational methods, the toxicity and risk of chemicals are ranked from in vitro screening data for genotoxic agents,140 endocrine-disrupting chemicals,141 thyroid receptor-interfering compounds,142 and environmental, manufacturing, and agrochemical compounds in large chemical libraries.143 HTS assays were used to assess the toxicity and endocrine disruption activity of dispersants in the DeepWater Horizon oil spill144 and evaluate the toxicity of chemicals in the West Virginia Elk River chemical spill.145 High-throughput transcriptomics has recently been incorporated into screening assays146 to identify the effects of genes, signaling pathways, and biological networks on environmental and chemical exposures, identify endocrine-disrupting environmental contaminants,147 and determine hazard value estimates for long-term exposures using short-term assays.148
Adverse outcome pathways (AOPs) are a framework for organizing scientific knowledge regarding the linkage between perturbation of a specific biological target by stressors and a resultant adverse outcome relevant to risk assessment and regulatory decision making. This framework provides approaches for assessing how biological processes such as gene expression, signaling pathways, and cellular functions respond to molecular events initiated by environmental exposures. Aggregate exposure pathways (AEPs)149 have been proposed as a geospatial environmental exposure complement to AOPs. Environmental exposure assessment ends where AOPs begin, at a key biological or molecular event. Eccles et al.150 demonstrated how AEPs and AOPs can be integrated for geospatially resolved mapping of molecular perturbations. To increase the accuracy and geographic coverage of geospatial exposure models and the human relevance of AOPs, an improved NAM-based approach for characterizing GEI-related outcomes and risk is needed.
Recent advances in functional genomics tools and technologies, in vitro approaches, and biological knowledge have enabled novel approaches for functionally validating, experimentally characterizing, and interpreting both newly identified and existing GEI findings relevant to human disease outcomes and better inferring biological plausibility after the discovery stage. The use of embryonic stem cells and induced pluripotent stem cells (iPSCs) from relevant cell types is an example of such an approach. Combined with genome/epigenome editing tools, these cell populations help identify and validate genetic variants responsible for environmental sensitivity. Additionally, organoid culture models such as tissue-chip platforms and other culture systems and microfluidics are emerging. Tissue chips or “organs on chips” derived from patient iPSC lines have enabled accurate in vitro human models of human diseases. Genome/epigenome editing tools such as CRISPR-Cas9 can generate robust high-content genome-wide screens that can be used to dissect functional non-coding genetic variants that drive susceptibility to environmental exposures.
Social and societal considerations in GEI research
Environmental justice
PEH combines large-scale individual-level biological data with individual-, community-, and population-level environmental data to enhance population-level health.151 For instance, GEI analysis can identify genetic variations that increase an individual’s susceptibility to pollutants. In the context of environmental justice, these results enhance understanding of how environmental factors affect health outcomes for different populations. This enables targeted population-level interventions to reduce hazardous exposures for vulnerable populations with the goal of addressing health disparities. As climate change accelerates, the impact of such GEI is likely to magnify, and genomics is posited as one mechanism for understanding climate change response.152
PEH also plays a crucial role in addressing environmental justice issues such as unequal pollutant exposure in low-income and marginalized communities.153 Historically, genetic studies have focused on populations of European descent and thus have not accurately represented the genetic and environmental diversity of populations underrepresented in clinical and scientific studies. Including more diverse populations in studies can expand understanding of the genetic and environmental factors that contribute to health disparities and help develop more effective interventions.154 In addition to addressing health disparities, integrating historically underrepresented populations into large cohorts improves the generalizability of research findings.155
The impact of exposures on gene expression varies between individuals and populations and can contribute to differences in disease risk and health outcomes. Accordingly, it is important to consider the potential consequences of racism and discrimination on individual and intergenerational epigenetics to understand the underlying mechanisms of disease and develop interventions. This includes research to identify populations at greater risk for transgenerational epigenetic changes due to generational exposure to toxicants or stress and interventions to mitigate the harm in the current generation and prevent harm in future generations.
Return of results
As the research community begins to report findings from GEI studies to research participants, there are many ethical, legal, and social implications. Nearly all participants in genetic and environmental studies want to know the results of their chemical exposures and GEI, even if the data have uncertain health implications and may reveal risks that cannot be addressed.156 Report-back of findings to communities and individuals should be routine as a matter of reciprocity, respect, transparency, and trust.156 The concerns of communities and their members can inform and guide the report-back of results. For findings involving high environmental exposures or high genetic susceptibility to an environmental exposure, researchers can help participants access resources to help them take appropriate actions, particularly in the context of understanding how GEI findings could affect their health. Report-back of findings should include information on choices to reduce individual-level exposures. Findings can aid decision-making in clinical settings and guide policy changes. Additional community engagement, guidelines, educational resources, training, and tool development are needed to support the effective communication of findings.
Data privacy
Data privacy and human subject protection are essential when collecting and analyzing environmental exposure data. Geospatial and sensor-based exposure data lack well-developed policies and protections, unlike identifiable genomics data. The Genetic Information Non-discrimination Act (GINA) protects genetic data, but there is no equivalent for environmental data.
As geospatial data are integrated into PEH studies, data privacy must evolve with technology. To assess environmental risk factors, participants’ residential addresses during critical susceptibility periods (e.g., infancy, childhood, adolescence, adulthood) are collected. Spatial exposure data that track location can reveal routines, habits, health, and pollutant exposures, making them vulnerable to identity theft and malicious activity. Accordingly, geospatial exposure data collection and use require clear guidelines and informed consent.157 Data collection, storage, and sharing should incorporate encryption, access controls, and data minimization.
With the growing demand for data sharing, individualized exposure assessment, and expanding repositories of public and commercial data, the risk of re-identification in environmental health research is increasing. As reviewed in Onsrud,158 location-specific environmental exposure measurements from wearable sensors capturing location, exposures, and biometrics can expose individuals to risk.
Breaches in data privacy can have severe consequences, including higher insurance costs, employability impacts, stigma at the individual and community levels, effects on home prices, and legal repercussions.158 Insurers and employers subject to environmental regulations could re-identify environmental data and deny coverage or employment based on exposures. This information can potentially lower property values and stigmatize communities. Privacy breaches and re-identification can also damage public confidence in environmental health and harm litigants and research. Importantly, privacy breaches may exacerbate negative effects on vulnerable populations because environmental research often targets populations with high levels of exposures. These issues and specific examples are reviewed in Onsrud.158
Conclusion
Understanding environmental exposures and GEI is crucial for predicting and preventing diseases, and the expansion of GEI research must include the exposome. Through the integration of multiple levels of genetic and environmental data, PEH can help identify populations at high risk due to genetic variants, exposure to specific environmental stressors, and combinations of both genetic and environmental risk factors. Integrating the exposome into emerging and existing studies focused on genetics/genomics will enable precise risk assessment and enhance regulatory decision-making. In PEH, PGSs and environmental risk scores can be merged to create individual-level metrics that are more clinically translatable and accurate than PGS alone. This could allow the generation of data-driven predictive risk scores that accurately predict an individual’s risk (or resilience) based on their unique genetic makeup and lifetime history of exposure to environmental risk factors, including those related to the built environment and socioeconomic status. In this comprehensive concept, emerging knowledge about inter-individual genetic/epigenetic diversity and its influence on susceptibility to environmental exposures could be used to predict and explain disease pathology more comprehensively and accurately. Building on findings at the individual level, PEH can identify novel points of intervention at the population level. For instance, changes can be made to the built environment to increase physical activity or access to smoking cessation programs for pregnant people can be expanded. Further, PEH results can be leveraged to prioritize the implementation and enforcement of environmental regulations to reduce health disparities by identifying populations with elevated risk due to environmental exposures.
Acknowledgments
This work was funded by intramural funds from the National Institute of Environmental Health Sciences. We wish to thank Hannah Collins Cakar for her help with manuscript preparation.
Author contributions
Conceptualization, A.A.M.-R., K.A.M., and R.W; methodology, A.A.M.-R., F.S.A., J.S.H., and K.A.M.; writing – original draft, A.A.M.-R., D.M.R., F.S.A., J.S.H., and K.A.M.; writing – review and editing, A.A.M.-R., D.M.R., B.A, F.S.A., J.S.H., K.P.M., D.C.F., T.A.B., S.S.N., C.P.S., K.G.P., D.M.B., C.P.L., S.A.N., G.W.C., A.K.M., B.A.M., Y.C., Q.E.H., K.A.M., and R.W.
Declaration of interests
The authors declare no competing interests.
References
- 1.Boyce W.T., Sokolowski M.B., Robinson G.E. Genes and environments, development and time. Proc. Natl. Acad. Sci. USA. 2020;117:23235–23241. doi: 10.1073/pnas.2016710117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bononi A., Goto K., Ak G., Yoshikawa Y., Emi M., Pastorino S., Carparelli L., Ferro A., Nasu M., Kim J.H., et al. Heterozygous germline BLM mutations increase susceptibility to asbestos and mesothelioma. Proc. Natl. Acad. Sci. USA. 2020;117:33466–33473. doi: 10.1073/pnas.2019652117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Modafferi S., Zhong X., Kleensang A., Murata Y., Fagiani F., Pamies D., Hogberg H.T., Calabrese V., Lachman H., Hartung T., Smirnova L. Gene-environment interactions in developmental neurotoxicity: a case study of synergy between chlorpyrifos and CHD8 knockout in human brainspheres. Environ. Health Perspect. 2021;129:77001. doi: 10.1289/ehp8580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rampersaud E., Mitchell B.D., Pollin T.I., Fu M., Shen H., O'Connell J.R., Ducharme J.L., Hines S., Sack P., Naglieri R., et al. Physical activity and the association of common FTO gene variants with body mass index and obesity. Arch. Intern. Med. 2008;168:1791–1797. doi: 10.1001/archinte.168.16.1791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bakermans-Kranenburg M.J., van IJzendoorn M.H. The hidden efficacy of interventions: gene×environment experiments from a differential susceptibility perspective. Annu. Rev. Psychol. 2015;66:381–409. doi: 10.1146/annurev-psych-010814-015407. [DOI] [PubMed] [Google Scholar]
- 6.Russell L.E., Zhou Y., Almousa A.A., Sodhi J.K., Nwabufo C.K., Lauschke V.M. Pharmacogenomics in the era of next generation sequencing - from byte to bedside. Drug Metab. Rev. 2021;53:253–278. doi: 10.1080/03602532.2021.1909613. [DOI] [PubMed] [Google Scholar]
- 7.Franczyk B., Rysz J., Gluba-Brzózka A. Pharmacogenetics of drugs used in the treatment of cancers. Genes. 2022;13:311. doi: 10.3390/genes13020311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Di L. Reaction phenotyping to assess victim drug-drug interaction risks. Expert Opin. Drug Discov. 2017;12:1105–1115. doi: 10.1080/17460441.2017.1367280. [DOI] [PubMed] [Google Scholar]
- 9.Wang C.W., Preclaro I.A.C., Lin W.H., Chung W.H. An updated review of genetic associations with severe adverse drug reactions: translation and implementation of pharmacogenomic testing in clinical practice. Front. Pharmacol. 2022;13:886377. doi: 10.3389/fphar.2022.886377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Baccarelli A., Dolinoy D.C., Walker C.L. A precision environmental health approach to prevention of human disease. Nat. Commun. 2023;14:2449. doi: 10.1038/s41467-023-37626-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Drobni Z.D., Kolossvary M., Karady J., Jermendy A.L., Tarnoki A.D., Tarnoki D.L., Simon J., Szilveszter B., Littvay L., Voros S., et al. Heritability of coronary artery disease: insights from a classical twin study. Circ. Cardiovasc. Imaging. 2022;15:e013348. doi: 10.1161/circimaging.121.013348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ottman R. Gene-environment interaction: definitions and study designs. Prev. Med. 1996;25:764–770. doi: 10.1006/pmed.1996.0117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Westerman K.E., Sofer T. Many roads to a gene-environment interaction. Am. J. Hum. Genet. 2024;111:626–635. doi: 10.1016/j.ajhg.2024.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Laclaustra M., Lopez-Garcia E., Civeira F., Garcia-Esquinas E., Graciani A., Guallar-Castillon P., Banegas J.R., Rodriguez-Artalejo F. LDL cholesterol rises with BMI only in lean individuals: cross-sectional U.S. and Spanish representative data. Diabetes Care. 2018;41:2195–2201. doi: 10.2337/dc18-0372. [DOI] [PubMed] [Google Scholar]
- 15.Gottlieb E.R., Ziegler J., Morley K., Rush B., Celi L.A. Assessment of racial and ethnic differences in oxygen supplementation among patients in the intensive care unit. JAMA Intern. Med. 2022;182:849–858. doi: 10.1001/jamainternmed.2022.2587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wild C.P. The exposome: from concept to utility. Int. J. Epidemiol. 2012;41:24–32. doi: 10.1093/ije/dyr236. [DOI] [PubMed] [Google Scholar]
- 17.Getzinger G.J., Ferguson P.L. Illuminating the exposome with high-resolution accurate-mass mass spectrometry and nontargeted analysis. Current Opinion in Environmental Science & Health. 2020;15:49–56. doi: 10.1016/j.coesh.2020.05.005. [DOI] [Google Scholar]
- 18.You L., Sun G.H., Yu D., Liu X.Y., Xu G.W. New advances in exposomics-analysis methods and research paradigms based on chromatography-mass spectrometry. Se Pu. 2024;42:109–119. doi: 10.3724/sp.J.1123.2023.12001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Rappaport S.M., Smith M.T. Epidemiology. Environment and disease risks. Science. 2010;330:460–461. doi: 10.1126/science.1192603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dennis K.K., Marder E., Balshaw D.M., Cui Y., Lynes M.A., Patti G.J., Rappaport S.M., Shaughnessy D.T., Vrijheid M., Barr D.B. Biomonitoring in the era of the exposome. Environ. Health Perspect. 2017;125:502–510. doi: 10.1289/ehp474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Patel C.J., Ioannidis J.P.A. Studying the elusive environment in large scale. JAMA. 2014;311:2173–2174. doi: 10.1001/jama.2014.4129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nuckols J.R., Ward M.H., Jarup L. Using geographic information systems for exposure assessment in environmental epidemiology studies. Environ. Health Perspect. 2004;112:1007–1015. doi: 10.1289/ehp.6738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Cohen M.A., Adar S.D., Allen R.W., Avol E., Curl C.L., Gould T., Hardie D., Ho A., Kinney P., Larson T.V., et al. Approach to estimating participant pollutant exposures in the Multi-Ethnic Study of Atherosclerosis and Air Pollution (MESA Air) Environ. Sci. Technol. 2009;43:4687–4693. doi: 10.1021/es8030837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ward M.H., Jones R.R., Brender J.D., de Kok T.M., Weyer P.J., Nolan B.T., Villanueva C.M., van Breda S.G. Drinking water nitrate and human health: an updated review. Int. J. Environ. Res. Public Health. 2018;15:1557. doi: 10.3390/ijerph15071557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.James P., Hart J.E., Banay R.F., Laden F. Exposure to greenness and mortality in a nationwide prospective cohort study of women. Environ. Health Perspect. 2016;124:1344–1352. doi: 10.1289/ehp.1510363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kumar P., Morawska L., Martani C., Biskos G., Neophytou M., Di Sabatino S., Bell M., Norford L., Britter R. The rise of low-cost sensing for managing air pollution in cities. Environ. Int. 2015;75:199–205. doi: 10.1016/j.envint.2014.11.019. [DOI] [PubMed] [Google Scholar]
- 27.Rai A.C., Kumar P., Pilla F., Skouloudis A.N., Di Sabatino S., Ratti C., Yasar A., Rickerby D. End-user perspective of low-cost sensors for outdoor air pollution monitoring. Sci. Total Environ. 2017;607–608:691–705. doi: 10.1016/j.scitotenv.2017.06.266. [DOI] [PubMed] [Google Scholar]
- 28.Nieuwenhuijsen M.J., Donaire-Gonzalez D., Rivas I., de Castro M., Cirach M., Hoek G., Seto E., Jerrett M., Sunyer J. Variability in and agreement between modeled and personal continuously measured black carbon levels using novel smartphone and sensor technologies. Environ. Sci. Technol. 2015;49:2977–2982. doi: 10.1021/es505362x. [DOI] [PubMed] [Google Scholar]
- 29.Dons E., Int Panis L., Van Poppel M., Theunis J., Wets G. Personal exposure to black carbon in transport microenvironments. Atmos. Environ. X. 2012;55:392–398. [Google Scholar]
- 30.Leaffer D., Wolfe C., Doroff S., Gute D., Wang G., Ryan P. Wearable ultrafine particle and noise monitoring sensors jointly measure personal co-exposures in a pediatric population. Int. J. Environ. Res. Public Health. 2019;16:308. doi: 10.3390/ijerph16030308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Henning A., J Downs N., Vanos J.K. Wearable ultraviolet radiation sensors for research and personal use. Int. J. Biometeorol. 2022;66:627–640. doi: 10.1007/s00484-021-02216-8. [DOI] [PubMed] [Google Scholar]
- 32.National Academies of Sciences Engineering and Medicine . Proceedings of a Workshop—In Brief. National Academies Press; 2023. Developing Wearable Technologies to Advance Understanding of Precision Environmental Health. [PubMed] [Google Scholar]
- 33.Holmgren S.D., Boyles R.R., Cronk R.D., Duncan C.G., Kwok R.K., Lunn R.M., Osborn K.C., Thessen A.E., Schmitt C.P. Catalyzing knowledge-driven discovery in environmental health sciences through a community-driven harmonized language. Int. J. Environ. Res. Public Health. 2021;18:8985. doi: 10.3390/ijerph18178985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wise L.A., Hartge P. In: Modern Epidemiology. Lash T.L., VanderWeele T.J., Haneuse S., Rothman K.J., editors. Wolters Kluwer; 2021. Field methods; pp. 221–246. [Google Scholar]
- 35.Lash T.L., VanderWeele T.J., Rothman K.J. In: Modern Epidemiology. Lash T.L., VanderWeele T.J., Haneuse S., Rothman K.J., editors. Wolters Kluwer; 2021. Measurement and measurement error; pp. 287–314. [Google Scholar]
- 36.Hamilton C.M., Strader L.C., Pratt J.G., Maiese D., Hendershot T., Kwok R.K., Hammond J.A., Huggins W., Jackman D., Pan H., et al. The PhenX Toolkit: get the most from your measures. Am. J. Epidemiol. 2011;174:253–260. doi: 10.1093/aje/kwr193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Chung M.K., House J.S., Akhtari F.S., Makris K.C., Langston M.A., Islam K.T., Holmes P., Chadeau-Hyam M., Smirnov A.I., Du X., et al. Decoding the exposome: data science methodologies and implications in exposome-wide association studies (ExWASs) Exposome. 2024;4:osae001. doi: 10.1093/exposome/osae001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Bhatnagar A. Environmental determinants of cardiovascular disease. Circ. Res. 2017;121:162–180. doi: 10.1161/circresaha.117.306458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kolb H., Martin S. Environmental/lifestyle factors in the pathogenesis and prevention of type 2 diabetes. BMC Med. 2017;15:131. doi: 10.1186/s12916-017-0901-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Florez J.C., Udler M.S., Hanson R.L. In: Diabetes in America. Cowie C.C., Casagrande S.S., Menke A., Cissell M.A., Eberhardt M.S., Meigs J.B., Gregg E.W., Knowler W.C., Barrett-Connor E., Becker D.J., et al., editors. 2018. Genetics of type 2 diabetes. (National Institute of Diabetes and Digestive and Kidney Diseases (US))). [PubMed] [Google Scholar]
- 41.Dharmage S.C., Perret J.L., Custovic A. Epidemiology of asthma in children and adults. Front. Pediatr. 2019;7:246. doi: 10.3389/fped.2019.00246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Willis-Owen S.A.G., Cookson W.O.C., Moffatt M.F. The genetics and genomics of asthma. Annu. Rev. Genomics Hum. Genet. 2018;19:223–246. doi: 10.1146/annurev-genom-083117-021651. [DOI] [PubMed] [Google Scholar]
- 43.Tomasetti C., Vogelstein B. Cancer etiology. Variation in cancer risk among tissues can be explained by the number of stem cell divisions. Science. 2015;347:78–81. doi: 10.1126/science.1260825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Dai J., Shen W., Wen W., Chang J., Wang T., Chen H., Jin G., Ma H., Wu C., Li L., et al. Estimation of heritability for nine common cancers using data from genome-wide association studies in Chinese population. Int. J. Cancer. 2017;140:329–336. doi: 10.1002/ijc.30447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Brown R.C., Lockwood A.H., Sonawane B.R. Neurodegenerative diseases: an overview of environmental risk factors. Environ. Health Perspect. 2005;113:1250–1256. doi: 10.1289/ehp.7567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Pihlstrøm L., Wiethoff S., Houlden H. Genetics of neurodegenerative diseases: an overview. Handb. Clin. Neurol. 2017;145:309–323. doi: 10.1016/b978-0-12-802395-2.00022-5. [DOI] [PubMed] [Google Scholar]
- 47.Greene C.S., Penrod N.M., Williams S.M., Moore J.H. Failure to replicate a genetic association may provide important clues about genetic architecture. PLoS One. 2009;4:e5639. doi: 10.1371/journal.pone.0005639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Qiu W., Rogers A.J., Damask A., Raby B.A., Klanderman B.J., Duan Q.L., Tyagi S., Niu S., Anderson C., Cahir-Mcfarland E., et al. Pharmacogenomics: novel loci identification via integrating gene differential analysis and eQTL analysis. Hum. Mol. Genet. 2014;23:5017–5024. doi: 10.1093/hmg/ddu191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Zhou X., Lee S.H. An integrative analysis of genomic and exposomic data for complex traits and phenotypic prediction. Sci. Rep. 2021;11:21495. doi: 10.1038/s41598-021-00427-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Laville V., Majarian T., Sung Y.J., Schwander K., Feitosa M.F., Chasman D.I., Bentley A.R., Rotimi C.N., Cupples L.A., de Vries P.S., et al. Gene-lifestyle interactions in the genomics of human complex traits. Eur. J. Hum. Genet. 2022;30:730–739. doi: 10.1038/s41431-022-01045-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Martin A.R., Kanai M., Kamatani Y., Okada Y., Neale B.M., Daly M.J. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 2019;51:584–591. doi: 10.1038/s41588-019-0379-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Aschard H., Zaitlen N., Lindström S., Kraft P. Variation in predictive ability of common genetic variants by established strata: the example of breast cancer and age. Epidemiology. 2015;26:51–58. doi: 10.1097/ede.0000000000000195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Davis A.P., Wiegers T.C., Johnson R.J., Sciaky D., Wiegers J., Mattingly C.J. Comparative Toxicogenomics Database (CTD): update 2023. Nucleic Acids Res. 2023;51:D1257–D1262. doi: 10.1093/nar/gkac833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Davis A.P., Murphy C.G., Saraceni-Richards C.A., Rosenstein M.C., Wiegers T.C., Hampton T.H., Mattingly C.J. GeneComps and ChemComps: a new CTD metric to identify genes and chemicals with shared toxicogenomic profiles. Bioinformation. 2009;4:173–174. doi: 10.6026/97320630004173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Davis A.P., Rosenstein M.C., Wiegers T.C., Mattingly C.J. DiseaseComps: a metric that discovers similar diseases based upon common toxicogenomic profiles at CTD. Bioinformation. 2011;7:154–156. doi: 10.6026/97320630007154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Waters M., Stasiewicz S., Merrick B.A., Tomer K., Bushel P., Paules R., Stegman N., Nehls G., Yost K.J., Johnson C.H., et al. CEBS--Chemical Effects in Biological Systems: a public data repository integrating study design and toxicity data with microarray and proteomics data. Nucleic Acids Res. 2008;36:D892–D900. doi: 10.1093/nar/gkm755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Dick D.M., Agrawal A., Keller M.C., Adkins A., Aliev F., Monroe S., Hewitt J.K., Kendler K.S., Sher K.J. Candidate gene-environment interaction research: reflections and recommendations. Perspect. Psychol. Sci. 2015;10:37–59. doi: 10.1177/1745691614556682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Arnau-Soler A., Macdonald-Dunlop E., Adams M.J., Clarke T.K., MacIntyre D.J., Milburn K., Navrady L., Generation Scotland. A.M.Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium. Hayward C., et al. Genome-wide by environment interaction studies of depressive symptoms and psychosocial stress in UK Biobank and Generation Scotland. Transl. Psychiatry. 2019;9:14. doi: 10.1038/s41398-018-0360-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Thomas D. Gene--environment-wide association studies: emerging approaches. Nat. Rev. Genet. 2010;11:259–272. doi: 10.1038/nrg2764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Reddon H., Guéant J.L., Meyre D. The importance of gene-environment interactions in human obesity. Clin. Sci. (Lond.) 2016;130:1571–1597. doi: 10.1042/cs20160221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Figueroa J.D., Han S.S., Garcia-Closas M., Baris D., Jacobs E.J., Kogevinas M., Schwenn M., Malats N., Johnson A., Purdue M.P., et al. Genome-wide interaction study of smoking and bladder cancer risk. Carcinogenesis. 2014;35:1737–1744. doi: 10.1093/carcin/bgu064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Virolainen S.J., VonHandorf A., Viel K.C.M.F., Weirauch M.T., Kottyan L.C. Gene-environment interactions and their impact on human health. Genes Immun. 2023;24:1–11. doi: 10.1038/s41435-022-00192-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Maas S.C.E., Mens M.M.J., Kühnel B., van Meurs J.B.J., Uitterlinden A.G., Peters A., Prokisch H., Herder C., Grallert H., Kunze S., et al. Smoking-related changes in DNA methylation and gene expression are associated with cardio-metabolic traits. Clin. Epigenetics. 2020;12:157. doi: 10.1186/s13148-020-00951-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Nowak A.L., Anderson C.M., Mackos A.R., Neiman E., Gillespie S.L. Stress during pregnancy and epigenetic modifications to offspring DNA: a systematic review of associations and implications for preterm birth. J. Perinat. Neonatal Nurs. 2020;34:134–145. doi: 10.1097/jpn.0000000000000471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Sales V.M., Ferguson-Smith A.C., Patti M.E. Epigenetic mechanisms of transmission of metabolic disease across generations. Cell Metab. 2017;25:559–571. doi: 10.1016/j.cmet.2017.02.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Moran S., Arribas C., Esteller M. Validation of a DNA methylation microarray for 850,000 CpG sites of the human genome enriched in enhancer sequences. Epigenomics. 2016;8:389–399. doi: 10.2217/epi.15.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Geronimus A.T. Deep integration: letting the epigenome out of the bottle without losing sight of the structural origins of population health. Am. J. Public Health. 2013;103:S56–S63. doi: 10.2105/ajph.2013.301380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Guidi J., Lucente M., Sonino N., Fava G.A. Allostatic load and its impact on health: a systematic review. Psychother. Psychosom. 2021;90:11–27. doi: 10.1159/000510696. [DOI] [PubMed] [Google Scholar]
- 69.Cirillo E., Parnell L.D., Evelo C.T. A review of pathway-based analysis tools that visualize genetic variants. Front. Genet. 2017;8:174. doi: 10.3389/fgene.2017.00174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Cavalli G., Heard E. Advances in epigenetics link genetics to the environment and disease. Nature. 2019;571:489–499. doi: 10.1038/s41586-019-1411-0. [DOI] [PubMed] [Google Scholar]
- 71.Barker D.J., Osmond C. Infant mortality, childhood nutrition, and ischaemic heart disease in England and Wales. Lancet. 1986;1:1077–1081. doi: 10.1016/s0140-6736(86)91340-1. [DOI] [PubMed] [Google Scholar]
- 72.Heindel J.J., Newbold R., Schug T.T. Endocrine disruptors and obesity. Nat. Rev. Endocrinol. 2015;11:653–661. doi: 10.1038/nrendo.2015.163. [DOI] [PubMed] [Google Scholar]
- 73.Meda P., Bruzzone R., Knodel S., Orci L. Blockage of cell-to-cell communication within pancreatic acini is associated with increased basal release of amylase. J. Cell Biol. 1986;103:475–483. doi: 10.1083/jcb.103.2.475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Harrill J.A., Everett L.J., Haggard D.E., Sheffield T., Bundy J.L., Willis C.M., Thomas R.S., Shah I., Judson R.S. High-throughput transcriptomics platform for screening environmental chemicals. Toxicol. Sci. 2021;181:68–89. doi: 10.1093/toxsci/kfab009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Niu L., Thiele M., Geyer P.E., Rasmussen D.N., Webel H.E., Santos A., Gupta R., Meier F., Strauss M., Kjaergaard M., et al. Noninvasive proteomic biomarkers for alcohol-related liver disease. Nat. Med. 2022;28:1277–1287. doi: 10.1038/s41591-022-01850-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Gouveia-Figueira S., Karimpour M., Bosson J.A., Blomberg A., Unosson J., Pourazar J., Sandström T., Behndig A.F., Nording M.L. Mass spectrometry profiling of oxylipins, endocannabinoids, and N-acylethanolamines in human lung lavage fluids reveals responsiveness of prostaglandin E2 and associated lipid metabolites to biodiesel exhaust exposure. Anal. Bioanal. Chem. 2017;409:2967–2980. doi: 10.1007/s00216-017-0243-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Walker D.I., Uppal K., Zhang L., Vermeulen R., Smith M., Hu W., Purdue M.P., Tang X., Reiss B., Kim S., et al. High-resolution metabolomics of occupational exposure to trichloroethylene. Int. J. Epidemiol. 2016;45:1517–1527. doi: 10.1093/ije/dyw218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Clark C., Dayon L., Masoodi M., Bowman G.L., Popp J. An integrative multi-omics approach reveals new central nervous system pathway alterations in Alzheimer's disease. Alzheimer's Res. Ther. 2021;13:71. doi: 10.1186/s13195-021-00814-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Fang J., Zhang P., Wang Q., Chiang C.W., Zhou Y., Hou Y., Xu J., Chen R., Zhang B., Lewis S.J., et al. Artificial intelligence framework identifies candidate targets for drug repurposing in Alzheimer's disease. Alzheimer's Res. Ther. 2022;14:7. doi: 10.1186/s13195-021-00951-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Song X., Zhu Z., Qian X., Liu X., Chen S., Tang H. Multi-omics Characterization of type 2 diabetes mellitus-induced cognitive impairment in the db/db mouse model. Molecules. 2022;27:1904. doi: 10.3390/molecules27061904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Howson J.M.M., Zhao W., Barnes D.R., Ho W.K., Young R., Paul D.S., Waite L.L., Freitag D.F., Fauman E.B., Salfati E.L., et al. Fifteen new risk loci for coronary artery disease highlight arterial-wall-specific mechanisms. Nat. Genet. 2017;49:1113–1119. doi: 10.1038/ng.3874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Jia P., Hu R., Yan F., Dai Y., Zhao Z. scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies. Genome Biol. 2022;23:220. doi: 10.1186/s13059-022-02785-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Ziffra R.S., Kim C.N., Ross J.M., Wilfert A., Turner T.N., Haeussler M., Casella A.M., Przytycki P.F., Keough K.C., Shin D., et al. Single-cell epigenomics reveals mechanisms of human cortical development. Nature. 2021;598:205–213. doi: 10.1038/s41586-021-03209-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Liu C., Martins A.J., Lau W.W., Rachmaninoff N., Chen J., Imberti L., Mostaghimi D., Fink D.L., Burbelo P.D., Dobbs K., et al. Time-resolved systems immunology reveals a late juncture linked to fatal COVID-19. Cell. 2021;184:1836–1857.e22. doi: 10.1016/j.cell.2021.02.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Ding R., Wang Q., Gong L., Zhang T., Zou X., Xiong K., Liao Q., Plass M., Li L. scQTLbase: an integrated human single-cell eQTL database. Nucleic Acids Res. 2024;52:D1010–D1017. doi: 10.1093/nar/gkad781. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.El-Soda M., Malosetti M., Zwaan B.J., Koornneef M., Aarts M.G.M. Genotype×environment interaction QTL mapping in plants: lessons from Arabidopsis. Trends Plant Sci. 2014;19:390–398. doi: 10.1016/j.tplants.2014.01.001. [DOI] [PubMed] [Google Scholar]
- 87.Wang H., Zhang F., Zeng J., Wu Y., Kemper K.E., Xue A., Zhang M., Powell J.E., Goddard M.E., Wray N.R., et al. Genotype-by-environment interactions inferred from genetic effects on phenotypic variability in the UK Biobank. Sci. Adv. 2019;5:eaaw3538. doi: 10.1126/sciadv.aaw3538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Aschard H. A perspective on interaction effects in genetic association studies. Genet. Epidemiol. 2016;40:678–688. doi: 10.1002/gepi.21989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Wang H., Ye M., Fu Y., Dong A., Zhang M., Feng L., Zhu X., Bo W., Jiang L., Griffin C.H., et al. Modeling genome-wide by environment interactions through omnigenic interactome networks. Cell Rep. 2021;35:109114. doi: 10.1016/j.celrep.2021.109114. [DOI] [PubMed] [Google Scholar]
- 90.Warrington N.M., Zhu G., Dy V., Heath A.C., Madden P.A.F., Hemani G., Kemp J.P., McMahon G., St Pourcain B., Timpson N.J., et al. Genome-wide association study of blood lead shows multiple associations near ALAD. Hum. Mol. Genet. 2015;24:3871–3879. doi: 10.1093/hmg/ddv112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Ritz B.R., Chatterjee N., Garcia-Closas M., Gauderman W.J., Pierce B.L., Kraft P., Tanner C.M., Mechanic L.E., McAllister K. Lessons learned from past gene-environment interaction successes. Am. J. Epidemiol. 2017;186:778–786. doi: 10.1093/aje/kwx230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Zhou F., Ren J., Lu X., Ma S., Wu C. Gene–environment interaction: A variable selection perspective. Methods Mol. Biol. 2021;2212:191–223. doi: 10.1007/978-1-0716-0947-7_13. [DOI] [PubMed] [Google Scholar]
- 93.Quintana M.A., Conti D.V. Integrative variable selection via Bayesian model uncertainty. Stat. Med. 2013;32:4938–4953. doi: 10.1002/sim.5888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Motsinger A.A., Ritchie M.D., Reif D.M. Novel methods for detecting epistasis in pharmacogenomics studies. Pharmacogenomics. 2007;8:1229–1241. doi: 10.2217/14622416.8.9.1229. [DOI] [PubMed] [Google Scholar]
- 95.Ritchie M.D., Motsinger A.A. Multifactor dimensionality reduction for detecting gene-gene and gene-environment interactions in pharmacogenomics studies. Pharmacogenomics. 2005;6:823–834. doi: 10.2217/14622416.6.8.823. [DOI] [PubMed] [Google Scholar]
- 96.Neslund-Dudas C., Levin A.M., Rundle A., Beebe-Dimmer J., Bock C.H., Nock N.L., Jankowski M., Datta I., Krajenta R., Dou Q.P., et al. Case-only gene–environment interaction between ALAD tagSNPs and occupational lead exposure in prostate cancer. Prostate. 2014;74:637–646. doi: 10.1002/pros.22781. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Gauderman W.J., Mukherjee B., Aschard H., Hsu L., Lewinger J.P., Patel C.J., Witte J.S., Amos C., Tai C.G., Conti D., et al. Update on the state of the science for analytical methods for gene-environment interactions. Am. J. Epidemiol. 2017;186:762–770. doi: 10.1093/aje/kwx228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Ritchie M.D., Davis J.R., Aschard H., Battle A., Conti D., Du M., Eskin E., Fallin M.D., Hsu L., Kraft P., et al. Incorporation of biological knowledge into the study of gene-environment interactions. Am. J. Epidemiol. 2017;186:771–777. doi: 10.1093/aje/kwx229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Marderstein A.R., Davenport E.R., Kulm S., Van Hout C.V., Elemento O., Clark A.G. Leveraging phenotypic variability to identify genetic interactions in human phenotypes. Am. J. Hum. Genet. 2021;108:49–67. doi: 10.1016/j.ajhg.2020.11.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Su Y.-R., Di C.-Z., Hsu L., Genetics and Epidemiology of Colorectal Cancer Consortium A unified powerful set-based test for sequencing data analysis of GxE interactions. Biostatistics. 2017;18:119–131. doi: 10.1093/biostatistics/kxw034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Miao J., Song G., Wu Y., Hu J., Wu Y., Basu S., Andrews J.S., Schaumberg K., Fletcher J.M., Schmitz L.L. Reimagining gene-environment interaction analysis for human complex traits. bioRxiv. 2022 Preprint at. [Google Scholar]
- 102.Dahl A., Nguyen K., Cai N., Gandal M.J., Flint J., Zaitlen N. A robust method uncovers significant context-specific heritability in diverse complex traits. Am. J. Hum. Genet. 2020;106:71–91. doi: 10.1016/j.ajhg.2019.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Moore R., Casale F.P., Jan Bonder M., Horta D., BIOS Consortium. C.’t Hoen P.A., Barroso I., Stegle O., Jansen R., Franke L., et al. A linear mixed-model approach to study multivariate gene–environment interactions. Nat. Genet. 2019;51:180–186. doi: 10.1038/s41588-018-0271-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Heys K.A., Shore R.F., Pereira M.G., Jones K.C., Martin F.L. Risk assessment of environmental mixture effects. RSC Adv. 2016;6:47844–47857. [Google Scholar]
- 105.Teuschler L.K. Deciding which chemical mixtures risk assessment methods work best for what mixtures. Toxicol. Appl. Pharmacol. 2007;223:139–147. doi: 10.1016/j.taap.2006.07.010. [DOI] [PubMed] [Google Scholar]
- 106.Psaty B.M., O'Donnell C.J., Gudnason V., Lunetta K.L., Folsom A.R., Rotter J.I., Uitterlinden A.G., Harris T.B., Witteman J.C.M., Boerwinkle E., CHARGE Consortium Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium: design of prospective meta-analyses of genome-wide association studies from 5 cohorts. Circ. Cardiovasc. Genet. 2009;2:73–80. doi: 10.1161/circgenetics.108.829747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Wei P., Tang H., Li D. Functional logistic regression approach to detecting gene by longitudinal environmental exposure interaction in a case-control study. Genet. Epidemiol. 2014;38:638–651. doi: 10.1002/gepi.21852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Dodge H.H., Zhu J., Mattek N.C., Austin D., Kornfeld J., Kaye J.A. Use of high-frequency in-home monitoring data may reduce sample sizes needed in clinical trials. PLoS One. 2015;10:e0138095. doi: 10.1371/journal.pone.0138095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.McKinney B.A., Reif D.M., Ritchie M.D., Moore J.H. Machine learning for detecting gene-gene interactions: a review. Appl. Bioinformatics. 2006;5:77–88. doi: 10.2165/00822942-200605020-00002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Jiao S., Hsu L., Bézieau S., Brenner H., Chan A.T., Chang-Claude J., Le Marchand L., Lemire M., Newcomb P.A., Slattery M.L., Peters U. SBERIA: set-based gene-environment interaction test for rare and common variants in complex diseases. Genet. Epidemiol. 2013;37:452–464. doi: 10.1002/gepi.21735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Moore R., Casale F.P., Jan Bonder M., Horta D., BIOS Consortium. Franke L., Barroso I., Stegle O. A linear mixed-model approach to study multivariate gene-environment interactions. Nat. Genet. 2019;51:180–186. doi: 10.1038/s41588-018-0271-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Johnson R., Sotoudeh R., Conley D. Polygenic scores for plasticity: a new tool for studying gene-environment interplay. Demography. 2022;59:1045–1070. doi: 10.1215/00703370-9957418. [DOI] [PubMed] [Google Scholar]
- 113.Barrett J.H., Smith G., Waxman R., Gooderham N., Lightfoot T., Garner R.C., Augustsson K., Wolf C.R., Bishop D.T., Forman D., Colorectal Cancer Study Group Investigation of interaction between N-acetyltransferase 2 and heterocyclic amines as potential risk factors for colorectal cancer. Carcinogenesis. 2003;24:275–282. doi: 10.1093/carcin/24.2.275. [DOI] [PubMed] [Google Scholar]
- 114.McAllister K., Mechanic L.E., Amos C., Aschard H., Blair I.A., Chatterjee N., Conti D., Gauderman W.J., Hsu L., Hutter C.M., et al. Current challenges and new opportunities for gene-environment interaction studies of complex diseases. Am. J. Epidemiol. 2017;186:753–761. doi: 10.1093/aje/kwx227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Sudlow C., Gallacher J., Allen N., Beral V., Burton P., Danesh J., Downey P., Elliott P., Green J., Landray M., et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12:e1001779. doi: 10.1371/journal.pmed.1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.All of Us Research Program Genomics Investigators Genomic data in the All of Us Research Program. Nature. 2024;627:340–346. doi: 10.1038/s41586-023-06957-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Viet S.M., Falman J.C., Merrill L.S., Faustman E.M., Savitz D.A., Mervish N., Barr D.B., Peterson L.A., Wright R., Balshaw D., O'Brien B. Human Health Exposure Analysis Resource (HHEAR): a model for incorporating the exposome into health studies. Int. J. Hyg Environ. Health. 2021;235:113768. doi: 10.1016/j.ijheh.2021.113768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.National Academies of Sciences Engineering and Medicine . The National Academies Press; 2023. Building Confidence in New Evidence Streams for Human Health Risk Assessment: Lessons Learned from Laboratory Mammalian Toxicity Tests. [DOI] [PubMed] [Google Scholar]
- 119.Jansen R.J., Argos M., Tong L., Li J., Rakibuz-Zaman M., Islam M.T., Slavkovich V., Ahmed A., Navas-Acien A., Parvez F., et al. Determinants and consequences of arsenic metabolism efficiency among 4,794 individuals: demographics, lifestyle, genetics, and toxicity. Cancer Epidemiol. Biomarkers Prev. 2016;25:381–390. doi: 10.1158/1055-9965.Epi-15-0718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Kabbani D., Akika R., Wahid A., Daly A.K., Cascorbi I., Zgheib N.K. Pharmacogenomics in practice: a review and implementation guide. Front. Pharmacol. 2023;14:1189976. doi: 10.3389/fphar.2023.1189976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.International Warfarin Pharmacogenetics Consortium. Klein T.E., Altman R.B., Eriksson N., Gage B.F., Kimmel S.E., Lee M.T.M., Limdi N.A., Page D., Roden D.M., et al. Estimation of the warfarin dose with clinical and pharmacogenetic data. N. Engl. J. Med. 2009;360:753–764. doi: 10.1056/NEJMoa0809329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Dean L. In: Medical Genetics Summaries. M. P.V., Scott S.A., Pirmohamed M., editors. National Center for Biotechnology Information; 2015. Abacavir Therapy and HLA-B∗57:01 Genotype. [PubMed] [Google Scholar]
- 123.Zhou S. Clinical pharmacogenomics of thiopurine S-methyltransferase. Curr. Clin. Pharmacol. 2006;1:119–128. doi: 10.2174/157488406784111627. [DOI] [PubMed] [Google Scholar]
- 124.Olopade O.I., Grushko T.A., Nanda R., Huo D. Advances in breast cancer: pathways to personalized medicine. Clin. Cancer Res. 2008;14:7988–7999. doi: 10.1158/1078-0432.Ccr-08-1211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Cheung M., Testa J.R. BAP1, a tumor suppressor gene driving malignant mesothelioma. Transl. Lung Cancer Res. 2017;6:270–278. doi: 10.21037/tlcr.2017.05.03. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Pilarski R., Carlo M.I., Cebulla C., Abdel-Rahman M. In: GeneReviews. Adam M.P., Feldman J., Mirzaa G.M., Pagon R.A., Wallace S.E., Bean L.J.H., Gripp K.W., Amemiya A., editors. University of Washington, Seattle; 1993. BAP1 Tumor Predisposition Syndrome. [PubMed] [Google Scholar]
- 127.Balik-Meisner M., Truong L., Scholl E.H., La Du J.K., Tanguay R.L., Reif D.M. Elucidating gene-by-environment interactions associated with differential susceptibility to chemical exposure. Environ. Health Perspect. 2018;126:67010. doi: 10.1289/ehp2662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Rusyn I., Chiu W.A., Wright F.A. Model systems and organisms for addressing inter- and intra-species variability in risk assessment. Regul. Toxicol. Pharmacol. 2022;132:105197. doi: 10.1016/j.yrtph.2022.105197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Threadgill D.W., Miller D.R., Churchill G.A., de Villena F.P.M. The collaborative cross: a recombinant inbred mouse population for the systems genetic era. ILAR J. 2011;52:24–31. doi: 10.1093/ilar.52.1.24. [DOI] [PubMed] [Google Scholar]
- 130.Bogue M.A., Churchill G.A., Chesler E.J. Collaborative Cross and Diversity Outbred data resources in the Mouse Phenome Database. Mamm. Genome. 2015;26:511–520. doi: 10.1007/s00335-015-9595-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Saul M.C., Philip V.M., Reinholdt L.G., Center for Systems Neurogenetics of Addiction. Chesler E.J. High-diversity mouse populations for complex traits. Trends Genet. 2019;35:501–514. doi: 10.1016/j.tig.2019.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Lusis A.J., Seldin M.M., Allayee H., Bennett B.J., Civelek M., Davis R.C., Eskin E., Farber C.R., Hui S., Mehrabian M., et al. The Hybrid Mouse Diversity Panel: a resource for systems genetics analyses of metabolic and cardiovascular traits. J. Lipid Res. 2016;57:925–942. doi: 10.1194/jlr.R066944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Tabakoff B., Smith H., Vanderlinden L.A., Hoffman P.L., Saba L.M. Networking in biology: The Hybrid Rat Diversity Panel. Methods Mol. Biol. 2019;2018:213–231. doi: 10.1007/978-1-4939-9581-3_10. [DOI] [PubMed] [Google Scholar]
- 134.Mackay T.F.C., Richards S., Stone E.A., Barbadilla A., Ayroles J.F., Zhu D., Casillas S., Han Y., Magwire M.M., Cridland J.M., et al. The Drosophila melanogaster Genetic Reference Panel. Nature. 2012;482:173–178. doi: 10.1038/nature10811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Le Goff A., Louvel S., Boullier H., Allard P. Toxicoepigenetics for risk assessment: bridging the gap between basic and regulatory science. Epigenet. Insights. 2022;15 doi: 10.1177/25168657221113149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Akhtari F.S., Green A.J., Small G.W., Havener T.M., House J.S., Roell K.R., Reif D.M., McLeod H.L., Wiltshire T., Motsinger-Reif A.A. High-throughput screening and genome-wide analyses of 44 anticancer drugs in the 1000 Genomes cell lines reveals an association of the NQO1 gene with the response of multiple anticancer drugs. PLoS Genet. 2021;17:e1009732. doi: 10.1371/journal.pgen.1009732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Shin H.M., Ernstoff A., Arnot J.A., Wetmore B.A., Csiszar S.A., Fantke P., Zhang X., McKone T.E., Jolliet O., Bennett D.H. Risk-based high-throughput chemical screening and prioritization using exposure models and in vitro bioactivity assays. Environ. Sci. Technol. 2015;49:6760–6771. doi: 10.1021/acs.est.5b00498. [DOI] [PubMed] [Google Scholar]
- 138.Tice R.R., Austin C.P., Kavlock R.J., Bucher J.R. Improving the human hazard characterization of chemicals: a Tox21 update. Environ. Health Perspect. 2013;121:756–765. doi: 10.1289/ehp.1205784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Judson R., Houck K., Martin M., Knudsen T., Thomas R.S., Sipes N., Shah I., Wambaugh J., Crofton K. In vitro and modelling approaches to risk assessment from the U.S. Environmental Protection Agency ToxCast programme. Basic Clin. Pharmacol. Toxicol. 2014;115:69–76. doi: 10.1111/bcpt.12239. [DOI] [PubMed] [Google Scholar]
- 140.Hsieh J.H., Smith-Roe S.L., Huang R., Sedykh A., Shockley K.R., Auerbach S.S., Merrick B.A., Xia M., Tice R.R., Witt K.L. Identifying compounds with genotoxicity potential using Tox21 high-throughput screening assays. Chem. Res. Toxicol. 2019;32:1384–1401. doi: 10.1021/acs.chemrestox.9b00053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Vandenberg L.N. Toxicity testing and endocrine disrupting chemicals. Adv. Pharmacol. 2021;92:35–71. doi: 10.1016/bs.apha.2021.05.001. [DOI] [PubMed] [Google Scholar]
- 142.Politi R., Rusyn I., Tropsha A. Prediction of binding affinity and efficacy of thyroid hormone receptor ligands using QSAR and structure-based modeling methods. Toxicol. Appl. Pharmacol. 2014;280:177–189. doi: 10.1016/j.taap.2014.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Huang R., Xia M., Sakamuru S., Zhao J., Shahane S.A., Attene-Ramos M., Zhao T., Austin C.P., Simeonov A. Modelling the Tox21 10 K chemical profiles for in vivo toxicity prediction and mechanism characterization. Nat. Commun. 2016;7:10425. doi: 10.1038/ncomms10425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Judson R.S., Martin M.T., Reif D.M., Houck K.A., Knudsen T.B., Rotroff D.M., Xia M., Sakamuru S., Huang R., Shinn P., et al. Analysis of eight oil spill dispersants using rapid, in vitro tests for endocrine and other biological activity. Environ. Sci. Technol. 2010;44:5979–5985. doi: 10.1021/es102150z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Gwinn W.M., Bousquet R.W., Perry C.E., Urbano N.C., Auerbach S.S. NTP Research Report on the Preliminary Evaluation of 4-Methylcyclohexylmethanol in an In Vitro Human Airway Model: Research Report 7. National Toxicology Program; 2018. NTP Research Reports. [DOI] [PubMed] [Google Scholar]
- 146.Mav D., Phadke D.P., Balik-Meisner M.R., Merrick B.A., Auerbach S., Niemeijer M., Huppelschoten S., Baze A., Parmentier C., Richert L., et al. Utility of extrapolating human S1500+ genes to the whole transcriptome: tunicamycin case study. Bioinform. Biol. Insights. 2020;14 doi: 10.1177/1177932220952742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 147.Gwinn W.M., Auerbach S.S., Parham F., Stout M.D., Waidyanatha S., Mutlu E., Collins B., Paules R.S., Merrick B.A., Ferguson S., et al. Evaluation of 5-day in vivo rat liver and kidney with high-throughput transcriptomics for estimating benchmark doses of apical outcomes. Toxicol. Sci. 2020;176:343–354. doi: 10.1093/toxsci/kfaa081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 148.Corton J.C., Kleinstreuer N.C., Judson R.S. Identification of potential endocrine disrupting chemicals using gene expression biomarkers. Toxicol. Appl. Pharmacol. 2019;380:114683. doi: 10.1016/j.taap.2019.114683. [DOI] [PubMed] [Google Scholar]
- 149.Teeguarden J.G., Tan Y.M., Edwards S.W., Leonard J.A., Anderson K.A., Corley R.A., Kile M.L., Simonich S.M., Stone D., Tanguay R.L., et al. Completing the link between exposure science and toxicology for improved environmental health decision making: the aggregate exposure pathway framework. Environ. Sci. Technol. 2016;50:4579–4586. doi: 10.1021/acs.est.5b05311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150.Eccles K.M., Karmaus A.L., Kleinstreuer N.C., Parham F., Rider C.V., Wambaugh J.F., Messier K.P. A geospatial modeling approach to quantifying the risk of exposure to environmental chemical mixtures via a common molecular target. Sci. Total Environ. 2023;855:158905. doi: 10.1016/j.scitotenv.2022.158905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 151.Khoury M.J., Engelgau M., Chambers D.A., Mensah G.A. Beyond public health genomics: can big data and predictive analytics deliver precision public health? Public Health Genom. 2018;21:244–250. doi: 10.1159/000501465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 152.Bernatchez L., Ferchaud A.L., Berger C.S., Venney C.J., Xuereb A. Genomics for monitoring and understanding species responses to global climate change. Nat. Rev. Genet. 2024;25:165–183. doi: 10.1038/s41576-023-00657-y. [DOI] [PubMed] [Google Scholar]
- 153.Adkins-Jackson P.B., Chantarat T., Bailey Z.D., Ponce N.A. Measuring structural racism: a guide for epidemiologists and other health researchers. Am. J. Epidemiol. 2022;191:539–547. doi: 10.1093/aje/kwab239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 154.Mulligan C.J. Systemic racism can get under our skin and into our genes. Am. J. Phys. Anthropol. 2021;175:399–405. doi: 10.1002/ajpa.24290. [DOI] [PubMed] [Google Scholar]
- 155.Senier L., Brown P., Shostak S., Hanna B. The socio-exposome: advancing exposure science and environmental justice in a post-genomic era. Environ. Sociol. 2017;3:107–121. doi: 10.1080/23251042.2016.1220848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 156.National Academies of Sciences, E., and Medicine . National Academies Press; 2018. Returning Individual Research Results to Participants: Guidance for a New Research Paradigm. [DOI] [PubMed] [Google Scholar]
- 157.Boronow K.E., Perovich L.J., Sweeney L., Yoo J.S., Rudel R.A., Brown P., Brody J.G. Privacy risks of sharing data from environmental health studies. Environ. Health Perspect. 2020;128:17008. doi: 10.1289/ehp4817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 158.Onsrud H.J. Identifying unethical conduct in the use of GIS. Cartogr. Geogr. Inf. Syst. 1995;22:90–97. [Google Scholar]


