Skip to main content
eLife logoLink to eLife
. 2021 Jun 18;10:e63852. doi: 10.7554/eLife.63852

Psychosocial experiences modulate asthma-associated genes through gene-environment interactions

Justyna A Resztak 1, Allison K Farrell 2, Henriette Mair-Meijers 1, Adnan Alazizi 1, Xiaoquan Wen 3, Derek E Wildman 4, Samuele Zilioli 5,6, Richard B Slatcher 5, Roger Pique-Regi 1,7,, Francesca Luca 1,7,
Editors: Patricia W Finn8, Mone Zaidi9
PMCID: PMC8282343  PMID: 34142656

Abstract

Social interactions and the overall psychosocial environment have a demonstrated impact on health, particularly for people living in disadvantaged urban areas. Here, we investigated the effect of psychosocial experiences on gene expression in peripheral blood immune cells of children with asthma in Metro Detroit. Using RNA-sequencing and a new machine learning approach, we identified transcriptional signatures of 19 variables including psychosocial factors, blood cell composition, and asthma symptoms. Importantly, we found 169 genes associated with asthma or allergic disease that are regulated by psychosocial factors and 344 significant gene-environment interactions for gene expression levels. These results demonstrate that immune gene expression mediates the link between negative psychosocial experiences and asthma risk.

Research organism: Human

Introduction

Psychosocial experiences have long been recognized to affect human health (Miller et al., 2009). Intrapersonal processes (e.g., emotionality [Pressman et al., 2019; Smith et al., 2004], interpersonal social relationships [Repetti et al., 2002; Robles et al., 2014]) and broader structural environments (e.g., neighborhood quality and socioeconomic status [SES]; Gallo and Matthews, 2003) are all associated with the morbidity and severity of diseases such as asthma (Harrison, 1998), cancer (Meyer and Mark, 1995), cardiovascular disease (Everson-Rose and Lewis, 2005), as well as mortality rates (Holt-Lunstad et al., 2010; Chida and Steptoe, 2008). Asthma is a chronic inflammatory disease of the respiratory tract that disproportionately affects children (Moorman et al., 2012). It is one of the costliest pediatric health conditions (Weiss et al., 2000) and a leading cause of school absenteeism (Akinbami and Centers for Disease Control and Prevention National Center for Health Statistics, 2006). Financially struggling cities, such as Detroit, are at an especially high risk for asthma morbidity and mortality (Sullivan et al., 2002). While environmental and genetic factors lead to the development of asthma and affect the health of children with asthma (von Mutius, 2000; von Mutius, 2009; Umetsu et al., 2002), psychosocial stress is a critical factor contributing to asthma severity (Wright et al., 1998; Wright et al., 2005; Shankardass et al., 2009; Chen and Miller, 2007; Sandberg et al., 2000). Understanding the biological pathways underlying these associations is crucial to strengthen the causal claims linking psychosocial experiences and health.

The growing field of social genomics investigates how various dimensions of a person's social and psychological environment influence gene expression (Cole, 2014; Slavich and Cole, 2013; Cole, 2009; Toyokawa et al., 2012; Galea et al., 2011). There is ample evidence for links between gene expression in blood and three major categories of psychosocial experiences: SES (Chen et al., 2009), social relationships (Robles et al., 2018; Stanton et al., 2017; Powell et al., 2013), and emotionality (Farrell et al., 2018; Segman et al., 2010). Beyond single-gene analyses, previous studies in this area (Slavich and Cole, 2013; Cole, 2014; Cole, 2009) identified a pattern of differentially expressed genes referred to as the conserved transcriptional response to adversity (CTRA). The CTRA is characterized by increased expression of genes involved in inflammation and decreased expression of genes involved in type I interferon antiviral responses and IgG1 antibody synthesis (Fredrickson et al., 2013). However, these studies investigated a limited set of psychosocial experiences and did not resolve whether these pathways are causally linked to health outcomes or rather a consequence of disease status.

Several approaches have been developed for investigating the role of gene expression in complex trait variation (Nica et al., 2010; Marigorta et al., 2017; Võsa et al., 2018; Nica and Dermitzakis, 2008). Recently, transcriptome-wide association studies [TWAS] and other Mendelian randomization (MR) approaches have been used to integrate genetic effects on gene expression and on complex traits to establish causal links between a gene and a phenotype (Gusev et al., 2016). MR approaches have been developed in epidemiology to examine the causal effect of a modifiable exposure on disease without conducting a randomized trial. MR designs use the genotype association with the two variables of interest to control for reverse causation and confounding. Here, we use this type of approach to connect genes with complex traits. Traditionally genes are annotated to association signals in genome-wide association studies [GWAS] based on physical proximity. TWAS test for an association between gene expression and complex traits, where gene expression is predicted based on genotypes in the GWAS study and independent expression quantitative trait locus (eQTL) data. Notably this goes beyond physical proximity of GWAS signals to genes and establishes a putative mechanism linking genetic variants to complex traits through genetic regulation of gene expression. Very few studies of genetic regulation of gene expression (expression quantitative trait loci, eQTL mapping) in humans have included comprehensive information on psychosocial exposures, and no study to date has been able to determine the likelihood of a causal relationship between psychosocial experiences, gene expression, and asthma. This study aims at filling this gap by combining genetic and well-characterized psychosocial data from a cohort of children with asthma living in Metro Detroit (Figure 1a).

Figure 1. Transcriptional signatures of psychosocial experiences and asthma symptoms.

(a) Central hypothesis. (b) Number of genes in elastic net regression models that explain at least 1% of variance. Colors represent different categories of variables. (c) Pearson’s correlations between cross-validated transcriptional signatures and measured variables for elastic net regression models that explain at least 1% of variance. (d) Forced expiratory volume in one second [FEV1] percent predicted transcriptional signature model fit (Pearson’s rho = 0.76, p<0.001). (e) MacArthur socioeconomic status transcriptional signature model fit (Pearson’s rho = 0.67, p<0.001). (f) Longitudinal change in observed neutrophils (x axis) and longitudinal change in transcriptional signature of neutrophils (y axis) (Pearson’s rho = 0.72, p<0.001, gray = identity line).

Figure 1—source data 1. GLMnet model weights for the transcriptional signatures.
The transcriptional signatures are in columns, while the genes are in rows.

Figure 1.

Figure 1—figure supplement 1. Clustered heatmap of (Pearson) correlations between all variables used in the study.

Figure 1—figure supplement 1.

Color indicates the strength and direction of correlation; white indicates p-value>0.05. Hierarchical clustering is represented on the top and side of the heatmap. Variables are grouped by colors indicating the different categories considered.

Figure 1—figure supplement 2. Scatterplot of Pearson's correlation coefficients between each pair of observed variables (x axis) and metagenes (y axis) for the 19 variables with transcriptional signatures explaining >1% of observed variance.

Figure 1—figure supplement 2.

Color represents significance of the correlation (gray: nonsignificant correlations in observed variables or metagenes; blue: significant correlation between observed variables only; green: significant correlation between transcriptional signatures only; black: correlation significant in both observed variables and metagenes).

Figure 1—figure supplement 3. Gene set enrichment analysis results on genes differentially expressed for psychosocial experiences.

Figure 1—figure supplement 3.

(a) Gene Ontology, (b) Kyoto Encyclopedia of Genes and Genomes, and (c) Reactome Pathway Database. Variable symbols: cddstf: self-disclosure; cdmcfl: child-reported conflict with mother; cdres: perceived responsiveness; pincme: parent’s income; ppeqcf: parent-reported conflict with child; psesl: subjective socioeconomic status. Down denotes downregulated genes, up denotes upregulated genes.

Figure 1—figure supplement 4. Result of identity-by-descent analysis (IBD; maximum likelihood estimation [MLE]) on DNA-derived genotypes for all 251 participants.

Figure 1—figure supplement 4.

k0: probability of sharing zero alleles IBD; k1: probability of sharing one allele IBD.

Figure 1—figure supplement 5. Self-reported ethnicity (x axis) vs. percent global African ancestry (y axis) in 119 participants for whom declared ethnicity is available.

Figure 1—figure supplement 5.

Figure 1—figure supplement 6. Proportion of reads mapping to the Y chromosome over all mapped reads separately for self-reported females and males.

Figure 1—figure supplement 6.

Each dot is one sample.

Figure 1—figure supplement 7. Sources of variation in gene expression data.

Figure 1—figure supplement 7.

(a) Proportion of global variance in gene expression explained by each covariate tested within a single linear model. (b) Proportion of variance explained by each covariate for each analyzed gene.

Figure 1—figure supplement 8. Comparison of variance explained by conserved transcriptional response to adversity (CTRA)-based (x axis) and unbiased (y axis) elastic net prediction models; color indicates type of variable (red = identity line).

Figure 1—figure supplement 8.

The Asthma in the Lives of Families Today (ALOFT) project was established in 2009 to identify the behavioral and biological pathways through which family social environments impact youth with asthma. This study started during the years leading up to Detroit filing for bankruptcy in 2013 and is still ongoing. Detroit started a marked economic recovery in 2016; yet not all population groups and geographic areas have experienced it simultaneously or to the same extent. To analyze the relationship between psychosocial experiences, asthma, and transcriptional regulation, we investigated genome-wide gene expression (RNA-seq) for 251 youth participating in the ALOFT study. For 119 participants, we also collected 53 psychosocial and biological variables (Supplementary file 1a, b, Figure 1—figure supplement 1). Measures of psychosocial experiences were grouped into three subcategories, indicating SES, social relationships, and emotionality. Psychosocial experiences were captured through subjective and objective measures (e.g., negative affect assessed from daily diaries and recorded audio, respectively), as well as global and daily measures.

Results

Psychosocial factors and asthma alter the transcriptome

To denoise and impute psychosocial effects on gene expression for the entire cohort of 251 participants, we developed a new machine learning approach based on generalized linear models with elastic net regularization (GLMnet; Friedman et al., 2010) and cross-validation. Using this approach, we derived transcriptional signatures that represent the portion of the transcriptome that correlates with each psychosocial factor. Analogous methods have been adopted to define transcriptional signatures of T-cell exhaustion in aging (Alpert et al., 2019) and survival in cancer (Asgharzadeh et al., 2006), but have not been previously used for psychosocial factors. We identified significant transcriptional signatures for 31 out of 53 variables (Figure 1b–e, Figure 1—source data 1, Supplementary file 1c). We used an independent longitudinal dataset to validate the transcriptional signatures. We considered the changes in the observed variable between two time points (≥1 year) and compared it to the longitudinal changes in the transcriptional signature. Note that the transcriptional signature is imputed for the second time point from gene expression samples that are not included in the training set. We found significant correlations in the observed and imputed changes for the majority of variables (Spearman's correlation p-value<0.05; e.g., Figure 1f, Supplementary file 1d).

Transcriptional signatures of the SES measures showed a strong overlap with each other (Figure 2), suggesting that they may have very similar molecular effects or measure the same factors. However, we also saw correlations across all three variable categories. For example, subjective SES was significantly correlated with objective maternal responsiveness, family conflict, and self-reported self-disclosure, which is the extent to which the youths talk about their thoughts and feelings (r = −0.3, p=1.1 * 10−6, r = 0.29, p=4.26 * 10−6, r = 0.66, p=3 * 10−32, respectively). Overall, correlations between transcriptional signatures reflect correlations between measured variables (Figure 1—figure supplement 2), yet they are stronger between the transcriptional signatures, highlighting the denoising effect. Measured psychosocial factors were also associated with interindividual variation in gene expression for several genes. For example, perceived responsiveness and self-disclosure were associated with changes in gene expression for 143 and 3279 genes, respectively (Supplementary file 1e, f). Genes positively associated with perceived responsiveness were enriched for biological processes relative to the response to IL18, while genes positively associated with self-disclosure were enriched for the interferon 1 pathway (Figure 1—figure supplement 3).

Figure 2. Correlation among psychosocial and clinical transcriptional signatures.

Heatmap of Pearson’s correlations between transcriptional signatures explaining at least 1% of variance. Heatmap color indicates strength and direction of correlation; white indicates p-value>0.05. Hierarchical clustering of variables is represented above the heatmap, with colors indicating categories for each variable as indicated in the legend.

Figure 2.

Figure 2—figure supplement 1. Network representation of correlations between transcriptional signatures of psychosocial experiences (edge width reflects absolute value of Pearson's correlation score, edge color reflects positive [red] or negative [blue] correlation).

Figure 2—figure supplement 1.

Figure 2—figure supplement 2. Network representation of correlations between all transcriptional signatures (edge width reflects absolute value of Pearson's correlation score, edge color reflects positive [red] or negative [blue] correlation).

Figure 2—figure supplement 2.

When we correlated transcriptional signatures of asthma severity with those for psychosocial variables, we observed overlap with SES and social relationships, but not emotionality. In particular, we found significant positive correlations between the transcriptional signatures of lung function (percent-predicted FEV1) and psychosocial measures of self-disclosure (r = 0.42, p=4.9 * 10−12) and subjective SES (r = 0.36, p=3.3 * 10−9). Unexpectedly, objective maternal responsiveness was negatively correlated with lung function (r = −0.19, p=0.002), and percent unoccupied properties in the neighborhood (r = 0.41, p=7.8 * 10−12) was positively correlated with lung function. We also found that the transcriptional signature of self-disclosure was also significantly associated with other measures of asthma, such as nightly asthma symptoms (r = 0.27, p=1.2 * 10−5) and asthma severity (r = −0.32, p=2.9 * 10−7), echoing the large body of work on the importance of self-disclosure for health (Pennebaker, 1995). These results provide a potential mechanism through gene expression changes in leukocytes for previously reported links between SES and asthma symptoms (Litonjua et al., 1999; Mielck et al., 1996). Past research has also found emotionality to be a strong predictor of asthma severity (Lehrer et al., 1993).

Notably, the transcriptional signatures of blood composition were also associated with asthma symptoms, with a positive correlation for proportion of lymphocytes and negative correlation for proportion of neutrophils (Figure 2—figure supplement 1). Given the important role of several blood cell types in asthma severity and exacerbations (Mamessier et al., 2008; Vedel-Krogh et al., 2017; Bigler et al., 2017; Ray and Kolls, 2017; Lima-Matos et al., 2018; Casciano et al., 2016; Sinz et al., 2017), it is possible that transcriptional changes associated with blood composition mediated the correlations between psychosocial experiences and asthma outcomes.

Genetic interactions with psychosocial factors affect gene regulation

To directly investigate whether transcriptional signatures associated with negative psychosocial experiences contribute to inter-individual variation in asthma risk, we used eQTL mapping combined with TWAS (Gusev et al., 2016). TWAS uses eQTLs as instrumental variables (IVs) to causally link gene expression to phenotypes. To examine local genetic effects on leukocyte gene expression, we performed cis-eQTL mapping and identified 8610 genes with at least one eQTL (eGenes, 10% false discovery rate [FDR], Figure 3—source data 1). These eGenes were enriched in Genotype-Tissue Expression Project (GTEx) whole blood eGenes (Aguet et al., 2020) (Fisher’s test OR = 3.2, p-value<2.2 * 10−16), but we also identified additional 1801 eGenes that were not detected by GTEx in whole blood.

We used the method for probabilistic TWAS analysis (PTWAS) (Zhang et al., 2020), which improves upon previous TWAS methods by ensuring only strong IVs are used, and is designed to allow for validating the causality assumption (see Materials and methods). We identified 2806 eGenes in the GTEx dataset that were causally associated with asthma and allergic diseases (hay fever, eczema, and allergic rhinitis) (5% FDR). Of these, 853 were eGenes in our dataset (Figure 3—source data 3). Here, we interrogated whether these causal genetic effects can be modulated by psychosocial factors through gene-environment interactions. To examine the genotype-by-environment effects of psychosocial experiences and blood composition on gene expression, we used the imputed transcriptional signatures for the entire cohort of 251 individuals. In addition to imputing missing data, we argue that these transcriptional signatures may better capture the environmental effects on the state of the cells at the molecular level (i.e., after denoising), compared to the observed variables. This is because observed variables have high levels of noise, and the measured values may not reflect the true biological effect. Therefore, we used the predicted values for all participants, including those for whom the variables were directly measured (denoising). This is similar to the context eQTL approach (Zhernakova et al., 2017) that uses other genes as a proxy variable for the environment, but here the ‘context’ is more easily interpretable because it is defined by a transcriptional signature associated with a specific psychosocial factor. Similarly, cell-type composition imputed from gene expression was used to map cell-type interaction QTLs for 43 cell-type-tissue combinations in the GTEx v8 dataset (Kim-Hellmuth et al., 2020).

For each of the eGenes identified in our dataset, we tested the lead eQTL for an interaction effect (see Materials and methods) with any of the transcriptional signatures. We discovered 344 significant interaction eQTLs across 134 unique genes (10% FDR; Figure 3a, Figure 3—source data 4). We found interaction eQTLs for all four blood composition signatures (proportion of lymphocytes, neutrophils, monocytes, and eosinophils with 81, 65, 27, and 25 GxE interactions, respectively), which represent cell-type-specific eQTLs (57.6% of all GxE eQTLs). 101 of the 108 blood-interacting eGenes (93.5%) were also identified as genes with interaction eQTLs with cell-type composition in GTEx whole blood (Kim-Hellmuth et al., 2020). We identified 124 GxE interaction effects on gene expression with psychosocial experiences across 77 genes, including self-disclosure (48 genes), subjective SES (40 genes), and objective maternal responsiveness (16 genes) (Figure 3—source data 5). These only partially overlapped (77%) GxE effects observed for blood composition and included interactions specific to psychosocial factors (Figure 3c). To evaluate whether the interactions with psychosocial experiences may be mediated by cell composition, we repeated the GxE mapping after correcting for blood cell composition. We observed 125 significant GxE effects (10% FDR) after removing the effect of blood composition differences (Figure 3—source data 6).

Figure 3. GxE effects on gene expression and asthma risk.

(a) Interaction expression quantitative trait locus (eQTL) results. GxE genes: number of significant GxE interactions with transcriptional signatures at 10% FDR; OR: odds ratio of enrichment of GxE genes with measured variable (p<0.01) in GxE genes with transcriptional signatures (p<0.01). (b) Network of interactions between environments and eGenes. Each node represents an eGene with an interaction eQTL (black) or a variable that modulates the genetic effect on gene expression. Only nodes with at least two interactions are labeled. Edges represent significant interaction eQTLs (10% FDR). (c, d) Causal gene-complex trait interactions identified through transcriptome-wide association studies (TWAS) are modulated by psychosocial experiences. Psychosocial variables (c) or blood composition (d) are in the left column, eGenes in the central column and complex traits in the right column. A connecting line represents either a causal link between eGene and asthma or allergic disease trait identified through TWAS (middle to right) or a significant interaction eQTL (left to middle). (e, f) Examples of genes causally associated with asthma and with GxE effects that modulate genetic risk. Both genes are causally associated with asthma in TWAS. Each dot is an individual. The same data are presented in the inset and main figure within each panel. In the main figure, the trend lines represent the best model fit between the psychosocial variable and gene expression for each genotype class. The slope of each line and the q-value for the GxE effect are also reported. The boxplot in the inset represents the same normalized gene expression data across the three tertiles of the psychosocial variable.

Figure 3—source data 1. Results of cis-expression quantitative trait locus (cis-eQTL) mapping permutation pass with FastQTL correcting for top 18 gene expression principal components [PCs] (space-delimited file).
Columns are as follows. 1: Ensembl ID of the tested gene; 2: number of variants tested in cis for this gene; 3: maximum likelihood estimation (MLE) of the shape1 parameter of the beta distribution; 4: MLE of the shape2 parameter of the beta distribution; 5: dummy; 6: ID of the best variant found for this molecular phenotypes (i.e., with the smallest p-value); 7: distance between the molecular phenotype-variant pair; 8: the nominal p-value of association that quantifies how significant from 0, the regression coefficient is; 9: the slope associated with the nominal p-value of association; 10: a first permutation p-value directly obtained from the permutations with the direct method (corrected nominal p-value that accounts for the fact that multiple variants are tested per molecular phenotype); 11: a second permutation p-value obtained via beta approximation (used to calculate FDR).
Figure 3—source data 2. Results of cis-expression quantitative trait locus (cis-eQTL) mapping permutation pass with FastQTL without correction for gene expression principal components [PCs] (space-delimited file).
Columns are as follows. 1: Ensembl ID of the tested gene; 2: number of variants tested in cis for this gene; 3: maximum likelihood estimation (MLE) of the shape1 parameter of the beta distribution; 4: MLE of the shape2 parameter of the beta distribution; 5: dummy; 6: ID of the best variant found for this molecular phenotypes (i.e., with the smallest p-value); 7: distance between the molecular phenotype-variant pair; 8: the nominal p-value of association that quantifies how significant from 0, the regression coefficient is; 9: the slope associated with the nominal p-value of association; 10: a first permutation p-value directly obtained from the permutations with the direct method (corrected nominal p-value that accounts for the fact that multiple variants are tested per molecular phenotype); 11: a second permutation p-value obtained via beta approximation (used to calculate FDR).
Figure 3—source data 3. Overlap of eGenes with psychosocial effects and significant probabilistic transcriptome-wide association studies (PTWAS) association results (5% FDR) for asthma and allergic disease.
Columns are as follows. 1: GWAS trait tested in PTWAS; 2: Ensembl ID of the tested gene; 3: most significant tissue in PTWAS; 4: multitissue PTWAS p-value; 5–24: as in Figure 3—source data 4.
Figure 3—source data 4. Results of cis-interaction-expression quantitative trait locus (eQTL) mapping (tab-delimited file).
Columns are as follows. 1: Ensembl ID of the tested gene; 2: ID of the tested variant; 3: p-value of the GxE model intercept; 4: p-value of the genotype dosage effect; 5: p-value of the transcriptional signature effect; 6: p-value of the interaction between genotype dosage and transcriptional signature effect; 7: intercept; 8: dosage effect; 9: transcriptional signature effect; 10: interaction between genotype dosage and transcriptional signature effect; 11: intercept standard error (SE); 12: genotype dosage effect SE; 13: transcriptional signature effect SE; 14: interaction between genotype dosage and transcriptional signature effect SE; 15: q-value of the GxE model intercept; 16: q-value of the genotype dosage effect; 17: q-value of the transcriptional signature effect; 18: q-value of the interaction between genotype dosage and transcriptional signature effect; 19: tested transcriptional signature; 20: permutation-corrected p-value of the interaction between genotype dosage and transcriptional signature effect; 21: q-value of the permutation-corrected p-value of the interaction between genotype dosage and transcriptional signature effect.
Figure 3—source data 5. Overlap of GxE interactions and significant probabilistic transcriptome-wide association studies (PTWAS) association results (5% FDR).
Figure 3—source data 6. Results of cis-interaction-expression quantitative trait locus (eQTL) mapping after correcting for cell composition effects (tab-delimited file).
Figure 3—source data 7. Results of cis-interaction-expression quantitative trait locus (eQTL) mapping with measured variables (tab-delimited file).

Figure 3.

Figure 3—figure supplement 1. QQplots of expression quantitative trait locus (eQTL)-transcriptional signature interaction test permutation-corrected p-values, combined for blood composition (top) and psychosocial variables (bottom).

Figure 3—figure supplement 1.

Color indicates whether the equivalent interaction test with observed variable was nominally significant (p-corrected <0.01, coral) or not significant (p-corrected >0.01, black).
Figure 3—figure supplement 2. Genetic variants interact with psychosocial environments to alter expression of genes linked to asthma and allergic disease.

Figure 3—figure supplement 2.

Scatterplots depict the following. (a) Self-disclosure interacts with expression quantitative trait locus (eQTL) rs6458333:G:C to alter expression of SRF. (b) Self-disclosure interacts with eQTL rs7203263:C:G to alter expression of NLRC5. (c) Self-disclosure interacts with eQTL rs12922757:G:A to alter expression of GAS8. (d) Self-disclosure interacts with eQTL rs5015567:G:A to alter expression of PFKFB3. (e) Self-disclosure interacts with eQTL rs200494:T:G to alter expression of RP5-874C20.3. (f) Objective maternal responsiveness interacts with eQTL rs17713729:A:C to alter expression of SH3YL1. (g) Objective maternal responsiveness interacts with eQTL rs6458333:G:C to alter expression of SRF. (h) Objective maternal responsiveness interacts with eQTL rs7203263:C:G to alter expression of NLRC5. (i) Objective maternal responsiveness interacts with eQTL rs5015567:G:A to alter expression of PFKFB3. (j) Objective maternal responsiveness interacts with eQTL rs9269774:G:A to alter expression of HLA-DRB9. (k) Socioeconomic status interacts with eQTL rs12424772:A:T to alter expression of SLC6A12. (l) Socioeconomic status interacts with eQTL rs6458333:G:C to alter expression of SRF. (m) Socioeconomic status interacts with eQTL rs7788412:A:G to alter expression of CCM2. (n) Socioeconomic status interacts with eQTL rs36123367:AT:A to alter expression of RNF185. (o) Socioeconomic status interacts with eQTL rs5015567:G:A to alter expression of PFKFB3. (p) Socioeconomic status interacts with eQTL rs12194992:G:A to alter expression of RP3-525N10.2. (q) Percent houses unoccupied interacts with eQTL rs5015567:G:A to alter expression of PFKFB3. Colors signify the genotype class. Green: homozygous reference; purple: heterozygous; orange: homozygous alternate.
Figure 3—figure supplement 3. Genetic variants interact with psychosocial environments to alter expression of genes linked to asthma and allergic disease.

Figure 3—figure supplement 3.

Boxplots depict the following. (a) Self-disclosure interacts with expression quantitative trait locus (eQTL) rs6458333:G:C to alter expression of SRF. (b) Self-disclosure interacts with eQTL rs7203263:C:G to alter expression of NLRC5. (c) Self-disclosure interacts with eQTL rs12922757:G:A to alter expression of GAS8. (d) Self-disclosure interacts with eQTL rs5015567:G:A to alter expression of PFKFB3. (e) Self-disclosure interacts with eQTL rs200494:T:G to alter expression of RP5-874C20.3. (f) Objective maternal responsiveness interacts with eQTL rs17713729:A:C to alter expression of SH3YL1. (g) Objective maternal responsiveness interacts with eQTL rs6458333:G:C to alter expression of SRF. (h) Objective maternal responsiveness interacts with eQTL rs7203263:C:G to alter expression of NLRC5. (i) Objective maternal responsiveness interacts with eQTL rs5015567:G:A to alter expression of PFKFB3. (j) Objective maternal responsiveness interacts with eQTL rs9269774:G:A to alter expression of HLA-DRB9. (k) Socioeconomic status interact with eQTL rs12424772:A:T to alter expression of SLC6A12. (l) Socioeconomic status interacts with eQTL rs6458333:G:C to alter expression of SRF. (m) Socioeconomic status interacts with eQTL rs7788412:A:G to alter expression of CCM2. (n) Socioeconomic status interacts with eQTL rs36123367:AT:A to alter expression of RNF185. (o) Socioeconomic status interacts with eQTL rs5015567:G:A to alter expression of PFKFB3. (p) Socioeconomic status interacts with eQTL rs12194992:G:A to alter expression of RP3-525N10.2. (q) Percent houses unoccupied interacts with eQTL rs5015567:G:A to alter expression of PFKFB3. X axis specifies the number of alternate alleles. Colors signify the genotype class. Green: homozygous reference; purple: heterozygous; orange: homozygous alternate.
Figure 3—figure supplement 4. Causal gene-complex trait interactions identified through transcriptome-wide association studies (TWAS) are modulated by psychosocial experiences.

Figure 3—figure supplement 4.

Psychosocial and biological variables are in the left column, eGenes in the central column, and complex traits in the right column. A connecting line represents either a causal link between eGene and trait identified through TWAS (middle to right) or a significant interaction expression quantitative trait locus (eQTL) (left to middle). Red represents blood composition variables, orange represents asthma variables, and green represents psychosocial variables.
Figure 3—figure supplement 5. QQplots of interaction expression quantitative trait locus (eQTL) mapping test p-values (black), permutation p-values (green), and permutation-corrected p-values (pink).

Figure 3—figure supplement 5.

To validate these GxE results, we expanded the sample size for all variables with a transcriptional signature. We found that the GxE eQTLs detected with the measured variables (Figure 3—source data 7) were significantly enriched for low p-values in the GxE eQTLs detected with the transcriptional signatures (Figure 3A, Figure 3—figure supplement 1), and the interaction effects were highly significantly correlated (Supplementary file 1g).

Next, we explored the overlap between the GxE genes and previously published datasets that measured interactions with different environments (N = 134 genes, see Materials and methods and Supplementary text). We found that 94.8% of our GxE genes replicated in other datasets of GxE in gene expression (p<0.05). For example, 62 interaction eGenes for psychosocial experiences overlapped with interaction eGenes in response to pathogens (Lee et al., 2014; Barreiro et al., 2012; Çalışkan et al., 2015; Nédélec et al., 2016). This result may indicate that negative psychosocial experiences lead to genotype-specific adverse health effects by influencing the same immune pathways activated by infections. Furthermore, psychosocial experiences may modify the individual response to pathogens and affect health outcomes.

Risk for asthma is modulated by GxE

We hypothesized that genes and pathways that contribute to asthma risk are also involved in asthma symptom severity through gene regulatory variation in immune cells. We investigated whether genes associated with risk for asthma were modulated through psychosocial experiences (E) and/or GxE effects. Among the genes causally linked to asthma or allergic disease risk by PTWAS, expression of 169 genes was modulated by psychosocial environments, including self-disclosure (125 genes), subjective SES (105 genes), family conflict (30 genes), percent unoccupied houses in the neighborhood (27 genes), maternal responsiveness (17 genes), objective negative affect (e.g., feeling sad or angry, 11 genes), child-reported conflict with parent (9 genes) and percent ≥fair houses in the neighborhood (2 genes) (Figure 3—source data 5). The genetic effect on gene expression is modulated by psychosocial factors through GxE for seven genes causally implicated in asthma (four genes) and allergic diseases (four genes) (Figure 3c–f, Figure 3—figure supplements 24). For example, higher expression of the Exocyst Complex Component three gene (EXOC3) is associated with an increased risk of asthma. We found that self-disclosure, which is the extent to which the youths talk about their thoughts and feelings, increases expression of this gene only for individuals carrying at least one copy of the T allele at rs5865330 (Figure 3e). The genetic effect was even more pronounced in the highest tertile of self-disclosure (Figure 3e, inset). Lower expression of the Growth Arrest Specific 8 gene (GAS8) is associated with an increased risk of asthma. The A allele at rs12922757 increases expression of this gene only in individuals with perceived high SES, thus reducing the risk of disease (Figure 3f). A similar effect, and in the same direction, is found for GAS8 and higher self-disclosure (Figure 3—figure supplements 2 and 3).

Discussion

In this study, we collected a unique dataset with genome-wide gene expression paired with extensive and accurate assessment of each participant's biological and psychosocial functioning, across a variety of domains known or likely to be relevant for asthma. We developed a new approach to denoise and impute the transcriptional signatures of asthma symptoms and psychosocial experiences in peripheral blood. Longitudinal data collected on the same individuals validated the transcriptional signatures imputed on an unobserved later time point, mirroring the changes on phenotype. This demonstrates that the molecular signature of psychosocial experiences on immune cells can track changes over time and can be used to analyze cohorts where these variables are not available.

We showed overlap between transcriptional signatures of asthma symptoms and both SES and social relationships, thereby demonstrating that molecular blood gene expression pathways exist through which psychosocial experiences can affect asthma. While some of the variables used to describe pulmonary function may not directly reflect a conventional clinical endpoint, the fact that they were associated with significant transcriptional signatures indicates that they may still have pathophysiological relevance for individuals with asthma. For example, we found a significant transcriptional signature for FEV1, which is endorsed by the National Asthma Education and Prevention Program as a means for grading asthma severity (Birnkrant, 1997). Interestingly, we detected a higher number of significant correlations across imputed variables compared to the measured variables. This observation supports the effectiveness of the denoising procedure that we used to define transcriptional signatures.

Correlations between asthma phenotypes and psychosocial variables may be partially due to changes in blood compositions. Correlations between asthma severity and blood composition are supported by previous findings from the U-BIOPRED cohort where the number of genes differentially expressed between individuals with severe asthma and healthy controls was reduced by 90% after accounting for blood cell composition (Bigler et al., 2017). This is not surprising as several partially overlapping endotypes of asthma have been described to date, distinguished by pro-inflammatory contributions from different immune cell types. The most common asthma subtype is characterized by the involvement of T helper type 2 cells (Th2) sensitized primarily to allergens and subsequent eosinophilic airway inflammation triggered by the type two cytokines (particularly Il-5) (Woodruff et al., 2009). However, in non-allergic individuals, eosinophilic inflammation may be triggered by other immune cell types (Brusselle et al., 2013). Elevated levels of neutrophils have been associated with more severe asthma and suggested as an alternative mechanism to eosinophilic inflammation (Ray and Kolls, 2017). Further work to dissect the contributions of each cell type can be accomplished in future studies with single-cell transcriptomics.

Genes sets associated with psychosocial variables were enriched for different Gene Ontology functions and pathways. For example, self-disclosure is associated with genes distinctly enriched for neutrophil-mediated immunity (Figure 1—figure supplement 3), while parent-reported conflict with child is associated with expression of genes enriched for erythrocyte differentiation. This result suggests that response to negative psychosocial experiences involves processes outside of the scope of the CTRA, which was designed to capture inflammation, antibody production, and type I interferon response. Psychosocial experiences change over time because of the children’s development as a result of broader changes in the urban environments and as a consequence of shifts in family dynamics. These changes of the psychosocial experiences are reflected longitudinally in the gene expression of immune cells and may modify the asthma symptoms and overall health. As we gain additional knowledge on the mechanisms connecting psychosocial experiences to disease, these results can be useful to support the need for social interventions that may ultimately lead to improved overall health. For example, family counseling may improve the psychosocial environment of children with asthma, ameliorating their symptoms, and reducing the impact of systemic health disparities. These social interventions may be implemented independently or together with drug treatments, and their impact could be further monitored through gene expression with larger longitudinal samples in future studies.

Pioneering work by our group and others has shown that environmental effects on gene expression and their interactions with genetic factors can play a very important role in regulating genes that are associated with disease (Moyerbrailean et al., 2016; Richards et al., 2019; Findley et al., 2019; Knowles et al., 2017; Nédélec et al., 2016; Zhernakova et al., 2017). This is also applicable to asthma genetics (Rava et al., 2015) and exposure to rhinovirus infection (Çalışkan et al., 2015) and cytokines (Thompson et al., 2020).

Our study demonstrates that many psychosocial experiences leave an impact on gene expression, including genes that are known to be associated with asthma. When considering genetic effects on gene expression, 21% of the 8610 eQTL we discovered were not found in GTEx whole blood samples. These newly identified genetic effects may be due to limited power when performing an overlap between results from different eQTL studies. An alternative explanation is that we captured context-specific genetic effects that are due to differences between our cohort and GTEx samples in cell-type composition, ancestry, age, psychosocial environment, and/or the asthma status. Indeed, our GxE analysis identifies 344 instances of eGenes with context-specific genetic regulation of gene expression, including 124 instances of GxE with psychosocial experiences.

One outstanding challenge in human complex trait genetics focuses on the portability of polygenic risk scores across population groups or environments (Mostafavi et al., 2020). The 124 instances of GxE with psychosocial experiences are particularly relevant when evaluating a polygenic risk score for asthma phenotypes. For example, the individual contribution of a gene can be modified by SES in one direction and in a different direction for a different gene.

Here, we show that these altered gene expression immune profiles may in turn exacerbate asthma symptoms in children living in inner cities, who are exposed to riskier psychosocial environments. Using human genetics tools, we established that psychosocial factors can modulate the causal genetic effects between gene expression and asthma. Importantly, our results demonstrate that psychosocial factors, such as self-disclosure and SES, modulate genetic risk of asthma and other allergic diseases through altered peripheral blood gene expression.

Materials and methods

Study participants

Participants were included from an ongoing longitudinal study, ALOFT ( recruited from November 2010 to July 2018, Wayne State University Institutional Review Board approval #0412110B3F). The ALOFT study investigates the links between family dynamics, biological changes, and asthma morbidity among youth from the Detroit metropolitan area. Participants were recruited from local area hospitals and schools (for recruitment details, see Supplemental text). To be included in the study, youth were required to be between 10 and 15 years of age at the time of recruitment and diagnosed with at least mild to persistent asthma by a physician (with diagnosis confirmed from medical records). Youth were screened for medical conditions and medications that might affect asthma and associated biological markers. Only one participant reported current oral corticosteroid use. The full sample included 297 youth and their primary caregivers (typically mothers, referred to as ‘parent’ below). However, only youth with valid gene expression data were included in this investigation. Thus, the sample comprised 251 youth (148 boys and 103 girls), whose average age was 12.89 years old (sd = 1.77 years), and at least one parent. Psychosocial and biological variables, including asthma measures, were available for a subset of 119 participants. For a subset of up to 103 participants, we have collected longitudinal data (either 1- or 2 year follow-up), which we used to validate the transcriptional signatures. For cross-sectional and longitudinal sample sizes for each variable, refer to Supplementary file 1b.

Participant recruitment and collection of psychosocial and biological variables

The parent completed a telephone screening interview to determine eligibility in the study. Written assent and consent were obtained from the participating youth and their parent, respectively.

In-lab assessments

The participating youth and parent visited the laboratory, where they completed background questionnaires on a computer and individual interviews assessing stress and asthma management. The parent reported demographics, including their annual income and education level, and completed measures of subjective SES (the McArthur ladder [Adler et al., 2000], adjusted so that 1 corresponds to the lowest status and 10 to the highest), neighborhood stress (Ewart and Suchday, 2002), conflict with their child (the Parental Environment Questionnaire; Elkins et al., 1997), and depressive symptoms (the CES-D; Radloff, 1977). The zip code for each family was also collected and used to retrieve objective measures of neighborhood quality based on census block data from 2010 and Data Driven Detroit (collected in 2009), including the percentage of houses rated as fair in quality or better, percentage of houses currently unoccupied, and the percent of people in that area living below the poverty line (Zilioli et al., 2017).

At the same time, youth reported on demographics, warmth received from their mother (Parental Behavior Inventory; Schaefer, 1965), conflict with their mother, the quality of their family environment (the Risky Families Questionnaire; Felitti et al., 1998; Taylor et al., 2004), depressive symptoms (the Child Depression Inventory; Kovacs, 2011), and the frequency and severity of their asthma symptoms (the Teen Asthma History). Youth also reported on their parents smoking inside the household. However, due to low prevalence as well as uncertainty on whether the parents were present in the household during the 4 days of data collection, we decided to not use this information in our analyses. They also completed a spirometry test using the nSpire Health KoKo PFT to obtain the following pulmonary measures: FEV1 percent predicted, FVC percent predicted, FEV1/FVC percent predicted. Also at this visit, the youth and parent were given detailed instructions regarding a 4-day daily assessment period. The laboratory visit lasted approximately 2 hr.

In-home assessments

For 4 days following the laboratory visit (two weekdays and two weekend days), youth and their parent completed daily assessments. Both youth and their parent completed daily diaries each evening about their experiences throughout their day, and sleep diaries each morning about the quality of their sleep. Daily diaries contained items assessing their positive (i.e., happy, interested, excited, and proud) and negative (i.e., sad, angry, upset, worried, distressed) affect, and how much affection and conflict they witnessed between their parents. Youth were also asked to think about the most important and meaningful conversation they had with someone that day and the extent to which they talked about their thoughts and feelings during that conversation (to measure self-disclosure), and how understanding, validating, and caring their conversation partner was (to measure perceived responsiveness). The sleep diaries contained the Pittsburgh Sleep Scale (Monk et al., 1994), which assesses sleep latency (how long to fall asleep), sleep efficiency (how much time in bed spent sleeping), the number of awakenings throughout the night, the total duration of sleep in hours, and the quality of the sleep. Through the daily and sleep diaries, the participants provided information on the following measures of asthma: severity and frequency of daily and nightly asthma symptoms (wheezing, shortness of breath, coughing, chest tightness, other) and nightly inhaler use. Description of daily diary and sleep diary items used in this investigation is included in Supplementary file 2. When youth completed the daily and sleep diaries (i.e., at awakening and before bed), they used a peak flow meter twice to measure peak flow, with the best score between the two assessments used as our measure of morning and evening peak flow. Only daily and sleep diary reports from youth are used in this investigation. Additionally, youth provided four samples of saliva daily for 4 days at wakeup, 30 min after wakeup, before dinner, and immediately before bed using passive drool methods. Sample time was recorded by participant report, time stamps, and MEMS 6 TrackCap monitors (Aardex Ltd., Switzerland). Samples were initially stored in participants’ refrigerators, but upon return to the lab, saliva samples were stored in the laboratory refrigerator at −20°C until assayed. To reduce positive skewness, we natural log transformed the cortisol values (raw cortisol +1). Hierarchical linear models were run in HLM to extract the average diurnal cortisol intercept, slope, and cortisol awakening response (CAR) for each participant. Finally, participants wore the Electronically Activated Recorder (EAR) in their front pocket or in a belt clip provided from the time they woke up until bedtime. The EAR captured 50 s of sound every 9 min (Mehl et al., 2001). EAR data were coded by trained coders using the Everyday Child Home Observation (ECHO) coding system (Tobin et al., 2015). Specifically, for this investigation, we use codes of wheezing, positive affect (i.e., happy, interested, excited), negative affect (i.e., sadness, anger, upset, worry, distress), maternal responsiveness (i.e., how much the mother expresses pride, support, and warmth towards the youth), and family conflict (i.e., whether an argument, conflict, fight, or yelling was overheard). Scores for each EAR-observed behavior reflect a mean of the total recordings in which the behavior was observed during waking hours. After completion of the in-home assessment period, the participants returned study materials and the EAR. Youth and parents were compensated for their time.

Additional details on the measures collected in-home are provided in Supplementary file 2. Descriptive and reliability statistics can be found in Supplementary file 1b. Correlations between measures can be found in Figure 1—figure supplement 1.

Biological sample collection

Following the daily assessment period, a peripheral blood draw was conducted for each youth participant. Each youth provided 16, 4, and 8 ml of peripheral blood collected into Vacutainer Cell Preparation Tubes (Becton Dickinson and Co., East Rutherford, NJ) for PBMC (FICOLL gradient vacutainers), DNA (sodium citrate vacutainer, Fisher Scientific catalog #BD-366415), and RNA (EDTA vacutainer) extraction, respectively. Peripheral blood mononuclear cells (PBMCs) were extracted from this sample, as previously described (Weckle et al., 2015). All PBMC samples were phenotyped for glucocorticoid (GC) resistance in an established in vitro assay (Marin et al., 2009) measuring the levels of IL-5, IL-13, and IFN-γ in the supernatant (Quantikine ELISA D5000B, D1300B, and DIF-50, R&D Systems, Minneapolis, MN). Specifically, PBMCs cultured in RPMI-1640 solution (Life Technologies, Carlsbad, CA) supplemented with 10% FBS (Life Technologies) and 2% HEPES (Sigma-Aldrich, St. Louis, MO) were stimulated for 48 hr with PMA + ionomycin (phorbolmyristate acetate 25 ng/ml, Fisher Scientific, Hanover, IL; ionomycin calcium salt, 1 μg/ml, Sigma-Aldrich) and treated with hydrocortisone (28 nmol/l, Sigma-Aldrich) or vehicle control. GC resistance was calculated as log-fold change of cytokine level in hydrocortisone condition over control and averaged over two replicates. DNA was extracted using DNA Blood Mini Kit (Qiagen, Germantown, MD), and RNA was extracted using LeukoLOCK Total RNA Isolation System (Thermo Fisher Scientific, Waltham, MA).

Each of the aforementioned measures were collected annually for a period of 2 years (three data collection waves) from participants who provided continued informed consent. In this study, we used both cytokine levels in stimulated PBMCs (e.g., IL5 stimulated), as well as log-fold change in cytokine levels between stimulated and stimulated + glucocorticoid-treated condition (e.g., IL13 GC resistance).

Genotype data

All individuals in this study were genotyped from low-coverage (~0.4×) whole-genome sequencing and imputed to 37.5M variants using the 1000 Genomes database by Gencove (New York, NY). The genotype accuracy at this sequencing depth has 98.22% positive agreement and 99.82% negative agreement compared to genotyping arrays (Wasik et al., 2021; Li et al., 2021). These data were also used for sample quality control (see Ancestry QC, Sex QC and Genotype QC) and to calculate the top three principal components [PCs] to use as covariates in all statistical analyses.

Genotype QC

To detect potential sample swaps that may have occurred in sample processing or library preparation, we compared genotypes of RNA and DNA samples from all individuals. We used samtools mpileup function to obtain genotypes from each individual’s RNA-seq bam files for NCBI dbSNP Build 144 variants and kept only variants with more than 40 reads coverage. We used bcftools gtcheck function to compare genotype calls across all biallelic SNPs in all DNA and RNA samples. RNA samples that failed to cluster with their respective DNA sample were repeated (library preparation and sequencing). If the discrepancy was not resolved, these samples were excluded from the analysis. A total of 251 samples passed this QC filter. Ultimately, the pairwise discordance rate between genotype calls from RNA and their respective DNA samples from the same individual ranged between 0.03 and 0.12. In contrast, the pairwise discordance rate between all the other unrelated samples ranged from 0.20 to 0.33. This rate is not a direct measure of genotype call accuracy, but it is useful to identify possible sample swaps and mismatches.

We also used the DNA-derived genotype information to confirm none of the participants were related. We performed identity-by-descent (IBD) analysis by maximum likelihood estimation (MLE) using the R package SNPRelate (version 1.16.0). As input we used random 1500 SNPs passing the following criteria: minor allele frequency (MAF) >0.05, missing rate <0.05, LD threshold <0.2 (Figure 1—figure supplement 4).

Ancestry and sex QC

For the individuals for whom the data was available, we plotted self-reported ethnicity against percent global African ancestry defined as the sum of West, East, Central, and North African global genetic ancestries calculated by Gencove (Figure 1—figure supplement 5). All samples were in agreement with self-reported ethnicity. Three participants who identified as multiracial were found to be of admixed African and European ancestry based on genotype analysis provided by Gencove. To check consistency of self-reported sex against genetic data, we plotted fraction of reads mapping to the Y chromosome for all samples. We noted a clear separation between the sexes with no outliers (Figure 1—figure supplement 6).

RNA-seq data collection and preprocessing

Total RNA was extracted using LeukoLOCK (Thermo Fisher), which captures total RNA from neutrophils, eosinophils, basophils, monocytes, and lymphocytes, and preserved at −80°C. All RNA samples had a RNA Integrity Number (RIN) of at least 6 measured on Agilent Bioanalyzer. Library preparation was performed in batches of up to 96 samples (with multiple samples from the same participant always processed within the same batch) on 1–4 μg total RNA, per standard Illumina TruSeq Stranded mRNA library preparation protocol, and sequenced on Illumina NextSeq500 to a depth of 21 million (M) to 76M reads, mean 41M reads (150 bp paired-end). HISAT2 (Kim et al., 2015) was used to align demultiplexed reads to the human genome version ‘GRCh37_snp_tran’, which considers splicing and common genetic variants. Aligned and cleaned (deduplicated) reads were counted using HTSeq and GRCh37.75 transcriptome assembly across 63,677 genes. Post-sequencing quality control included removal of samples with excess PCR duplicate rate (>60%) and genotype QC check against respective DNA sample. For all gene expression analyses, genes on sex chromosomes and genes with expression below 6 reads or 0.1 counts per million in at least 20% of samples were dropped. The final RNA-seq dataset consists of 251 unique samples and 18,904 genes.

Differential gene expression analysis

We used DESeq2 v1.22.1 (Love et al., 2014) to test for differential gene expression across the 23 psychosocial experiences using a likelihood ratio test (LRT) in 119 individuals from the first wave of data collection. To adjust for potential confounders, we included as covariates the three top PCs of a matrix of possible confounders that included RIN, site of RNA extraction, library preparation batch, percent reads mapping to exons, percent non-duplicate reads, age, sex, height, weight, top three genotype PCs, and the four transcriptional signatures of blood composition (Figure 1—figure supplement 7), except when testing for the effect of blood composition differences on gene expression where we have not included transcriptional signatures of blood composition when calculating PCs of potential confounders. Many of these confounders are very correlated, and the three top PCs explained 99.7% of their variance. Supplementary file 1h represents correlations between individual covariates and the three top PCs of the covariate matrix. In short, PC1–PC3 largely represent weight, height, and age, respectively. For each tested variable, the LRT is then used to compare between two models: GE ~ cvPC1+cvPC2+cvPC3+tested_variable (full model) and GE ~ cvPC1+cvPC2+cvPC3 (reduced model). To control for FDR, we used the default independent filtering step and multiple test correction implemented in DESeq2. Supplementary file 1e lists differentially expressed genes at 10% FDR, while Supplementary file 1f contains the full results of the analysis. Supplementary file 1i lists differentially expressed genes for blood composition measures at 10% FDR, while Supplementary file 1j contains full DESeq results for this analysis.

GO and pathway enrichment analyses

We used the R package clusterProfiler (Yu et al., 2012) to run GO, KEGG, and REACTOME enrichment analyses (hypergeometric test) across genes upregulated and downregulated compared to the background of all expressed genes (Figure 1—figure supplement 3). Enriched categories were defined at 5% FDR.

Imputation and denoising of transcriptional signatures

We assume that the observed variables have high levels of noise and the measured values do not reflect the true biological effects. Therefore, we used the predicted values for all participants, including those for whom the variables were directly measured (denoising). We developed an approach to impute and denoise a transcriptional signature for psychosocial, environmental, and other phenotypic variables based on Generalized Linear Models with Elastic-Net Regularization. First, we normalized the count data using the voom function in the limma v3.38.3 package in R (Law et al., 2014). Second, we regressed out the following confounding factors: RIN, percent reads mapping to exons, percent non-duplicate reads, site of RNA extraction, library preparation batch, sample collection wave, age, sex, height, weight, genotype PC1, genotype PC2, and genotype PC3. Third, we used the R package glmnet v2.0–16 in R/3.5.2 (Gaussian model), with a relaxed alpha = 0.1 to allow for highly co-regulated genes to be included in the prediction model.

The predicted values are imputed based on the generalized linear models with penalized maximum likelihood built using glmnet for each variable separately, according to the general model:

PhenotypeorEnvironment=intercept+β1E(gene1)+β2E(gene2)++βnE(genen), (1)

where E (genen) is normalized expression of gene n, βn is its estimated coefficient, and n is minimized via penalized maximum likelihood with elastic-net mixing parameter α set to 0.1 (0 representing ridge regression, 1 representing lasso regression).

The best fit for the model was used to predict, for both observed and unobserved samples, the biological impact on gene expression of the relevant variable. We did this because the measurement error for the observed variables will also not be uniform as some individuals may not respond accurately or truthfully (e.g., to self-disclosure questions) to all questions. If the objective was to estimate the measurement including its biases and errors, the ‘technical’ variance for the observed variable would be smaller if we were to use the observed values. However, we use the denoised/imputed values, where the ‘biological’ variance, or the error between the fitted value of the signature with respect to the true unobserved biological impact, would be the same for both the measured and non-measured individuals. Overall, our procedure should not create any biases but rather decrease the variability of the imputed/denoised variables, thus reducing the chance of false-positive GxE.

Leave-one-out cross-validation was used to evaluate the best models. We used the cross-validated mean square error (MSE) metric and its standard deviation to evaluate which signatures were more predictive. We calculated the R2 for each of the models based on the % MSE reduction from cross-validation. To compare the results that would be achievable with the CTRA-based approach, we used the same method but we limited the molecular signature to only include the 53 genes that are used to calculate the CTRA score (Fredrickson et al., 2013). 48 of the 53 genes comprising the CTRA are measurable in our sample (CTRA genes below detection: IL1A, IFIT1L, IFITM5, IFNB1, IGLL3). We compared the fraction of variance explained between the CTRA-based and unrestricted models (Figure 1—figure supplement 8).

Correlation between transcriptional signatures

We used Pearson’s correlations to evaluate overlap between transcriptional signatures of variables explaining at least 1% of variance imputed on the entire cohort of 251 participants as in Equation (1). We only considered correlations with p-value<0.05.

Longitudinal replication

We collected a second time point (approximately 1 or 2 years after the time point used in current analyses) for a subset of 13 variables – subjective SES, self-disclosure, YR parent-child conflict, stimulated IL5, IL13 GC resistance, eosinophils, lymphocytes, monocytes, neutrophils, FEV1 percent predicted, nightly asthma symptoms, nightly inhaler use, and asthma severity – to validate the transcriptional signatures. We considered the longitudinal changes in the transcriptional signatures imputed from the new gene expression data and compared them to the changes in the observed variable between the two time points. Note that the transcriptional signature is imputed for the second time point from gene expression samples that are not included in the training set. We used Spearman's correlation to compare the changes from the imputed transcriptional signature to those directly observed.

cis-eQTL mapping

We calculated gene expression residuals (as in the imputation and denoising approach) and then used FastQTL (Ongen et al., 2016) with adaptive permutations (1000-10,000). For each gene, we tested all genetic variants within 1 Mb of the transcription start site (TSS) and with cohort MAF > 0.1, for a total of 17,679 genes and 82,679,170 variant-gene pairs tested. We optimized the number of gene expression PCs in the model to maximize the number of eGenes. The model that yielded the largest number of eGenes included 18 gene expression PCs (Figure 3—source data 1).

Interaction eQTL mapping

To identify interaction eQTLs, we considered the lead eQTL for each of the eGenes identified at 10% FDR by FastQTL (without correcting for gene expression principal components). This is similar to what was done by GTEx (Kim-Hellmuth et al., 2020) and others (Alasoo et al., 2018; Kim-Hellmuth et al., 2017), and equivalent to a very conservative pruning of all SNPs in the entire cis-association region. We did not correct for gene expression principal components because some of them are correlated with cell composition and the environmental variables, thus complicating the interpretation of the linear model. To reduce impact of potential outliers, we quantile-normalized each transcriptional signature prior to GxE testing. We fit a linear model that includes both the genotype dosage and the marginal environmental effect as well as their interaction: Expression ~dosage + transcriptional signature +dosage*transcriptional signature. To fit this model, we used the lm function in R-3.5.2. We generated an empirical null distribution of 100 million permuted p-values to correct the interaction p-values (Figure 3—figure supplement 4). The empirical null distribution was obtained through multiple runs of the model for each tested transcriptional signature-gene pair while permuting the genotype dosages. Storey's q-value method to control for FDR was applied on the permutation-corrected p-values for all tests within each transcriptional signature separately.

To ensure the signal detected was not solely due to cell composition differences, we repeated the GxE eQTL mapping procedure as above, while correcting for four signatures of cell composition using the following model: Expression ~ eosinophils + leukocytes+monocytes + neutrophils+dosage + transcriptional signature +dosage*transcriptional signature. Figure 3—source data 6 contains full results of this analysis.

Interaction eQTL mapping with measured variables was performed the same way as with transcriptional signatures, except sample size was limited by the available data (Figure 3—source data 7).

Replication analysis of GxE

We calculated the enrichment of GxE genes from the measured variables (p-value<0.01) in the set of GxE genes from the transcriptional signatures (p-value<0.01). To this end, we performed Fisher's exact tests on 2 × 2 contingency tables indicating whether a gene had a GxE eQTL with the measured variable (yes/no) and with the corresponding transcriptional signature (yes/no). Additionally, we calculated the correlation of the standardized interaction effect size (z-score) for each gene obtained when considering measured variables and corresponding transcriptional signatures.

To validate our GxE results, we considered the following GxE studies for which full interaction testing results are available (Barreiro et al., 2012; Lee et al., 2014; Çalışkan et al., 2015; Nédélec et al., 2016; Moyerbrailean et al., 2016; Kim-Hellmuth et al., 2020). We show numbers of our GxE eGenes (FDR < 10%) that replicated in other studies (p-value<0.05).

TWAS analyses

To directly investigate whether discovered effects on gene expression and GxE interactions may contribute to asthma, allergic disease risk, and/or behavioral phenotypes, we used PTWAS results (Zhang et al., 2020) (5% FDR) as an independent source of evidence of causality between gene expression levels and asthma/allergic disease risk. PTWAS utilizes probabilistic eQTL annotations derived from multivariant Bayesian fine-mapping analysis conferring higher power to detect TWAS associations than existing methods. The evidence for causality from PTWAS is strong for the following reasons: (1) we use only strong IVs by combining the strength of multiple independent strong eQTLs for each gene and combining information across all tissues; (2) within the PTWAS framework, we can then validate the causality assumption for each gene-trait-tissue combination. We found that the exclusion restriction criterion was violated (heterogeneity of independent estimates across multiple strong eQTLs, I2 statistic >0.5) in only 0.36% of the gene-trait pairs for which we computed this statistic, none of which overlap our reported results. Using eQTL data across 49 tissues from GTEx v8, we used PTWAS to analyze GWAS summary statistics from several large-scale projects. Here, we specifically focused on the following asthma studies: GABRIEL-Asthma, TAGC-Asthma-EUR, UKB-20002–1111-self-reported-asthma, UKB-6152–8-diagnosed-by-doctor-Asthma, and allergic disease studies: EAGLE-Eczema, UKB-20002–1452-self-reported-eczema-or-dermatitis, UKB-6152–9-diagnosed-by-doctor-Hayfever-allergic-rhinitis-or-eczema. Additionally we considered other phenotypes that may be relevant for our cohort: chronotype (Jones-et-al-2016-Chronotype, UKB-1180-Morning-or-evening-person-chronotype), sleep duration (Jones-et-al-2016-SleepDuration, UKB-1160-Sleep-duration), and depressive symptoms (SSGAC-Depressive-Symptoms). To identify eGenes in children with asthma that are causally associated with asthma, we considered all 4943 eGenes that were used for the interaction eQTL analysis with a significant (10% FDR) marginal effect of the psychosocial experiences from the linear model that includes both the genotype dosage and the marginal environmental effect as well as their interaction: Expression ~ dosage + transcriptional signature +dosage*transcriptional signature.

Acknowledgements

We thank Luis Barreiro and Noah Snyder-Mackler for comments on an earlier version of this manuscript, and members of the Luca, Pique-Regi, Zilioli, and Slatcher labs for helpful discussions. We thank the study participants and their families for taking part in this study. We thank two anonymous reviewers for their insightful comments.

Appendix 1

Participant recruitment methods

We used the following three recruitment methods, in order of most-used to least-used:

  1. Dr. Secord, a co-I on the study, identifies patients who meet some of the basic criteria for the study who are then sent informational letters to their home address. The letter informs them of the basic information about the study and a telephone number to contact if interested.

  2. A nurse from DMC Children's Hospital approaches patients and their guardians in asthma clinic or ER and informs them about the study. If interested, patient’s guardian is later contacted to further inform them about the study and screen them for eligibility.

  3. Through informational flyer, posted in local asthma clinic, distributed to local K-12 schools with basic information about the study and eligibility criteria, and study coordinator contact information.

Basic information provided to potential participants across the three methods informs them that this is a family asthma study looking at everyday life and how health is affected, that there is a home or office/lab visit focused on surveys, a brief interview, and a 4-day at-home period and a follow-up where blood is collected.

Participant eligibility

  1. Child is between 10 and 15 years old.

  2. Child has an asthma diagnosis of at least mild to persistent asthma.

  3. At least one parent/guardian is willing to participate as well as the child.

  4. Participating parent/guardian should be living with the child consistently for the last six months.

  5. If unsure of which parent/guardian should participate, then the one that would have the most knowledge about health and day-to-day life.

  6. If multiple children in the house have asthma and are in the study age range, the family can choose which child will participate based on who would be most agreeable to the various measures taken throughout the study.

Identification of gene expression confounders

There are many sources of technical and biological noise in RNA-seq data. To accurately estimate gene expression differences between meaningful groups, one must account for additional sources of gene expression variation that are not of interest. To investigate the major sources of variation in the RNA-seq dataset, we performed principal component analysis (PCA) on normalized RNA-seq data and investigated (1) the correlation between first 15 PCs and each of the covariates (data not shown), and (2) proportion of variance explained by each covariate overall (Figure 1—figure supplement 7a), and per each gene (Figure 1—figure supplement 7b). The following covariates were significantly correlated with at least one PC: library preparation batch (confounded with library sequencing batch), RNA quality score (RIN), laboratory where the RNA sample was extracted, age, sex, height, weight, and ancestry (measured as first three principal components on the genotype matrix). These variables were also correlated with each other; therefore, we used PCA to calculate the PCs of all these covariates that explained >99.5% variance and incorporated them in the gene expression analyses as described in the Materials and methods.

Blood composition has a profound effect on gene expression

Blood composition is expected to have a strong effect on overall blood gene expression as blood is a heterogeneous tissue and different cell types may contribute different transcripts to the overall gene expression. The results of differential gene expression analysis with DESeq2 using LRT are displayed in Supplementary file 1i, j. As expected, the number of differentially expressed genes is larger for cell types that constitute a bigger fraction of the cell pool. Accordingly, we did not find genes differentially expressed for basophils, a cell type that constitutes less than 1% of total leukocytes. Based on these results, cell composition was accounted for in differential gene expression analyses of psychosocial variables and other analyses that distinguished between the two types of effects (mediation analysis).

Overlap of transcriptional signatures

We investigated whether transcriptional signatures for different variables were correlated with each other, which would suggest shared transcriptional pathways for these phenotypes and environments (Figure 2—figure supplement 1). Transcriptional signatures of the socioeconomic measures showed strong overlap: specifically, houses rated ≥fair, unoccupied houses, and subjective SES were intercorrelated. There was also some overlap in the social relationships category, with transcriptional signatures of daily youth-reported self-disclosure and objective maternal responsiveness significantly correlated. However, we also saw correlations crossing all three variable categories. For example, subjective SES was significantly correlated with objective maternal responsiveness, family conflict, and self-disclosure; objective negative affect was correlated with self-disclosure, family conflict, objective maternal responsiveness, and houses rated ≥fair.

We were able to identify transcriptional signatures that explained at least 1% of variance for four of the five blood composition variables and two of the seven glucocorticoid variables. Models predicting blood composition performed best with 47–150 genes, and models predicting glucocorticoid variables contained 17–100 genes. Notably, the transcriptional signatures explained a far greater percent of variation in blood composition than other categories of variables. Transcriptional signatures within each category were highly correlated, showing greater overlap within all glucocorticoid response variables and all blood composition variables (except for eosinophils, which were uncorrelated with all other blood composition signatures) than seen in asthma severity or social relationship categories. When we correlated glucocorticoid and blood composition transcriptional signatures with transcriptional signatures for psychosocial experiences and asthma severity, we observed some overlap for glucocorticoid response: specifically, stimulated levels of IL-5 were associated with percent-predicted FEV1 (r = −0.51, p<0.001), unoccupied houses (r = −0.27, p=<0.001), and youth-reported parent-child conflict (r = 0.18, p=0.003). There was much more overlap in blood composition transcriptional signatures with asthma severity and psychosocial experience signatures. For example, transcriptional signature of monocytes was correlated with a large number of transcriptional signatures of both asthma and psychosocial experiences, including nightly asthma symptoms (r = 0.21, p<0.001), and FEV (r = 0.45, p<0.001), self-disclosure (r = 0.49, p<0.001), unoccupied houses (r = 0.39, p=<0.001), subjective SES (r = 0.43, p<0.001), and objective negative affect (r = 0.35, p<0.001). Lymphocytes showed a very similar pattern of correlations to neutrophils, but in the opposite direction.

Targeted vs. unbiased approach to identifying effects of psychosocial experiences on gene expression

Previous research analyzing blood gene expression developed the CTRA (Fredrickson et al., 2013), which proposes expression of 53 immune genes as a composite indicator of immune system response to adverse social environments (reviewed in Cole, 2014). We compared the performance of our unbiased prediction model with a model limited to the 53 CTRA genes (Figure 1—figure supplement 8). The CTRA-based model performs better at predicting the fraction of two of the five blood cell types. This may reflect differential expression of CTRA genes in some cell types. CTRA-based and unbiased models perform similarly on psychosocial and neighborhood measures, with 9/16 traits (56%) predicted more accurately by the unbiased model. CTRA-based model performs poorly at predicting asthma and glucocorticoid phenotypes, and is outperformed by the unbiased model in 10/14 traits (71%), which is expected because it was not originally developed to capture these phenotypes. In summary, limiting the scope of the prediction model to preselected immune genes does not greatly diminish performance in predicting blood and psychosocial phenotypes, but the CTRA-based model does not appropriately reflect transcriptomic signatures of asthma and glucocorticoid phenotypes. This result suggests that genes outside the CTRA subset are important to many asthma and some psychosocial phenotypes.

Replication of GxE effects

To validate the observed GxE effects on gene expression, we explored the overlap between the GxE genes and previously published datasets that measured interactions with different environments. We found that the majority of our GxE genes replicated in other datasets of GxE in gene expression (p<0.05). Of our genes with GxE, 2 were found to have GxE in response to influenza (Lee et al., 2014), 59 were found to have GxE in response to a variety of chemical treatments (Moyerbrailean et al., 2016), 75 were found to have GxE in response to a variety of Mycobacterium tuberculosis (Barreiro et al., 2012), 87 were found to have GxE in response to rhinovirus (Çalışkan et al., 2015), 99 genes were found to have GxE in response to pathogens (Nédélec et al., 2016), and 117 had GxE with cell-type fraction in whole blood (Kim-Hellmuth et al., 2020).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Roger Pique-Regi, Email: rpique@wayne.edu.

Francesca Luca, Email: fluca@wayne.edu.

Patricia W Finn, University of Illinois at Chicago, United States.

Mone Zaidi, Icahn School of Medicine at Mount Sinai, United States.

Funding Information

This paper was supported by the following grant:

  • National Heart, Lung, and Blood Institute R01HL114097 to Samuele Zilioli, Richard B Slatcher.

Additional information

Competing interests

No competing interests declared.

Author contributions

Data curation, Formal analysis, Validation, Investigation, Visualization, Writing - original draft, Writing - review and editing.

Data curation, Formal analysis, Writing - original draft, Writing - review and editing.

Data curation.

Data curation.

Resources, Methodology, Writing - review and editing.

Resources, Data curation, Supervision, Writing - review and editing.

Resources, Data curation, Supervision, Project administration, Writing - review and editing.

Conceptualization, Resources, Data curation, Supervision, Funding acquisition, Project administration, Writing - review and editing.

Conceptualization, Resources, Data curation, Supervision, Funding acquisition, Investigation, Methodology, Writing - original draft, Project administration, Writing - review and editing.

Conceptualization, Resources, Data curation, Supervision, Funding acquisition, Investigation, Methodology, Writing - original draft, Project administration, Writing - review and editing.

Ethics

Human subjects: Participants were included from an ongoing longitudinal study, Asthma in the Lives of Families Today (ALOFT; recruited from November 2010-July 2018, Wayne State University Institutional Review Board approval #0412110B3F).

Additional files

Supplementary file 1. Supplementary tables.

(a) Basic demographic information on 119 individuals used to train the transcriptional signature models (‘training’) and the entire cohort of 251 participants (‘total’). Reported are the count (N) and percentage of non-missing values per each category. For each variable in parentheses, we reported the p-values for significant differences in variable distribution between the training group and the group not included in the model. (b) List of variables collected for the current study. DD: daily diary; SD: sleep diary; EAR: coded from Electronically Activated Recorder; YR: youth reported; PR: parent reported; CD: census data; GC: glucocorticoid; SD: standard deviation; α: Chronbach's alpha measuring reliability as the average correlation between scale items, as a function of the number of items included in the scale, N initial – sample size used for differential gene expression analysis and building transcriptional signatures (mean, SD, and α are reported for this subset), N longitudinal – number of samples with measurements from two time points available, N expanded – final sample size used for transcriptional signature and GxE validations. (c) Evaluation of transcriptional signatures derived using elastic net regression. For each variable, we report Pearson’s correlation coefficient, p-value, cross-validated percent variance explained, and sample size for the training dataset (columns 2–5), and Pearson’s correlation coefficient, p-value, and sample size for the entire cohort (columns 6–8). (d) Longitudinal replication of transcriptional signatures. For each variable, we report Spearman’s correlation coefficient between longitudinal change in observed variable and change in the transcriptional signature, p-value, coefficient of variation of longitudinal change in observed variable, and sample size. (e) Differentially expressed genes (DEGs) for all psychosocial variables (10% FDR; N: sample size). (f) Full DESeq results for differential gene expression analyses for psychosocial variables. (g) Correlations between z-scores of GxE interaction effects measured using transcriptional signatures and observed data. For each variable, we report Spearman’s correlation coefficient and its p-value, and odds ratio and p-value from Fisher’s test for enrichment of nominally significant results (permutation-corrected p-value<0.01) between GxE expression quantitative trait locus (eQTL) testing using transcriptional signatures and observed data. (h) Correlations between the top three principal components of covariate matrix and individual covariates (Pearson's product-moment correlations with numeric variables, polyserial correlations with bivariate variables; ns: correlation p-value>0.05). (i) DEGs associated with blood cell composition (10% FDR; N: sample size). (j) Full DESeq results for differential gene expression analyses for blood composition variables.

elife-63852-supp1.xlsx (48.9MB, xlsx)
Supplementary file 2. Detailed descriptions of methods for psychosocial data collection.
elife-63852-supp2.docx (41.2KB, docx)
Transparent reporting form
Reporting standard 1. STREGA reporting recommendations, extended from STROBE Statement.

Data availability

The data are available on dbGAP (accession number: phs002182.v1.p1).

The following dataset was generated:

Zilioli S, Slatcher RB, Pique-Regi R, Luca F. 2021. Asthma in the Lives of Families Today (ALOFT) dbGAP. phs002182.v1.p1

References

  1. Adler NE, Epel ES, Castellazzo G, Ickovics JR. Relationship of subjective and objective social status with psychological and physiological functioning: preliminary data in healthy white women. Health Psychology. 2000;19:586–592. doi: 10.1037/0278-6133.19.6.586. [DOI] [PubMed] [Google Scholar]
  2. Aguet F, Barbeira AN, Bonazzola R, Brown A, Castel SE, Jo B, Kasela S, GTEx Consortium The GTEx consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Akinbami L, Centers for Disease Control and Prevention National Center for Health Statistics The State of Childhood Asthma, United States, 1980-2005. Advance Data. 2006;381:1–24. [PubMed] [Google Scholar]
  4. Alasoo K, Rodrigues J, Mukhopadhyay S, Knights AJ, Mann AL, Kundu K, Hale C, Dougan G, Gaffney DJ, HIPSCI Consortium Shared genetic effects on chromatin and gene expression indicate a role for enhancer priming in immune response. Nature Genetics. 2018;50:424–431. doi: 10.1038/s41588-018-0046-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Alpert A, Pickman Y, Leipold M, Rosenberg-Hasson Y, Ji X, Gaujoux R, Rabani H, Starosvetsky E, Kveler K, Schaffert S, Furman D, Caspi O, Rosenschein U, Khatri P, Dekker CL, Maecker HT, Davis MM, Shen-Orr SS. A clinically meaningful metric of immune age derived from high-dimensional longitudinal monitoring. Nature Medicine. 2019;25:487–495. doi: 10.1038/s41591-019-0381-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Asgharzadeh S, Pique-Regi R, Sposto R, Wang H, Yang Y, Shimada H, Matthay K, Buckley J, Ortega A, Seeger RC. Prognostic significance of gene expression profiles of metastatic neuroblastomas lacking MYCN gene amplification. JNCI: Journal of the National Cancer Institute. 2006;98:1193–1203. doi: 10.1093/jnci/djj330. [DOI] [PubMed] [Google Scholar]
  7. Barreiro LB, Tailleux L, Pai AA, Gicquel B, Marioni JC, Gilad Y. Deciphering the genetic architecture of variation in the immune response to Mycobacterium tuberculosis infection. PNAS. 2012;109:1204–1209. doi: 10.1073/pnas.1115761109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bigler J, Boedigheimer M, Schofield JPR, Skipp PJ, Corfield J, Rowe A, Sousa AR, Timour M, Twehues L, Hu X, Roberts G, Welcher AA, Yu W, Lefaudeux D, Meulder B, Auffray C, Chung KF, Adcock IM, Sterk PJ, Djukanović R, U-BIOPRED Study Group A severe asthma disease signature from gene expression profiling of peripheral blood from U-BIOPRED cohorts. American Journal of Respiratory and Critical Care Medicine. 2017;195:1311–1320. doi: 10.1164/rccm.201604-0866OC. [DOI] [PubMed] [Google Scholar]
  9. Birnkrant DJ. Guidelines for the diagnosis and management of asthma. National asthma education and prevention program expert panel, NIH, February, 1997. Clinical Pediatrics. 1997;36:722–723. doi: 10.1177/000992289703601212. [DOI] [Google Scholar]
  10. Brusselle GG, Maes T, Bracke KR. Eosinophils in the spotlight: eosinophilic airway inflammation in nonallergic asthma. Nature Medicine. 2013;19:977–979. doi: 10.1038/nm.3300. [DOI] [PubMed] [Google Scholar]
  11. Çalışkan M, Baker SW, Gilad Y, Ober C. Host genetic variation influences gene expression response to rhinovirus infection. PLOS Genetics. 2015;11:e1005111. doi: 10.1371/journal.pgen.1005111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Casciano J, Krishnan JA, Small MB, Buck PO, Gopalan G, Li C, Kemp R, Dotiwala Z. Value of peripheral blood eosinophil markers to predict severity of asthma. BMC Pulmonary Medicine. 2016;16:109. doi: 10.1186/s12890-016-0271-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chen E, Miller GE, Walker HA, Arevalo JM, Sung CY, Cole SW. Genome-wide transcriptional profiling linked to social class in asthma. Thorax. 2009;64:38–43. doi: 10.1136/thx.2007.095091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Chen E, Miller GE. Stress and inflammation in exacerbations of asthma. Brain, Behavior, and Immunity. 2007;21:993–999. doi: 10.1016/j.bbi.2007.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chida Y, Steptoe A. Positive psychological well-being and mortality: a quantitative review of prospective observational studies. Psychosomatic Medicine. 2008;70:741–756. doi: 10.1097/PSY.0b013e31818105ba. [DOI] [PubMed] [Google Scholar]
  16. Cole SW. Social regulation of human gene expression. Current Directions in Psychological Science. 2009;18:132–137. doi: 10.1111/j.1467-8721.2009.01623.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Cole SW. Human social genomics. PLOS Genetics. 2014;10:e1004601. doi: 10.1371/journal.pgen.1004601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Elkins IJ, McGue M, Iacono WG. Genetic and environmental influences on parent-son relationships: evidence for increasing genetic influence during adolescence. Developmental Psychology. 1997;33:351–363. doi: 10.1037/0012-1649.33.2.351. [DOI] [PubMed] [Google Scholar]
  19. Everson-Rose SA, Lewis TT. Psychosocial factors and cardiovascular diseases. Annual Review of Public Health. 2005;26:469–500. doi: 10.1146/annurev.publhealth.26.021304.144542. [DOI] [PubMed] [Google Scholar]
  20. Ewart CK, Suchday S. Discovering how urban poverty and violence affect health: development and validation of a neighborhood stress index. Health Psychology. 2002;21:254–262. doi: 10.1037/0278-6133.21.3.254. [DOI] [PubMed] [Google Scholar]
  21. Farrell AK, Slatcher RB, Tobin ET, Imami L, Wildman DE, Luca F, Zilioli S. Socioeconomic status, family negative emotional climate, and anti-inflammatory gene expression among youth with asthma. Psychoneuroendocrinology. 2018;91:62–67. doi: 10.1016/j.psyneuen.2018.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Felitti VJ, Anda RF, Nordenberg D, Williamson DF, Spitz AM, Edwards V, Koss MP, Marks JS. Relationship of childhood abuse and household dysfunction to many of the leading causes of death in adults: the adverse childhood experiences (ACE) Study. American Journal of Preventive Medicine. 1998;14:245–258. doi: 10.1016/s0749-3797(98)00017-8. [DOI] [PubMed] [Google Scholar]
  23. Findley AS, Richards AL, Petrini C, Alazizi A, Doman E, Shanku AG, Davis GO, Hauff N, Sorokin Y, Wen X, Pique-Regi R, Luca F. Interpreting coronary artery disease risk through Gene-Environment interactions in gene regulation. Genetics. 2019;213:651–663. doi: 10.1534/genetics.119.302419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fredrickson BL, Grewen KM, Coffey KA, Algoe SB, Firestine AM, Arevalo JM, Ma J, Cole SW. A functional genomic perspective on human well-being. PNAS. 2013;110:13684–13689. doi: 10.1073/pnas.1305419110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software. 2010;33:1–22. doi: 10.18637/jss.v033.i01. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Galea S, Uddin M, Koenen K. The urban environment and mental disorders. Epigenetics. 2011;6:400–404. doi: 10.4161/epi.6.4.14944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Gallo LC, Matthews KA. Understanding the association between socioeconomic status and physical health: do negative emotions play a role? Psychological Bulletin. 2003;129:10–51. doi: 10.1037/0033-2909.129.1.10. [DOI] [PubMed] [Google Scholar]
  28. Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BW, Jansen R, de Geus EJ, Boomsma DI, Wright FA, Sullivan PF, Nikkola E, Alvarez M, Civelek M, Lusis AJ, Lehtimäki T, Raitoharju E, Kähönen M, Seppälä I, Raitakari OT, Kuusisto J, Laakso M, Price AL, Pajukanta P, Pasaniuc B. Integrative approaches for large-scale transcriptome-wide association studies. Nature Genetics. 2016;48:245–252. doi: 10.1038/ng.3506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Harrison BDW. Psychosocial aspects of asthma in adults. Thorax. 1998;53:519–525. doi: 10.1136/thx.53.6.519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Holt-Lunstad J, Smith TB, Layton JB. Social relationships and mortality risk: a meta-analytic review. PLOS Medicine. 2010;7:e1000316. doi: 10.1371/journal.pmed.1000316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nature Methods. 2015;12:357–360. doi: 10.1038/nmeth.3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Kim-Hellmuth S, Bechheim M, Pütz B, Mohammadi P, Nédélec Y, Giangreco N, Becker J, Kaiser V, Fricker N, Beier E, Boor P, Castel SE, Nöthen MM, Barreiro LB, Pickrell JK, Müller-Myhsok B, Lappalainen T, Schumacher J, Hornung V. Genetic regulatory effects modified by immune activation contribute to autoimmune disease associations. Nature Communications. 2017;8:266. doi: 10.1038/s41467-017-00366-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kim-Hellmuth S, Aguet F, Oliva M, Muñoz-Aguirre M, Kasela S, Wucher V, Castel SE, Hamel AR, Viñuela A, Roberts AL, Mangul S, Wen X, Wang G, Barbeira AN, Garrido-Martín D, Nadel BB, Zou Y, Bonazzola R, Quan J, Brown A, Martinez-Perez A, Soria JM, Getz G, Dermitzakis ET, Small KS, Stephens M, Xi HS, Im HK, Guigó R, Segrè AV, Stranger BE, Ardlie KG, Lappalainen T, GTEx Consortium Cell type-specific genetic regulation of gene expression across human tissues. Science. 2020;369:eaaz8528. doi: 10.1126/science.aaz8528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Knowles DA, Davis JR, Edgington H, Raj A, Favé MJ, Zhu X, Potash JB, Weissman MM, Shi J, Levinson DF, Awadalla P, Mostafavi S, Montgomery SB, Battle A. Allele-specific expression reveals interactions between genetic variation and environment. Nature Methods. 2017;14:699–702. doi: 10.1038/nmeth.4298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kovacs M. Children’s Depression Inventory (CDI2) Technical Manual; 2011. [Google Scholar]
  36. Law CW, Chen Y, Shi W, Smyth GK. Voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biology. 2014;15:R29. doi: 10.1186/gb-2014-15-2-r29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Lee MN, Ye C, Villani AC, Raj T, Li W, Eisenhaure TM, Imboywa SH, Chipendo PI, Ran FA, Slowikowski K, Ward LD, Raddassi K, McCabe C, Lee MH, Frohlich IY, Hafler DA, Kellis M, Raychaudhuri S, Zhang F, Stranger BE, Benoist CO, De Jager PL, Regev A, Hacohen N. Common genetic variants modulate pathogen-sensing responses in human dendritic cells. Science. 2014;343:1246980. doi: 10.1126/science.1246980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lehrer PM, Hochron SM, Isenberg S, Rausch L, Carr R. The asthma symptom profile: a psychophysically based scale for assessment of asthma symptoms. Journal of Psychosomatic Research. 1993;37:515–521. doi: 10.1016/0022-3999(93)90007-3. [DOI] [PubMed] [Google Scholar]
  39. Li JH, Mazur CA, Berisa T, Pickrell JK. Low-Pass sequencing increases the power of GWAS and decreases measurement error of polygenic risk scores compared to genotyping arrays. Genome Research. 2021;31:529–537. doi: 10.1101/gr.266486.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Lima-Matos A, Ponte EV, de Jesus JPV, Almeida PCA, Lima VB, Kwon N, Riley J, de Mello LM, Cruz ÁA. Eosinophilic asthma, according to a blood eosinophil criterion, is associated with disease severity and lack of control among underprivileged urban brazilians. Respiratory Medicine. 2018;145:95–100. doi: 10.1016/j.rmed.2018.10.025. [DOI] [PubMed] [Google Scholar]
  41. Litonjua AA, Carey VJ, Weiss ST, Gold DR. Race, socioeconomic factors, and area of residence are associated with asthma prevalence. Pediatric Pulmonology. 1999;28:394–401. doi: 10.1002/(SICI)1099-0496(199912)28:6&#x0003c;394::AID-PPUL2&#x0003e;3.0.CO;2-6. [DOI] [PubMed] [Google Scholar]
  42. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Mamessier E, Nieves A, Lorec AM, Dupuy P, Pinot D, Pinet C, Vervloet D, Magnan A. T-cell activation during exacerbations: a longitudinal study in refractory asthma. Allergy. 2008;63:1202–1210. doi: 10.1111/j.1398-9995.2008.01687.x. [DOI] [PubMed] [Google Scholar]
  44. Marigorta UM, Denson LA, Hyams JS, Mondal K, Prince J, Walters TD, Griffiths A, Noe JD, Crandall WV, Rosh JR, Mack DR, Kellermayer R, Heyman MB, Baker SS, Stephens MC, Baldassano RN, Markowitz JF, Kim MO, Dubinsky MC, Cho J, Aronow BJ, Kugathasan S, Gibson G. Transcriptional risk scores link GWAS to eQTLs and predict complications in crohn's disease. Nature Genetics. 2017;49:1517–1521. doi: 10.1038/ng.3936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Marin TJ, Chen E, Munch JA, Miller GE. Double-exposure to acute stress and chronic family stress is associated with immune changes in children with asthma. Psychosomatic Medicine. 2009;71:378–384. doi: 10.1097/PSY.0b013e318199dbc3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Mehl MR, Pennebaker JW, Crow DM, Dabbs J, Price JH. The electronically activated recorder (EAR): a device for sampling naturalistic daily activities and conversations. Behavior Research Methods, Instruments, & Computers. 2001;33:517–523. doi: 10.3758/BF03195410. [DOI] [PubMed] [Google Scholar]
  47. Meyer TJ, Mark MM. Effects of psychosocial interventions with adult Cancer patients: a meta-analysis of randomized experiments. Health Psychology. 1995;14:101–108. doi: 10.1037/0278-6133.14.2.101. [DOI] [PubMed] [Google Scholar]
  48. Mielck A, Reitmeir P, Wjst M. Severity of childhood asthma by socioeconomic status. International Journal of Epidemiology. 1996;25:388–393. doi: 10.1093/ije/25.2.388. [DOI] [PubMed] [Google Scholar]
  49. Miller G, Chen E, Cole SW. Health psychology: developing biologically plausible models linking the social world and physical health. Annual Review of Psychology. 2009;60:501–524. doi: 10.1146/annurev.psych.60.110707.163551. [DOI] [PubMed] [Google Scholar]
  50. Monk TH, Reynolds CF, Kupfer DJ, Buysse DJ, Coble PA, Hayes AJ, Machen MA, Petrie SR, Ritenour AM. The pittsburgh sleep diary. Journal of Sleep Research. 1994;3:111–120. doi: 10.1111/j.1365-2869.1994.tb00114.x. [DOI] [PubMed] [Google Scholar]
  51. Moorman JE, Akinbami LJ, Bailey CM, Zahran HS, King ME, Johnson CA, Liu X. National surveillance of asthma: united states, 2001-2010. Vital & Health Statistics. 2012;35:1–58. [PubMed] [Google Scholar]
  52. Mostafavi H, Harpak A, Agarwal I, Conley D, Pritchard JK, Przeworski M. Variable prediction accuracy of polygenic scores within an ancestry group. eLife. 2020;9:e48376. doi: 10.7554/eLife.48376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Moyerbrailean GA, Richards AL, Kurtz D, Kalita CA, Davis GO, Harvey CT, Alazizi A, Watza D, Sorokin Y, Hauff N, Zhou X, Wen X, Pique-Regi R, Luca F. High-throughput allele-specific expression across 250 environmental conditions. Genome Research. 2016;26:1627–1638. doi: 10.1101/gr.209759.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Nédélec Y, Sanz J, Baharian G, Szpiech ZA, Pacis A, Dumaine A, Grenier JC, Freiman A, Sams AJ, Hebert S, Pagé Sabourin A, Luca F, Blekhman R, Hernandez RD, Pique-Regi R, Tung J, Yotova V, Barreiro LB. Genetic ancestry and natural selection drive population differences in immune responses to pathogens. Cell. 2016;167:657–669. doi: 10.1016/j.cell.2016.09.025. [DOI] [PubMed] [Google Scholar]
  55. Nica AC, Montgomery SB, Dimas AS, Stranger BE, Beazley C, Barroso I, Dermitzakis ET. Candidate causal regulatory effects by integration of expression QTLs with complex trait genetic associations. PLOS Genetics. 2010;6:e1000895. doi: 10.1371/journal.pgen.1000895. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Nica AC, Dermitzakis ET. Using gene expression to investigate the genetic basis of complex disorders. Human Molecular Genetics. 2008;17:R129–R134. doi: 10.1093/hmg/ddn285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Ongen H, Buil A, Brown AA, Dermitzakis ET, Delaneau O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics. 2016;32:1479–1485. doi: 10.1093/bioinformatics/btv722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Pennebaker JW. Emotion, Disclosure, and Health.  Washington, DC: American Psychological Association; 1995. [Google Scholar]
  59. Powell ND, Sloan EK, Bailey MT, Arevalo JM, Miller GE, Chen E, Kobor MS, Reader BF, Sheridan JF, Cole SW. Social stress up-regulates inflammatory gene expression in the leukocyte transcriptome via β-adrenergic induction of myelopoiesis. PNAS. 2013;110:16574–16579. doi: 10.1073/pnas.1310655110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Pressman SD, Jenkins BN, Moskowitz JT. Positive affect and health: what do we know and where next should we go? Annual Review of Psychology. 2019;70:627–650. doi: 10.1146/annurev-psych-010418-102955. [DOI] [PubMed] [Google Scholar]
  61. Radloff LS. The CES-D scale. Applied Psychological Measurement. 1977;1:385–401. doi: 10.1177/014662167700100306. [DOI] [Google Scholar]
  62. Rava M, Smit LA, Nadif R. Gene-environment interactions in the study of asthma in the postgenomewide association studies era. Current Opinion in Allergy & Clinical Immunology. 2015;15:70–78. doi: 10.1097/ACI.0000000000000131. [DOI] [PubMed] [Google Scholar]
  63. Ray A, Kolls JK. Neutrophilic inflammation in asthma and association with disease severity. Trends in Immunology. 2017;38:942–954. doi: 10.1016/j.it.2017.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Repetti RL, Taylor SE, Seeman TE. Risky families: family social environments and the mental and physical health of offspring. Psychological Bulletin. 2002;128:330–366. doi: 10.1037/0033-2909.128.2.330. [DOI] [PubMed] [Google Scholar]
  65. Richards AL, Muehlbauer AL, Alazizi A, Burns MB, Findley A, Messina F, Gould TJ, Cascardo C, Pique-Regi R, Blekhman R, Luca F. Gut Microbiota has a widespread and modifiable effect on host gene regulation. mSystems. 2019;4:00323-18. doi: 10.1128/mSystems.00323-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Robles TF, Slatcher RB, Trombello JM, McGinn MM. Marital quality and health: a meta-analytic review. Psychological Bulletin. 2014;140:140–187. doi: 10.1037/a0031859. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Robles TF, Repetti RL, Reynolds BM, Chung PJ, Arevalo JMG, Cole SW. Family environments and leukocyte transcriptome indicators of a proinflammatory phenotype in children and parents. Development and Psychopathology. 2018;30:235–253. doi: 10.1017/S0954579417000591. [DOI] [PubMed] [Google Scholar]
  68. Sandberg S, Paton JY, Ahola S, McCann DC, McGuinness D, Hillary CR, Oja H. The role of acute and chronic stress in asthma attacks in children. The Lancet. 2000;356:982–987. doi: 10.1016/S0140-6736(00)02715-X. [DOI] [PubMed] [Google Scholar]
  69. Schaefer ES. Children's reports of parental behavior: an inventory. Child Development. 1965;36:413. doi: 10.2307/1126465. [DOI] [PubMed] [Google Scholar]
  70. Segman RH, Goltser-Dubner T, Weiner I, Canetti L, Galili-Weisstub E, Milwidsky A, Pablov V, Friedman N, Hochner-Celnikier D. Blood mononuclear cell gene expression signature of postpartum depression. Molecular Psychiatry. 2010;15:93–100. doi: 10.1038/mp.2009.65. [DOI] [PubMed] [Google Scholar]
  71. Shankardass K, McConnell R, Jerrett M, Milam J, Richardson J, Berhane K. Parental stress increases the effect of traffic-related air pollution on childhood asthma incidence. PNAS. 2009;106:12406–12411. doi: 10.1073/pnas.0812910106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Sinz H, Renz H, Skevaki C. Cellular and noncellular bloodborne biomarkers in asthma. Annals of Allergy, Asthma & Immunology. 2017;118:672–679. doi: 10.1016/j.anai.2017.04.016. [DOI] [PubMed] [Google Scholar]
  73. Slavich GM, Cole SW. The emerging field of human social genomics. Clinical Psychological Science. 2013;1:331–348. doi: 10.1177/2167702613478594. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Smith TW, Glazer K, Ruiz JM, Gallo LC. Hostility, anger, aggressiveness, and coronary heart disease: an interpersonal perspective on personality, emotion, and health. Journal of Personality. 2004;72:1217–1270. doi: 10.1111/j.1467-6494.2004.00296.x. [DOI] [PubMed] [Google Scholar]
  75. Stanton SCE, Zilioli S, Briskin JL, Imami L, Tobin ET, Wildman DE, Mair-Meijers H, Luca F, Kane HS, Slatcher RB. Mothers' Attachment is linked to their children's Anti-Inflammatory Gene Expression via Maternal Warmth. Social Psychological and Personality Science. 2017;8:796–805. doi: 10.1177/1948550616687125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Sullivan SD, Weiss KB, Lynn H, Mitchell H, Kattan M, Gergen PJ, Evans R, National Cooperative Inner-City Asthma Study (NCICAS) Investigators The cost-effectiveness of an inner-city asthma intervention for children. Journal of Allergy and Clinical Immunology. 2002;110:576–581. doi: 10.1067/mai.2002.128009. [DOI] [PubMed] [Google Scholar]
  77. Taylor SE, Lerner JS, Sage RM, Lehman BJ, Seeman TE. Early environment, emotions, responses to stress, and health. Journal of Personality. 2004;72:1365–1394. doi: 10.1111/j.1467-6494.2004.00300.x. [DOI] [PubMed] [Google Scholar]
  78. Thompson EE, Dang Q, Mitchell-Handley B, Rajendran K, Ram-Mohan S, Solway J, Ober C, Krishnan R. Cytokine-induced molecular responses in airway smooth muscle cells inform genome-wide association studies of asthma. Genome Medicine. 2020;12:64. doi: 10.1186/s13073-020-00759-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Tobin ET, Kane HS, Saleh DJ, Naar-King S, Poowuttikul P, Secord E, Pierantoni W, Simon VA, Slatcher RB. Naturalistically observed conflict and youth asthma symptoms. Health Psychology. 2015;34:622–631. doi: 10.1037/hea0000138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Toyokawa S, Uddin M, Koenen KC, Galea S. How does the social environment 'get into the mind'? Epigenetics at the intersection of social and psychiatric epidemiology. Social Science & Medicine. 2012;74:67–74. doi: 10.1016/j.socscimed.2011.09.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Umetsu DT, McIntire JJ, Akbari O, Macaubas C, DeKruyff RH. Asthma: an epidemic of dysregulated immunity. Nature Immunology. 2002;3:715–720. doi: 10.1038/ni0802-715. [DOI] [PubMed] [Google Scholar]
  82. Vedel-Krogh S, Fallgaard Nielsen S, Lange P, Vestbo J, Nordestgaard BG. Association of blood eosinophil and blood neutrophil counts with asthma exacerbations in the Copenhagen general population study. Clinical Chemistry. 2017;63:823–832. doi: 10.1373/clinchem.2016.267450. [DOI] [PubMed] [Google Scholar]
  83. von Mutius E. The environmental predictors of allergic disease. Journal of Allergy and Clinical Immunology. 2000;105:9–19. doi: 10.1016/S0091-6749(00)90171-4. [DOI] [PubMed] [Google Scholar]
  84. von Mutius E. Gene-environment interactions in asthma. Journal of Allergy and Clinical Immunology. 2009;123:3–11. doi: 10.1016/j.jaci.2008.10.046. [DOI] [PubMed] [Google Scholar]
  85. Võsa U, Claringbould A, Westra H-J, Bonder MJ, Deelen P, Zeng B, Kirsten H. Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis. bioRxiv. 2018 doi: 10.1101/447367. [DOI]
  86. Wasik K, Berisa T, Pickrell JK, Li JH, Fraser DJ, King K, Cox C. Comparing low-pass sequencing and genotyping for trait mapping in pharmacogenetics. BMC Genomics. 2021;22:197. doi: 10.1186/s12864-021-07508-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Weckle A, Aiello AE, Uddin M, Galea S, Coulborn RM, Soliven R, Meier H, Wildman DE. Rapid fractionation and isolation of whole blood components in samples obtained from a Community-based setting.  J. Vis. Exp. 2015;30:52227. doi: 10.3791/52227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Weiss KB, Sullivan SD, Lyttle CS. Trends in the cost of illness for asthma in the united states, 1985-1994. Journal of Allergy and Clinical Immunology. 2000;106:493–499. doi: 10.1067/mai.2000.109426. [DOI] [PubMed] [Google Scholar]
  89. Woodruff PG, Modrek B, Choy DF, Jia G, Abbas AR, Ellwanger A, Koth LL, Arron JR, Fahy JV. T-helper type 2-driven inflammation defines major subphenotypes of asthma. American Journal of Respiratory and Critical Care Medicine. 2009;180:388–395. doi: 10.1164/rccm.200903-0392OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Wright RJ, Rodriguez M, Cohen S. Review of psychosocial stress and asthma: an integrated biopsychosocial approach. Thorax. 1998;53:1066–1074. doi: 10.1136/thx.53.12.1066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Wright RJ, Cohen RT, Cohen S. The impact of stress on the development and expression of atopy. Current Opinion in Allergy and Clinical Immunology. 2005;5:23–29. doi: 10.1097/00130832-200502000-00006. [DOI] [PubMed] [Google Scholar]
  92. Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: A Journal of Integrative Biology. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Zhang Y, Quick C, Yu K, Barbeira A, Luca F, Pique-Regi R, Kyung Im H, Wen X, GTEx Consortium PTWAS: investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis. Genome Biology. 2020;21:232. doi: 10.1186/s13059-020-02026-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Zhernakova DV, Deelen P, Vermaat M, van Iterson M, van Galen M, Arindrarto W, van 't Hof P, Mei H, van Dijk F, Westra HJ, Bonder MJ, van Rooij J, Verkerk M, Jhamai PM, Moed M, Kielbasa SM, Bot J, Nooren I, Pool R, van Dongen J, Hottenga JJ, Stehouwer CD, van der Kallen CJ, Schalkwijk CG, Zhernakova A, Li Y, Tigchelaar EF, de Klein N, Beekman M, Deelen J, van Heemst D, van den Berg LH, Hofman A, Uitterlinden AG, van Greevenbroek MM, Veldink JH, Boomsma DI, van Duijn CM, Wijmenga C, Slagboom PE, Swertz MA, Isaacs A, van Meurs JB, Jansen R, Heijmans BT, 't Hoen PA, Franke L. Identification of context-dependent expression quantitative trait loci in whole blood. Nature Genetics. 2017;49:139–145. doi: 10.1038/ng.3737. [DOI] [PubMed] [Google Scholar]
  95. Zilioli S, Slatcher RB, Fritz H, Booza JC, Cutchin MP. Brief report: neighborhood disadvantage and hair cortisol among older urban african americans. Psychoneuroendocrinology. 2017;80:36–38. doi: 10.1016/j.psyneuen.2017.02.026. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Patricia W Finn1

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

This paper identifies transcriptional signatures of 19 variables including psychosocial factors, blood cell composition and asthma symptoms through RNA-sequencing and a new machine learning strategy. The results show that immune gene expression mediates the link to negative psychosocial experiences.

Decision letter after peer review:

Thank you for submitting your article "Psychosocial experiences modulate asthma-associated genes through gene-environment interactions" for consideration by eLife. Your article has been reviewed by 2 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Mone Zaidi as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

As the editors have judged that your manuscript is of interest, but as described below that additional experiments are required before it is published, we would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). First, because many researchers have temporarily lost access to the labs, we will give authors as much time as they need to submit revised manuscripts. We are also offering, if you choose, to post the manuscript to bioRxiv (if it is not already there) along with this decision letter and a formal designation that the manuscript is "in revision at eLife". Please let us know if you would like to pursue this option. (If your work is more suitable for medRxiv, you will need to post the preprint yourself, as the mechanisms for us to do so are still in development.)

Summary

This paper studies the interaction between genetics and psychosocial environment. RNA-seq in PBMCs is performed to assess associations between transcription, questionnaire response data, PBMC cytokine production, and pulmonary function tests. Additionally, the investigators utilized genetic data to assess gene by environment interactions. This work would be of interest to a wide viewership. However, significant issues with underlying methodology detract from interpretation of the data and make it difficult to draw any definitive conclusions. Given these underlying concerns, this paper's conclusions are not supported by the data presented and significant changes in methodology are necessary.

Essential revisions:

1. The greatest concern with this manuscript is that instead of using observed variables the use generalized linear models with elastic net regularization was employed to develop surrogate markers for observed variables within transcriptional data. This in itself is not inherently problematic if the authors were able to identify transcriptional signatures that explained a significant amount of variance in the observed variable. Unfortunately, the data suggests that transcriptional signatures only predict differences in peripheral leukocyte population. The % variance explained for neutrophils, lymphocytes, and eosinophils was >30% (not suggested as a cutoff). Notably, these three variables had significant and strong correlations at follow-up time points. This suggests that the authors indeed did identify transcriptional signatures that might reasonably serve as surrogates to these observed variables. However, the main focus of this manuscript is the psychosocial environmental exposure. The % variance explained for each these transcriptional signatures is < 5%. Further, these transcriptional signatures showed weak rho values when assessed at follow-up. These are not reasonable surrogates for observed values as these signatures could not within a reasonable margin of error predict many of these psychosocial measures. Due to this issue the interpretation of figures 2 and 3 becomes obscured. It remains usure what it means to have self-disclosure signature correlated with a FEV1 percent predicted signature when these signatures are not predictive of observed data. Do the observed values correlate? If so this might lend credence to authors proposed transcriptional signatures. Further the same issue plagues to the findings in figure 3 where these transcriptional signatures are used for the regression to test if known eQTLs have interaction with psychosocial measures. Since Figures 2 and 3 represent the major findings of the paper. Unfortunately, definitive conclusion cannot be drawn until the authors rectify this issue.

2. With regard to the mediation analysis presented in Figure 2, two issues arise. First, the authors present the mediation as if the three leukocyte populations were assessed simultaneously as mediators which does not appear to be the case from the description in the methods. Second, a sensitivity analysis should be performed with the mediation to assess if there is an observed confounder.

3. There is concern if the sequencing genomes at 0.1x coverage is sufficient. This leaves a significant amount data that is imputed. How many of the significant findings were derived from imputed values? Does the pair-wise error rate reported for genotype call between 0.03 and 0.12 refer to percent of base pairs where there is a discrepancy? Units are important, is this 3-12% or 0.03-0.12%.

4. It is unclear whether the interpretation of the eQTLs associated with asthma/atopy risk in population where 100% subjects have asthma. In these populations changes in gene expression can't correspond to risk because no individual is at risk; they already have the disease.

5. One lingering question is the clinical implication(s) of this extremely elegant work. In the big picture, how could we employ these results to tailor treatments for kids with asthma?

6. The effect of sex differences. While the authors recruited a similar number of male and females, it is not indicated their proportion in the 119. Additionally, asthma severity in females is exacerbated due to menstruation (Also, asthma is worsened during hormonal changes in females (e.g., menstruation, see Zein, J.G., Erzurum, S.C. Asthma is Different in Women. Curr Allergy Asthma Rep 15, 28 (2015). https://doi.org/10.1007/s11882-015-0528-y). According to the provided data, the authors do not correct for this effect and based on the current text, it is not address whether it is was taken into account. Additionally, what is the justification for dropping all the gene chromosomes? Would the results change if they were not removed?

7. One of the main difficulties while reading the text for me was that some of the topics are not clearly introduced and explained in the main text, while they were super easy to follow the same concepts and ideas in the supplementary information and in the appendix. Could the author modify the text to make it a little bit clearer, please?

8. The authors indicate that SES transcriptional signature "strong overlap with each other". Regarding this sentence: (a) SES is a very complex item to define and measure. Could the author specify how did they calculate it, e.g., income, education, a conglomerated index? (b) Is it a correlation between which variables, e.g., normalized RNA-seq counts, outputs from the model? Or you are talking about common present/absent transcripts? (c) Based on the heatmap, there are a total of 4 SES-houses unoccupied, subjective SES, parental income and houses rated >=fair, and only houses unoccupied and houses rated >= fair have a strong negative correlation, which one would have expected. Also, only parental income and houses rated >=fair are in the same cluster. (d) "For example, subjective SES was significantly correlated with objective parental responsiveness, family conflict, and self-reported self-disclosure, which is the extent to which the youths talk about their thoughts and feelings (r=-0.26, p=0.004, r=0.25, p=0.006, r=0.53, p=6.8*10-10, respectively)". Are those correlations between the transcriptional signatures associated with each variable? Or between the variables themselves. Could you clarify it, please?Sometimes the authors employed the term transcriptional signatures of subjective SES while other times they just referred to subjective SES. Finally, were those correlations models corrected by age? Did you look at associations with race/ethnicity? Did you correct from age, sex, race, ethnicity, SES?

9 "Using mediation analysis, we found significant (p<0.05) paths through all three blood composition signatures, such that, at the molecular level, self-disclosure association with higher pulmonary function could be partially explained by an increase in the proportions of monocytes and neutrophils and reduced proportions of lymphocytes (Figure 2b)". Were other physiological inputs (e.g., subjected SES), mediators (e.g., IL13 GC resistance) and/or outputs (e.g., asthma severity) modeled for the mediation analysis? If not, may the authors explain the reasoning of selecting the chosen variables, please?

10. "To examine the genotype-by-environmental effects… entire cohort of 251 individuals" I am concerned whether the subjects that were employed to identify the transcriptional signatures include representative of those individuals not included in the models in terms of socio-demographic characteristics. Also, the models were validated with only in a subset of psychological measurements-note that psychological measurements were indeed the worst predicted in the validation set. Therefore, I am not sure that the predicted psychological values are adequate. Is there any manner that we can verify their accuracy?

11. For the physiological analysis, why anxiety measurements were not incorporated, e.g., GAD-7 or similar, as they can trigger asthma symptoms?

eLife. 2021 Jun 18;10:e63852. doi: 10.7554/eLife.63852.sa2

Author response


Summary

This paper studies the interaction between genetics and psychosocial environment. RNA-seq in PBMCs is performed to assess associations between transcription, questionnaire response data, PBMC cytokine production, and pulmonary function tests. Additionally, the investigators utilized genetic data to assess gene by environment interactions. This work would be of interest to a wide viewership. However, significant issues with underlying methodology detract from interpretation of the data and make it difficult to draw any definitive conclusions. Given these underlying concerns, this paper's conclusions are not supported by the data presented and significant changes in methodology are necessary.

We thank the reviewers for their insightful and excellent comments. We have carefully considered them and addressed all the major points. To address the comments, we have collected additional data on 31 variables for samples for which we did not have this information curated in our earlier submission (increasing the sample size on average by 80%). We also re-ran the eQTL mapping analysis because we realized that we had mis-annotated the regulatory region for some genes; yet this did not substantially change the results. We also removed one transcriptional signature (parental income), because upon further inspection the variable needed to be recoded and the transcriptional signature was not significantly predictive.

Essential revisions:

1. The greatest concern with this manuscript is that instead of using observed variables the use generalized linear models with elastic net regularization was employed to develop surrogate markers for observed variables within transcriptional data. This in itself is not inherently problematic if the authors were able to identify transcriptional signatures that explained a significant amount of variance in the observed variable. Unfortunately, the data suggests that transcriptional signatures only predict differences in peripheral leukocyte population. The % variance explained for neutrophils, lymphocytes, and eosinophils was >30% (not suggested as a cutoff). Notably, these three variables had significant and strong correlations at follow-up time points. This suggests that the authors indeed did identify transcriptional signatures that might reasonably serve as surrogates to these observed variables. However, the main focus of this manuscript is the psychosocial environmental exposure. The % variance explained for each these transcriptional signatures is < 5%. Further, these transcriptional signatures showed weak rho values when assessed at follow-up. These are not reasonable surrogates for observed values as these signatures could not within a reasonable margin of error predict many of these psychosocial measures. Due to this issue the interpretation of figures 2 and 3 becomes obscured. It remains usure what it means to have self-disclosure signature correlated with a FEV1 percent predicted signature when these signatures are not predictive of observed data. Do the observed values correlate? If so this might lend credence to authors proposed transcriptional signatures. Further the same issue plagues to the findings in figure 3 where these transcriptional signatures are used for the regression to test if known eQTLs have interaction with psychosocial measures. Since Figures 2 and 3 represent the major findings of the paper. Unfortunately, definitive conclusion cannot be drawn until the authors rectify this issue.

We understand the reviewer's concern and thank them for the suggestions. Our objective is not simply to predict the unobserved values, but we also assume that the observed variable has high levels of noise and the measured value does not reflect the true biological effect. Therefore, we use the predicted values for all participants, including those for whom the variable was directly measured (denoising). The best fit for the model is used to predict, for both observed and unobserved samples, the biological impact on gene expression of the relevant variable. Please note that Supplementary File 1c contains cross-validated percent variance explained – the most stringent metric for evaluating model performance. Correlations between all variables and their transcriptional signatures were significant (pvalue<0.05), but we decided to use a cutoff of 1% cross-validated variance explained to only use the strongest transcriptional signatures. When we focus on these variables, the correlations between measured variables and their transcriptional signatures are significantly positive and range from 0.250.98 (mean=0.74).

We now also include correlations between measured and imputed values on a bigger subset of individuals (additional individuals not used in training the transcriptional signatures) in Supplementary File 1c. All correlations are significant (p<0.05) and range from 0.14-0.90 (mean=0.48).

The major limitation in predicting changes at follow up has been that for some variables the actual measured value did not change at all or very minimally in this short time frame.

Irrespective of being a cell-type composition variable, or a different type of variable if there was a large change and the signature was predictive, we were able to predict the change from the gene expression data.

To address the reviewer’s concern that correlations between transcriptional signatures may not reflect correlations between observed values, we have now added a Figure 1—figure supplement 2. This figure compares the correlation between the observed measurements and the correlation between the imputed/denoised variables. Some of the values for the pairwise correlations between transcriptional signatures slightly changed because we have now used the values imputed for the entire sample. Strong pairwise correlations in the measured variables are preserved in the imputed variables and detected by both approaches (Correlation of pair-wise correlation coefficients is 0.9, p-value = 6.069e-11). Additionally, pairwise correlations that are detected as significant in both the measured and the denoised/imputed variables are all in the same direction. Across all pairwise correlation coefficients, the correlation of those obtained between the measured variables or the imputed variables is 0.58, p-value < 2.2e-16. In our opinion, the fact that we detect a higher number of significant correlations in the imputed variables is due to the denoising effect.

We have added the following text to the manuscript:

Results:

"Overall, correlations between transcriptional signatures reflect correlations between measured variables (Figure 1—figure supplement 2), yet they are stronger between the transcriptional signatures, highlighting the denoising effect"

Discussion:

“Interestingly, we detected a higher number of significant correlations across imputed variables compared to the measured variables. This observation supports the effectiveness of the denoising procedure that we used to define transcriptional signatures.”

2. With regard to the mediation analysis presented in Figure 2, two issues arise. First, the authors present the mediation as if the three leukocyte populations were assessed simultaneously as mediators which does not appear to be the case from the description in the methods. Second, a sensitivity analysis should be performed with the mediation to assess if there is an observed confounder.

We agree with the reviewer that more complex scenarios may explain the observed correlations between the transcriptional signatures. Acknowledging these effects and given that the major focus of the manuscript is on the GxE effect in gene expression and asthma, we have decided to remove this section of the results as it would require additional analyses beyond the scope of this manuscript.

3. There is concern if the sequencing genomes at 0.1x coverage is sufficient. This leaves a significant amount data that is imputed. How many of the significant findings were derived from imputed values? Does the pair-wise error rate reported for genotype call between 0.03 and 0.12 refer to percent of base pairs where there is a discrepancy? Units are important, is this 3-12% or 0.03-0.12%.

We thank the reviewer for this comment. We used an external provider (Gencove, New York, NY) with a well-established genotyping-from-sequencing pipeline (PMID: 33743587, PMID: 33536225) to obtain genotype calls from high purity DNA samples for all participants. The actual coverage used was 0.4x, we have updated the methods to report this value. The genotype accuracy at this sequencing depth is 98.22% positive agreement and 99.82% negative agreement compared to genotyping arrays. In our study, we used the pairwise discordance between DNA and RNA-derived genotypes to identify sample swaps. The 3-12% DNA/RNA discordance was assessed only on the biallelic SNPs; yet only a subset of these SNPs is used for eQTL mapping (MAF>0.1 and within 1Mb of the TSS). Additionally, the reported figure represents an overestimate of the true error rate, as heterozygous genetic variants are inherently more prone to call errors and issues such as allele-specific expression may obscure genotype calls derived from RNA-seq data (known RNA editing sites were not excluded). Furthermore, we do not use discrete genotypes for eQTL mapping, and we use allelic dosage instead because they are less sensitive to genotype calling errors.

The following changes have been made in the methods section:

"Genotype data. All individuals in this study were genotyped from low-coverage (~0.4X) whole-genome sequencing and imputed to 37.5 M variants using the 1000 Genomes database by Gencove (New York, NY). […] This rate is not a direct measure of genotype call accuracy, but it is useful to identify possible sample swaps and mismatches."

4. It is unclear whether the interpretation of the eQTLs associated with asthma/atopy risk in population where 100% subjects have asthma. In these populations changes in gene expression can't correspond to risk because no individual is at risk; they already have the disease.

We agree with the reviewer that our cohort of case-only asthmatic participants would not allow us to determine which genes may predispose to higher asthma risk. That is why we used an external dataset from a Transcriptome-Wide Association Study (Zhang, 2019), which assessed disease-gene expression associations based on eQTL data from GTEx and multiple large GWAS for asthma risk in the typical case control study design. We hypothesized that the same genes and pathways that contribute to asthma risk are also involved in asthma symptom severity through gene regulatory variation in immune cells. We now clarify these concepts as reported below:

Results

"We hypothesized that genes and pathways that contribute to asthma risk are also involved in asthma symptom severity through gene regulatory variation in immune cells. We investigated whether genes associated with the risk for asthma were modulated through psychosocial experiences (E), and/or GxE effects."

Discussion

"As we gain additional knowledge on the mechanisms connecting psychosocial experiences to disease, these results can be useful to support the need for social interventions that may ultimately lead to improved overall health[…] The 124 instances of GxE with psychosocial experiences are particularly relevant when evaluating a polygenic risk score for asthma phenotypes."

5. One lingering question is the clinical implication(s) of this extremely elegant work. In the big picture, how could we employ these results to tailor treatments for kids with asthma?

We agree with the reviewer that this is indeed an extremely relevant question. Our work is at the intersection between clinical, molecular, and social studies. While genetic risk for extreme asthma phenotypes cannot be modified, our work shows that the social environment can modulate these important genetic effects on the regulation of genes relevant for asthma and immunity. We envision that this and other studies may open the road to social interventions aimed at improving the psychosocial environment of asthmatic children in order to ameliorate their symptoms and ultimately lead to better clinical outcomes. These social interventions may be implemented independently or together with drug treatments.

The following changes were made in the revised discussion:

"As we gain additional knowledge on the mechanisms connecting psychosocial experiences to disease, these results can be useful to support the need for social interventions that may ultimately lead to improved overall health. […] These social interventions may be implemented independently or together with drug treatments, and their impact could be further monitored through gene expression with larger longitudinal samples in future studies."

6. The effect of sex differences. While the authors recruited a similar number of male and females, it is not indicated their proportion in the 119. Additionally, asthma severity in females is exacerbated due to menstruation (Also, asthma is worsened during hormonal changes in females (e.g., menstruation, see Zein, J.G., Erzurum, S.C. Asthma is Different in Women. Curr Allergy Asthma Rep 15, 28 (2015). https://doi.org/10.1007/s11882-015-0528-y). According to the provided data, the authors do not correct for this effect and based on the current text, it is not addressed whether it is was taken into account. Additionally, what is the justification for dropping all the gene chromosomes? Would the results change if they were not removed?

We agree with the reviewer that sex is an important biological factor to consider. The proportion of males:females is similar in the subset of participants used for building the transcriptional signatures (68:51=1.33) as in the overall cohort (148:103=1.44). The male skew in our cohort reflects the higher predisposition of boys towards asthma compared to girls. We correct for sex effect, but not for menstrual cycle effects, because this variable was not collected in the ALOFT study. Additionally, our sample size would not allow us to correct for too many variables. To account for unmeasured confounders, which may also include the menstrual cycle, we correct for gene expression PCs in the eQTL mapping analysis.

Sex chromosome gene expression data need to be treated with caution as females may have higher expression of X chr gene escaping X inactivation. Y chr gene can of course only be assessed in males, reducing the overall sample size. For these reasons we decided to exclude sex chromosomes from our analyses. Of the 2695 asthma GWAS hits in the GWAS Catalog (ebi.ac.uk/gwas) none have been reported on the Y chromosome and only four associations (across two variants) are reported on the X chromosome.

7. One of the main difficulties while reading the text for me was that some of the topics are not clearly introduced and explained in the main text, while they were super easy to follow the same concepts and ideas in the supplementary information and in the appendix. Could the author modify the text to make it a little bit clearer, please?

We apologize for the lack of clarity. We have now moved the section "Transcriptional signatures aid in denoising the data." from the appendix to the Methods section as detailed below:

"Imputation and de-noising of transcriptional signatures. We assume that the observed variables have high levels of noise and the measured values do not reflect the true biological effects. […] Overall, our procedure should not create any biases but rather decrease the variability of the imputed/denoised variables thus reducing the chance of false positive GxE."

8. The authors indicate that SES transcriptional signature "strong overlap with each other". Regarding this sentence: (a) SES is a very complex item to define and measure. Could the author specify how did they calculate it, e.g., income, education, a conglomerated index? (b) Is it a correlation between which variables, e.g., normalized RNA-seq counts, outputs from the model? Or you are talking about common present/absent transcripts? (c) Based on the heatmap, there are a total of 4 SES-houses unoccupied, subjective SES, parental income and houses rated >=fair, and only houses unoccupied and houses rated >= fair have a strong negative correlation, which one would have expected. Also, only parental income and houses rated >=fair are in the same cluster. (d) "For example, subjective SES was significantly correlated with objective parental responsiveness, family conflict, and self-reported self-disclosure, which is the extent to which the youths talk about their thoughts and feelings (r=-0.26, p=0.004, r=0.25, p=0.006, r=0.53, p=6.8*10-10, respectively)". Are those correlations between the transcriptional signatures associated with each variable? Or between the variables themselves. Could you clarify it, please?Sometimes the authors employed the term transcriptional signatures of subjective SES while other times they just referred to subjective SES. Finally, were those correlations models corrected by age? Did you look at associations with race/ethnicity? Did you correct from age, sex, race, ethnicity, SES?

We agree with the reviewer that this section of the manuscript can be further clarified. We generally refer to correlations between transcriptional signatures, unless we specify the use of the measured variable. Importantly, the correlations between transcriptional signatures are also reflected in the measured variables, as described in our response to comment 1.

Subjective SES is self-reported by the parent using the MacArthur Socioeconomic Status Ladder (Adler et al., 2000) on a scale of 1-10 adjusted so that 1 corresponds to the lowest status and 10 to the highest. Details on this and all the other variables are reported in Supplementary file 2. We have now made it clear throughout the text whenever we refer to the individual measure of Subjective Socioeconomic status by always using the full variable name.

The following text is now included in the methods:

"The parent reported demographics, including their annual income and education level, and completed measures of subjective socioeconomic status (the McArthur ladder ((Adler et al. 2000), adjusted so that 1 corresponds to the lowest status and 10 to the highest), neighborhood stress ((Ewart and Suchday 2002), conflict with their child (the Parental Environment Questionnaire ((Elkins, McGue, and Iacono 1997)), and depressive symptoms (the CES-D ((Radloff 1977))."

The transcriptional signatures were calculated on normalized count data after regressing out the following confounding factors: RIN, percent reads mapping to exons, percent non-duplicate reads, site of RNA extraction, library preparation batch, sample collection wave, age, sex, height, weight, genotype PC1, genotype PC2, genotype PC3. These details are reported in the methods as below:

"First, we normalized the count data using the voom function in the limma v3.38.3 package in R(Law et al. 2014). Second, we regressed out the following confounding factors: RIN, percent reads mapping to exons, percent non-duplicate reads, site of RNA extraction, library preparation batch, sample collection wave, age, sex, height, weight, genotype PC1, genotype PC2, genotype PC3."

9. "Using mediation analysis, we found significant (p<0.05) paths through all three blood composition signatures, such that, at the molecular level, self-disclosure association with higher pulmonary function could be partially explained by an increase in the proportions of monocytes and neutrophils and reduced proportions of lymphocytes (Figure 2b)". Were other physiological inputs (e.g., subjected SES), mediators (e.g., IL13 GC resistance) and/or outputs (e.g., asthma severity) modeled for the mediation analysis? If not, may the authors explain the reasoning of selecting the chosen variables, please?

We agree with the reviewer that more complex scenarios may explain the observed correlations between the transcriptional signatures. Acknowledging these effects and given that the major focus of the manuscript is on the GxE effect in gene expression and asthma, we have decided to remove this section of the results as it would require additional analyses beyond the scope of this manuscript.

10. "To examine the genotype-by-environmental effects… entire cohort of 251 individuals" I am concerned whether the subjects that were employed to identify the transcriptional signatures include representative of those individuals not included in the models in terms of socio-demographic characteristics. Also, the models were validated with only in a subset of psychological measurements-note that psychological measurements were indeed the worst predicted in the validation set. Therefore, I am not sure that the predicted psychological values are adequate. Is there any manner that we can verify their accuracy?

We thank the reviewer for the suggestion. We have now included socio-demographic data separately for the sub-cohort used for training the transcriptional signature models (N=119), and for the entire cohort (N=251) in Supplementary File 1a. There are no differences in socio-demographic measures between the group used for training the transcriptional signature models and the group not included in the model for: sex ratio (chi^2=0.05, p-value=0.83), self-reported ethnicity (chi^2=2.43, p-value=0.49), parent’s income (Wilcoxon-Mann-Whitney W=2982.5, pvalue=0.98), or parent’s education (WilcoxonMann-Whitney W=3658, pvalue=0.07). These results are now included in Supplementary File 1a.

For this revised manuscript, we have collected additional data on 31 variables for samples for which we did not have this information curated in our earlier submission, and we show that GxE eQTLs detected using the transcriptional signatures are enriched for GxE eQTLs detected using the measured variables.

Note that with the measured variables we have limited power due to the smaller sample size and the inherent noise for some of these variables.

These new findings are reported in the Results section:

"To validate these GxE results we expanded the sample size for all variables with a transcriptional signature. We found that the GxE eQTLs detected with the measured variables (Figure 3-Source Data 7) were significantly enriched for low p-values in the GxE eQTLs detected with the transcriptional signatures (Figure 3—figure supplement 1, Figure 3A) and the interaction effects were highly significantly correlated (Supplementary File 1g)."

And in the methods:

"We calculated the enrichment of GxE genes from the measured variables (p-value<0.01) in the set of GxE genes from the transcriptional signatures (p-value<0.01). […] Additionally, we calculated the correlation of the standardized interaction effect size (z-score) for each gene obtained when considering measured variables and corresponding transcriptional signatures."

11. For the physiological analysis, why anxiety measurements were not incorporated, e.g., GAD-7 or similar, as they can trigger asthma symptoms?

We thank the reviewer for this comment. The ALOFT study did not collect GAD-7 or other measures of acute anxiety. Previous studies have shown that while both anxiety and depression are associated with worse asthma-related quality of life, depression is also associated with worse asthma control (Kim, 2006, PMID: 17035436). In our analysis we included child depressive symptoms (Children's Depression Inventory, Kovacs, 1992), but we did not find a significant transcriptional signature.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Zilioli S, Slatcher RB, Pique-Regi R, Luca F. 2021. Asthma in the Lives of Families Today (ALOFT) dbGAP. phs002182.v1.p1

    Supplementary Materials

    Figure 1—source data 1. GLMnet model weights for the transcriptional signatures.

    The transcriptional signatures are in columns, while the genes are in rows.

    Figure 3—source data 1. Results of cis-expression quantitative trait locus (cis-eQTL) mapping permutation pass with FastQTL correcting for top 18 gene expression principal components [PCs] (space-delimited file).

    Columns are as follows. 1: Ensembl ID of the tested gene; 2: number of variants tested in cis for this gene; 3: maximum likelihood estimation (MLE) of the shape1 parameter of the beta distribution; 4: MLE of the shape2 parameter of the beta distribution; 5: dummy; 6: ID of the best variant found for this molecular phenotypes (i.e., with the smallest p-value); 7: distance between the molecular phenotype-variant pair; 8: the nominal p-value of association that quantifies how significant from 0, the regression coefficient is; 9: the slope associated with the nominal p-value of association; 10: a first permutation p-value directly obtained from the permutations with the direct method (corrected nominal p-value that accounts for the fact that multiple variants are tested per molecular phenotype); 11: a second permutation p-value obtained via beta approximation (used to calculate FDR).

    Figure 3—source data 2. Results of cis-expression quantitative trait locus (cis-eQTL) mapping permutation pass with FastQTL without correction for gene expression principal components [PCs] (space-delimited file).

    Columns are as follows. 1: Ensembl ID of the tested gene; 2: number of variants tested in cis for this gene; 3: maximum likelihood estimation (MLE) of the shape1 parameter of the beta distribution; 4: MLE of the shape2 parameter of the beta distribution; 5: dummy; 6: ID of the best variant found for this molecular phenotypes (i.e., with the smallest p-value); 7: distance between the molecular phenotype-variant pair; 8: the nominal p-value of association that quantifies how significant from 0, the regression coefficient is; 9: the slope associated with the nominal p-value of association; 10: a first permutation p-value directly obtained from the permutations with the direct method (corrected nominal p-value that accounts for the fact that multiple variants are tested per molecular phenotype); 11: a second permutation p-value obtained via beta approximation (used to calculate FDR).

    Figure 3—source data 3. Overlap of eGenes with psychosocial effects and significant probabilistic transcriptome-wide association studies (PTWAS) association results (5% FDR) for asthma and allergic disease.

    Columns are as follows. 1: GWAS trait tested in PTWAS; 2: Ensembl ID of the tested gene; 3: most significant tissue in PTWAS; 4: multitissue PTWAS p-value; 5–24: as in Figure 3—source data 4.

    Figure 3—source data 4. Results of cis-interaction-expression quantitative trait locus (eQTL) mapping (tab-delimited file).

    Columns are as follows. 1: Ensembl ID of the tested gene; 2: ID of the tested variant; 3: p-value of the GxE model intercept; 4: p-value of the genotype dosage effect; 5: p-value of the transcriptional signature effect; 6: p-value of the interaction between genotype dosage and transcriptional signature effect; 7: intercept; 8: dosage effect; 9: transcriptional signature effect; 10: interaction between genotype dosage and transcriptional signature effect; 11: intercept standard error (SE); 12: genotype dosage effect SE; 13: transcriptional signature effect SE; 14: interaction between genotype dosage and transcriptional signature effect SE; 15: q-value of the GxE model intercept; 16: q-value of the genotype dosage effect; 17: q-value of the transcriptional signature effect; 18: q-value of the interaction between genotype dosage and transcriptional signature effect; 19: tested transcriptional signature; 20: permutation-corrected p-value of the interaction between genotype dosage and transcriptional signature effect; 21: q-value of the permutation-corrected p-value of the interaction between genotype dosage and transcriptional signature effect.

    Figure 3—source data 5. Overlap of GxE interactions and significant probabilistic transcriptome-wide association studies (PTWAS) association results (5% FDR).

    Columns as in Figure 3—source data 3.

    Figure 3—source data 6. Results of cis-interaction-expression quantitative trait locus (eQTL) mapping after correcting for cell composition effects (tab-delimited file).

    Columns as in Figure 3—source data 4.

    Figure 3—source data 7. Results of cis-interaction-expression quantitative trait locus (eQTL) mapping with measured variables (tab-delimited file).

    Columns as in Figure 3—source data 4.

    Supplementary file 1. Supplementary tables.

    (a) Basic demographic information on 119 individuals used to train the transcriptional signature models (‘training’) and the entire cohort of 251 participants (‘total’). Reported are the count (N) and percentage of non-missing values per each category. For each variable in parentheses, we reported the p-values for significant differences in variable distribution between the training group and the group not included in the model. (b) List of variables collected for the current study. DD: daily diary; SD: sleep diary; EAR: coded from Electronically Activated Recorder; YR: youth reported; PR: parent reported; CD: census data; GC: glucocorticoid; SD: standard deviation; α: Chronbach's alpha measuring reliability as the average correlation between scale items, as a function of the number of items included in the scale, N initial – sample size used for differential gene expression analysis and building transcriptional signatures (mean, SD, and α are reported for this subset), N longitudinal – number of samples with measurements from two time points available, N expanded – final sample size used for transcriptional signature and GxE validations. (c) Evaluation of transcriptional signatures derived using elastic net regression. For each variable, we report Pearson’s correlation coefficient, p-value, cross-validated percent variance explained, and sample size for the training dataset (columns 2–5), and Pearson’s correlation coefficient, p-value, and sample size for the entire cohort (columns 6–8). (d) Longitudinal replication of transcriptional signatures. For each variable, we report Spearman’s correlation coefficient between longitudinal change in observed variable and change in the transcriptional signature, p-value, coefficient of variation of longitudinal change in observed variable, and sample size. (e) Differentially expressed genes (DEGs) for all psychosocial variables (10% FDR; N: sample size). (f) Full DESeq results for differential gene expression analyses for psychosocial variables. (g) Correlations between z-scores of GxE interaction effects measured using transcriptional signatures and observed data. For each variable, we report Spearman’s correlation coefficient and its p-value, and odds ratio and p-value from Fisher’s test for enrichment of nominally significant results (permutation-corrected p-value<0.01) between GxE expression quantitative trait locus (eQTL) testing using transcriptional signatures and observed data. (h) Correlations between the top three principal components of covariate matrix and individual covariates (Pearson's product-moment correlations with numeric variables, polyserial correlations with bivariate variables; ns: correlation p-value>0.05). (i) DEGs associated with blood cell composition (10% FDR; N: sample size). (j) Full DESeq results for differential gene expression analyses for blood composition variables.

    elife-63852-supp1.xlsx (48.9MB, xlsx)
    Supplementary file 2. Detailed descriptions of methods for psychosocial data collection.
    elife-63852-supp2.docx (41.2KB, docx)
    Transparent reporting form
    Reporting standard 1. STREGA reporting recommendations, extended from STROBE Statement.

    Data Availability Statement

    The data are available on dbGAP (accession number: phs002182.v1.p1).

    The following dataset was generated:

    Zilioli S, Slatcher RB, Pique-Regi R, Luca F. 2021. Asthma in the Lives of Families Today (ALOFT) dbGAP. phs002182.v1.p1


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES