Skip to main content
Comprehensive Psychoneuroendocrinology logoLink to Comprehensive Psychoneuroendocrinology
. 2021 Mar 9;6:100045. doi: 10.1016/j.cpnec.2021.100045

Epigenetic prediction of 17β-estradiol and relationship to trauma-related outcomes in women

Laura M Hack a,b,c, Shota Nishitani a,d, Anna K Knight d, Varun Kilaru a, Stephanie A Maddox e,f, Antonia V Seligowski e,f, Tanja Jovanovic g, Kerry J Ressler a,e,f, Alicia K Smith d,a,∗,1, Vasiliki Michopoulos a,1
PMCID: PMC9216622  PMID: 35757356

Abstract

17β-estradiol (E2) levels in women correlate with multiple neuropsychiatric symptoms, including those that are stress-related. Furthermore, prior work from our group has demonstrated that E2 status influences DNA methylation (DNAm) across the genome. We developed and validated a DNAm-based predictor of E2 (one of four naturally occurring estrogens) using a training set of 183 females and a test set of 79 females from the same traumatized cohort. We showed that predicted E2 levels were highly correlated with measured E2 concentrations in our testing set (r ​= ​0.75, p ​= ​1.8e-15). We further demonstrated that predicted E2 concentrations, in combination with measured values, negatively correlated with current post-traumatic stress disorder (PTSD) (β ​= ​−0.38, p ​= ​0.01) and major depressive disorder (MDD) diagnoses (β ​= ​−0.45, p ​= ​0.02), as well as a continuous measure of PTSD symptom severity (β ​= ​−2.3, p ​= ​0.007) in females. Finally, we tested our predictor in an independent data set (n ​= ​85) also comprised of recently traumatized female subjects to determine if the predictor would generalize to a different population than the one on which it was developed. We found that the correlation between predicted and actual E2 concentrations in the external validation data set was also high (r ​= ​0.48, p ​= ​3.0e-6). While further validation is warranted, a DNAm predictor of E2 concentrations will advance our understanding of hormone-epigenetic interactions. Furthermore, such a DNAm predictor may serve as an epigenetic proxy for E2 concentrations and thus provide an important biomarker to better evaluate the contribution of E2 to current and potentially future psychiatric symptoms in samples for which E2 is not measured.

Keywords: DNA methylation, Estrogen, Random forests (RFs), Machine learning, Biomarker, Trauma

Highlights

  • Neuropsychiatric symptoms correlate with estradiol (E2) levels in females.

  • We developed a DNA methylation-based E2 predictor using machine learning approach.

  • Our predictor performed well in an external validation cohort of traumatized women.

  • Predicted E2 concentrations correlated with stress-related phenotypes.

  • Our model may serve as an epigenetic biomarker of E2 status in women.

1. Introduction

17β-estradiol (E2) concentrations in women are associated with greater risk for various neuropsychiatric symptoms, including depression [1] and changes in memory [2]. Although the mechanisms by which E2 influences vulnerability to psychiatric symptoms are not fully understood, E2 has effects on multiple neurotransmitter systems, and brain circuits involved in mood and memory regulation are sensitive to variation in E2 concentrations [1]. Studies generally suggest that lower concentrations of E2 contribute to higher symptomatology and vulnerability to neuropsychiatric disease, supporting a role for E2 as a neurotrophic, neuroprotective, and psychoprotective steroid hormone [3]. Furthermore, administration of synthetic estrogen to women with low endogenous E2 levels, including those who are in perimenopause, post-menopause, or postpartum, improves affective and cognitive symptoms [4].

While E2 is synthesized and secreted by both sexes, peripheral concentrations are higher overall and fluctuate to a greater extent over the course of the menstrual cycle in reproductively-mature women [5]. This fluctuation in E2 is implicated in the higher prevalence, severity, and burden of affective and stressor-related disorders in women as compared to men [3,6]. The relationship between E2 concentrations and stress-related sequelae is a well characterized example of the importance of E2 status on psychiatric symptoms and their underlying neurobiological and molecular substrates. There is evidence that healthy, normally cycling females exposed to a psychosocial stress paradigm have higher subjective distress during periods of low E2 as compared to high E2 [7]. Consistent with this finding, female rats demonstrate higher fear and anxiety-related behaviors during diestrus or metestrous (low E2 phases of their estrous cycle) [8,9], and chronic replacement of E2 in aged, ovariectomized female rats improves anxiety- and depression-like behavior [10]. Similarly, replacement of E2 in ovariectomized female rhesus macaques also improves anxiety-like behavior [11]. These translational findings parallel findings in women showing that low E2 concentrations associate with deficits in fear extinction [12] and inhibition of learned fear responses [13] in both healthy women and those with post-traumatic stress disorder (PTSD). Furthermore, trauma-exposed women experience greater severity of phobic anxiety during a phase of their menstrual cycle characterized by lower E2 (early follicular) as compared to higher E2 (mid luteal) concentrations and this difference is driven by women with PTSD [14]. On a neurobiological level, stress response neural circuitry is more active in women during the low E2, early follicular phase as compared to the high E2, late follicular/midcycle phase [15]. It is important to also note that change in E2 levels has also been found to disrupt affect, although the exact mechanism by which this occurs is unknown [16].

Overall, preclinical and clinical data indicate that E2 status impacts vulnerability to and presentation of psychiatric symptoms in women and highlights the importance of assessing E2 status in studies characterizing female risk for stress- and trauma-related adverse outcomes. However, most studies assessing psychiatric symptoms do not measure E2 concentrations, but an increasing number of studies have genome-wide methylation data available. Because E2 concentrations correlate with DNA methylation (DNAm) in women measured within the same menstrual cycle [17], DNAm signatures may serve as a surrogate of endogenous E2 concentrations within a cycle and potentially across cycles. We note there is conflicting evidence regarding the degree of intra-individual stability of steroid hormones across menstrual cycles. For example [18], showed that E2 and progesterone (P4) levels derived from saliva were stable across two consecutive cycles and [19] demonstrated stability of urinary metabolites of E2 and P4, while there was less consistency in these salivary measures in a study by Ref. [20]. While the literature suggests that there is a relationship between E2 levels and stress- and trauma-related symptoms within a particular menstrual cycle and further studies are necessary to assess the degree to which E2 levels from one cycle can predict symptoms in another cycle, the goals of the present study were to develop a methylation-based predictor of E2 concentrations using epigenome-wide data and to assess the relationship of predicted E2 levels with the presence of current PTSD and MDD diagnoses, and PTSD symptom severity in women. We hypothesized that a DNAm-based prediction of low E2 status would be associated with greater risk for current PTSD and MDD diagnosis and greater PTSD symptom severity in traumatized women.

2. Materials and methods

2.1. DNAm predictor development in women (GTP)

We developed our DNAm-based E2 predictor using data collected as part of the Grady Trauma Project (GTP), a civilian study of trauma exposure in a predominately African American population from a large urban hospital in Atlanta, GA, USA [21]. Briefly, subjects were approached in the waiting rooms of primary care and OBGYN clinics as well as at the hospital outpatient pharmacy. Exclusion criteria included being under age 18, actively psychotic, or having intellectual disability. Participants underwent a screening interview based on screening forms and scales that were read aloud by a volunteer or project staff member, due to the varying literacy of the subjects. Randomly selected subjects were invited to further participate in a clinician-administered structured interview to assess the presence or absence of psychiatric diagnoses on a different day [21]. The overall project was approved by the Institutional Review Board at Emory University and the Grady Health Systems Research Oversight Committee. All participants signed informed consent prior to the start of the screening interview.

In the present study, we used a continuous measure of PTSD symptom severity obtained from the Clinician-Administered PTSD scale (CAPS) [22] for DSM-IV in a subset of participants during an earlier recruitment period and DSM-5 in a later recruitment period. The CAPS is a 30-item self-report scale with good psychometric properties across clinical populations and research settings [23]. Current PTSD diagnosis was based on the CAPS. Presence of each of the 17 DSM-IV or 20 DSM-5 diagnostic criteria for PTSD was determined using a frequency rating of 1 or higher paired with an intensity score of 2 or higher. Current MDD diagnosis was determined using the Structured Clinical Interview for DSM-IV [24].

2.1.1. Blood-based assays in GTP

Study participants provided whole blood in EDTA tubes for biological assays, and serum collected and frozen at −80 ​°C until time of hormone assay. Total E2 concentrations were assayed at Yerkes Biomarkers Core Laboratory at Emory University using a commercially available radioimmunoassay kit (KE2D1; Siemens Healthcare Diagnostics) as previously described [17]. E2 concentrations were natural log transformed due to non-normal distributions.

DNA methylation was assessed using the HumanMethylation450 BeadChip (Illumina) as described in previous work [25]. Briefly, 1 ​μg of DNA underwent bisulfite treatment and the >485,000 probes on the array were interrogated for methylation status. Beta values were generated with BeadStudio and set to missing (no call) if detection p-values exceeded 0.001. CpGassoc [26] was used to exclude samples with probe detection call rates <95% and those with an average intensity value of either <50% of the experiment-wide sample mean or <2,000 arbitrary units. In addition, CpG sites with missing data for >10% of samples and probes that cross-hybridize between autosomes and sex chromosomes were removed [27]. Beta Mixture Quantile dilation was utilized to normalize each dataset [28]. The method described by Houseman and colleagues was used to estimate the proportion of granulocytes and lymphocytes in our whole blood DNA samples [29,30].

After quality control steps, we limited probes to transcripts expressed in the blood to correspond with our previous analysis assessing the relationship between E2 concentrations and epigenetic data [17], leaving 87,388 probes corresponding to 15,877 transcripts. To maintain our predictor’s compatibility with the Infinium MethylationEPIC BeadChip (Illumina), we further limited the number of probes to those that are overlapping between both arrays (n ​= ​81,637). Additionally, due to variability of correlation of probes between the two chips, we chose to restrict the analysis to a subset of CpGs that correlated at r ​> ​0.2 [31] in order to have a sufficient number of starting probes that were at least weakly correlated on both chips (n ​= ​27,335). After excluding probes that had any missingness for 262 female subjects with E2 data, we were left with 23,209 probes to enter into the analysis. Finally, we randomly divided the subjects into training (n ​= ​183, 80%) and testing (n ​= ​79, 20%) sets.

2.1.2. Predictor selection and model building

To select a parsimonious set of predictors, we entered age, cellular proportions (CD4, CD8, CD14, CD19, CD56), and CpGs (n ​= ​23,209) from female participants in our training set into an elastic net regression algorithm using log-transformed E2 concentrations as the outcome. We set the elastic net mixing parameter, alpha, to 0.5 to allow for equal contribution of the ridge and LASSO methods [32], as this has performed well previously for developing DNAm-based predictors of age in adults [33] and gestational age in neonates [34]. We then used the features identified by the elastic net algorithm to train a random forest (RF) model, as RFs have demonstrated good performance in high-dimensional data in which there are many more predictors than respondents, and they can model non-linear relationships between variables [35]. RFs are a recursive partitioning method in which predictors are divided into decision trees through splits from a parent to child nodes using a random subset of predictors [36]. We used 500 trees and predictors selected at each split. While RFs can be used for feature selection, we elected to use elastic net for this task due to the computational intensity of using RFs for feature selection. The use of a regularized regression method for feature selection and a tree-based algorithm for predictive model building has demonstrated good performance in a prior study in a psychiatric sample [37]. Analyses were performed using R with the packages glmnet and randomForest.

2.2. External DNAm validation sample

We used data from the prospective Grady Predictive Biomarkers Emergency Department (ED) study to externally validate our DNAm-based E2 predictor in women (n ​= ​85). Subjects presenting to the emergency room at Grady Memorial Hospital were recruited to participate if they had experienced a trauma within the past 72 hours and met DSM-IV diagnostic criterion A for PTSD [38]. Exclusion criteria included being <18, having a current episode or past history of mania, schizophrenia, or other psychoses, endorsing prominent suicidal ideation within the past month (having frequent and/or intense thoughts about killing oneself), and experiencing a loss of consciousness for more than five minutes as a result of the traumatic event. Individuals were also excluded if they were currently intoxicated, not alert, oriented, or coherent, or were in active labor, respiratory distress, or hemodynamically compromised. Overall, this ED sample was more ethnically diverse than the GTP cohort [39]. The overall project was also approved by the Institutional Review Board at Emory University and the Grady Health Systems Research Oversight Committee.

2.2.1. Blood-based assays in ED sample

Venous blood samples were collected from participants in the ED following trauma exposure by medical staff using standard techniques [40]. Within six hours of collection, EDTA tubes were centrifuged at 4 ​°C and plasma was frozen at −80 ​°C until time of hormone assay. Total E2 concentrations were measured using a commercially available radioimmunoassay kit (DiaMetra) with an inter-assay coefficient of variance (CV) of 2.8% and intra-assay CV of <9%. To help determine how specific the predictor is to E2 concentrations and not other commonly-assessed steroid hormones, we assessed the correlations of the DNAm-based values to the total measured values of three other steroid hormones, including progesterone (P4), cortisol, and cortisone. Total progesterone, cortisol, and cortisone concentrations were assayed using liquid-chromatography-mass-spectrometry (LC-MS) using previously validated protocols [41].

DNA methylation data was generated for 48 female subjects with E2 data using the HumanMethylation450 BeadChip (ED 450K), while the same data was generated for 37 additional subjects with E2 data using the Infinium MethylationEPIC BeadChip (ED EPIC) with the protocol described above. Methylation data for these 85 subjects were combined. Missing CpGs were imputed using k-nearest neighbors in the R package impute using the average values of CpGs in the GTP training cohort as the gold standard.

2.3. Correlation of E2 DNAm with stress-related outcomes

We determined whether the DNAm-based E2 concentrations correlated with stress-related phenotypes in the GTP cohort of women using regression analyses. We assessed the relationship between predicted E2 levels and stress-related outcomes, including current PTSD and MDD diagnoses and PTSD symptom severity, collected on the same day as blood sample collection for the methylation data. In addition to the 79 subjects from the testing set, we also generated DNAm-based E2 levels for an additional 40 subjects with DNA methylation data but no measured estradiol levels (total n ​= ​119). Of all subjects with predicted E2 concentrations, 99 had stress-related outcome measures, including current PTSD and MDD diagnoses and CAPS severity score. We entered age and whether the subjects had measured or predicted E2 levels as covariates. Supplementary Figure 1 shows a CONSORT diagram illustrating the number of GTP subjects in the different analyses.

3. Results

3.1. Predictor development sample (GTP)

3.1.1. Descriptive results for GTP females

Table 1 shows the sociodemographic and hormonal characteristics for GTP females. Age for the total sample of 262 females ranged from 18 to 77 with a mean of 40.3 years old. The range of E2 concentrations in the total GTP sample was 2.85–546 ​pg/mL with a geometric mean of 29.6 ​pg/mL, representing women during childbearing years, pregnancy, and menopause. The geometric mean values for measured E2 in the training and testing sets were 31.9 ​pg/mL and 24.8 ​pg/mL, respectively. Supplementary Figure 2 shows the distributions of measured E2 in the training (A) and testing (B) data sets, which were comparable (p ​= ​0.22). Of the 262 total women in the GTP sample, 215 had data for stress-related outcomes, including 150 females in the training set and 65 females in the testing set. 34 (23%) and 17 (11%) subjects in the training set met criteria for current PTSD and MDD, while 13 (20%) and 10 (15%) in the testing set met criteria for these diagnoses, respectively (p’s ​> ​0.05).

Table 1.

Sociodemographic and hormonal characteristics in the Grady Trauma Project (GTP).

Total female GTP sample Training Testing
Number of subjects 262 183 79
Age (years), mean (SD), range 40.3 (13.3), 18-77 40.1 (13.4), 18-77 40.7 (13.1), 18-70
Measured E2 (pg/mL), geometric mean, range 29.6, 2.85–546 31.9, 2.85–546 24.8, 2.85–546
Measured Childbearing E2, n (%), 30–400 ​pg/mL 96 (37%) 71 (39%) 25 (32%)
Measured Pregnancy E2, n (%), >400 ​pg/mL 26 (10%) 18 (10%) 8 (10%)
Measured Postmenopausal E2, n (%), <30 ​pg/mL 140 (53%) 94 (51%) 46 (58%)
Current PTSD, n (%) 47 (21%), total ​= ​215 34 (23%), total ​= ​150 13 (20%), total ​= ​65
Current MDD, n (%) 27 (13%), total ​= ​215 17 (11%), total ​= ​150 10 (15%), total ​= ​65
CAPS Severity, mean (SD), range 15.8 (14.7), 0–68.2 14.8 (14.7), 4.1–55.9 18.0 (14.7), 0–68.2

3.1.2. Model performance in GTP females

Elastic net regression selected 35 CpG sites (Supplementary Table 1) and age as predictive of E2 concentrations. The sites were distributed throughout the genome and were located in gene bodies, 5′ and 3’ UTRs, and transcriptional start sites of genes. There was no enrichment of CpGs in islands, shores, shelves, or enhancers. 86% of the CpG sites selected by the elastic net algorithm were CpGs associated with E2 concentrations (FDR< 5%) in our previous epigenome-wide study (p ​= ​1.0e-10) [17]. We entered the 35 CpG sites and age selected by elastic net into a RF algorithm to generate the predictor. The correlation between the predicted E2 concentrations generated by the RF model and the measured concentrations was 0.98 (p ​< ​2.2e-16) in the training data, indicating strong model fit (Fig. 1A). In the GTP testing data, the correlation between the measured and DNAm predicted E2 concentrations was 0.75 (p ​= ​1.8e-15; Fig. 1B), also indicating a strong fit.

Fig. 1.

Fig. 1

Correlation between predicted and measured E2 concentrations in (A) GTP training (n ​= ​183) and (B) GTP testing (n ​= ​79) data.

3.2. External validation of DNAm E2 predictor in ED study

Table 2 shows the sociodemographic and hormonal characteristics for the ED data. The range of measured E2 concentrations in the total sample of 85 females was 13.3–2511.4 ​pg/mL with a geometric mean of 54.2 ​pg/mL (distribution shown in Supplementary Figure 2C). Age for these subjects ranged from 18 to 63 years old with a mean of 34.8 years old. We found a correlation of 0.48 (p ​= ​3.0e-6) between measured E2 concentrations and those predicted from the RF model generated from GTP data (Fig. 3). The correlation may have been lower in this external validation data set as compared to the GTP testing set due to differences in when the epigenetic data was obtained and the fact that the subjects in the ED study had recently experienced a trauma. A sensitivity analysis excluding the subjects with high E2 and who were likely pregnant (E2 levels ​> ​400 ​pg/mL) remained significant, although with a lower effect size (r ​= ​0.23, p ​= ​0.04). We were also able to perform a sensitivity analysis for body mass index (BMI) in this cohort, and, using a partial Pearson correlation that corrected for BMI, noted comparable correlation between measured and predicted E2 (r ​= ​0.49, p ​= ​2.3e-06).

Table 2.

Sociodemographic and hormonal characteristics in Grady Predictive Biomarkers ED Study.

Total female ED sample ED 450K ED EPIC
Number of subjects 85 48 37
Age (years), mean (SD), range 34.8 (13.6), 18-63 32.6 (14.0), 18-63 37.5 (12.7), 19-60
Measured E2 (pg/mL), geometric mean, range 54.2, 13.3–2511.4 50.6, 13.3–2511.4 59.2, 21.2–2215.9
Childbearing E2, N (%),
30–400 ​pg/mL
34 (40%) 22 (46%) 25 (68%)
Pregnancy E2, N (%),
>400 ​pg/mL
4 (5%) 3 (6%) 1 (2%)
Postmenopausal E2, N (%),
<30 ​pg/mL
47 (55%) 23 (48%) 11 (30%)
Measured progesterone (ng/mL), geometric mean, range 0.7, 0.029–163 0.7, 0.029–163 1.2, 0.7–88.8
Measured cortisone (ng/mL), geometric mean, range 26.3, 3.5–56.0 28.9, 3.5–56.0 23.2, 8.5–44.4
Measured cortisol (ng/mL), geometric mean, range 220.8, 35.4–832.0 280.3, 35.4–832.0 159.2, 36.4–630.5

Fig. 3.

Fig. 3

Correlation between predicted and measured E2 concentrations in the ED Study. Circles show subjects (ED 450K, n ​= ​48) whose methylation data was generated using the HumanMethylation450 BeadChip and squares represent subjects (ED EPIC, n ​= ​37) that were run on the MethylationEPIC BeadChip.

Next, we assessed the correlations between model-generated values and three other measured hormones, including progesterone, cortisone, and cortisol to determine the specificity of the predictor. Table 2 shows ranges and geometric means for measured progesterone, cortisone, and cortisol in ng/mL. As shown in Table 3, the correlation between measured E2 and measured progesterone was strong and significant, whereas the correlations between measured E2 and the other measured hormones were weaker (cortisone) or non-significant (cortisol). Consistent with these correlations, the predicted E2 concentrations correlated highly with measured progesterone concentrations (r ​= ​0.52, p ​= ​4.0e-7), but not with measured cortisone (r ​= ​0.11, p ​= ​0.28) or cortisol (r ​= ​−0.03, p ​= ​0.72) concentrations.

Table 3.

Correlations between measured hormone levels in the ED Study. Correlation coefficients (r) are provided for all pairwise comparisons. Significant correlations (<2.2e-16 ​< ​p ​< ​9.8e-3) are indicated by bold text.

E2 Cortisol Cortisone Progesterone
E2 (pg/mL) 0.15 0.29 0.93
Cortisol 0.15 0.39 0.15
Cortisone 0.29 0.39 0.28
Progesterone 0.93 0.15 0.28

Number of subjects with data for each hormone level: E2 (n ​= ​85), cortisol (n ​= ​83), cortisone (n ​= ​83), progesterone (n ​= ​84).

3.3. Correlation of DNAm predictions with stress-related phenotypes in GTP women

When we examined the association between predicted E2 and stress-related outcomes in 99 GTP females with phenotypic data (24% PTSD) using age as a covariate, there were negative associations with all three stress-related outcomes but they were not significant. A power calculation using a standard power level of 0.8, an alpha of 0.05, and two predictors (E2 concentration and age) indicated that the sample of subjects with predicted E2 concentrations (n ​= ​99) would be insufficient to detect effect sizes in the small range (Cohen’s f2 ​< ​0.10) in regression analyses [42]. Thus, we assessed the combination of both predicted E2 (n ​= ​99) and measured E2 (n ​= ​150) concentrations in a total of 249 GTP females with age and type of E2 (predicted or measured) as covariates. These analyses showed significant, negative relationships between E2 values and both current PTSD (β ​= ​−0.38, p ​= ​0.01) (Fig. 2A), current MDD (β ​= ​−0.45, p ​= ​0.02) (2B) and CAPS severity (β ​= ​−2.3, p ​= ​0.007) (2C), showing that the low E2 DNAm levels are associated with greater stress-related outcomes.

Fig. 2.

Fig. 2

The combination of predicted (n ​= ​99) and measured (n ​= ​150) E2 concentrations for 249 female GTP subjects were lower in those with (A) current PTSD (p ​= ​0.01) and (B) current MDD (p ​= ​0.02) then those without those diagnoses after correction for covariates. There was also a negative relationship between (C) CAPS severity and the combination of predicted and measured E2 concentrations (p ​= ​0.007) after correction for covariates.

4. Discussion

DNA methylation has been used for prediction of several traits, including age [33], gestational age [34], cancer diagnosis and prognosis [43], and smoking status [44]. To our knowledge, the current study is the first to predict E2 concentrations using epigenome-wide methylation data and relate these E2 predictions to stress-related phenotypes in women. Our DNAm-based E2 predictor generated values with strong correlations between measured E2 concentrations in both a holdout testing set and an independent validation cohort of traumatized women. Among the 35 CpGs that can be used to impute E2 concentrations, many associate independently with E2 concentrations in a previous study [17]. For example, cg02187522 is located within arginine vasopressin, which encodes a product that is cleaved into arginine vasopressin (AVP). Our finding that higher E2 was associated with lower DNA methylation within AVP is consistent with data showing that E2 increases AVP expression [45]. Consistent with the correlation between measured levels of E2 and progesterone, our model’s predicted values were highly correlated with both E2 and progesterone. As such, our DNAm predictor may more broadly reflect menstrual phase or more general reproductive status in women, though this study did not have sufficient information to evaluate that possibility.

In the current study, we found that predicted E2 levels correlated negatively with current PTSD diagnosis and PTSD symptom severity in women. Consistent with the present results, both human and animal studies have generally found greater severity of psychological symptoms [14] or fear-related behaviors [8,9,46] during assessment if a female was in a low E2 phase of her cycle. However, one study has found that women with PTSD in the high E2 (luteal) phase had higher levels of a particular PTSD symptom - flashbacks [47]. Furthermore, low E2 concentrations are associated with deficits in fear extinction [12] and inhibition of learned fear responses in women [13]. One possible application for this E2 DNAm predictor is to determine E2 status in publicly available data sets for which epigenome-wide methylation and psychiatric measures but no E2 assay data exist. DNAm-based predictions of E2 may serve as a biomarker to further assess the relationship between hormone status and psychiatric symptoms. Future studies will be necessary to characterize the degree to which this E2 DNAm predictor changes longitudinally through different menstrual and reproductive phases.

One limitation of the current work is that we used blood-based DNAm data to develop our predictor, and we may have obtained different results if we had used methylation data from other human tissues or cell types. Thus, our predictor will need to be evaluated in different tissues and compartments to assess specificity. Future studies will need to determine whether the model accurately predicts E2 concentrations in urine and saliva, as some research groups have epigenetic data from these sources. Furthermore, our DNAm predictor was modeled on total E2 concentrations, and not free levels of E2. A further limitation is that the predicted range of E2 concentrations in both the holdout testing set and independent validation sets was narrower than the measured values. The limited range of the predicted values restricts the ability of our model to precisely characterize women according to clinical classifications such as childbearing, pregnant, or postmenopausal [40,48]. Similarly, because many of these steroid hormones are correlated, this method does not specifically discriminate between them. As additional samples and data become available, we hope that it can be further refined. An additional limitation is that the ED validation cohort used plasma samples for steroid hormone testing while GTP measures were assayed from serum. While this difference in biological compartment could introduce variability in our predictor, recent data indicates that serum and plasma concentrations of E2 in women are comparable [48]. Additionally, different radioimmunoassays (KE2D1; Siemens Healthcare Diagnostics vs DiaMetra) were used for the ED and GTP cohorts with the potential for systematic differences in the E2 measurements between the two samples. Future incorporation of additional clinical variables may help to better align the predicted values with measured E2 concentrations. However, we chose to optimize the model for use with minimal demographic data to maximize its potential for use in research studies. Furthermore, the elastic net method we employed chooses a parsimonious subset of features (i.e., CpG sites), selecting only one among highly correlated sites, which introduces an element of chance into CpG selection. Finally, our DNAm predictor will need to be validated in different ethnicities, as methylation of CpG sites can differ by race or ethnicity [49], as well as tested in subjects less than 18 years old. Despite these limitations, this methylation risk score offers the opportunity for researchers to leverage epigenetic data as a surrogate marker of E2 concentrations to evaluate the influence of E2 status on risk and symptom severity of neuropsychiatric conditions and beyond.

Author contributions

LMH conducted all analysis and wrote the initial draft with AKS and VM. TJ, VM, and KJR collected the data and provided clinical expertise for the cohorts. SN and AKS generated methylation data contributed in the quality control of the biological assays. LMH, SN, VK, and AKK conducted the statistical analysis. SAM, AVS, and VM provided critical feedback regarding hormone assays and interpretation of results. All authors reviewed and edited the final manuscript.

Role of funding sources

This research was supported by the National Institutes of Health Grants MH094757 (KJR), MH071537 (KJR), MH096764 (KJR), HD085850 (VM), MH115174 (VM), MH085806 (AKS), NARSAD YI award #19233 (AKS), and the training grant MH101079. The funding source did not participate in the study design, data collection, analysis, interpretation, or preparation of the manuscript.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

The authors thank the participants of the study and the GTP staff for their assistance with participant recruitment and data collection.

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.cpnec.2021.100045.

Appendix A. Supplementary data

The following is the Supplementary data to this article:

Multimedia component 1
mmc1.docx (3.7MB, docx)

References

  • 1.Wharton W., Gleason C.E., Olson S.R., Carlsson C.M., Asthana S. Neurobiological underpinnings of the estrogen - mood relationship. Curr. Psychiatr. Rev. 2012;8:247–256. doi: 10.2174/157340012800792957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Luine V.N. Estradiol and cognitive function: past, present and future. Horm. Behav. 2014;66:602–618. doi: 10.1016/j.yhbeh.2014.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Shors T.J., Leuner B. Estrogen-mediated effects on depression and memory formation in females. J. Affect. Disord. 2003;74:85–96. doi: 10.1016/s0165-0327(02)00428-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Fischer B., Gleason C., Asthana S. Effects of hormone therapy on cognition and mood. Fertil. Steril. 2014;101:898–904. doi: 10.1016/j.fertnstert.2014.02.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Vrtačnik P., Ostanek B., Mencej-Bedrač S., Marc J. The many faces of estrogen signaling. Biochem. Med. 2014;24:329–342. doi: 10.11613/BM.2014.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Li S.H., Graham B.M. Why are women so vulnerable to anxiety, trauma-related and stress-related disorders? The potential role of sex hormones. Lancet Psychiatry. 2017;4:73–82. doi: 10.1016/S2215-0366(16)30358-3. [DOI] [PubMed] [Google Scholar]
  • 7.Albert K., Pruessner J., Newhouse P. Estradiol levels modulate brain activity and negative responses to psychosocial stress across the menstrual cycle. Psychoneuroendocrinology. 2015;59:14–24. doi: 10.1016/j.psyneuen.2015.04.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Marcondes F.K., Miguel K.J., Melo L.L., Spadari-Bratfisch R.C. Estrous cycle influences the response of female rats in the elevated plus-maze test. Physiol. Behav. 2001;74:435–440. doi: 10.1016/s0031-9384(01)00593-5. [DOI] [PubMed] [Google Scholar]
  • 9.Mora S., Dussaubat N., Diaz-Veliz G. Effects of the estrous cycle and ovarian hormones on behavioral indices of anxiety in female rats. Psychoneuroendocrinology. 1996;21:609–620. doi: 10.1016/s0306-4530(96)00015-7. [DOI] [PubMed] [Google Scholar]
  • 10.Walf A.A., Paris J.J., Frye C.A. Chronic estradiol replacement to aged female rats reduces anxiety-like and depression-like behavior and enhances cognitive performance. Psychoneuroendocrinology. 2009;34:909–916. doi: 10.1016/j.psyneuen.2009.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Michopoulos V., Checchi M., Sharpe D., Wilson M.E. Estradiol effects on behavior and serum oxytocin are modified by social status and polymorphisms in the serotonin transporter gene in female rhesus monkeys. Horm. Behav. 2011;59:528–535. doi: 10.1016/j.yhbeh.2011.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Glover G.H., Mueller B.A., Turner J.A., van Erp T.G., Liu T.T., Greve D.N., Voyvodic J.T., Rasmussen J., Brown G.G., Keator D.B., Calhoun V.D., Lee H.J., Ford J.M., Mathalon D.H., Diaz M., O’Leary D.S., Gadde S., Preda A., Lim K.O., Wible C.G., Stern H.S., Belger A., McCarthy G., Ozyurt B., Potkin S.G. Function biomedical informatics research network recommendations for prospective multicenter functional MRI studies. J. Magn. Reson. Imag. 2012;36:39–54. doi: 10.1002/jmri.23572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Glover E.M., Jovanovic T., Norrholm S.D. Estrogen and extinction of fear memories: implications for posttraumatic stress disorder treatment. Biol. Psychiatr. 2015;78:178–185. doi: 10.1016/j.biopsych.2015.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Nillni Y.I., Pineles S.L., Patton S.C., Rouse M.H., Sawyer A.T., Rasmusson A.M. Menstrual cycle effects on psychological symptoms in women with PTSD. J. Trauma Stress. 2015;28:1–7. doi: 10.1002/jts.21984. [DOI] [PubMed] [Google Scholar]
  • 15.Goldstein J.M., Jerram M., Abbs B., Whitfield-Gabrieli S., Makris N. Sex differences in stress response circuitry activation dependent on female hormonal cycle. J. Neurosci. 2010;30:431–438. doi: 10.1523/JNEUROSCI.3021-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Schiller C.E., Johnson S.L., Abate A.C., Schmidt P.J., Rubinow D.R. Reproductive steroid regulation of mood and behavior. Comp. Physiol. 2016;6:1135–1160. doi: 10.1002/cphy.c150014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Maddox S.A., Kilaru V., Shin J., Jovanovic T., Almli L.M., Dias B.G., Norrholm S.D., Fani N., Michopoulos V., Ding Z., Conneely K.N., Binder E.B., Ressler K.J., Smith A.K. Estrogen-dependent association of HDAC4 with fear in female mice and women with PTSD. Mol. Psychiatr. 2018;23:658–665. doi: 10.1038/mp.2016.250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gann P.H., Giovanazzi S., Van Horn L., Branning A., Chatterton R.T. Saliva as a medium for investigating intra- and interindividual differences in sex hormone levels in premenopausal women. Canc. Epidemiol. Biomarkers Prev. 2001;10:59–64. [PubMed] [Google Scholar]
  • 19.Ferrell R.J., O’Connor K.A., Rodríguez G., Gorrindo T., Holman D.J., Brindle E., Miller R.C., Schechter D.E., Korshalla L., Simon J.A., Mansfield P.K., Wood J.W., Weinstein M. Monitoring reproductive aging in a 5-year prospective study: aggregate and individual changes in steroid hormones and menstrual cycle lengths with age. Menopause. 2005;12:567–577. doi: 10.1097/01.gme.0000172265.40196.86. [DOI] [PubMed] [Google Scholar]
  • 20.Chatterton R.T., Mateo E.T., Hou N., Rademaker A.W., Acharya S., Jordan V.C., Morrow M. Characteristics of salivary profiles of oestradiol and progesterone in premenopausal women. J. Endocrinol. 2005;186:77–84. doi: 10.1677/joe.1.06025. [DOI] [PubMed] [Google Scholar]
  • 21.Gillespie C.F., Bradley B., Mercer K., Smith A.K., Conneely K., Gapen M., Weiss T., Schwartz A.C., Cubells J.F., Ressler K.J. Trauma exposure and stress-related disorders in inner city primary care patients. Gen. Hosp. Psychiatr. 2009;31:505–514. doi: 10.1016/j.genhosppsych.2009.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Blake D.D., Weathers F.W., Nagy L.M., Kaloupek D.G., Gusman F.D., Charney D.S., Keane T.M. The development of a clinician-administered PTSD scale. J. Trauma Stress. 1995;8:75–90. doi: 10.1007/BF02105408. [DOI] [PubMed] [Google Scholar]
  • 23.Weathers F.W., Keane T.M., Davidson J.R. Clinician-administered PTSD scale: a review of the first ten years of research. Depress. Anxiety. 2001;13:132–156. doi: 10.1002/da.1029. [DOI] [PubMed] [Google Scholar]
  • 24.First M.B., Spitzer R.L., Gibbon M., Williams J.B.W. American Psychiatric Association; Washington, D.C: 1997. Structured Clinical Interview for DSM-IV Axis I Disorders. [Google Scholar]
  • 25.Mehta D., Klengel T., Conneely K.N., Smith A.K., Altmann A., Pace T.W., Rex-Haffner M., Loeschner A., Gonik M., Mercer K.B., Bradley B., Muller-Myhsok B., Ressler K.J., Binder E.B. Childhood maltreatment is associated with distinct genomic and epigenetic profiles in posttraumatic stress disorder. Proc. Natl. Acad. Sci. U. S. A. 2013;110:8302–8307. doi: 10.1073/pnas.1217750110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Barfield R., Kilaru V., Smith A., Conneely K. Bioinformatics in Press. 2012. CpGassoc: an R function for analysis of DNA methylation microarray data. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Chen Y.A., Lemire M., Choufani S., Butcher D.T., Grafodatskaya D., Zanke B.W., Gallinger S., Hudson T.J., Weksberg R. Discovery of cross-reactive probes and polymorphic CpGs in the illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013;8:203–209. doi: 10.4161/epi.23470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Teschendorff A.E., Marabita F., Lechner M., Bartlett T., Tegner J., Gomez-Cabrero D., Beck S. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics. 2013;29:189–196. doi: 10.1093/bioinformatics/bts680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Houseman E.A., Accomando W.P., Koestler D.C., Christensen B.C., Marsit C.J., Nelson H.H., Wiencke J.K., Kelsey K.T. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinf. 2012;13:86. doi: 10.1186/1471-2105-13-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Reinius L.E., Acevedo N., Joerink M., Pershagen G., Dahlen S.E., Greco D., Soderhall C., Scheynius A., Kere J. Differential DNA methylation in purified human blood cells: implications for cell lineage and studies on disease susceptibility. PloS One. 2012;7 doi: 10.1371/journal.pone.0041361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Logue M.W., Smith A.K., Wolf E.J., Maniates H., Stone A., Schichman S.A., McGlinchey R.E., Milberg W., Miller M.W. The correlation of methylation levels measured using Illumina 450K and EPIC BeadChips in blood samples. Epigenomics. 2017;9:1363–1371. doi: 10.2217/epi-2017-0078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Friedman J., Hastie T., Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Software. 2010;33:1–22. [PMC free article] [PubMed] [Google Scholar]
  • 33.Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 2013;14:R115. doi: 10.1186/gb-2013-14-10-r115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Knight A.K., Craig J.M., Theda C., Bækvad-Hansen M., Bybjerg-Grauholm J., Hansen C.S., Hollegaard M.V., Hougaard D.M., Mortensen P.B., Weinsheimer S.M., Werge T.M., Brennan P.A., Cubells J.F., Newport D.J., Stowe Z.N., Cheong J.L., Dalach P., Doyle L.W., Loke Y.J., Baccarelli A.A., Just A.C., Wright R.O., Téllez-Rojo M.M., Svensson K., Trevisi L., Kennedy E.M., Binder E.B., Iurato S., Czamara D., Räikkönen K., Lahti J.M., Pesonen A.K., Kajantie E., Villa P.M., Laivuori H., Hämäläinen E., Park H.J., Bailey L.B., Parets S.E., Kilaru V., Menon R., Horvath S., Bush N.R., LeWinn K.Z., Tylavsky F.A., Conneely K.N., Smith A.K. An epigenetic clock for gestational age at birth based on blood methylation data. Genome Biol. 2016;17:206. doi: 10.1186/s13059-016-1068-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Chen X., Ishwaran H. Random forests for genomic data analysis. Genomics. 2012;99:323–329. doi: 10.1016/j.ygeno.2012.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Breiman L. Random forests. Mach. Learn. 2001;45:5–32. [Google Scholar]
  • 37.Chekroud A.M., Zotti R.J., Shehzad Z., Gueorguieva R., Johnson M.K., Trivedi M.H., Cannon T.D., Krystal J.H., Corlett P.R. Cross-trial prediction of treatment outcome in depression: a machine learning approach. Lancet Psychiatry. 2016;3:243–250. doi: 10.1016/S2215-0366(15)00471-X. [DOI] [PubMed] [Google Scholar]
  • 38.Thompson N.J., Fiorillo D., Rothbaum B.O., Ressler K.J., Michopoulos V. Coping strategies as mediators in relation to resilience and posttraumatic stress disorder. J. Affect. Disord. 2018;225:153–159. doi: 10.1016/j.jad.2017.08.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Michopoulos V., Maples-Keller J., Roger E.I., Beaudoin F.L., Sumner J.A., Rothbaum B.O., Hudak L., Gillespie C.F., Kronish I.M., McLean S.A., Ressler K.J. Nausea in the peri-traumatic period is associated with prospective risk for PTSD symptom development. Neuropsychopharmacology. 2019;44(4):668–673. doi: 10.1038/s41386-018-0276-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Michopoulos V., Beurel E., Gould F., Dhabhar F.S., Schultebraucks K., Galatzer-Levy I., Rothbaum B.O., Ressler K.J., Nemeroff C.B. Association of prospective risk for chronic PTSD symptoms with low TNFalpha and IFNgamma concentrations in the immediate aftermath of trauma exposure. Am J Psychiatry, appiajp201919010039. 2019 doi: 10.1176/appi.ajp.2019.19010039. [DOI] [PubMed] [Google Scholar]
  • 41.Franke A.A., Custer L.J., Morimoto Y., Nordt F.J., Maskarinec G. Analysis of urinary estrogens, their oxidized metabolites, and other endogenous steroids by benchtop orbitrap LCMS versus traditional quadrupole GCMS. Anal. Bioanal. Chem. 2011;401:1319–1330. doi: 10.1007/s00216-011-5164-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Soper D.S. 2018. A-priori Sample Size Calculator for Multiple Regression. [Software] [Google Scholar]
  • 43.Hao X., Luo H., Krawczyk M., Wei W., Wang W., Wang J., Flagg K., Hou J., Zhang H., Yi S., Jafari M., Lin D., Chung C., Caughey B.A., Li G., Dhar D., Shi W., Zheng L., Hou R., Zhu J., Zhao L., Fu X., Zhang E., Zhang C., Zhu J.K., Karin M., Xu R.H., Zhang K. DNA methylation markers for diagnosis and prognosis of common cancers. Proc. Natl. Acad. Sci. U. S. A. 2017;114:7414–7419. doi: 10.1073/pnas.1703577114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Alghanim H., Wu W., McCord B. DNA methylation assay based on pyrosequencing for determination of smoking status. Electrophoresis. 2018 doi: 10.1002/elps.201800098. [DOI] [PubMed] [Google Scholar]
  • 45.Roy B.N., Reid R.L., Van Vugt D.A. The effects of estrogen and progesterone on corticotropin-releasing hormone and arginine vasopressin messenger ribonucleic acid levels in the paraventricular nucleus and supraoptic nucleus of the rhesus monkey. Endocrinology. 1999;140:2191–2198. doi: 10.1210/endo.140.5.6684. [DOI] [PubMed] [Google Scholar]
  • 46.Frye C.A., Petralia S.M., Rhodes M.E. Estrous cycle and sex differences in performance on anxiety tasks coincide with increases in hippocampal progesterone and 3alpha. 5alpha-THP. Pharmacology, biochemistry, and behavior. 2000;67:587–596. doi: 10.1016/s0091-3057(00)00392-0. [DOI] [PubMed] [Google Scholar]
  • 47.Bryant R.A., Felmingham K.L., Silove D., Creamer M., O’Donnell M., McFarlane A.C. The association between menstrual cycle and traumatic memories. J. Affect. Disord. 2011;131:398–401. doi: 10.1016/j.jad.2010.10.049. [DOI] [PubMed] [Google Scholar]
  • 48.Coburn S.B., Stanczyk F.Z., Falk R.T., McGlynn K.A., Brinton L.A., Sampson J., Bradwin G., Xu X., Trabert B. Comparability of serum, plasma, and urinary estrogen and estrogen metabolite measurements by sex and menopausal status. Cancer Causes Control. 2019;30:75–86. doi: 10.1007/s10552-018-1105-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Barfield R.T., Almli L.M., Kilaru V., Smith A.K., Mercer K.B., Duncan R., Klengel T., Mehta D., Binder E.B., Epstein M.P., Ressler K.J., Conneely K.N. Accounting for population stratification in DNA methylation studies. Genet. Epidemiol. 2014;38:231–241. doi: 10.1002/gepi.21789. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1
mmc1.docx (3.7MB, docx)

Articles from Comprehensive Psychoneuroendocrinology are provided here courtesy of Elsevier

RESOURCES