Abstract
DNA methylation age (DNAmAge) surpasses chronological age in its ability to predict age-related morbidities and mortality. This study analyzed data from 287 middle-aged twins in the Louisville Twin Study (mean age 51.9 years ± 7.03) to investigate the effect of DNAmAge acceleration on change in IQ (ΔIQ) between childhood and midlife, while testing childhood socioeconomic status (SES) as a moderator of the relationship. DNAmAge was estimated with five commonly used algorithms (Horvath, Horvath Skin and Blood, GrimAge, and PhenoAge). A factor analysis of these measures produced a two-factor structure which we identified as first generation and second generation measures. Results of genetically informed, quasi-causal regression models indicated that accelerated second generation DNAmAge predicted more negative ΔIQ from childhood to midlife, after accounting for genetic and environmental confounds shared by twins. The relationship between DNAmAge and ΔIQ was moderated by childhood SES, with a stronger effect observed among twins from low SES backgrounds. Second generation DNAmAge measures trained to estimate phenotypic biological age show promise in their predictive value for cognitive decline in midlife. Our genetically informed twin design suggested that DNAmAge may represent a biological pathway through which early-life socioeconomic disadvantage impacts midlife cognitive health.
Introduction
Aging is highly heterogeneous, and chronological age is not a sensitive measure of the unique physiological and developmental processes that occur across the human lifespan. Patterns of DNA methylation (DNAm) across the genome can be used to estimate DNA methylation age (DNAmAge), a family of measures strongly correlated with chronological age but better able to predict morbidities and mortality. Adjusting for chronological age, DNAmAge measures can be residualized to generate measures of acceleration 1, such that positive residuals reflect faster biological aging, while negative residuals reflect slower, and presumably healthier, aging. Since the advent of the first measures a decade ago, DNAmAge measures reliably correlate with demographic characteristics like education level and socioeconomic status (SES) as well as clinically relevant outcomes including cognitive performance in adulthood 2,3.
Predicting cognitive outcomes from DNAmAge
Individuals with accelerated DNAmAge have an increased risk of cancer, stroke, heart disease, and neurodegenerative disease 4–6. While cognitive functioning is a crucial facet of both healthy and pathological aging, the link between DNAmAge and cognitive functioning remains unclear, partly due to inconsistencies in the predictive accuracy of various DNAmAge algorithms 7. First generation DNAmAge measures, such as Horvath, Horvath Skin and Blood, and Hannum DNAmAge, were designed primarily to estimate chronological age with methylation data from cytosine-phosphate-guanine (CpG) sites sensitive to time-dependent changes 1,8,9. While two studies found that Horvath DNAmAge was associated with cognitive decline 6,10, several others have found no statistically significant associations between first generation DNAmAge measures and cognitive performance or decline 11–14.
In contrast, second generation DNAmAge measures, including PhenoAge and GrimAge, are trained to predict age-related outcomes by integrating DNAm signatures tied to clinical biomarkers, health behaviors (e.g., smoking), and demographic variables 15,16. These algorithms aim to capture phenotypic aging processes beyond chronological age, such as morbidity and mortality risk. Accelerated PhenoAge and GrimAge are consistently associated with lower cognitive performance in middle and older adulthood 11,12,17,18. However, findings on cognitive decline are mixed. Neither Hillary et al. 18 nor Maddock et al. 12 reported statistically significant results, yet Reed and colleagues 13 found that those with cognitive decline in midlife showed more accelerated GrimAge scores than those without decline. These findings suggest that second generation clocks, developed to more accurately estimate phenotypic biological age, may better index age-related changes in cognitive ability.
Early-life exposures and pace of DNAmAge
Disadvantaged socioeconomic status (SES) in childhood is an established risk factor for lower cognitive ability across the lifespan and late-life cognitive decline 19. Recent research has also begun exploring links between childhood adversity and later-life DNA methylation age (DNAmAge). In retrospective accounts of adversity (e.g., poverty, death of parent, alcohol/drug use in family, abuse), only poverty was related to GrimAge acceleration 20. Childhood SES but not adult SES predicted accelerated DNAmAge in midlife using the Horvath and Hannum clocks, suggesting the methylome, or DNA methylation modifications across the genome, may be particularly vulnerable to external stressors in childhood 21,22. However, findings remain inconsistent: McCrory et al. 23 found no association between childhood SES and Horvath, Hannum, or PhenoAge DNAmAge in adults over 50, and unexpectedly, Faul et al. 11 found that low childhood SES was associated with slower aging on the Horvath and Hannum clocks later in life. It is not yet known whether SES moderates the effects of DNAmAge on cognitive outcomes.
Genetics, Epigenetics, and Classical Twin Designs
Associations between early life exposures, mid- to late-life outcomes, and DNAmAge have primarily been correlational. One study decomposed the variance of DNAmAge into additive genetic, shared environmental, and nonshared environmental components using a traditional twin model, while two others used pedigree designs to estimate heritability 16,24,25. Our group has shown how to use a longitudinal twin design to go beyond estimating heritability while sharpening causal inference between a predictor and outcome 26. Only two studies have examined the relationship between DNAmAge and cognitive ability while controlling for between-family confounds 10,14. In the Middle-Aged Danish Twin Study, no relationship was observed between the Horvath and Hannum clocks and cognitive change over 10 years in midlife 14. Vaccarino and colleagues 10 found no associations between Horvath, Hannum, GrimAge, or PhenoAge clocks and baseline cognition in middle-aged male twins. However, there was a statistically significant within-twin pair effect of Horvath DNAmAge on cognitive decline 11 years later. The reason why only Horvath’s original measure predicted cognitive ability is unclear, as initial validation studies of PhenoAge and GrimAge reported associations with cognitive outcomes 15,16.
Using the Louisville Twin Study (LTS), we expanded on this limited amount of research examining DNAmAge and cognitive change and sought to clarify how early life environmental characteristics may influence midlife cognitive outcomes. The LTS, initiated in 1957, includes intensive, prospective data on twins’ development from infancy through adolescence and recent re-evaluations in midlife on their cognitive, physical, and psychosocial functioning as well as epigenetic aging27. Parental data collected during the twins’ childhood further enabled us to examine how early life exposures, here rearing SES, relate to twins’ midlife epigenetic and cognitive aging.
The current study used a twin design to investigate causal effects of DNAmAge on change in IQ (ΔIQ) between childhood and midlife, while testing childhood SES as a moderator of the relationship. While observational designs cannot reproduce the causal inference possible with random assignment, twins offer a quasi-experimental method for studying causal processes while controlling for family-level genetic and environmental confounds. If a monozygotic (MZ) twin has a higher DNAmAge and also a greater ΔIQ than their co-twin, the association cannot be due to genetic differences or early environmental exposures, because identical twins reared together are matched for these factors. For this reason, we refer to associations that survive twin-based controls as “quasi-causal” 26. To address variability in DNAmAge algorithms, we conducted an exploratory factor analysis of five widely used measures. We expected second generation DNAmAge measures to be sensitive to ΔIQ in midlife, as they were developed to capture phenotypic aging processes beyond chronological age. Finally, given recent findings linking early life stressors with midlife DNAmAge, we hypothesized that the relationship between midlife DNAmAge and ΔIQ would depend on differences in childhood SES.
Results
Descriptive statistics
The sample includes 287 individual twins, comprising 60 monozygotic and 41 dizygotic complete pairs and 85 single twins. All participated in the childhood and midlife phases of the LTS (Supplementary Fig. S1). Twins were assessed in person using three versions of the Wechsler Intelligence Scale for Children (WISC) and the Wechsler Adult Intelligence Scale (WAIS-IV). We used Wechsler IQ scores as an overall estimate of an individual’s level of global cognitive ability. The average age for WISC was 14.42 (1.93) years, and the average age for WAIS-IV was 51.87 (7.03). Mean childhood IQ was 102.7 (13.4), and mean midlife IQ was 105.4 (13.8) (Table 1). After regressing DNAmAge values on chronological age and adjusting for cell type, Horvath, Horvath Skin and Blood, Hannum, PhenoAge, and GrimAge 1,8,9,15,16 had means close to zero (Table 1). MZ twin correlations (0.60–0.74) were approximately double those for DZ twins (0.28–0.37) across most measures, except PhenoAge, where MZ and DZ correlations were more similar (0.56 vs. 0.47). Pairwise clock correlations are in Supplementary Table S1.
Table 1.
Sample Characteristics of Louisville Twins
Characteristic | Total n (%) | Mean (SD) | MZ Correlation1 r (SE) | DZ Correlation2 r (SE) |
---|---|---|---|---|
Sociodemographic factors | 287 | |||
Age, years | 51.87 (7.03) | |||
Age at last WISC, years | 14.42 (1.93) | |||
Female Sex | 169 (58.89) | |||
White | 262 (91.29) | |||
Non-Hispanic | 279 (99.29) | |||
Education, 16 or more years | 161 (56.09) | |||
Childhood SES3 | 46.09 (26.02) | |||
Smoker | 66 (23.00) | |||
Cognitive function | ||||
WISC Full Scale IQ Score | 282 | 102.67 (13.44) | 0.88 (0.06) | 0.58 (0.14) |
WAIS Full Scale IQ Score | 255 | 105.45 (13.84) | 0.87 (0.07) | 0.50 (0.14) |
DNAmAge 4 | 281 | |||
Horvath | 0.00 (3.37) | 0.62 (0.11) | 0.37 (0.15 | |
Horvath Skin and Blood | −0.03 (3.59) | 0.62 (0.10) | 0.32 (0.15) | |
Hannum | 0.01 (3.25) | 0.60 (0.11) | 0.28 (0.16) | |
PhenoAge | −0.09 (4.20) | 0.56 (0.11) | 0.47 (0.14) | |
GrimAge | −0.12 (3.42) | 0.74 (0.09) | 0.37 (0.15) | |
Gen 1 Composite | 0.00 (2.85) | 0.62 (0.10) | 0.32 (0.15) | |
Gen 2 Composite | 0.00 (1.78) | 0.71 (0.09) | 0.42 (0.15) |
Monozygotic Twin Pair Correlation
Dizygotic Twin Pair Correlation
Childhood socioeconomic status (SES) measured using the Duncan Socioeconomic Index
DNAmAge is controlled for chronological age and cell composition. Residuals are in years.
Results from standardized univariate twin models decomposing the variance of individual DNAmAge measures show little to no shared environmental variance, C, in Horvath, Horvath Skin and Blood, Hannum, and GrimAge clocks. Because C was estimated at a negative value in these cases initially, C was set to zero and the constrained AE models are presented (Supplementary Table S2). These four DNAmAge measures were substantially heritable, with the additive genetic effects, A, ranging from 57–73%, and non-shared environment, E, accounting for 27–43% of the variance. The ACE decomposition of PhenoAge differed in that we estimated the A at 0.23 (95% CI: [−0.35, 0.80]), C at 0.33 (95% CI: [−0.16, 0.82]), and E at 0.45 (95% CI: [0.28, 0.62]) (Supplementary Table S2).
Exploratory Factor Analysis of DNAm Clocks
We conducted an exploratory factor analysis (EFA) using the five DNAmAge measures residualized for age and cell count. Based on Eigenvalues and fit indices, we selected a two-factor model (RMSEA = 0.08; 95% CI: [0.00, 0.20]) (Supplementary Fig. S2). Results from a likelihood ratio test comparing a two-factor model to a one-factor model suggested that the two-factor model fit substantially better (χ2 = 70.07, df = 4, p < .001). First generation DNAmAge measures (Horvath, Horvath Skin and Blood, and Hannum) loaded on one factor, while second generation measures (PhenoAge and GrimAge) loaded on a second factor. We refer to these factors as “Gen 1” and “Gen 2”, respectively. Factor loadings (p < .05) are displayed in Supplementary Table S3. The factors were correlated r = 0.57. Composite scores were generated for each participant based on the EFA. Gen 1 is the sum of standardized values for Horvath, Horvath Skin and Blood, and Hannum DNAmAge (M = 0.00, SD = 2.85). Gen 2 is the sum of standardized values for GrimAge and PhenoAge DNAmAge (M = 0.00, SD = 1.78). Twin pair correlations for Gen 1 and Gen 2 are in Table 1.
Univariate Twin Models of DNAmAge Factors
We decomposed the variances of Gen 1 and Gen 2 into A, C, and E components using the classical twin model. Both factors were moderately heritable, with A accounting for 50% of the variance in Gen 1 and 60% of the variance in Gen 2. The variance attributable to the non-shared environment was 42% in Gen 1 and 31% in Gen 2, while shared environment accounted for less than 1% of the variance in both factors. Standardized estimates are in Supplementary Table S2.
Phenotypic and “Quasi-Causal” Association Models
Following the twin modeling framework outlined by Turkheimer and Harden (2014), we first estimated a “phenotypic association” model with no control for differential regressions between and within twin pairs. We then fit a “quasi-causal” regression model, using the ACE decomposition of the predictor variable to control for family-level genetic and environmental confounds in the individual-level regressions. This sequence (phenotypic association model and quasi-causal regression model) was estimated for Gen 1 then repeated for Gen 2. Figure 1 illustrates the model: adult IQ was regressed on childhood IQ, DNAmAge, and the A and C components of each; DNAmAge was regressed on childhood IQ and its A and C components; SES (a family-level variable) was included as a linear covariate for DNAmAge and IQ variables and as an interaction term with DNAmAge. By including childhood IQ in the model, the conditional effects of SES and DNAmAge can be interpreted as predicting change in IQ from childhood to midlife (ΔIQ). The phenotypic regression coefficient (bP3) is an estimate of the relationship between the DNAmAge and ΔIQ with between-family genetic and environmental confounds controlled, i.e., within a pair of identical twins reared together.
Figure 1.
Path diagram depicting the association between DNAmAge and change in IQ, modified by childhood socioeconomic status (SES). Variances of ACE components are estimated, with paths to observed variables set to 1.0. All regression coefficients are unstandardized. bA1 and bC1 are regression coefficients from the A and C components of Childhood IQ to Adult IQ. bP1 is the phenotypic regression conditional on bA1 and bC1. bA2 and bC2 are regression coefficients from the A and C components of Childhood IQ to DNAmAge. bP2 is the phenotypic regression conditional on bA2 and bC2. bA3 and bC3 are regression coefficients from the A and C components of DNAmAge to Adult IQ. bP3 is the phenotypic regression conditional on bA3 and bC3. SES is a covariate for DNAmAge and the IQ variables, and a moderator of bP3.
Predictors of DNAmAge
In the phenotypic association model, there were no statistically significant associations with Gen 1 DNAmAge. All Gen 1 model results are presented in Supplementary Table S4. Accelerated Gen 2 DNAmAge was associated with lower childhood SES (b=−0.41, SE = 0.13, p < .05) in the phenotypic association model (Table 2). Next, we used the quasi-causal regression model to estimate the effects of childhood IQ and SES on DNAmAge, using the ACE decomposition of childhood IQ to control for genetic and environmental family-level confounds. In the quasi-causal analyses, there were similarly no statistically significant predictors of Gen 1 DNAmAge. Childhood SES again predicted Gen 2 DNAmAge (b=−0.67, SE = 0.19, p < .001) in this model, while childhood IQ did not predict Gen 2 DNAmAge. Genetic and environmental confounds were not significantly different from zero. Gen 2 DNAmAge results from both models are presented in Table 2.
Table 2.
Phenotypic and Quasi-Causal Regression Model Results for Gen 2 DNAmAge
Gen 2 DNAmAge | ||||||
---|---|---|---|---|---|---|
Childhood IQ Estimate (SE) | DNAmAge Estimate (SE) | Adult IQ Estimate (SE) | ||||
Phenotypic Association Model | SES | |||||
b | 0.56 (0.09)** | b | −0.67 (0.20)** | b | 0.20 (0.11) | |
Childhood IQ | ||||||
bA2 | bA1 | |||||
bC2 | bC1 | |||||
DNAmAge | bP2 | −0.08 (0.09) | bP1 | 0.48 (0.14)** | ||
bA3 | ||||||
bC3 | ||||||
bP3 | −0.10 (0.03)* | |||||
DNAmAge X SES | ||||||
bint | 0.06 (0.03)* | |||||
Variances | ||||||
A | 1.15 (0.34) | 1.96 (0.31) | 0.50 (0.08) | |||
C | 0.15 (0.32) | 0.00 (0.00)† | 0.00 (0.00) † | |||
E | 0.22 (0.04) | 0.85 (0.16) | 0.24 (0.05) | |||
Quasi-Causal Regression Model | SES | |||||
b | 0.56 (0.09)** | b | −0.67 (0.19)* | b | 0.18 (0.12) | |
Childhood IQ | ||||||
bA2 | −0.78 (0.41) | bA1 | 0.16 (0.23) | |||
bC2 | 0.30 (1.42) | bC1 | 0.80 (0.67) | |||
bP2 | 0.38 (0.26) | bP1 | 0.48 (0.14)** | |||
DNAmAge | ||||||
bA3 | 0.13 (0.15) | |||||
bC3 | 0.00 (0.00) | |||||
bP3 | −0.18 (0.07)* | |||||
DNAmAge X SES | ||||||
bint | 0.07 (0.03)* | |||||
Variances | ||||||
A | 1.06 (0.29) | 1.73 (0.56) | 0.43 (0.12) | |||
C | 0.21 (0.28) | 0.00 (0.00)† | 0.00 (0.00)† | |||
E | 0.23 (0.04) | 0.80 (0.15) | 0.23 (0.05) |
bP, phenotypic association between predictor and outcome; bA, amount of variance attributable to additive genetic influences; bC, amount of variance attributable to shared environmental experiences; bint, estimate of the interaction effect. Data are presented as unstandardized regression coefficients with standard errors in parentheses.
C variances have been set to zero
p < .05
p ≤ .001
Predictors of Midlife Cognitive Functioning
In the phenotypic association models for both Gen 1 and Gen 2, childhood IQ statistically predicted adult IQ. The strong relationship between childhood IQ and adult IQ (r = 0.77) is also illustrated in Supplementary Figure S3, where adult IQ is regressed on childhood IQ without controlling for the differential regressions between and within twin pairs. Similarly, SES was positively associated with childhood IQ (b = 0.56, SE = 0.09, p < .001) in both phenotypic association models. Higher childhood SES predicted higher adult IQ in the Gen 1 model (b = 0.16, SE = 0.07, p < .05), though this association did not meet statistical significance in the Gen 2 model. We found a statistically significant phenotypic association between Gen 2 and ΔIQ (bP3= −0.10, SE = 0 .03, p < .05). In both Gen 1 and Gen 2 phenotypic association models, we then tested the interaction between DNAmAge and SES (bint) in the prediction of ΔIQ but found that the interaction was not statistically significant.
In quasi-causal models, we then regressed midlife IQ on childhood IQ (bP1), DNAmAge (bP3), and the A and C components of both variables. There was a statistically significant phenotypic association between childhood IQ and adult IQ after accounting for genetic and environmental confounds shared by identical twins in both Gen 1 (bP1 = 0.41, p < .01) and Gen 2 models (bP1 = 0.48, p < .001). This association explains that over 40% of the variance in adult IQ can be attributed to childhood IQ. Consistent with the phenotypic association results above, accelerated Gen 2 DNAmAge predicted more negative ΔIQ from childhood to midlife (b = − 0.18, SE = 0.07, p < .05). Genetic and environmental confounds (bA3 and bC3) were not significantly different from zero. The quasi-causal twin model implies that within a pair of MZ twins raised together, the member of the pair with accelerated DNAmAge also declines more in cognitive ability between childhood and midlife. These findings were only apparent when DNAmAge was measured with second generation (Gen 2) DNAmAge measures.
The interaction revealed that twins raised in lower SES families showed a more negative relationship between Gen 2 DNAmAge and ΔIQ (b = 0.07, SE = 0.03, p < .05) (Table 2). Figure 2 illustrates the moderating effect of childhood SES by using a median split of our sample for SES and displaying the regression of adult IQ on Gen 2 DNAmAge in both high- and low-SES groups. The effect of DNAmAge on ΔIQ is greater in twins raised in lower SES households, indicating that socioeconomic disadvantage may be an early exposure that amplifies the negative effects of epigenetic aging on cognitive trajectories in midlife.
Figure 2.
Regression of Adult IQ residuals on Gen 2 DNAmAge for high and low SES groups. IQ residuals are derived from the regression of adult IQ on childhood IQ and childhood socioeconomic status.
Discussion
DNAmAge is an effective predictor of specific aspects of biological aging, but challenges remain due to the lack of a gold standard DNAmAge measure and variations in the predictive accuracies of various measures. Most studies have focused on associations between DNAmAge and phenotypic aging outcomes, but few have tested whether such associations occur because DNAm causes biological aging as opposed to merely correlating with it. Our study first identified two factors across five DNAmAge measures and then used a longitudinal twin design to improve causal inference between early-life exposures, DNAmAge, and ΔIQ in midlife.
As hypothesized, Gen 2 DNAmAge, but not Gen 1, predicted ΔIQ in midlife. This result was consistent with several previous studies showing associations between second generation measures and cognition cross-sectionally 11,17 and longitudinally 13. PhenoAge and GrimAge algorithms both incorporate methylation signatures of clinical markers—PhenoAge from blood-based biomarkers predicting physical function, multimorbidity, and mortality 15, and GrimAge from plasma proteins and smoking pack years selected for their prediction of time-to-death 16. These second generation DNAmAge measures have already shown improvements over first generation DNAmAge in predicting physical functioning, morbidity, and lifespan 16,28. Our finding that accelerated Gen 2 DNAmAge predicted more negative ΔIQ suggests that decline in cognitive ability is interwoven with other age-related physiological processes that associate with the methylome.
The within-twin pair association between Gen 2 DNAmAge and ΔIQ that withstood controls for between family level confounds suggests that differences in environmental exposures partially explain their association. Two prior twin studies of DNAmAge and cognition had mixed findings 10,14. Our Gen 1 findings were consistent with Starnawska and colleagues 14 who did not find within-twin pair associations between Horvath and Hannum DNAmAge and cognition cross-sectionally or longitudinally. Our results differed from Vaccarino et al. 10 who found no within-pair association between PhenoAge and GrimAge and cognitive decline. Differences in study design may explain this discrepancy: while our sample included both male and female twins, Vaccarino et al. examined only male veterans. Our study also spanned a longer age range, predicting change from childhood and adolescent IQ to midlife, whereas Vaccarino et al. used two testing points within participants’ 50s and 60s. Importantly, the LTS lifespan longitudinal design allowed us to capture change from premorbid assessments of cognitive ability to midlife, a period when subtle, age-related declines may emerge.
While some prior studies found that lower childhood SES was associated with accelerated DNAmAge on the Horvath and Hannum clocks 21,22, others detected no significant effect 23,29. Our results supported the hypothesis that early-life SES would differentially affect Gen 1 and Gen 2 DNAmAge. Childhood SES predicted Gen 2 DNAmAge but not Gen 1 DNAmAge in midlife, consistent with findings from McCrory et al. 20, who found childhood poverty was associated with accelerated GrimAge in 50- to 87-year-olds. Childhood poverty has been found to influence health across the lifespan by limiting access to resources like education and healthcare 30. However, the interaction effect between DNAmAge and SES in our study offers a novel explanation for one pathway through which early-life stressors become biologically embedded. The relationship between accelerated epigenetic age and cognitive decline was amplified within twin pairs from lower-income households, suggesting that those raised in poorer families are more vulnerable to the cognitive effects associated with the broader physiological aging measured by DNAmAge.
To our knowledge, only one previous study has conducted a factor analysis on multiple DNAmAge measures. In a model predicting multimorbidity and activities of daily living, Faul and colleagues 11 found Horvath and Hannum measures loaded on a first factor and PhenoAge, GrimAge, and DunedinPACE loaded on a second factor. Our EFA results aligned with this, identifying a “Gen 1” factor with Horvath, Horvath Skin and Blood, and Hannum clocks, and a “Gen 2” factor with PhenoAge and GrimAge. Although we did not calculate DunedinPACE, a “third generation” measure that estimates rate of decline over nearly twenty years, future studies should compare its sensitivity with Gen 2 in the LTS sample 31. These results contribute to an understanding of the shared underlying patterns of measurement in “Gen 1” and “Gen 2” DNAmAge. Distinguishing between these two domains is a step towards refining the application of DNAmAge: our findings suggest second generation DNAmAge measures may have greater utility in studies of cognitive aging than first generation DNAmAge.
Strengths and Limitations
Unlike prior studies described here that relied on retrospective accounts of childhood financial hardship or parental occupation, ours is the first to use a prospective rating of childhood SES (Duncan Index) which may have helped mitigate recall bias and improve reliability. However, the Duncan Index is a crude measure of parental occupation, and adding other socioeconomic indicators could improve future research. We did not test adult SES in this study, given that previous work had suggested childhood SES, but not adult SES, was predictive of DNAmAge. While our sample covered a broad socioeconomic range, it was predominantly white and recruited from one region of the United States, limiting generalizability. Future studies should examine the relationship between second generation DNAmAge and patterns of cognitive ability in larger and more diverse samples. Finally, we used IQ score as a proxy for global cognitive ability and future work should examine relationships between DNAmAge and specific cognitive domains including memory and executive functioning.
Materials and Methods
Participants
Participants were 287 individual twins from the midlife phase of the Louisville Twin Study (LTS). The LTS began as a study of childhood development in multiple birth pairs. Twins born between 1950–1997 in the Louisville metropolitan area were followed and assessed on cognitive ability, physical development, and temperament between 3 months and 15 years of age. Parents of twins provided data on demographics, parenting factors, and home environment. Zygosity was determined through blood typing during childhood. The sample contains same-sex monozygotic (MZ) and dizygotic (DZ) twin pairs, and opposite-sex DZ pairs. The LTS closed down in 2000 after 1,770 individual twins had been recruited 32. In 2018, follow-up of LTS twins aged 30–65 was initiated 27, and the LTS reopened in 2019 to begin the first midlife data collection of the 1,770 twins. The study was approved by the University of Louisville Institutional Review Board (19.0989). Participants provided a 50cc whole blood sample for genotyping and DNA methylation assay. Genotyping confirmed zygosity for any unknown twin pairs. The present sample includes 60 MZ pairs, 41 DZ pairs, and 85 individual twins who completed a midlife study visit between 2019–2023. Supplementary Figure S1 illustrates the participant flow, attrition, and retention across study phases.
Socioeconomic Status
Childhood socioeconomic status (SES) was measured using the Duncan socioeconomic index which assigns scores of 0–100 based on the occupation of the head of the household 33. Occupational prestige was based on 1950 US Census income data and education level associated with the occupation. The SES of our sample (M = 47.3, SD = 25.7) was consistent with the overall LTS distribution (M = 46.9, SD = 26.9), evenly distributed, and representative of the Louisville, Kentucky area (Davis et al., 2019).
Cognitive Ability Measures
LTS participants were assessed using validated Wechsler batteries of cognitive functioning. Children and adolescents were administered one of three forms of the Wechsler Intelligence Scale for Children (WISC, WISC-R, WISC-III; 34–36. Adults in the midlife phase were assessed using the Wechsler Adult Intelligence Scale IV (WAIS-IV; 37. Test scores are age-standardized based on a standardization sample that is roughly representative of the United States. The Wechsler batteries generate a Full-Scale Intelligence Quotient (IQ), which is an overall estimate of an individual’s current level of global cognitive ability, with a mean of 100 and a standard deviation of 15. Children were typically administered cognitive assessments at ages 7, 8, 9, 12, and 15, though not all twins were assessed at each timepoint; this study used the most recently available WISC assessment for each participant, with over 80% collected at age 15 (see Table 1 for more detailed descriptive statistics).
DNAmAge Measures
Genomic DNA was extracted from whole-blood samples that were collected at the University of Louisville and Norton Healthcare medical campuses. Venous blood was collected in ethylenediaminetetraacetic acid (EDTA) tubes and shipped to the Norris Comprehensive Cancer Institute at the University of Southern California Keck School of Medicine. DNA was extracted with promega Maxwell 16 LEV blood DNA kits, and then treated with bisulfite reagents at the USC Molecular Genomics Core following manufacturer protocol. Methylation was assayed with the Illumina Infinium Human MethylationEPIC BeadChip (Illumina, San Diego, CA, USA) at the USC Molecular Genomics Core at the Norris Cancer Institute at the Keck School of Medicine.
Quality control was conducted in R 4.3.0 38 using the minfi package 39 to identify aberrant samples and CpG sites, remove cross-reactive probes, conduct background correction, and adjust for batch effects prior to estimating DNAmAge variables. Background correction was performed using the normal-exponential out-of-band (noob) method 40. The ComBat method was used to adjust for laboratory batches 41.
We estimated blood cell composition using the Houseman method 42 and obtained percentages of CD8 + T cells, CD4 + T cells, natural killer cells (NK), B cells, monocytes, and granulocytes. Cell proportions are highly correlated, so in the current analysis we included the first and second principal components from a principal components (PC) analysis (collectively, they explained 86.12% of the variance) in all analyses.
As replicate blood samples often have different methylation values due to noise in the CpG sites, batch effects, and sample preparation, Higgins-Chen et al. 43 developed a principal components-based method to bolster reliability in DNAmAge algorithms by training them based on PC analysis, extracting the covariance between multicollinear CpGs, including age-related covariance. We estimated DNAmAge using five PC-trained algorithms, Horvath, Horvath Skin and Blood, Hannum, PhenoAge, and GrimAge 1,8,9,15,16.
Data Analysis
Descriptive statistics for the sample’s demographic characteristics, survey data, and cognitive ability data were computed in R version 4.3.2 38. Methylation data were cleaned and prepared in R. DNAmAge was calculated as the residual difference in PC-trained values predicted from chronological age in a bivariate linear regression model. DNAmAge values were adjusted for cell composition. Age and cell-adjusted variables were then winsorized such that values more than two standard deviations from the mean DNAmAge score were replaced with a value two standard deviations above or below the mean. Pairwise correlations for the five DNAmAge measures were then estimated using maximum likelihood in Mplus Version 8 44.
We used the classical twin model to decompose the phenotypic variance of DNAmAge into additive genetic (A), shared environmental (C), and nonshared environmental (E) components. Given that MZ twins share 100% of their genetic makeup and DZ twin pairs share half, on average, the A variance is correlated 1.0 within MZ twin pairs and 0.5 within DZ twin pairs. In both MZ and DZ twin pairs, the C component includes experiences that make siblings raised together more similar, and is correlated 1.0. Nonshared environmental components are unique to individuals and by definition uncorrelated within twin pairs. We estimated a univariate ACE model for all five DNAmAge measures individually using Mplus.
Exploratory factor analysis was conducted in Mplus to identify underlying dimensions across the five DNAm clock variables (Horvath, Horvath Skin and Blood, Hannum, PhenoAge, and GrimAge). The number of factors to be rotated was determined by the scree plot of Eigenvalues (Supplementary Fig. S2) and a likelihood ratio test comparing a two-factor model to a one-factor model. EFA models were estimated using full-information maximum likelihood with an oblique (geomin) rotation. Factor loadings were evaluated for statistical significance, with a criterion of p < .05 (Supplementary Table S3). We estimated composite scores by standardizing the five DNAmAge variables and using the sum of the standardized values for the variables in each factor. Once composite scores were computed, univariate ACE models were estimated for each factor, decomposing the variance of DNAmAge composite scores into genetic, shared environmental, and nonshared environmental components.
To explore the primary aims of the study, the relationship between each DNAmAge factor and ΔIQ was modeled in two ways. Both the phenotypic association model and “quasi-causal” regression model were fit in Mplus. The phenotypic association model is equivalent to a simple phenotypic regression where the relationship between predictor and outcome is examined without controlling for genetic and shared environmental effects. This was estimated prior to all genetically informed models. Using a dataset wide by twin pair and estimated parameters set to be equal across the members of a pair, we regressed midlife IQ on childhood IQ, childhood SES, DNAmAge, and the interaction between SES and DNAmAge.
Next, we fit the quasi-causal regression model to estimate the phenotypic regression coefficients conditional on bA and bC paths. The classical twin method was used to partition childhood IQ, adult IQ, and DNAmAge into A, C, and E components. In this case, midlife IQ was regressed on childhood IQ, DNAmAge, and the A and C components of DNAmAge and childhood IQ, as illustrated in Fig. 1. Childhood SES was used as a linear covariate for DNAmAge and the IQ variables. We also estimated an interaction between childhood SES and DNAmAge to determine whether the strength of the relationship between DNAmAge and ΔIQ depended on the SES of the family in which twins were raised. A statistically significant bP3 coefficient affirmatively answers the question: Within a pair of identical twins, does the twin with the higher DNAmAge value also have the larger positive or negative ΔIQ, statistically adjusting for the effects of the A and C components of DNAmAge (bA3 and bC3).
Supplementary Material
Funding
Funding was provided by the National Institute on Aging Grant No. R01AG063949.
Footnotes
Declarations
Competing Interest Statement
No competing interests.
Additional Declarations: No competing interests reported.
Contributor Information
Sophie A. Bell, Department of Psychology, University of Virginia
Christopher R. Beam, Department of Psychology, University of Southern California
Ebrahim Zandi, Keck School of Medicine, University of Southern California.
Alyssa Kam, Department of Psychology, University of Southern California.
Emily Andrews, Department of Psychology, University of Virginia.
Jonathan Becker, Department of Family and Geriatric Medicine, University of Louisville School of Medicine.
Deborah Finkel, Institute for Gerontology, Jönköping University.
Deborah W. Davis, Department of Pediatrics, University of Louisville School of Medicine
Eric Turkheimer, Department of Psychology, University of Virginia.
Data Availability
The datasets analyzed during the current study are not publicly available but de-identified data and/or code are available from the corresponding author on reasonable request.
References
- 1.Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 14, R115 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Oblak L., van der Zaag J., Higgins-Chen A T., Levine M E. & Boks M P. A systematic review of biological, social and environmental factors associated with epigenetic clock acceleration. Ageing Res. Rev. 69, 101348 (2021). [DOI] [PubMed] [Google Scholar]
- 3.Ryan J., Wrigglesworth J., Loong J., Fransquet P. D. & Woods R. L. A Systematic Review and Meta-analysis of Environmental, Lifestyle, and Health Factors Associated With DNA Methylation Age. Journals Gerontology: Ser. A. 75, 481–494 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Fransquet P D., Wrigglesworth J., Woods R L., Ernst M E. & Rya J. The epigenetic clock as a predictor of disease and mortality risk: a systematic review and meta-analysis. Clin. Epigenetics. 11, 62 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Horvath S. & Ritz B R. Increased epigenetic age and granulocyte counts in the blood of Parkinson’s disease patients. Aging (Albany NY). 7, 1130–1142 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Levine M. E., Lu A. T., Bennett D. A. & Horvath S. Epigenetic age of the pre-frontal cortex is associated with neuritic plaques, amyloid load, and Alzheimer’s disease related cognitive functioning. Aging (Albany NY). 7, 1198–1211 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bell C. G. et al. DNA methylation aging clocks: challenges and recommendations. Genome Biol. 20, 249 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hannum G. et al. Genome-wide Methylation Profiles Reveal Quantitative Views of Human Aging Rates. Mol. Cell. 49, 359–367 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Horvath S. et al. Epigenetic clock for skin and blood cells applied to Hutchinson Gilford Progeria Syndrome and ex vivo studies. Aging (Albany NY). 10, 1758–1775 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Vaccarino V. et al. Epigenetic Age Acceleration and Cognitive Decline: A Twin Study. J. Gerontol. Biol. Sci. Med. Sci. 76, 1854–1863 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Faul J. D. et al. Epigenetic-based age acceleration in a representative sample of older Americans: Associations with aging-related morbidity and mortality. Proceedings of the National Academy of Sciences 120, e2215840120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Maddock J. et al. Childhood growth and development and DNA methylation age in mid-life. Clin. Epigenet. 13, 155 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Reed R. G., Carroll J. E., Marsland A. L. & Manuck S. B. DNA methylation-based measures of biological aging and cognitive decline over 16-years: preliminary longitudinal findings in midlife. Aging (Albany NY). 14, 9423–9444 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Starnawska A. et al. Blood DNA methylation age is not associated with cognitive functioning in middle-aged monozygotic twins. Neurobiol. Aging. 50, 60–63 (2017). [DOI] [PubMed] [Google Scholar]
- 15.Levine M. E. et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging 10, 573–591 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lu A. T. et al. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging 11, 303–327 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Graves A. J. et al. Accelerated epigenetic age is associated with whole-brain functional connectivity and impaired cognitive performance in older adults. Sci. Rep. 14, 9646 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hillary R. F. et al. An epigenetic predictor of death captures multi-modal measures of brain health. Mol. Psychiatry. 26, 3806 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Aartsen M. J. et al. Advantaged socioeconomic conditions in childhood are associated with higher cognitive functioning but stronger cognitive decline in older age. Proc. Natl. Acad. Sci. U S A. 116, 5478–5486 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.McCrory C. et al. Early life adversity and age acceleration at mid-life and older ages indexed using the next-generation GrimAge and Pace of Aging epigenetic clocks. Psychoneuroendocrinology 137, 105643 (2022). [DOI] [PubMed] [Google Scholar]
- 21.Austi M. K. et al. Early-life socioeconomic disadvantage, not current, predicts accelerated epigenetic aging of monocytes. Psychoneuroendocrinology 97, 131–134 (2018). [DOI] [PubMed] [Google Scholar]
- 22.Hughe A. et al. Socioeconomic Position and DNA Methylation Age Acceleration Across the Life Course. Am. J. Epidemiol. 187, 2346–2354 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.McCror C. et al. How does socio-economic position (SEP) get biologically embedded? A comparison of allostatic load and the epigenetic clock(s). Psychoneuroendocrinology 104, 64–73 (2019). [DOI] [PubMed] [Google Scholar]
- 24.Jylhävä J. et al. Longitudinal changes in the genetic and environmental influences on the epigenetic clocks across old age: Evidence from two twin cohorts. eBioMedicine 40, 710–716 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Marion R. E. et al. DNA methylation age of blood predicts all-cause mortality in later life. Genome Biol. 16, 25 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Turkheimer E. & Harden K. P. Behavior genetic research methods: Testing quasi-causal hypotheses using multivariate twin data. in Handbook of research methods in social and personality psychology, 2nd ed 159–187 (Cambridge University Press, New York, NY, US, (2014). [Google Scholar]
- 27.Beam C. R. et al. Midlife Study of the Louisville Twins: Connecting Cognitive Development to Biological and Cognitive Aging. Behav. Genet. 50, 73–83 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Levine M. E. Assessment of Epigenetic Clocks as Biomarkers of Aging in Basic and Population Research. Journals Gerontology: Ser. A. 75, 463–465 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Fiorit G. et al. Social adversity and epigenetic aging: a multi-cohort study on socioeconomic differences in peripheral blood DNA methylation. Sci. Rep. 7, 16266 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Barboza Solís C. et al. Mediating pathways between parental socio-economic position and allostatic load in mid-life: Findings from the 1958 British birth cohort. Soc. Sci. Med. 165, 19–27 (2016). [DOI] [PubMed] [Google Scholar]
- 31.Belsky D. W. et al. DunedinPACE, a DNA methylation biomarker of the pace of aging. eLife 11, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Davi D. W., Turkheime E., Finke D., Bea C. & Rya L. The Louisville Twin Study: Past, Present and Future. Twin Res. Hum. Genet. 22, 735–740 (2019). [DOI] [PubMed] [Google Scholar]
- 33.Duncan O. A socioeconomic index for all occupations. Occupations social status; (1961). [Google Scholar]
- 34.Wechsler D. Wechsler Intelligence Scale for Children (Psychological Corporation, 1949). [Google Scholar]
- 35.Wechsler D. Wechsler Intelligence Scale for Children-Revised (The Psychological Corporation, 1974). [Google Scholar]
- 36.Wechsler D. Wechsler Intelligence Scale for Children (The Psychological Corporation, 1991). [Google Scholar]
- 37.Wechsler D. Wechsler Adult Intelligence Scale - Fourth Edition Administration and Scoring Manual. (2008). https://psycnet.apa.org/doiLanding?doi=10.1037%2Ft15169-000 [Google Scholar]
- 38.R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2023). [Google Scholar]
- 39.Aryee M J. et al. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics 30, 1363–1369 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Triche T J., Weisenberger D J., Van Den Berg D., Laird P W. & Siegmund K D. Low-level processing of Illumina Infinium DNA Methylation BeadArrays. Nucleic Acids Res. 41, e90 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Johnso W. E., Li C. & Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8, 118–127 (2007). [DOI] [PubMed] [Google Scholar]
- 42.Houseman E. A. et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinform. 13, 86 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Higgins-Chen A. T. et al. A computational solution for bolstering reliability of epigenetic clocks: implications for clinical trials and longitudinal tracking. Nat. Aging. 2, 644–661 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Muthén L. K. & Muthén B. O. Mplus: Statistical Analysis with Latent Variables: User’s Guide (Version 8) (Muthén & Muthén, 1998). [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets analyzed during the current study are not publicly available but de-identified data and/or code are available from the corresponding author on reasonable request.