Abstract
Background
Blood-based markers of cognitive functioning might provide an accessible way to track neurodegeneration years prior to clinical manifestation of cognitive impairment and dementia.
Results
Using blood-based epigenome-wide analyses of general cognitive function, we show that individual differences in DNA methylation (DNAm) explain 35.0% of the variance in general cognitive function (g). A DNAm predictor explains ~4% of the variance, independently of a polygenic score, in two external cohorts. It also associates with circulating levels of neurology- and inflammation-related proteins, global brain imaging metrics, and regional cortical volumes.
Conclusions
As sample sizes increase, the ability to assess cognitive function from DNAm data may be informative in settings where cognitive testing is unreliable or unavailable.
Supplementary Information
The online version contains supplementary material available at 10.1186/s13059-021-02596-5.
Keywords: DNA methylation, EWAS, Cognitive ability, Prediction, Epidemiology
Background
Blood-based markers of cognitive functioning might provide an accessible way to track neurodegeneration years prior to clinical manifestation of cognitive impairment and dementia. They might also form an easy, objective, and less stressful way to assess neurodegeneration compared to pen-and-paper cognitive tests or in circumstances where biosamples alone are available. Furthermore, they could help to inform our understanding of the biological basis of brain health differences. Blood-based DNA methylation can be used to generate predictors of lifestyle factors, such as smoking, alcohol consumption, and obesity [1]—factors that are linked with poorer cognitive function and an increased risk of dementia [2]. However, blood-based DNA methylation predictors of cognitive function itself, rather than its known correlates, may index a wider range of risk factors for neurodegeneration. Despite being peripheral to the central nervous system, blood is an easily accessible tissue, and its DNA methylation patterns may enable early diagnosis and provide mechanistic insights of early phases of disease progression. Although DNA methylation in brain tissue may provide more direct insights into the biology of neurodegeneration, [3, 4] acquiring in-vivo brain tissue samples is not feasible outside of extraordinary circumstances. Recent methodological advances [5, 6] have enabled the estimation of variance that DNA methylation can account for in complex traits. Therefore, we can now quantify how well blood-based DNA methylation predicts cognitive test outcomes.
There is modest evidence for associations between individual blood-based methylation sites and cognitive functioning; six CpG probes were identified as genome-wide significant in meta-analyses of epigenome-wide association studies (EWASs) of seven cognitive traits [7]. That study was limited by heterogeneous cognitive outcomes across cohorts, which also varied in age and ethnicity (meta-analysis n ranging from 2557 to 6809). Large-scale single cohort studies with consistent cognitive phenotyping and DNA methylation typing and quality control are lacking. Here, we overcome these limitations by utilising phenotypic cognitive data and blood-based methylation data from a single large cohort of European ancestries.
Results
The Generation Scotland dataset
Blood-based DNA methylation and general cognitive ability (g) were assessed concurrently in 9162 adult participants from the Generation Scotland cohort [8, 9] (Additional file 1, Table S1; Additional file 2, Figure S1; Methods). The study cohort comprised 59% females and had a mean age of 49.8 years (SD 13.6; range 18–93). Prior to running the main analyses, the cognitive phenotypes were pre-corrected for four covariates: age, sex, BMI and an epigenetic smoking score [10]. The DNA methylation data were corrected for batch, age, sex and epigenetic smoking, which all have pervasive effects on DNAm levels. Residuals from these linear regression models were taken forward for the primary analyses.
Estimating the proportion of variance in cognitive ability explained by all CpG sites
We first explored if global patterns of DNA methylation associated with individual differences in cognitive ability. To determine the proportion of variance in g that can be explained by all CpG sites on the DNAm array and to identify individual CpGs associated with g, we conducted a Bayesian penalised regression and Gaussian mixture-based variance partitioning analysis using BayesR+ software. BayesR+ has been shown to implicitly control for white cell proportions, which are typically estimated from the DNAm data, related participants, and other unknown confounders [6]. Three mixture distributions were specified, corresponding to possible small, medium and large effect sizes for the CpGs (explaining 0.01%, 0.1% and 1% of the variance, respectively). Variance components analyses indicated that 41.6% [95% credible interval 31.0%, 53.0%] of variance in g was explained by all DNA methylation probes (Additional file 1, Table S2).
Variance component sensitivity analyses
Although BayesR+ can control for genetic relatedness as previously demonstrated in Generation Scotland analyses [6], a sensitivity analysis using a linear mixed model approach [5] with an epigenetic relationship matrix was considered. It yielded near identical results (43.4% (SE 0.03)); a sensitivity analysis using data from an unrelated subset (n=4261) of the study cohort that was processed in a single methylation batch also showed similar estimates (58.4% (SE 0.07); Additional file 1, Table S3). Of the prior distributions specified in the BayesR+ analysis, the majority of the variance explained by the DNAm array was accounted for by CpG sites assigned to the mixture corresponding to small effects (Additional file 1, Table S4).
Estimating the proportion of variance in cognitive ability explained by common SNPs
Previous studies, including those using data from Generation Scotland [11], have identified non-zero common-SNP-based heritability estimates for general cognitive ability. To assess the genetic contribution to variance in g and to see if this overlaps with the DNAm component, two additional BayesR+ models were run. The first included genetic data alone, which estimated a SNP-based heritability of 37.9% [18.3%, 52.9%], which is in line with previous GREML estimates from the cohort [11]. The second model considered the proportion of variance explained when combining the effects of genetics and DNAm, resulting in an estimate of 63.8% [50.0%, 73.5%] (Additional file 1, Table S2). Notably, the CpG contribution to the variance accounted for was largely independent of the genetic component—absolute attenuation 6.6% (relative attenuation 15.9%) to the epigenetic effect size estimate in the model that included genetics (estimate 35.0% [24.8%, 46.7%]).
Epigenome-wide association study to identify individual CpG sites associated with cognitive ability
After identifying a substantial DNAm-based variance component for g, we carried out an epigenome-wide association study in BayesR+. We investigated the associations between g and individual CpG sites, which were assigned to one of the three mixtures. We identified three unique lead DNAm sites with a posterior inclusion probability (PIP) greater than 0.80 and, after accounting for highly correlated CpG clusters, a group-based PIP>0.95 (Additional file 2, Figure S2; Additional file 1, Table S5; the entire output is available at 10.5281/zenodo.5794029 [12]). For the three lead CpG sites, we queried the EWAS catalog (accessed on April 5, 2021) for associations with other traits at a previously defined epigenome-wide significance threshold of P<3.6x10-8 (Additional file 1, Table S6) [13, 14]. They have been linked to age, sex, metabolite levels and lung function. Of 28 CpGs identified in previous blood-based EWAS analyses of cognitive ability and Alzheimer’s disease [4, 7, 15], 24 were available for lookup in the present dataset; Generation Scotland data was not included in any of these studies. There was no evidence for replication (maximum PIP of 0.03; Additional file 1, Table S7).
Epigenetic Score (EpiScore) for cognitive ability tested in two independent cohorts: the Lothian Birth Cohorts of 1921 and 1936
A weighted linear Epigenetic Score (EpiScore) for g that included all CpG sites in the EWAS was applied to two independent cohorts to determine the proportion of variance in g that can be explained by a single predictor variable. Such a predictor may improve risk prediction and patient stratification for studies of cognitive decline and dementia. The weights for each CpG were the mean posterior effect sizes from the EWAS model of g. These weights were applied to CpGs in two independent studies, The Lothian Birth Cohort 1936 (LBC1936) and The Lothian Birth Cohort 1921 (LBC1921); n=844 and n=427 with concurrently measured DNAm, cognitive scores, and cognitive polygenic scores available, respectively (Additional file 1, Table S8; Additional file 2, Figure S3; Methods). The resulting cognitive EpiScores showed very weak correlations (absolute Pearson r < 0.072; all P values ≥0.050) with measured white blood cell counts in both LBC1921 and LBC1936 (Additional file 1, Table S9). The incremental R2 upon the addition of the EpiScore to a linear regression model adjusting for age and sex was 3.4% (P=2.0x10-8) in LBC1936 and 4.5% (P=9.9x10-6) in LBC1921. The corresponding R2 for the polygenic score derived from a GWAS of 168,033 individuals in UK Biobank—a training sample approximately 18-times greater than the DNAm training sample—was 7.3% (P<2x10-16) and 6.9% (P=3.1x10-8), respectively. The additive incremental R2 from the two omics-based predictors was 10.7% in LBC1936 and 10.5% in LBC1921 (Fig. 1).
Relationship between sample size and theoretical variance explained by a cognitive EpiScore
To see how the size of the training sample might affect the proportion of variance in g we could explain, we simulated several scenarios. Based on a training sample size of 10,000 and an (arbitrary) assumption of 100,000 CpGs affecting the trait, with a DNAm variance components estimate of 41.6%, we would expect a DNAm prediction R2 value of about 4.0% (following Formula 1 from [16]). This is very similar to the estimates obtained. If the training sample increased to 20,000 or 100,000 then the expected R2 should increase to around 8% and 29%, respectively.
Comparison of the associations between measured cognitive ability and a cognitive EpiScore with health and lifestyle factors
We then tested how the EpiScore associated with health and lifestyle risk factors and how these compared to those observed for measured g. If the EpiScore yields similar effect sizes to measured g in its associations with these factors then it would strengthen its case for inclusion in prediction and risk stratification analyses for cognitive decline. In age- and sex-adjusted linear regression analyses with common risk factors of cognitive decline (smoking, years of education, BMI, lung function, walking speed, grip strength, high blood pressure, alcohol consumption, a depression questionnaire score, and an index of social deprivation), the EpiScore showed directionally consistent but weaker associations than measured g across both LBC1921 and LBC1936 (Fig. 1 and Additional file 1, Table S10). The only exception was alcohol consumption, where the EpiScore outperformed measured g. A multiple regression model in LBC1936 with all covariates as predictors of measured g is presented in Additional file 1, Table S11. Upon addition of the EpiScore to the model, the incremental R2 estimate was 0.8% (P=1.8x10-3).
Comparison of the associations between measured cognitive ability and a cognitive EpiScore with inflammatory protein levels
To see if the EpiScore and measured g metrics were comparable in terms of their associations with protein biomarkers, age- and sex-adjusted linear regression analyses were conducted with 70 Olink inflammatory protein levels in LBC1936. The EpiScore and measured g associations with the proteins were moderately concordant (r = 0.43). The EpiScore associated with 11 proteins (P<0.05/70) compared to 5 for measured g with two proteins, IL-6 and IL18.R1, overlapping both sets (Fig. 1 and Additional file 1, Table S12).
Comparison of the associations between measured cognitive ability and a cognitive EpiScore with neurology protein levels and brain MRI measures
Finally, we compared the EpiScore and g associations with brain imaging outcomes and neurology protein levels. DNA methylation, structural brain MRI, and 90 Olink neurology-related proteins were also available at a follow-up wave of LBC1936 when participants were a mean age of 73 years (n=701 with proteins and n=551 with MRI). The EpiScore—protein associations mirrored those previously reported with measured fluid cognitive ability in the same dataset [17] with similar effect size estimates (r = 0.70). Thirteen EpiScore- and 15 measured g-protein associations were statistically significant (P<0.05/90) with 7 overlapping (Fig. 1 and Additional file 1, Table S12). There were associations with brain imaging measures of global volume (total brain, grey matter and normal appearing white matter volumes—Fig. 2 and Additional file 1, Table S13; Additional file 3). Furthermore, there were widespread associations between the EpiScore and regional brain cortical volume and thickness (Fig. 2 and Additional file 2, Figures S4-S5; Additional file 3), with significant overlap in cortical loci for both measured g and EpiScore. Overall, the EpiScore findings largely mirrored the associations between measured g and neuroimaging outcomes, albeit they were slightly smaller in magnitude.
Discussion
This is the first variance component analysis of DNAm and cognitive function. We show substantial contributions to general cognitive function (g) in addition to the development of a novel epigenetic score with application to two independent test cohorts where it associated with measured cognitive ability.
We also show associations between the EpiScore and lifestyle factors and risk factors for dementia, circulating levels of neuroinflammatory proteins and brain MRI measures. As these findings reflected similar, albeit slightly weaker associations to those with measured g, a cognitive EpiScore may provide a useful tool to measure brain health in clinical settings and to aid in risk prediction of neurodegeneration.
Whereas many of the associations with measured cognitive ability in LBC1936 have been studied and discussed in detail previously (e.g., neurology proteins, brain MRI, health and lifestyle variables [17, 18]), we present for the first time the associations with inflammatory proteins. The EpiScore and measured g both associated with levels of IL-6 (a well-established correlate of cognitive ability [19]) and IL18.R1 is a subunit of the IL18 receptor that, with IL18, participates in neuroinflammatory and neurodegenerative processes [20]. The concordance of effect size estimates between the EpiScore and measured g variables with multiple outcomes highlights the utility of the former as a potential surrogate measure. The EpiScore-health findings mirrored associations between measured g and health across all assessed modalities. This included brain MRI, blood protein levels and general lifestyle outcomes. Higher EpiScores were linked to greater total brain, grey matter and white matter volumes, lower levels of neuroinflammatory proteins that have been linked to poorer brain health, and more positive lifestyle patterns (e.g. faster walking speed, greater grip strength, lower BMI and lower prevalence of hypertension).
We used a large homogeneous discovery cohort with consistent cognitive testing and DNA collection across all participants. This will minimise the biases that are an inherent problem in heterogeneous and small EWAS meta-analyses. The estimation of posterior effect sizes using BayesR+ has been shown to be robust to various potential sources of heterogeneity, including family structure [21] and population structure based on genetic or epigenetic principal components [6]. Nonetheless, replication of our findings in cohorts of different ages and backgrounds, and at different stages of neuro-development/degeneration will help to refine and generalise the estimates presented here. Whereas a near unit correlation was observed between the cognitive score residuals with and without a quadratic age term, modelling non-linear relationships may also aid locus discovery and improve external predictions.
Methylation-based predictors of cognitive function may improve longitudinal disease prediction and risk profiling of neurodegenerative health outcomes such as dementia. Here, the epigenetic variance components and prediction score were independent of genetic contributions; the EpiScore also reflected measured cognitive ability in its associations with a variety of biological and health-based traits. Unlike DNA differences which are largely fixed throughout life, DNAm differences may reflect environmental effects and phenotypic causation, directly through cognitive-related pathways or indirectly via related lifestyle and health outcomes. The EpiScore can therefore reflect consequences of processes linked to cognitive health. This is evident by the attenuation of the incremental R2 estimate from 3.4 to 0.8% in a multiple regression model that included age and sex plus 10 covariates that are established correlates/risk factors for cognitive ability and cognitive decline. The correlation with these risk factors and the independent contribution of the EpiScore from the polygenic score for prediction of measured g emphasises the environmental variance being captured by the EpiScore. Given the overwhelming contribution of CpGs with small effects to our estimates, increasing EWAS sample sizes will likely lead to both locus discovery and more accurate DNAm-based predictors of cognitive function. Future studies should investigate prospective associations of the EpiScore relative to measured g. Even if the former is capturing processes related to or downstream of cognitive function, it may still provide information for assessing risk of neurodegeneration. Whereas the associations of the current EpiScore with lifestyle and MRI variables are more modest than those observed for measured g, this gap is likely to narrow as the sample size of the training set increases. This has potential implications for studies of cognitive function across the lifespan where pen-and-paper testing is not possible or unreliable, such as during neurodevelopment or neurodegeneration. In the future, cognitive EpiScores may be help to monitor decline in brain health and to stratify individuals into risk groups years prior to a clinical diagnosis of dementia.
Conclusions
As sample sizes increase, our ability to assess cognitive function from DNAm data may be informative in settings where cognitive testing is unreliable or unavailable.
Methods
The Generation Scotland Cohort
Details of the Generation Scotland: Scottish Family Health Study (GS) have been described in detail elsewhere [9, 22]. Briefly, GS comprises over 20,000 individuals comprehensively profiled for genetic, clinical, lifestyle and sociodemographic data. Around 8000 participants aged between 35 and 65 years were initially recruited through GP surgeries from five regions across Scotland. These participants were then asked to invite family members to join the study. Recruitment took place between 2006 and 2011 and the structure included relatives from up to three generations per family (around 5600 families participating in total). The age range of the cohort at the study baseline was 18 to 99 years. A subset of 9162 individuals from GS (aged 18 to 93 years, mean=49.7, SD=13.6) had genome-wide DNA methylation measured [23]. This subset was processed in two batches, hereafter referred to as “Set 1” and “Set 2”.
Methylation preparation in Generation Scotland
Quality control was performed on Illumina HumanMethylationEPIC BeadChip DNA methylation data from blood samples of 5200 related individuals from Set 1, and 4583 genetically unrelated individuals from Set 2, also genetically unrelated to those in Set 1. Three Set 1 individuals who had answered “yes” to presence of all of 16 self-reported disease conditions in the study’s health questionnaire were excluded from the analysis. Filtering for outliers (NSet1=80; NSet2=83), sex mismatches (NSet1=19; NSet2=12), non-blood samples (NSet1=13), and poorly detected samples was performed (NSet1=18) [23]. Further filtering was then carried out to remove, non-autosomal and non-CpG sites (N=22,163), CpGs with missing values and poorly-detected CpGs (NSet1=5910; NSet2=8878). Five individuals with a self-reported diagnosis of Alzheimer’s disease were removed, along with those with missing covariate information or cognitive variables. Following filtering, 9162 complete cases remained comprising 4901 Set 1 individuals and 4261 Set 2 individuals, with 764,525 CpGs in common between the two.
Cognitive Phenotypes in Generation Scotland
Six cognitive phenotypes were assessed in this study: logical memory, digit symbol test score, verbal fluency, vocabulary, general cognitive ability, and general fluid cognitive ability. The logical memory phenotype (verbal declarative memory) was calculated from the Wechsler Logical Memory test, taking the sum of immediate and delayed recall of one oral story [24]. The digit symbol phenotype is often used as a measure of processing speed and was calculated from the Wechsler Digit Symbol Substitution test in which participants must recode digits to symbols over a 120 second period [25]. The verbal fluency phenotype is often used as a measure of executive functioning and was derived from the phonemic verbal fluency test, using the letters C, F and L, each for 1 min [26]. Vocabulary was measured using the Mill Hill Vocabulary Scale, junior and senior synonyms combined [27]. Outliers were defined as scores >3.5 standard deviations above or below the mean and were removed prior to analysis. General fluid cognitive ability (gf) was calculated from the first unrotated principal component of logical memory, verbal fluency and digit symbol tests. General cognitive ability (g) was derived from the first unrotated principal component from the same variables plus vocabulary.
The Lothian Birth Cohorts of 1921 and 1936
The mean posterior effect sizes for the EWAS model of general cognitive function, g, were used to generate an epigenetic predictor in two independent datasets, the Lothian Birth Cohort 1936 (LBC1936) and the Lothian Birth Cohort 1921 (LBC1921).
The LBC are studies of cognitive ageing in older adults from the area around Edinburgh, Scotland [28, 29]. Briefly, participants were born in either 1921 or 1936 and completed the Scottish Mental Survey of 1932 or 1947 at age 11. From age 70 (LBC1936) and age 79 (LBC1921), they were assessed triennially for a variety of health and lifestyle outcomes, with DNA collected at each visit.
Methylation preparation in the Lothian Birth Cohorts
We considered blood-based DNA methylation data from the age 70 (LBC1936) and age 79 (LBC1921) samples. Methylation was assessed on the Illumina 450k array—the predecessor of the EPIC array. Processing and quality control have been described previously [1, 30, 31]. This included steps to remove methylation samples and individuals with poor quality control measures, along with individuals who had mismatching genotypes or predicted sex information. DNA methylation was measured at three time points (set 1, set 2 and set 3) and comprised 2,195, 996 and 552 samples, respectively. Prior to quality control, each set had 485,512 CpGs. Twenty-three duplicate samples were removed from set 2. Set 1 and set 2 had 123 duplicates between them, and a sample was removed from each duplicate pair (108 from set 1, 15 from set 2). Sets 1 and 2 were then combined (set12). Ten duplicates were removed from set 3. There were also 31 duplicates between set 3 and set 12. 26 samples were removed from set 3 and five were removed from set12. The three sets were combined (set123) and comprised 3556 samples. Samples and CpGs were filtered on low call rates (CpGs with a detection p value greater than 0.01), with a threshold of 95% for both samples and CpGs. 3525 samples and 470,278 CpGs remained after this step. Finally, sex chromosome CpGs were removed, leaving a dataset comprising 459,309 CpGs and 3525 samples. The current study used a subset from this dataset comprising 381,846 CpGs (overlapping with those included in the Generation Scotland analyses) for 861 LBC1936 (436 LBC1921) individuals—34 LBC1936 individuals were excluded due to DNAm being assessed as part of a separate analysis batch.
Cognitive Epigenetic Score (EpiScore) in the Lothian Birth Cohorts of 1921 and 1936
Within each LBC study, each CpG was scaled to mean 0, variance 1 with missing values mean imputed (i.e. set to 0) prior to multiplication by the mean CpG weights (for all available CpGs) and summation to give the epigenetic score.
Cognitive Polygenic Score in the Lothian Birth Cohorts
A polygenic score for cognitive ability was derived from Z scores from all possible SNPs (GWAS P≤1) in a UK Biobank GWAS of verbal numerical reasoning (n=168,033) [32] and applied to LBC1936 and LBC1921 genotype data using default settings in the PRSice software [33, 34]. The P≤1 threshold for polygenic scores of cognitive ability has similar predictive performance as scores built using more conservative thresholds [35].
Cognitive Phenotypes in the Lothian Birth Cohorts
In LBC1936 at Wave 1 (mean age 70 years), general cognitive ability, g, was defined as the first unrotated principal component (that accounted for 52% of the variance) from a PCA of six cognitive tests from the Wechsler Adult Intelligence Scale-III UK (matrix reasoning, letter number sequencing, block design, symbol search, digit symbol and digit span backward) [25] plus a test of vocabulary (National Adult Reading Test) [36].
A similar approach was taken in LBC1921 at Wave 1 (mean age 79 years), where we considered the first unrotated principal component (that also accounted for 51% of the variance) from a PCA of four cognitive tests: Raven’s Standard Progressive Matrices [27], letter-number sequencing [25], digit symbol coding [25], and the National Adult Reading Test [36]). The cognitive tests were completed at the same visit that the blood was drawn for DNA profiling in both LBC studies.
Statistical analysis
Variance components analysis and EWAS using BayesR+
BayesR+ was used for the EWAS and to estimate the proportion of variance in cognitive traits explained by genetics and DNA methylation. BayesR+ is a software implemented in C++ for performing Bayesian penalised regression and Gaussian mixture-based variance partitioning on complex traits [6]. The joint and conditional effects of methylation sites (n=764,525) on cognitive traits were examined. Phenotypic and methylation data were scaled to mean zero and unit variance. The prior distribution comprised a series of Gaussian distributions that corresponded to effect sizes of different magnitudes (i.e. methylation sites with small, medium and large effect sizes), as well as a discrete spike at zero which allows for the omission of probes with non-identifiable effects. The prior mixture variances were set to 0.0001, 0.001 and 0.01. Phenotypes were corrected for age, sex, BMI and epigenetic smoking score [10], and the DNA methylation data were corrected for batch, age, sex and epigenetic smoking. Adding a quadratic term for age in the correction of cognitive phenotypes made negligible differences to the trait residuals (r>0.98 between linear model residuals for each trait).
Estimating the variance components and individual CpG effects
To obtain estimates of variance accounted for in cognitive traits by methylation data and individual CpG associations with the cognitive test scores, Gibbs sampling was performed to sample over the posterior distribution conditional on the input data. The Gibbs algorithm consisted of 10,000 samples and 5000 samples of burn-in after which a thinning of 5 samples was applied to reduce autocorrelation. The process was repeated over four chains, initializing a different random number seed for each chain. The last 250 iterations from each chain were combined for downstream analyses. For the EWAS, CpGs within 2.5kb and highly correlated (absolute Pearson correlation >0.5) with a lead CpG with posterior inclusion probability greater than 0.2 were grouped together. For each probe group, we calculated the proportion of iterations for which at least one probe was included in the model, yielding the group posterior inclusion probability. We then calculated the average (across the 1000 iterations) sum of the squared regression coefficients for the probe group to give the contribution of the group to the total variance. Finally, we highlighted the lead CpG for the groups where the combined posterior inclusion probability was >0.80. The variance components estimates are taken as the mean sum of squared standardised mean posterior effect sizes across the 1000 iterations with the 2.5%ile and 97.5%ile forming the 95% credible interval.
Sensitivity analyses
Sensitivity analyses were performed using OSCA to estimate the proportion of variance explained in cognitive tests by epigenetic data. OSCA is a software tool designed for the analysis of complex traits using multiple omics data types, including genome-wide DNAm data. Omics relationship matrices (ORMs) were estimated using all probes to determine inter-individual relationships, analogous to a genomic relationship matrix (GRM), which is used to estimate SNP-based heritability. ORMs were computed separately using DNAm data from Generation Scotland Set 2 (unrelated individuals) and Sets 1 and 2 combined. Each ORM was subsequently fitted to a mixed linear model to estimate the variance explained by all DNAm probes using the restricted maximum likelihood (REML) method. These analyses were undertaken to confirm that the results obtained were not a function of the analysis method used.
GWAS and combined GWAS/EWAS analyses using BayesR+
Genetic effects at 560,797 SNPs (minor allele frequency > 1%; scaled to mean 0, variance 1) from the Illumina HumanOmniExpressExome-8 v1.0 Bead Chip or Illumina HumanOmniExpressExome-8 v1.2 Bead Chip were examined [37], setting prior mixture variances to 0.00001, 0.0001 and 0.001. For the genetic analysis, phenotypes were pre-corrected for age, sex and 20 genetic PCs. To estimate the additive and independent effects of DNAm and genetic data on complex traits, a combined analysis was run—using the phenotype corrections specified for the EWAS model—setting prior mixture variances as above.
EpiScore associations with measured g in the Lothian Birth Cohorts
There were 844 LBC1936 (427 LBC1921) individuals with cognitive, epigenetic and polygenic score data. Linear regression was used to test for an association between the predicted epigenetic score (predictor) and measured general cognitive ability (outcome), in models adjusting for age, sex and the polygenic score.
EpiScore and measured g associations with common risk factors in the Lothian Birth Cohorts
Age- and sex-adjusted linear regression models were used to relate the Epigenetic Score (EpiScore) for g and the measured g score (predictors) with common risk factors (outcomes) for cognitive decline, frailty and dementia. Each outcome was modelled independently in a separate regression model. The outcomes studied were body mass index (BMI in kg/m2); years of education; self-reported smoking (ever versus never); self-reported weekly units of alcohol; self-reported high blood pressure (yes/no); lung function (forced expiratory volume in one second) adjusted for age, sex, and height; time taken to walk 6 m (seconds); socioeconomic deprivation (Scottish Index of Multiple Deprivation in LBC1936 and social grades based on highest reached occupation [38] in LBC1921); and depression (HADS-D total from the Hospital Anxiety and Depression questionnaire) [28, 39, 40]. Prior to the analyses, BMI, 6-m walk time and units of alcohol were log transformed to reduce skew—a constant of one was added to the alcohol units before the log transformation.
EpiScore and measured g associations with neuroinflammatory proteins levels in the Lothian Birth Cohort 1936
An Olink panel of inflammation-related proteins [41], measured on blood samples at age 70 years in LBC1936, were related to both EpiScore g and measured g in age- and sex-adjusted linear regression models. An additional panel of Olink neurology-related proteins [17], measured on blood samples at age 73 years in LBC1936, were related to EpiScore g, which was derived from DNAm assessed from the same sample (analysis n=701). Quality control of the DNAm was identical to the age 70 samples. Processing occurred in two sets (NSet1=256, NSet2=445) where CpG sites were independently scaled to mean zero and variance one, prior to combining into a single variable. Each of the 70 inflammatory and 90 neurology proteins were adjusted via rank-based inverse normal transformations and regressed on age, sex and four genetic ancestry components as previously described [42]. Linear regression model assumptions were visually inspected via regression diagnostic plots.
EpiScore and measured g associations with brain MRI variables in the Lothian Birth Cohort 1936
Structural and diffusion tensor (DTI) MRI acquisition and processing in LBC1936 were performed at Wave 2 (age 73 years) according to an open-access protocol [43]. Total brain, grey matter and normal-appearing white matter (NAWM) volumes were calculated using a semi-automated multi-spectral fusion method [44]. Intracranial volume was determined semi-automatically using Analyze 11.0TM. White matter microstructural parameters fractional anisotropy (FA) and mean diffusivity (MD) were derived for 12 major tracts of interest: corpus callosum genu and splenium, bilateral frontal cingulum, arcuate, uncinate and superior longitudinal fasciculi and bilateral anterior thalamic radiation. These were obtained using probabilistic neighbourhood tractography in TractoR (https://www.tractor-mri.org.uk) [45, 46] as applied to BEDPOSTX/ProbTrackX in FSL (https://fsl.fmrib.ox.ac.uk) [47]. Participants were excluded if they had self-reported history of dementia or signs of cognitive impairment (Mini Mental State Examination score < 24/30); after exclusions, a total of 590 participants had complete cognitive, epigenetic and global neuroimaging data, and of these, 551 participants had complete and vertex-wise neuroimaging data. Localised associations between cognitive measures and vertex-wise cortical volume and thickness were performed using linear regression, controlling for age, sex and intracranial volume. The SurfStat MATLAB toolbox (http://www.math.mcgill.ca/keith/surfstat) for Matrix Laboratory R2012a (The MathWorks, Inc., Natick, MA, USA) was used to carry out analyses. Statistical maps of association magnitude and valence (t-maps) and significance (q maps; p values corrected for multiple comparisons using a false discovery rate with a q value of 0.05 across all 327,684 vertices on the cortical surface) were presented.
Supplementary Information
Acknowledgements
Not applicable.
Peer review information
Stephanie McClelland and Anahita Bishop were the primary editors of this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.
Review history
The review history for this manuscript is available as Additional file 4.
Authors’ contributions
DLM, RFH, ELS and REM analysed and interpreted the data; DTB, DAG, RMW, CN, RF, AC, SMM, MCVH, MAH, MEB, JMW and SEH helped with data preparation. ADM, MEB, JMW, SHE, DJP, EMT-D, AMM, KLE, IJD and SRC were involved in data generation. DLM and REM drafted the article with substantial input from RFH, ELS, EMT-D, SRC and MRR. MRR and REM designed study. The authors read and approved the final manuscript.
Funding
GS received core support from the Chief Scientist Office of the Scottish Government Health Directorates (CZD/16/6) and the Scottish Funding Council (HR03006). Genotyping and DNA methylation profiling of the GS samples was carried out by the Genetics Core Laboratory at the Edinburgh Clinical Research Facility, Edinburgh, Scotland, and was funded by the Medical Research Council UK and the Wellcome Trust (Wellcome Trust Strategic Award STratifying Resilience and Depression Longitudinally (STRADL; Reference 104036/Z/14/Z). The DNA methylation data assayed for Generation Scotland was partially funded by a 2018 NARSAD Young Investigator Grant from the Brain & Behavior Research Foundation (Ref: 27404; awardee: Dr David M Howard) and by a JMAS SIM fellowship from the Royal College of Physicians of Edinburgh (Awardee: Dr Heather C Whalley). LBC1936 MRI brain imaging was supported by Medical Research Council (MRC) grants [G0701120], [G1001245], [MR/M013111/1] and [MR/R024065/1]. Magnetic resonance image acquisition and analyses were conducted at the Brain Research Imaging Centre, Neuroimaging Sciences, University of Edinburgh (www.bric.ed.ac.uk) which is part of SINAPSE (Scottish Imaging Network: A Platform for Scientific Excellence) collaboration (www.sinapse.ac.uk) funded by the Scottish Funding Council and the Chief Scientist Office. This work was supported by the European Union Horizon 2020 (PHC.03.15, project No 666881), SVDs@Target, the Fondation Leducq Transatlantic Network of Excellence for the Study of Perivascular Spaces in Small Vessel Disease [ref no. 16 CVD 05]. We thank the LBC1936 participants and team members who contributed to these studies. The LBC1936 is supported by Age UK (Disconnected Mind project, which supports S.E.H.), the Medical Research Council (G0701120, G1001245, MR/M013111/1, MR/R024065/1) and the University of Edinburgh. Methylation typing of LBC1936 was supported by the Centre for Cognitive Ageing and Cognitive Epidemiology (Pilot Fund award), Age UK, The Wellcome Trust Institutional Strategic Support Fund, The University of Edinburgh, and The University of Queensland. Genotyping was funded by the Biotechnology and Biological Sciences Research Council (BB/F019394/1). Proteomic analyses in LBC1936 were supported by the Age UK grant and NIH Grants R01AG054628 and R01AG05462802S1. M.V.H. is funded by the Row Fogo Charitable Trust (Grant no. BROD.FID3668413). J.M.W is supported by the UK Dementia Research Institute which receives its funding from DRI Ltd, funded by the UK Medical Research Council, Alzheimers Society and Alzheimers Research UK. R.F.H., E.L.S.C and D.A.G. are supported by funding from the Wellcome Trust 4 year PhD in Translational Neuroscience: training the next generation of basic neuroscientists to embrace clinical research [108890/Z/15/Z]. E.M.T.D. was supported by the National Institutes of Health (NIH) grants R01AG054628, R01MH120219, R01HD083613, P2CHD042849 and P30AG066614. S.R.C. was also supported by a National Institutes of Health (NIH) research grant R01AG054628 and is supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (Grant Number 221890/Z/20/Z). D.L.Mc.C. and R.E.M. are supported by Alzheimers Research UK major project grant ARUK/PG2017B/10. R.E.M. is supported by Alzheimer’s Society major project grant AS-PG-19b-010.
This research was funded in whole, or in part, by Wellcome [104036/Z/14/Z and 108890/Z/15/Z]. For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Availability of data and materials
According to the terms of consent for GS participants, access to data must be reviewed by the GS Access Committee. Applications should be made to access@generationscotland.org.
Lothian Birth Cohort data are available on request from the Lothian Birth Cohort Study, University of Edinburgh (simon.cox@ed.ac.uk). Lothian Birth Cohort data are not publicly available due to them containing information that could compromise participant consent and confidentiality.
All code is available with open access at the following GitHub repository https://github.com/marioni-group/ewas_of_cognitive_function (Zenodo 10.5281/zenodo.5794175 [48]).
Declarations
Ethics approval and consent to participate
All components of GS received ethical approval from the NHS Tayside Committee on Medical Research Ethics (REC Reference Number: 05/S1401/89). GS has also been granted Research Tissue Bank status by the East of Scotland Research Ethics Service (REC Reference Number: 20-ES-0021), providing generic ethical approval for a wide range of uses within medical research.
Ethical approval for the LBC1921 and LBC1936 studies was obtained from the Multi-Centre Research Ethics Committee for Scotland (MREC/01/0/56) and the Lothian Research Ethics committee (LREC/1998/4/183; LREC/2003/2/29). In both studies, all participants provided written informed consent. These studies were performed in accordance with the Helsinki declaration.
Consent for publication
Not applicable
Competing interests
R.E.M has received a speaker fee from Illumina and is an advisor to the Epigenetic Clock Development Foundation. A.M.M has previously received speaker fees from Janssen and Illumina and research funding from The Sackler Trust. All other authors declare no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Daniel L. McCartney, Email: daniel.mccartney@ed.ac.uk
Robert F. Hillary, Email: robert.hillary@ed.ac.uk
Eleanor L. S. Conole, Email: eleanor.conole@ed.ac.uk
Daniel Trejo Banos, Email: daniel.trejobanos@uzh.ch.
Danni A. Gadd, Email: danni.gadd@ed.ac.uk
Rosie M. Walker, Email: rosie.walker@ed.ac.uk
Cliff Nangle, Email: c.nangle@ed.ac.uk.
Robin Flaig, Email: robin.flaig@ed.ac.uk.
Archie Campbell, Email: archie.campbell@ed.ac.uk.
Alison D. Murray, Email: a.d.murray@abdn.ac.uk
Susana Muñoz Maniega, Email: sarah.harris@ed.ac.uk.
Sarah E. Harris, Email: sarah.harris@ed.ac.uk
David J. Porteous, Email: david.porteous@ed.ac.uk
Elliot M. Tucker-Drob, Email: tuckerdrob@utexas.edu
Andrew M. McIntosh, Email: andrew.mcintosh@ed.ac.uk
Kathryn L. Evans, Email: kathy.evans@ed.ac.uk
Ian J. Deary, Email: simon.cox@ed.ac.uk
Simon R. Cox, Email: simon.cox@ed.ac.uk
Matthew R. Robinson, Email: matthew.robinson@ist.ac.at
Riccardo E. Marioni, Email: riccardo.marioni@ed.ac.uk
References
- 1.McCartney DL, Hillary RF, Stevenson AJ, Ritchie SJ, Walker RM, Zhang Q, et al. Epigenetic prediction of complex traits and death. Genome Biol. 2018;19(1):136. doi: 10.1186/s13059-018-1514-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Livingston G, Huntley J, Sommerlad A, Ames D, Ballard C, Banerjee S, Brayne C, Burns A, Cohen-Mansfield J, Cooper C, Costafreda SG, Dias A, Fox N, Gitlin LN, Howard R, Kales HC, Kivimäki M, Larson EB, Ogunniyi A, Orgeta V, Ritchie K, Rockwood K, Sampson EL, Samus Q, Schneider LS, Selbæk G, Teri L, Mukadam N. Dementia prevention, intervention, and care: 2020 report of the Lancet Commission. Lancet Elsevier. 2020;396(10248):413–446. doi: 10.1016/S0140-6736(20)30367-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Lunnon K, Smith R, Hannon E, De Jager PL, Srivastava G, Volta M, et al. Methylomic profiling implicates cortical deregulation of ANK1 in Alzheimer’s disease. Nat Neurosci. 2014;17(9):1164–1170. doi: 10.1038/nn.3782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lunnon K, Smith R, Hannon E, De Jager P, Srivastava G, Volta M, et al. Cross-tissue methylomic profiling strongly implicates a role for cortex-specific deregulation of ANK1 in Alzheimer’s disease neuropathology. Nat Neurosci. 2014;17(9):1164–1170. doi: 10.1038/nn.3782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhang F, Chen W, Zhu Z, Zhang Q, Nabais MF, Qi T, Deary IJ, Wray NR, Visscher PM, McRae AF, Yang J. OSCA: a tool for omic-data-based complex trait analysis. Genome Biol. 2019;20(1):107. doi: 10.1186/s13059-019-1718-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Trejo Banos D, McCartney DL, Patxot M, Anchieri L, Battram T, Christiansen C, et al. Bayesian reassessment of the epigenetic architecture of complex traits. Nat Commun. 2020;11(1):2865. doi: 10.1038/s41467-020-16520-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Marioni RE, McRae AF, Bressler J, Colicino E, Hannon E, Li S, et al. Meta-analysis of epigenome-wide association studies of cognitive abilities. Mol Psychiatr. 2018;23(11):2133–2144. doi: 10.1038/s41380-017-0008-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.McCartney DL, Zhang F, Hillary RF, Zhang Q, Stevenson AJ, Walker RM, et al. An epigenome-wide association study of sex-specific chronological ageing. Genome Med. 2019;12(1):1. doi: 10.1186/s13073-019-0693-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Smith BH, Campbell A, Linksted P, Fitzpatrick B, Jackson C, Kerr SM, Deary IJ, MacIntyre DJ, Campbell H, McGilchrist M, Hocking LJ, Wisely L, Ford I, Lindsay RS, Morton R, Palmer CNA, Dominiczak AF, Porteous DJ, Morris AD. Cohort Profile: Generation Scotland: Scottish Family Health Study (GS:SFHS). The study, its participants and their potential for genetic research on health and illness. Int J Epidemiol. 2013;42(3):689–700. doi: 10.1093/ije/dys084. [DOI] [PubMed] [Google Scholar]
- 10.Bollepalli S, Korhonen T, Kaprio J, Anders S, Ollikainen M. EpiSmokEr: a robust classifier to determine smoking status from DNA methylation data. Epigenomics. 2019;11(13):1469–1486. doi: 10.2217/epi-2019-0206. [DOI] [PubMed] [Google Scholar]
- 11.Marioni RE, Davies G, Hayward C, Liewald D, Kerr SM, Campbell A, Luciano M, Smith BH, Padmanabhan S, Hocking LJ, Hastie ND, Wright AF, Porteous DJ, Visscher PM, Deary IJ. Molecular genetic contributions to socioeconomic status and intelligence. Intelligence. 2014;44(100):26–32. doi: 10.1016/j.intell.2014.02.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.McCartney DL, Hillary RF, Conole EL, Trejo Banos D, Gadd DA, Walker RM, et al. Blood-based epigenome-wide analyses of cognitive abilities. Zenodo. https://zenodo.org/record/5794029. 2021. [DOI] [PMC free article] [PubMed]
- 13.MRC-IEU EWAS Catalog. Available from:http://www.ewascatalog.org/. Accessed 5 April 2021.
- 14.Saffari A, Silver MJ, Zavattari P, Moi L, Columbano A, Meaburn EL, Dudbridge F. Estimation of a significance threshold for epigenome-wide association studies. Genet Epidemiol. 2018;42(1):20–33. doi: 10.1002/gepi.22086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Roubroeks JAY, Smith AR, Smith RG, Pishva E, Ibrahim Z, Sattlecker M, Hannon EJ, Kłoszewska I, Mecocci P, Soininen H, Tsolaki M, Vellas B, Wahlund LO, Aarsland D, Proitsi P, Hodges A, Lovestone S, Newhouse SJ, Dobson RJB, Mill J, van den Hove DLA, Lunnon K. An epigenome-wide association study of Alzheimer’s disease blood highlights robust DNA hypermethylation in the HOXB6 gene. Neurobiol Aging. 2020;95:26–45. doi: 10.1016/j.neurobiolaging.2020.06.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Daetwyler HD, Villanueva B, Woolliams JA. Accuracy of predicting the genetic risk of disease using a genome-wide approach. PLoS ONE. 2008;3:e3395. doi: 10.1371/journal.pone.0003395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Harris SE, Cox SR, Bell S, Marioni RE, Prins BP, Pattie A, Corley J, Muñoz Maniega S, Valdés Hernández M, Morris Z, John S, Bronson PG, Tucker-Drob EM, Starr JM, Bastin ME, Wardlaw JM, Butterworth AS, Deary IJ. Neurology-related protein biomarkers are associated with cognitive ability and brain volume in older age. Nat Commun. 2020;11(1):800. doi: 10.1038/s41467-019-14161-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Corley J, Cox SR, Deary IJ. Healthy cognitive ageing in the Lothian Birth Cohort studies: marginal gains not magic bullet. Psychol Med. 2018;48(2):187–207. doi: 10.1017/S0033291717001489. [DOI] [PubMed] [Google Scholar]
- 19.McAfoose J, Baune BT. Evidence for a cytokine model of cognitive function. Neurosci Biobehav Rev. 2009;33(3):355–366. doi: 10.1016/j.neubiorev.2008.10.005. [DOI] [PubMed] [Google Scholar]
- 20.Alboni S, Cervia D, Sugama S, Conti B. Interleukin 18 in the CNS. J Neuroinflammation. 2010;7(1):9. doi: 10.1186/1742-2094-7-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Patxot M, Banos DT, Kousathanas A, Orliac EJ, Ojavee SE, Moser G, Holloway A, Sidorenko J, Kutalik Z, Mägi R, Visscher PM, Rönnegård L, Robinson MR. Probabilistic inference of the genetic architecture underlying functional enrichment of complex traits. Nat Commun. 2021;12(1):6972. doi: 10.1038/s41467-021-27258-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Smith BH, Campbell H, Blackwood D, Connell J, Connor M, Deary IJ, Dominiczak AF, Fitzpatrick B, Ford I, Jackson C, Haddow G, Kerr S, Lindsay R, McGilchrist M, Morton R, Murray G, Palmer CNA, Pell JP, Ralston SH, St Clair D, Sullivan F, Watt G, Wolf R, Wright A, Porteous D, Morris AD. Generation Scotland: the Scottish Family Health Study; a new resource for researching genes and heritability. BMC Med Genet. 2006;7(1):74. doi: 10.1186/1471-2350-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Madden RA, McCartney DL, Walker RM, Hillary RF, Bermingham ML, Rawlik K, et al. Birth weight associations with DNA methylation differences in an adult population. Epigenetics Taylor Francis. 2021;16(7):783–796. doi: 10.1080/15592294.2020.1827713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wechsler D. WMS-III: Wechsler memory scale administration and scoring manual. San Antonio, Tex: Psychological Corp.; 1997. [Google Scholar]
- 25.Wechsler D. Psychological Corporation. WAIS-III: administration and scoring manual : Wechsler Adult Intelligence Scale. 3. San Antonio, Tex: Psychological Corporation; 1997. [Google Scholar]
- 26.Lezak MD. Neuropsychological assessment. 3. New York, NY, US: Oxford University Press; 1995. pp. xviii–1026. [Google Scholar]
- 27.Raven JC. Manual for Raven’s progressive matrices and vocabulary scales. Oxford: Psychologists Press; 1972. [Google Scholar]
- 28.Deary IJ, Gow AJ, Pattie A, Starr JM. Cohort profile: the Lothian Birth Cohorts of 1921 and 1936. Int J Epidemiol. 2012;41(6):1576–1584. doi: 10.1093/ije/dyr197. [DOI] [PubMed] [Google Scholar]
- 29.Taylor AM, Pattie A, Deary IJ. Cohort Profile Update: The Lothian Birth Cohorts of 1921 and 1936. Int J Epidemiol. 2018;47:1042–1042r. doi: 10.1093/ije/dyy022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Shah S, McRae AF, Marioni RE, Harris SE, Gibson J, Henders AK, et al. Genetic and environmental exposures constrain epigenetic drift over the human life course. Genome Res. 2014;24(11):1725–1733. doi: 10.1101/gr.176933.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Marioni RE, Shah S, McRae AF, Chen BH, Colicino E, Harris SE, et al. DNA methylation age of blood predicts all-cause mortality in later life. Genome Biol. 2015;16(1):25. doi: 10.1186/s13059-015-0584-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Davies G, Lam M, Harris SE, Trampush JW, Luciano M, Hill WD, Hagenaars SP, Ritchie SJ, Marioni RE, Fawns-Ritchie C, Liewald DCM, Okely JA, Ahola-Olli AV, Barnes CLK, Bertram L, Bis JC, Burdick KE, Christoforou A, DeRosse P, Djurovic S, Espeseth T, Giakoumaki S, Giddaluru S, Gustavson DE, Hayward C, Hofer E, Ikram MA, Karlsson R, Knowles E, Lahti J, Leber M, Li S, Mather KA, Melle I, Morris D, Oldmeadow C, Palviainen T, Payton A, Pazoki R, Petrovic K, Reynolds CA, Sargurupremraj M, Scholz M, Smith JA, Smith AV, Terzikhan N, Thalamuthu A, Trompet S, van der Lee SJ, Ware EB, Windham BG, Wright MJ, Yang J, Yu J, Ames D, Amin N, Amouyel P, Andreassen OA, Armstrong NJ, Assareh AA, Attia JR, Attix D, Avramopoulos D, Bennett DA, Böhmer AC, Boyle PA, Brodaty H, Campbell H, Cannon TD, Cirulli ET, Congdon E, Conley ED, Corley J, Cox SR, Dale AM, Dehghan A, Dick D, Dickinson D, Eriksson JG, Evangelou E, Faul JD, Ford I, Freimer NA, Gao H, Giegling I, Gillespie NA, Gordon SD, Gottesman RF, Griswold ME, Gudnason V, Harris TB, Hartmann AM, Hatzimanolis A, Heiss G, Holliday EG, Joshi PK, Kähönen M, Kardia SLR, Karlsson I, Kleineidam L, Knopman DS, Kochan NA, Konte B, Kwok JB, le Hellard S, Lee T, Lehtimäki T, Li SC, Lill CM, Liu T, Koini M, London E, Longstreth WT, Jr, Lopez OL, Loukola A, Luck T, Lundervold AJ, Lundquist A, Lyytikäinen LP, Martin NG, Montgomery GW, Murray AD, Need AC, Noordam R, Nyberg L, Ollier W, Papenberg G, Pattie A, Polasek O, Poldrack RA, Psaty BM, Reppermund S, Riedel-Heller SG, Rose RJ, Rotter JI, Roussos P, Rovio SP, Saba Y, Sabb FW, Sachdev PS, Satizabal CL, Schmid M, Scott RJ, Scult MA, Simino J, Slagboom PE, Smyrnis N, Soumaré A, Stefanis NC, Stott DJ, Straub RE, Sundet K, Taylor AM, Taylor KD, Tzoulaki I, Tzourio C, Uitterlinden A, Vitart V, Voineskos AN, Kaprio J, Wagner M, Wagner H, Weinhold L, Wen KH, Widen E, Yang Q, Zhao W, Adams HHH, Arking DE, Bilder RM, Bitsios P, Boerwinkle E, Chiba-Falek O, Corvin A, de Jager PL, Debette S, Donohoe G, Elliott P, Fitzpatrick AL, Gill M, Glahn DC, Hägg S, Hansell NK, Hariri AR, Ikram MK, Jukema JW, Vuoksimaa E, Keller MC, Kremen WS, Launer L, Lindenberger U, Palotie A, Pedersen NL, Pendleton N, Porteous DJ, Räikkönen K, Raitakari OT, Ramirez A, Reinvang I, Rudan I, Dan Rujescu, Schmidt R, Schmidt H, Schofield PW, Schofield PR, Starr JM, Steen VM, Trollor JN, Turner ST, van Duijn CM, Villringer A, Weinberger DR, Weir DR, Wilson JF, Malhotra A, McIntosh AM, Gale CR, Seshadri S, Mosley TH, Jr, Bressler J, Lencz T, Deary IJ. Study of 300,486 individuals identifies 148 independent genetic loci influencing general cognitive function. Nat Commun. 2018;9(1):2098. doi: 10.1038/s41467-018-04362-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Euesden J, Lewis CM, O’Reilly PF. PRSice: Polygenic Risk Score software. Bioinforma Oxf Engl. 2015;31(9):1466–1468. doi: 10.1093/bioinformatics/btu848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ritchie SJ, Hill WD, Marioni RE, Davies G, Hagenaars SP, Harris SE, Cox SR, Taylor AM, Corley J, Pattie A, Redmond P, Starr JM, Deary IJ. Polygenic predictors of age-related decline in cognitive ability. Mol Psychiatr. 2020;25(10):2584–2598. doi: 10.1038/s41380-019-0372-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Hill WD, Marioni RE, Maghzian O, Ritchie SJ, Hagenaars SP, McIntosh AM, et al. A combined analysis of genetically correlated traits identifies 187 loci and a role for neurogenesis and myelination in intelligence. Mol Psychiatr. 2019;24(2):169–181. doi: 10.1038/s41380-017-0001-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Nelson HE, Willison J. National adult reading test (NART) NFER-Nelson: Windsor; 1991. [Google Scholar]
- 37.Howard DM, Hall LS, Hafferty JD, Zeng Y, Adams MJ, Clarke T-K, Porteous DJ, Nagy R, Hayward C, Smith BH, Murray AD, Ryan NM, Evans KL, Haley CS, Deary IJ, Thomson PA, McIntosh AM. Genome-wide haplotype-based association analysis of major depressive disorder in Generation Scotland and UK Biobank. Transl Psychiatr. 2017;7(11):1–9. doi: 10.1038/s41398-017-0010-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.General. Register Office. Census 1951: Classification of occupations. London: HMSO; 1956.
- 39.Zigmond AS, Snaith RP. The hospital anxiety and depression scale. Acta Psychiatr Scand. 1983;67(6):361–370. doi: 10.1111/j.1600-0447.1983.tb09716.x. [DOI] [PubMed] [Google Scholar]
- 40.Office of the Chief Statistician, the Scottish Government. Scottish Index of Multiple Deprivation: 2009 General report. 2009. Available from: http://www.scotland.gov.uk/Resource/Doc/933/0115249.pdf.
- 41.Hillary RF, Trejo-Banos D, Kousathanas A, McCartney DL, Harris SE, Stevenson AJ, et al. Multi-method genome- and epigenome-wide studies of inflammatory protein levels in healthy older adults. Genome Med. 2020;12(1):60. doi: 10.1186/s13073-020-00754-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Gadd DA, Hillary RF, McCartney DL, Zaghlool SB, Stevenson AJ, Nangle C, et al. Epigenetic scores for the circulating proteome as tools for disease prediction. 2021. 10.1101/2020.12.01.404681 Available from: https://www.biorxiv.org/content/10.1101/2020.12.01.404681v3. [DOI] [PMC free article] [PubMed]
- 43.Wardlaw JM, Bastin ME, Valdés Hernández MC, Maniega SM, Royle NA, Morris Z, Clayden JD, Sandeman EM, Eadie E, Murray C, Starr JM, Deary IJ. Brain aging, cognition in youth and old age and vascular disease in the Lothian Birth Cohort 1936: rationale, design and methodology of the imaging protocol. Int J Stroke Off J Int Stroke Soc. 2011;6(6):547–559. doi: 10.1111/j.1747-4949.2011.00683.x. [DOI] [PubMed] [Google Scholar]
- 44.Hernández M del CV, Ferguson KJ, Chappell FM, Wardlaw JM. New multispectral MRI data fusion technique for white matter lesion segmentation: method and comparison with thresholding in FLAIR images. Eur Radiol. 2010;20(7):1684–1691. doi: 10.1007/s00330-010-1718-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Clayden JD, Maniega SM, Storkey AJ, King MD, Bastin ME, Clark CA. TractoR: magnetic resonance imaging and tractography with R. J Stat Softw. 2011;44(8):1–18. doi: 10.18637/jss.v044.i08. [DOI] [Google Scholar]
- 46.Muñoz Maniega S, Bastin ME, Deary IJ, Wardlaw JM, Clayden JD. Reference tracts and generative models for brain white matter tractography. J Imaging. 2018;4:8. doi: 10.3390/jimaging4010008. [DOI] [Google Scholar]
- 47.Behrens TEJ, Berg HJ, Jbabdi S, Rushworth MFS, Woolrich MW. Probabilistic diffusion tractography with multiple fibre orientations: what can we gain? NeuroImage. 2007;34(1):144–155. doi: 10.1016/j.neuroimage.2006.09.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.McCartney DL, Hillary RF, Conole EL, Trejo Banos D, Gadd DA, Walker RM, et al. Blood-based epigenome-wide analyses of cognitive abilities - analysis code. Zenodo. 2021. 10.5281/zenodo.5794175. [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
According to the terms of consent for GS participants, access to data must be reviewed by the GS Access Committee. Applications should be made to access@generationscotland.org.
Lothian Birth Cohort data are available on request from the Lothian Birth Cohort Study, University of Edinburgh (simon.cox@ed.ac.uk). Lothian Birth Cohort data are not publicly available due to them containing information that could compromise participant consent and confidentiality.
All code is available with open access at the following GitHub repository https://github.com/marioni-group/ewas_of_cognitive_function (Zenodo 10.5281/zenodo.5794175 [48]).