Metabolomic sweet spot clock predicts mortality and age-related diseases in the Canadian Longitudinal Study on Aging

Olga Vishnyakova; Joosung Min; Stephen Leach; Xiaowei Song; Kenneth Rockwood; Angela Brooks-Wilson; Lloyd T Elliott

doi:10.1038/s43856-026-01375-2

. 2026 Jan 10;6:110. doi: 10.1038/s43856-026-01375-2

Metabolomic sweet spot clock predicts mortality and age-related diseases in the Canadian Longitudinal Study on Aging

Olga Vishnyakova ^1,^2,³, Joosung Min ³, Stephen Leach ¹, Xiaowei Song ^2,⁴, Kenneth Rockwood ^5,⁶, Angela Brooks-Wilson ^1,^2,^✉,^#, Lloyd T Elliott ^3,^✉,^#

PMCID: PMC12901296 PMID: 41520049

Abstract

Background

Chronological age does not capture individual health or resilience. Advances in metabolomics have enabled development of molecular aging biomarkers that capture deviations between biological and chronological age, highlighting how genetics, environment, and lifestyle shape biological aging. Despite their promise, metabolomic biomarkers face challenges such as interpretability, non-linearity, and reproducibility.

Methods

We have developed a metabolomic predictor of biological age based on untargeted metabolomic profiling of individuals aged 45–85 years from the Canadian Longitudinal Study on Aging. To enhance interpretability, we first identified metabolites related to health based on variance heterogeneity. For metabolites with identifiable optimal levels, or “sweet spots”, we modeled non-linearity using deviations from these values. A penalized regression model was trained on the Frailty Index using sweet spot deviations as predictors.

Results

Here we show that the Sweet Spot Clock built on 178 health-related metabolites is strongly associated with all-cause mortality (HR = 1.08, p = 5.8×10⁻¹², C-index=0.841) and age-related diseases. The biomarker outperforms models trained on chronological age and those using raw metabolite levels, underscoring the importance of modeling non-linearity. It remains predictive after adjusting for age, sex, lifestyle and socioeconomic factors, though its added value over standard health and demographic measures is modest. The model generalizes to an independent cohort of individuals aged 85 years and older.

Conclusions

The Sweet Spot Clock provides a reproducible and interpretable measure of biological age. By modeling deviations from optimal metabolite levels and training on health status rather than age, it offers a tool for understanding aging heterogeneity and identifying individuals at risk of health decline.

Subject terms: Predictive markers, Computational biology and bioinformatics

Vishnyakova et al. develop a metabolomic aging biomarker based on optimal metabolite levels, or “sweet spots.” They show that this biomarker predicts mortality and the onset of age-related diseases.

Plain Language Summary

Chronological age does not fully reflect a person’s health or resilience. We used data from Canadians aged 45–85 to develop a biomarker of biological age based on metabolites—small molecules in the blood that reflect body processes. We focused on 178 health-related metabolites and identified “sweet spots,” or optimal levels, for 74 of them. Our model, the Sweet Spot Clock, was strongly related to mortality and age-related diseases. These findings held up in an independent group of Canadians aged 85 and older. By focusing on health status and accounting for non-linear patterns, our approach offers a reproducible and interpretable way to measure biological aging and understand why some people age more healthfully than others.

Introduction

Aging is a universal process experienced uniquely by individuals. While chronological age is a strong proxy for biological age, it does not fully reflect individual variation in physiological and functional status. Biological age provides additional insight into health and the pace of aging among same-age individuals¹. Although chronological age is a major risk factor for age-related diseases like type 2 diabetes, cardiovascular disease, and cancer², it does not consistently predict disease susceptibility, treatment response, or variability in aging trajectories. People of the same chronological age may show diverse aging phenotypes shaped by environmental exposures, lifestyle, psychological conditions, and genetic factors^3–5. Discrepancies between chronological and biological age have led to a challenging search for reliable aging biomarkers—quantitative individual-level measures that predict biological age⁶ and capture the variability in the timing of disease onset, functional decline, and mortality between individuals. Such biomarkers can act as predictors of aging, reflect the individual level of molecular and cellular damage, and provide valuable insights into what drives heterogeneity in aging.

To operationalize accurate aging biomarkers, numerous -omics clocks have been developed over the past decade^7–9, leveraging advancements in technology and longitudinal -omics data (e.g., epigenomics, metabolomics, proteomics, transcriptomics, immuneomics, glycomics, and the gut microbiome)^10–21. These clocks typically target age deviation as a biomarker—the difference between predicted biological age and chronological age. Their weak inter-correlations indicate that they capture distinct facets of biological aging at various cellular levels, consistent with studies showing that metabolomic and epigenetic clocks often reflect complementary and non-overlapping biological processes²². Omics-based aging predictors are increasingly applied in research and clinical settings⁶. However, many models remain limited by the assumption of linearity^23,24, poor interpretability, and high dimensionality, often relying on numerous predictors with unclear relevance to aging heterogeneity²⁵. While non-linear methods such as neural networks are used to capture complex associations with aging outcomes, their black-box nature limits mechanistic insight²¹. There is a need for biomarkers that are interpretable, model non-linearity effectively, and are grounded in biologically relevant predictors. Metabolomics offers potential in this context, as metabolites reflect integrated signals from endogenous processes and external exposures, including diet, lifestyle, and acute stress^26,27. Though promising for identifying systemic markers of aging, challenges such as data reproducibility due to batch effects must be addressed²⁸.

In our previous work, we demonstrated that differences in variance between the least and the most healthy individuals at a given age can reveal phenotypes under homeostatic control, which we proposed to be closely related to health²⁹. By narrowing the focus to metabolites potentially relevant to health, we can better target measurements that explain aging heterogeneity, reduce model complexity, minimize data noise, and enhance the interpretability of metabolomic clocks⁶. Moreover, our earlier findings showed that many phenotypes under homeostatic control demonstrate complex, non-linear associations with health, a pattern also noted in recent -omics studies³⁰. Consequently, non-linear modeling or feature transformation techniques are necessary to address this complexity.

Here, we investigate the molecular bases of heterogeneity in aging by leveraging untargeted metabolomic profiling data from individuals in the Canadian Longitudinal Study on Aging³¹. We identify metabolites related to health and healthy aging and estimate their optimal levels, or “sweet spots”. We construct a metabolomic aging biomarker that explicitly models deviations from these optimal values. The resulting Sweet Spot Clock is predictive of mortality and the onset of age-related diseases and provides a reproducible and interpretable measure of biological aging that replicates in an independent cohort of older adults.

Methods

Data

We examine data from the Canadian Longitudinal Study on Aging (CLSA)³¹. These data consist of a stratified sample of Canadian males and females aged 45–85 years. Community-dwelling participants were recruited by CLSA from the ten Canadian provinces, excluding individuals unable to respond in English or French; residents of the three Canadian territories and some remote regions; individuals living on First Nation reserves and other First Nations settlements in the provinces or in nursing homes; full-time members of the armed forces; and individuals with significant cognitive impairment at recruitment. The subsample of participants selected for the Comprehensive Cohort (COM) of the CLSA underwent detailed physical assessments and provided blood and urine samples. Baseline data were collected between May 2012 and July 2015. The average follow-up period was six years, defined as the time between baseline and Follow-up 2 data collection. The onset of age-related diseases after baseline was determined using data from both Follow-up 1 and Follow-up 2 assessments. Metabolomic profiling was done for 9992 (51.5% females) out of 27,170 participants who provided blood samples at baseline. All participants of CLSA provided informed consent prior to data collection. The present study was approved by the Joint Research Ethics Board of the BC Cancer and the University of British Columbia, and harmonized with Simon Fraser University (#H22-01012).

Genomic data

Genotyping was done by CLSA for 26,622 participants using the Affymetrix UK Biobank Axiom array, with phasing and imputation using TOPMed reference panel v.r2³². CLSA performed marker-based and sample-based quality control³³. Variants were examined for discordant genotype frequency between batches, departure from HWE, discordance across control replicates, and sex genotype frequency discordance. We removed low-quality imputed genetic variants: with a minor allele frequency lower than 0.1%, an imputation quality score < 0.5, or a missing rate > 0.1. The remaining 11.7 million variants were used for GWAS. Participants were examined for relatedness, sex discordance, outliers in heterozygosity, and missing rates. Genetic ancestries were derived using principal component analysis and used to account for population structure. Sex was defined using genetically inferred sex chromosome composition as determined by CLSA³⁴. Samples with discordance between genetically inferred sex and self-reported sex were excluded from analysis.

Untargeted metabolomic data and quality control

CLSA metabolomic data includes levels of 1458 metabolites quantified in EDTA plasma samples by Metabolon using ultrahigh performance liquid chromatography-tandem mass spectroscopy (UPLC–MS/MS). A description of quality control and normalization is detailed elsewhere (https://www.clsa-elcv.ca/wp-content/uploads/2024/01/CLSA_DataSupportDoc_Metabolomics_v2.0_2023Aug03.pdf).

From an initial cohort of 9992 participants with metabolomic data, we selected a maximal subset of unrelated individuals without third-degree or closer relatives³⁵. We further excluded individuals who did not pass a sample-based quality control performed by CLSA. Specifically, we excluded samples with discrepancies between self-reported and genetic sex, as well as those exhibiting outlier values for missing genotype (>0.05%) data and heterozygosity rates (>0.215). The cohort used for analysis comprised 8887 individuals with European ancestry, 111 individuals with Asian ancestry, and 63 individuals with Black ancestry, as previously categorized by CLSA. We omitted analytes with a missing data rate exceeding 20%, resulting in 1041 retained measures. Focusing on homeostatic mechanisms, the study was confined to endogenous metabolites, excluding xenobiotics, which were identified through annotations provided to CLSA by Metabolon Inc. The final analytical dataset comprised 888 metabolites. Metabolite levels were log-transformed and scaled to ensure a mean of zero and standard deviation of one. To control for confounders, correlation analyses were performed between the metabolite levels and external factors, leading to adjustments in metabolite levels based on hours since the last meal or drink and dates of blood drawn. We applied an inverse rank-normal transformation to the metabolite ratios. Furthermore, we evaluated the metabolites for bimodality by ranking them using the bimodality index, where a bimodality index (BI) greater than 1.1 indicates potential bimodality, potentially affecting the reliability of the results³⁶. The metabolites used for analysis, and their bimodality indices, are listed in Supplementary Data 1.

Replication cohort

Data from the Super Seniors Study was used as an independent replication cohort³⁷. This study was approved by the joint Research Ethics Board of BC Cancer and the University of British Columbia, harmonized with Simon Fraser University (#H20-03117-A002). All participants gave written informed consent. Participants were determined to be Super Seniors if they met both of the following inclusion criteria: (i) 85 years or older at the date of recruitment, and (ii) self-reporting as never having had cancer (except non-melanoma skin cancer), cardiovascular disease, diabetes, dementia, or major pulmonary disease (except asthma). Controls were age-matched individuals who did not meet the inclusion criteria. Non-fasting blood samples were collected from all participants. The dataset comprised 548 Super Seniors and 119 age-matched controls. Self-reported sex was confirmed with genetically inferred sex chromosome composition, and genetic ancestry was inferred using principal component analysis.

EDTA plasma samples were sent to Metabolon for untargeted metabolomic profiling using Ultrahigh Performance Liquid Chromatography-Tandem Mass Spectroscopy (UPLC–MS/MS), yielding a total of 1679 biochemicals, including 1371 compounds of known identity. Raw data was extracted, peak-identified, and processed using Metabolon’s hardware and software. Peaks were quantified using area-under-the-curve. For each metabolite, the raw values in the experimental samples were divided by the median of those samples in each instrument batch, giving each batch and thus the metabolite a median of one. These data were also normalized to the QC samples for each batch, similar to CLSA (www.clsa-elcv.ca/wp-content/uploads/2024/01/CLSA_DataSupportDoc_Metabolomics_v2.0_2023Aug03.pdf). Consistent with CLSA preprocessing, we performed minimum value imputation, in which the minimum value detected for a given metabolite is used to replace the missing values for that same metabolite, followed by natural-log transformation and autoscaling.

Instruments for health assessment

We derived five composite instruments to quantify health status, each standardized to a unit interval where higher scores reflect greater health burden. The first instrument (I) is a deficit accumulation Frailty Index (FI), calculated as the proportion of 51 health deficits present. The second (II) captures the frequency of five major diseases—cancer (excluding non-melanoma skin cancer), cardiovascular disease, chronic pulmonary disease, dementia, and diabetes—normalized by individual-level burden. The third (III) reflects the count of chronic conditions not included in the major disease category. The fourth (IV) assesses cognitive function using the average of three domain-specific scores, each rank-normalized from CLSA cognitive tests. The fifth (V) evaluates physical function based on rank-normalized scores from five physical performance assessments. Details on phenotype definitions and scoring procedures are provided in Supplementary Table 1. The construction and methodologies used for these instruments are described elsewhere²⁹. We assigned health levels according to quartiles: healthiest, good health, fair health, and least healthy. Quartile-based cut-offs were applied to categorize participants by health levels regardless of age (Supplementary Table 2).

Statistics and reproducibility

The study workflow is summarized in Fig. 1a, b and detailed in Supplementary Fig. 1. Primary analyses focused on the largest ancestral subset (European ancestry), representing 98% of the cohort. To perform inference and evaluate the performance of the molecular biomarkers, we randomly partitioned the participants with European ancestry into a training set (85%) and a validation set (15%), resulting in sample sizes of 7579 and 1308, respectively. Sample sizes by sex and age group for the training and test sets are shown in Supplementary Table 3. The training set was used to subset metabolites, identify their optimal levels, and develop a model for estimating metabolomic age deviation. We stratified our analysis by sex, rather than adjusting for it, due to the substantial differences in metabolite concentrations and metabolic pathways observed between sexes³⁸. Health scores and metabolite levels were adjusted for age.

Identifying health-related metabolites

We selected metabolites that exhibited a variance effect on health, meaning that the variance among the healthiest group is significantly lower than among the least healthy individuals. To perform this selection, we conducted pairwise tests of equality of variance between the metabolite levels of the healthiest and least healthy groups using the Brown–Forsythe (BF) test for heteroskedasticity³⁹.

Let $x \in R^{N}$ be the metabolite level across $N$ samples, $p$ be a number of health groups, and $z_{i j} = ∣x_{i j} - {\tilde{x}}_{j}∣$ , where ${\tilde{x}}_{j}$ is the median of group j. The BF test statistic is the model F-statistic:

F = \frac{(N - p)}{(p - 1)} \frac{\sum_{j = 1}^{p} n_{j} {({\tilde{z}}_{\cdot j} - {\tilde{z}}_{\cdot \cdot})}^{2}}{\sum_{j = 1}^{p} \sum_{i = 1}^{n_{j}} {(z_{i j} - {\tilde{z}}_{\cdot j})}^{2}}

where $n_{j}$ is the number of observations in group j, ${\tilde{z}}_{\cdot j}$ are the group means of the $z_{i j}$ , and ${\tilde{z}}_{\cdot \cdot}$ is the overall mean of the $z_{i j}$ . This F-statistic follows the F-distribution $F_{p - 1, N - p}$ .

Analysis was stratified by sex. Metabolites were determined to be related to health and healthy aging if (1) the healthiest and least healthy groups had significantly different variance for at least one sex, and (2) the healthiest group had the lower variance. Bonferroni-adjustment was used to account for multiple testing (Bonferroni correction factor N = 5 instruments × 2 sexes × 888 metabolites = 8880).

Defining metabolite optimal levels

We employed piecewise regression analysis (also known as segmented regression) to identify breakpoints indicating shifts in the impact on health deficit scores⁴⁰. We defined the breakpoint as a single free parameter for which a continuous metabolite level modulates the mean response of a health deficit score through two linear segments joined at this breakpoint. For metabolite values $X$ and the response health score $Y$ , the piecewise regression equation for the expected mean $E (Y, ∣, x, w) = μ$ can be formulated as $μ_{i} = w_{i}^{T} γ + β (x_{i} - ψ) 1 (x_{i} > ψ)$ , where $i = 1, 2, . . n, ψ \in R_{X}$ is a breakpoint, $1 (\cdot) = 1$ when its argument is true, and $w_{i}^{T} γ$ includes additional linear terms, such as the model intercept, and possibly other covariates. The null hypothesis is $H_{0} : β = 0$ , where $β$ is the difference in slopes. The test statistic is a score-based statistic that follows the Normal distribution⁴¹.

We identified optimal values, i.e., breakpoints under conditions where: (1) the slopes of the linear segments exhibit opposite signs; (2) the 95% confidence intervals for both slopes exclude zero; and (3) the disparity in slope values is statistically significant, as confirmed by a two-sided score test⁴². We assumed that the breakpoints remained constant across age groups but varied by sex. To minimize age-related confounding, we adjusted health deficit scores for age prior to analysis and used the residual values for the piecewise regression. We conducted separate analyses for male and female sex to derive sex-specific estimates of the breakpoints. We used Bonferroni correction on the p-values of the piecewise regression to account for multiple comparisons (N = 5 instruments × 2 sexes × M metabolites), and deemed adjusted p-values below 0.05 to be statistically significant.

Distances from sweet spots

For each individual, the Euclidean distance (distances from sweet spots) was computed between the measured metabolite levels fitted to the piecewise regression model and the optimal metabolomic levels. This method quantifies the deviation of actual metabolite concentrations from their optimal levels and reflects homeostatic dysregulation.

Developing metabolomic clocks

Following the idea that biomarkers of aging have to reflect not only inter-individual differences in mortality risk, but also the onset of age-related disease, we trained a penalized regression model to predict health status. We trained the sex-specific ElasticNet models to predict FI, serving as a surrogate endpoint for mortality, using health-related metabolites as predictors. For measurements for which we were able to estimate optimal levels, distances from sweet spots (DSSs) were used as a transformation of metabolite levels. We refer to this model as the Sweet Spot Clock (Fig. 1c). To benchmark our approach against FI and established metabolomic clocks, we constructed comparator models that reflect key elements of prior frameworks trained on age¹⁷ or untransformed metabolite levels¹⁹, though these models were developed on targeted NMR platforms and focused on age or mortality rather than health decline. Specifically, we trained the following four ElasticNet models:

SweetSpotClock. Health-related metabolites with DSS-transformed levels were used as predictors. The model was trained to predict FI.
Baseline. This model reflects an established approach to constructing -omic clocks. Health-related metabolites were used as predictors, with metabolite levels transformed into distances from their estimated optimal values. The model was trained to predict chronological age.
ControlAge: health-related metabolites with DSS-transformed levels were used as predictors. The model was trained to predict chronological age. The model was trained to show the effect of training on FI, instead of chronological age, on the predictive ability of all-cause mortality.
ControlMb: health-related metabolites without transformation levels were used as predictors. The model was trained to predict FI. The model was trained to show the effect of DSS transformation on the predictive ability of all-cause mortality.

We applied 10-fold cross-validation to optimize the ElasticNet parameters on the training dataset. Post-training, we calibrated each model output to age units by rescaling predictions to match the mean and standard deviation of chronological age. We refer to these model outputs as metabolomic age. We then calculated metabolomic age deviation (MAD) for each participant by regressing biological age against chronological age and taking the residual. The resulting age predictors (MADs) were then compared based on their association with mortality in the test set. ElasticNet feature importance was determined using the caret R package⁴³.

Metabolic pathway enrichment analysis

We conducted pathway enrichment analysis to identify subpathways that were significantly enriched among the health-related metabolites. First, the significance ratio was calculated for each subpathway as the proportion of significant metabolites within the subpathway to the total number of measurements in that subpathway. The Fisher exact test was then used to determine significant enrichment⁴⁴, with Benjamini–Hochberg correction for multiple comparisons, controlling the false discovery rate at a significance level of 0.05.

Survival analysis

We assessed associations between a biomarker and all-cause mortality using Cox proportional hazards models. All models were adjusted for sex and chronological age to account for potential confounders. Given the evidence that hazard ratio (HR) estimates are influenced by the scale and distribution of mortality predictors⁴⁵, we did not perform direct comparisons of HR estimates. Instead, we relied on Wald’s test p-values for a scale-independent evaluation of the models and the concordance index (C-index). The predictive ability of biomarkers was assessed based on their performance on the test set.

Heritability estimate

To determine what portion of biomarker variability might be explained by the genetic component, we performed a GWAS on MAD. We used SAIGE⁴⁶ to obtain single-variant summary statistics, adjusting linear regressions for age, sex, and the first ten genetic principal components. We used LDSC (v1.0.1)⁴⁷ to estimate SNP-based heritability of MAD in CLSA COM using the LD scores from the 1000 Genomes Project phase 3. We did not include the X chromosome in the LDSC computation due to difficulties in assessing linkage disequilibrium in the X chromosome.

Association analysis

In CLSA, nutritional assessment was performed by the AB SCREEN II⁴⁸, and physical activity level was assessed by the Physical Activity Scale for the Elderly⁴⁹. Psychological distress was measured using the Kessler Psychological Distress Scale⁵⁰. Other factors were defined as follows: alcohol consumption (never, light/moderate, heavy drinker), household income (CDN < $20,000, $20,000–$50,000, $50,000–<$100,000, $100,000–$150,000, >$150,000), education level (<secondary, secondary, post-secondary diploma/certification, or university degree), smoking status (never, occasional, current smoker, current heavy smoker). For more details on the CLSA protocol, see the CLSA Cohort profile³¹.

Comparison to established epigenetic clocks

Epigenetic age was assessed using two first-generation epigenetic clocks: the Hannum clock (based on 71 CpG sites¹⁵), and the Horvath Pan-Tissue clock (based on 353 CpG sites¹⁴). These biomarkers targeted chronological age during the training process to estimate biological age. Both of these epigenetic clocks are included in the CLSA COM dataset. We also calculated PhenoAge and GrimAge using the Biolearn Python library⁵¹. While epigenetic data were available for 1307 CLSA COM samples, only 161 were included in the validation set. To increase accuracy, we used the full dataset (both test and training sets) to estimate the strength of correlation between the Sweet Spot Clock and the epigenetic clocks. This approach was feasible because epigenetic data were not utilized in the training process.

Results

Health assessment and participants characteristics

To identify health-related metabolites, we first assessed the health status of all participants. A total of 9061 individuals aged 45–86 years (51% females) from the CLSA Comprehensive cohort (CLSA COM) were included in the analysis of 1458 plasma metabolites. Samples were collected by CLSA during the period from May 2012 to July 2015, with a mean tracking period of 6 years. The participants’ mean body mass index was 28.1 (95% CI: 22.6–33.6) kg/m², and their mean fasting time before blood draw was 10 (95% CI: 6.5−13.5) h. Out of a total of 9061 individuals, 8887 were of European ancestry, 111 of Asian ancestry, and 63 of Black ancestry. We conducted a primary analysis on the ancestry that was most numerous in the sample (European) and replicated our findings within the ancestries that were less numerous (Asian, Black).

To assign health scores to each participant, we developed five health instruments to measure health decline in older adults, as detailed in the “Methods” section: I - the Frailty Index (FI); II - the presence of five major diseases: cancer, cardiovascular disease, major pulmonary disease, dementia, and diabetes; III - the count of other chronic conditions; IV - a composite cognitive score; and V - physical function. All of these scores ranged from 0 (healthiest) to 1 (least healthy). The statistics and correlations between health scores are described in Supplementary Fig. 2.

Twenty percent of metabolites were identified as related to health

A total of 888 metabolites (60% of a total of 1458 plasma metabolites) were examined for their relevance to health after excluding xenobiotics and those with more than 20% missing values (Supplementary Data 1). Among these, 686 metabolites (77%) with known identities were categorized into eight superpathways: lipid, amino acid, nucleotide, cofactor and vitamins, carbohydrate, peptide, partially characterized molecules, and energy.

We hypothesize that variance is lower in the healthiest group for metabolites related to health. Therefore, we tested each metabolite for variance heterogeneity between the most and the least healthy groups. Samples were assigned to these two groups based on health scores quantiles (“Methods” section). Of 888 metabolites tested, 178 (20%) showed statistically significant differences in variance at least for one health instrument and sex after Bonferroni correction for 8880 tests (Supplementary Data 2); 137 of them had known identity. These measurements were selected as predictors of metabolomic age.

Among the 178 metabolites with detected sweet spots, 77 measurements (43%) were significant across male and female sexes (Fig. 2a). Metabolites involved in pentose, glycine, serine, and threonine, and sterol metabolism showed differences in variance only in males. In contrast, metabolites from the pregnenolone and TCA Cycle pathways showed a variance effect on health exclusively in females. Analysis by superpathway revealed that amino acid, carbohydrate, and lipid superpathways harbored the most significantly heteroscedastic metabolites. The top markers that showed the strongest variance differences in these three superpathways were N6,N6,N6-trimethyllysine (p = 1.2 × 10⁻²⁷), glucose (p = 1.2 × 10⁻²⁰), and 1-stearoyl-2-linoleoyl-GPI (18:0/18:2) (p = 1.1 × 10⁻¹¹), respectively (Supplementary Data 3).

We tested metabolites for their relevance to five different health instruments. Out of the 178 selected measurements, 137 (80%) showed significant variance differences between the most and least frail older adults. In contrast, only 7 metabolites showed variance heterogeneity between the groups, highest and lowest in cognitive function, e.g., 1,5-anhydroglucitol (1,5-AG) (p = 0.009). Detailed information on metabolites related to each instrument and those shared between instruments is presented in Fig. 2b.

Only the androgenic steroids subpathway exhibited a statistically significant enrichment for health-related metabolites, with FDR-adjusted p-value 6.8 × 10⁻⁷ (n = 91). Other subpathways with notable significance ratios, calculated as a proportion of significant metabolites within the subpathway to the total number of measurements in that subpathway (see “Methods” section), included ascorbate and aldarate metabolism, and corticosteroids. The top significant pathways determined by the significance ratio are shown in Fig. 2c.

Most sweet spot estimates differ from population means

We identified optimal metabolite levels as breakpoints, for which values above or below these thresholds were linked to poorer health. Using piecewise regression (see “Methods” section), we determined optimal levels for 74 (42%) of the 178 selected metabolites (Supplementary Data 4) after Bonferroni-adjustment for 1780 tests. Among these 74 measurements, we detected an optimal level for 47 metabolites in older females, with the most significant effect change for gulonate (breakpoint estimate = −0.02, slopes = [−0.01, 0.04], p = 1.7 × 10⁻³⁴). In older males, we were able to estimate optimal levels for 59 metabolites; the most significant effect change was for N6,N6,N6-trimethyllysine (breakpoint estimate = −0.31, slopes = [−0.11, 0.01], p = 1.7 × 10⁻³⁴).

The different health instruments provided varying estimates for the breakpoint positions. However, the overlap of the 95% confidence intervals suggests a common range of estimates among the instruments. We selected the breakpoint estimate with the narrowest 95% confidence interval (CI) as the optimal metabolite value, i.e. sweet spot (Fig. 3). Similar to the variance analysis, a piecewise regression on the Frailty Index (I) produced a sweet spot estimate for 59 out of 74 measurements (80%), compared to only 9 out of 74 measurements (12%) for cognitive function. The 95% CIs of the optimal values overlapped between sexes for only 12 metabolites (20%), including metabolonic lactone sulfate and gulonate. Further analysis revealed that for 41 metabolites (55%), the 95% CI for the optimal value did not include zero, which is the CLSA population mean.

The forest plot represents estimated optimal levels for 74 metabolites. Optimal levels and their 95% confidence intervals (CIs) were estimated using piecewise regression. The central point of each error bar corresponds to the regression-estimated optimal level, and the bars indicate its 95% CIs. The narrowest CI for each metabolite, across all health instruments, is displayed. The heatmap represents −log10(p-values) associated with the strength of the effect of metabolite levels on health, as determined by five instruments. P-values from the two-sided score test were Bonferroni-adjusted (n = 1780). Instruments: I - the Frailty Index; II - the number of five diseases: cancer (except non-melanoma skin cancer), cardiovascular disease, major pulmonary disease, dementia, and diabetes; III - the number of other chronic conditions; IV - cognitive function; V - physical function. The source data for Fig. 3 is in Supplementary Data 4.

For each metabolite’s sweet spot, we calculated the distance between the optimal level and the observed metabolite measurement for each individual. We also investigated the correlation between distances from metabolite sweet spots and chronological age, along with health instruments (Supplementary Fig. 3). Among measurements with known identity, hydroxyasparagine** (Pearson r² = 0.48, p = 1.6 × 10⁻²¹²) and dehydroepiandrosterone sulfate (DHEA-S) (Pearson r² = 0.38, p = 5.7 × 10⁻¹⁶⁰) showed the strongest correlation between age and deviation from the metabolite’s sweet spot.

Sweet spot distance transformation enhances the mortality association of the metabolomic aging biomarker

To estimate participants’ metabolomic age, we trained the Sweet Spot Clock with a penalized regression model on the training set, using 178 identified health-related metabolites as predictors. For a subset of these, we used distances from estimated optimal values instead of the raw levels (“Methods” section). The relationship between chronological age and predicted metabolomic age, the model prediction transformed into years, is illustrated in Fig. 4a. To be consistent with other -omics-based biomarkers, metabolomic age was regressed onto age to calculate metabolomic age deviation (MAD). The wide distribution of MAD (Fig. 4b) suggests substantial inter-individual variation and a rightward shift among women relative to men. We evaluated the performance of the proposed biomarker on the test set by comparing it to a null, a baseline, and control models based on discrimination accuracy, measured by the concordance index (C-index), and the strength of the association with 6-year all-cause mortality, both derived from Cox proportional hazards models.

a Chronological age versus metabolomic age. Each dot represents a CLSA COM participant, with darker meaning >1 individual. R indicates Pearson correlation between variables; p-value is for a two-sided t-test. Values above the regression line (dashed black line) indicate faster agers; below the line—slower agers. b Distribution of metabolomic age deviation (in years). c 6-year survival proportions by metabolomic age deviation (MAD) quartiles. The Cox proportional hazard model was adjusted for age and sex. P-value was estimated by a two-sided Wald test. d Hazard ratios for various adverse outcomes derived from Cox hazard regression models adjusted for sex and age (mortality, n = 1285; diabetes, n = 942; COPD, n = 1063; stroke, n = 1108; kidney disease, n = 1096; cancer, n = 962; dementia, n = 1126). The covariate in the model is metabolomic age deviation. The central point of each error bar corresponds to the estimated hazard ratio, with horizontal lines indicating the 95% confidence interval. P-values for two-sided Wald’s tests are adjusted for seven tests using the Bonferroni correction. e Genome-wide association study of metabolomic age deviation (genomic control λ = 1.049). Association statistics were obtained from two-sided Wald tests. Gray horizontal line indicates genome-wide significance level of P < 5E-8. A single genomic region was significantly associated with the aging biomarker at the genome-wide significance level (rs11809159, p-value = 1.94 × 10⁻⁹). The source data for Fig. 4 is in Supplementary Data 6 and 7.

The Sweet Spot Clock, trained on FI using distances from metabolite sweet spots, showed the strongest association with all-cause mortality and the highest discrimination (Wald statistic = 6.9, p = 5.8 × 10⁻¹², C-index = 0.841). The baseline model (Baseline row in Table 2), an established metabolomic clock¹⁷, showed lower discrimination (C-index = 0.821) and a weaker association with all-cause mortality (Wald statistic = 4.7, p = 2.2 × 10⁻⁶); this model was trained to predict chronological age from measured metabolite levels. The null model, with only age and sex as predictors, showed the lowest discrimination (C-index = 0.809). Next, we investigated the contribution of two factors that resulted in improved performance compared to the baseline model: (1) training on FI instead of age and (2) transforming predictors by using distances from sweet spots instead of raw metabolite levels. To disentangle the contributions of each factor, we trained two additional models, each with one factor fixed: ControlAge (fixed target) and ControlMb (fixed predictors). Our results indicate that both targeting FI (Wald statistic = 6.7, p = 2.7 × 10⁻¹¹, C-index = 0.839) and using distances from sweet spots (Wald statistic = 5.0, p = 4.5 × 10⁻⁷, C-index = 0.830) contributed to the overall improvement (Table 1). The correlations between metabolomic age derived from each model and the training targets (FI and age) are shown in Supplementary Fig. 4.

Table 2.

The associations between metabolomic age deviations (MAD) and age-related diseases, as determined by Cox proportional hazards models

Outcome	Model*	C-Index	HR	95% CI	p-value**
Mortality	MAD	0.841	1.08	[1.06–1.10]	8.1 × 10⁻¹¹
N = 1285, N events = 9	Null	0.809	-***	-	-
	MAD Extended	0.863	1.09	[1.06–1.13]	2.2 × 10⁻⁷
	Null Extended	0.851	-	-	-
Diabetes	MAD	0.654	1.05	[1.04–1.07]	1.8 × 10⁻¹³
N = 942, N events = 236	Null	0.594	-	-	-
	MAD Extended	0.676	1.04	[1.03–1.06]	1.8 × 10⁻⁶
	Null Extended	0.656	-	-	-
COPD	MAD	0.692	1.05	[1.03–1.07]	6.2 × 10⁻⁷
N = 1063, N events = 108	Null	0.655	-	-	-
	MAD Extended	0.757	1.05	[1.02–1.06]	0.001
	Null Extended	0.746	-	-	-
Stroke	MAD	0.712	1.07	[1.04–1.11]	4.2 × 10⁻⁴
N = 1108, N events = 30	Null	0.649	-	-	-
	MAD Extended	0.723	1.07	[1.03–1.11]	0.02
	Null Extended	0.673	-	-	-
Kidney disease	MAD	0.641	1.05	[1.02–1.08]	0.04
N = 1096, N events = 49	Null	0.594	-	-	-
	MAD Extended	0.652	1.05	[1.01–1.08]	0.07
	Null Extended	0.618	-	-	-
Cancer	MAD	0.642	1.00	[0.98–1.02]	1
N = 962, N events = 194	Null	0.642	-	-	-
	MAD Extended	0.656	1.00	[0.98–1.02]	1
	Null Extended	0.655	-	-	-
Dementia or Alzheimer’s	MAD	0.908	0.92	[0.80–1.06]	1
disease	Null	0.896	-	-	-
N = 1126, N events = 5	MAD_Extended	0.955	0.94	[0.79–1.11]	1
	Null_Extended	0.954	-	-	-

Open in a new tab

* Each model was adjusted for sex and age. Hazard ratios (HRs) of MAD corresponding to each model are shown alongside their 95% confidence intervals (CIs). MAD are the residuals of the Sweet Spot Clock, calibrated into units of age, and then regressed onto age. The null model had age and sex as predictors. Extended models were additionally adjusted for body mass index, alcohol consumption, smoking status, physical activity level, and level of education.

** The p-values were obtained from two-sided Wald’s tests and Bonferroni-adjusted (n = 14).

*** HRs shown in this table correspond to the metabolomic biomarker only, as they quantify its added predictive value beyond the covariates. The Null and Extended Null models do not include the biomarker and therefore do not provide an HR. HRs for all covariates in all models are reported in Supplementary Data 6.

Table 1.

The associations between metabolomic age deviations (MAD) and all-cause mortality

Model*	Target	DSS	C-Index	HR	95% CI	p-value**	AUC***	Accuracy
Sweet Spot Clock	FI	+	0.841	1.08	[1.06−1.10]	5.8 × 10⁻¹²	0.824	0.951
ControlMb	FI	-	0.839	1.08	[1.06–1.11]	2.7 × 10⁻¹¹	0.822	0.950
ControlAge	Age	+	0.830	1.10	[1.06–1.14]	4.5 × 10⁻⁷	0.824	0.950
Baseline	Age	-	0.821	1.09	[1.05–1.13]	2.2 × 10⁻⁶	0.815	0.950
Frailty index	-	-	0.830	1.05	[1.02–1.07]	5.2 × 10⁻⁵	0.809	0.948
Null	-	-	0.809	-	-	-	0.795	0.949

Open in a new tab

*Association were determined by Cox proportional hazards models, on the test (1285 participants who had not withdrawn from the study by the second follow-up, with 69 death events). Each model has been adjusted for sex and age. MAD are the residuals of metabolomic clocks, calibrated into units of age, and then regressed onto age. Hazard ratios (HRs) of MAD corresponding to each model are shown alongside their 95% confidence intervals (CIs). The line corresponding to the proposed metabolomic predictor is highlighted in bold.

**The p-values were obtained from two-sided Wald tests.

***AUC and accuracy were estimated for logistic regressions on 6-year mortality status, adjusted for age and sex.

To complement the survival analyses, we also trained logistic regression models using 6-year mortality status (alive vs. dead) as a binary outcome. The ordering of biomarkers based on area-under-the-curve (AUC) and accuracy was consistent with the C-index rankings from Cox models (Table 1). We additionally compared AUCs between models using the DeLong test; although the Sweet Spot model had the highest AUC, the pairwise differences were not statistically significant (Supplementary Table 4).

Metabolomic age deviation is strongly associated with mortality and the onset of age-related diseases

Next, we evaluated the prognostic ability of the metabolomic biomarker. We tested for the association with the onset of age-related conditions: diabetes, chronic obstructive pulmonary disease (COPD), stroke, kidney disease, Alzheimer’s disease, and cancer. Besides mortality, MAD was associated with an increased risk of diabetes (HR = 1.05, p = 8.9×10⁻¹⁴, C-index = 0.654), COPD (HR = 1.05, p = 3.1 × 10⁻⁷, C-index = 0.692), stroke (HR = 1.07, p = 2.1 × 10⁻⁴, C-index = 0.712), and kidney disease (HR = 1.05, p = 0.02, C-index = 0.641) after Bonferroni correction for 7 tests (Fig. 4d). All models showed higher discrimination accuracy than corresponding null models (with only sex and age as covariates; Table 2).

To examine the extent to which MAD associates with mortality, independently from age, we divided participants into age deviation groups based on their MAD quartile. We then performed survival analysis and found that the group with a higher level of the MAD biomarker in a test set had a 79% higher risk of mortality over the 6-year period compared to the group with a lower age deviation (HR = 1.79; p = 8.7 × 10⁻⁶, C-index = 0.821). The 6-year survival proportions by MAD quartiles are shown in Fig. 4c. A complementary analysis using the 10th and 90th percentiles of MAD revealed an even stronger survival contrast (HR = 9.8; p = 6.5 × 10⁻⁵, C-index = 0.85) between health extremes (Supplementary Fig. 5).

To improve the robustness of our findings, we trained extended models, adjusting for additional covariates: body mass index (BMI), alcohol consumption, smoking status, physical activity level, and level of education (“Methods” section). MAD remained strongly associated with mortality, diabetes, COPD, and stroke. Similarly, all models showed higher discrimination accuracy than corresponding extended null models (Table 2).

Metabolomic age deviation is strongly associated with lifestyle factors

Next, we explored genetic and environmental components of MAD. We conducted a genome-wide association study (GWAS) on MAD, which identified a single genomic region significantly associated with the aging biomarker at the genome-wide significance level. The region is located on chromosome 1 (1p36.32), with a leading intergenic SNP, rs11809159, downstream of the AJAP1 gene (Fig. 4e). To assess the contribution of genetic factors to variability in MAD, we estimated SNP-based heritability using Linkage Disequilibrium Score Regression (LDSC). We found the SNP-heritability of MAD to be 0.23 (SE = 0.01). To investigate whether MAD and the Frailty Index share a common genetic component, we also performed a GWAS on the Frailty Index; however, all associations were below the genome-wide significance threshold.

We conducted an association analysis using univariate regression models on the test set to explore the relationship between MAD and health, lifestyle, and socioeconomic factors. Adjusted for sex and age, higher MAD was significantly associated with inflammatory biomarkers: High sensitivity C-reactive protein (β = 0.02, p = 4.7 × 10⁻¹⁴), Tumor necrosis factor-alpha (β = 0.04, p = 3.5 × 10⁻³⁴), and Interleukin-6 (β = 0.04, p = 2.2 × 10⁻³²). MAD was also positively associated with psychological distress (β = 0.08, p = 1.9 × 10⁻⁶), BMI (β = 0.04, p = 2.2 × 10⁻³²), Hemoglobin A1c, hyperglycemia biomarkers (β = 0.04, p = 2.5 × 10⁻³⁹). Socioeconomic variables revealed that lower household income (β = −0.03, p = 8.1 × 10⁻¹⁷) and educational attainment (β = −0.01, p = 2 × 10⁻³) were associated with higher MAD. Lifestyle factors—nutrition quality (β = −0.2, p = 1.5 × 10⁻²²), smoking (β = 0.02, p = 1.7 × 10⁻¹¹), and alcohol consumption (β = 0.03, p = 4.7 × 10⁻⁵), also demonstrated strong associations with elevated MAD. To test for the association between MAD and chronic condition load (number of conditions determined by Instrument II and Instrument III), cognitive (Instrument IV), and physical (Instrument V) function, we additionally adjusted regression models for BMI, alcohol consumption, smoking, education, and physical activity level, resulting in strong associations. All p-values were Bonferroni-adjusted for n = 15 tests (Table 3).

Table 3.

The associations between metabolomic age deviations (MAD) and multiple factors on the test

	Factor	beta	SD	p-value
Overall health	Chronic condition count	0.008	0.001	8.5 × 10⁻³⁶
	Physical function (Instrument IV)	0.003	0.001	2.8 × 10⁻⁹
	Cognitive function (Instrument V)	0.002	0.001	5.6 × 10⁻⁵
Body composition	Body Mass Index	0.037	0.003	2.2 × 10⁻³²
Inflammation	Interleukin-6	0.036	0.003	2.2 × 10⁻³²
	Tumor Necrosis Factor - Alpha	0.043	0.003	8.5 × 10⁻³⁶
	High Sensitivity C-Reactive Protein, mg/L	0.018	0.002	4.7 × 10⁻¹⁴
Hyperglycemia	Hemoglobin A1c, %	0.041	0.003	2.3 × 10⁻³⁹
Lifestyle	Nutritional Risk	-0.203	0.019	1.5 × 10⁻²²
	Smoking status	0.017	0.002	1.7 × 10⁻¹¹
	Alcohol consumption	0.032	0.007	4.7 × 10⁻⁵
	Physical activity levels	0.037	0.003	8.5 × 10⁻³⁶
	Psychological Distress	-1.200	0.225	1.8 × 10⁻⁶
Socioeconomic	Total household income	-0.030	0.003	8.1 × 10⁻¹⁷
	Level of Education	-0.011	0.003	2.0 × 10⁻³

Open in a new tab

Associations were evaluated using linear regression models adjusted for age and sex, with two-sided Wald tests of regression coefficients. P-values were Bonferroni-adjusted (n = 17). MAD are the residuals of the Sweet Spot Clock, calibrated into units of age, and then regressed onto age.

Metabolomic biomarker and mortality-predictive epigenetic biomarker share signals of aging heterogeneity

To understand the relationship with other -omics-based biomarkers, we measured the correlation between MAD and epigenetic age deviation as measured by five established epigenetic clocks. We observed weak correlations with the Hannum clock¹⁵ (r² = 0.12, p = 2.0 × 10⁻⁴), the Horvath Pan-Tissue clock¹⁴ (r² = 0.08, p = 0.005) and DNAm PhenoAge¹³ (r² = 0.17, p = 3.3 × 10⁻⁷). In contrast, the correlation with DNAm GrimAge⁴⁵ adjusted for chronological age appeared to be moderate (r² = 0.38, p = 2.3 × 10⁻⁴⁴ for GrimAge v1). For additional details on correlations between the omics clocks (constructed metabolomic clocks and established epigenetic clocks) in both the full dataset and the test set, see Supplementary Fig. 6 and Supplementary Fig. 7, respectively.

To further explore the predictive performance of the proposed metabolomic biomarker, we conducted a secondary analysis in a subset of 1242 individuals (64 mortality events) with paired metabolomic and DNA methylation data. In this subgroup, the Sweet Spot Clock demonstrated higher discrimination than established epigenetic clocks (C-index: 0.783 for Sweet Spot Clock vs. 0.751 for GrimAge v2). A combined multi-omics model incorporating both MAD and GrimAge v2 achieved the highest discrimination (C-index = 0.806; Supplementary Table 5).

Metabolite-level insights into biological age deviation

Next, we examined the metabolites’ contribution to the calculated age deviation. In total, the Frailty Index-based model for predicting MAD resulted in 126 unique predictors, 100 of which have known identity. Of these 126 metabolites, 111 and 78 were retained in the female and male models, respectively, following separate ElasticNet selections (Supplementary Data 5). In the female model, hydroxyasparagine** (β = 0.0074), mannose (β = 0.0049), and erythronate* (β = 0.0047) contributed the most to Frailty Index. For the male model, mannose (β=0.0039), (S)-3-hydroxybutyrylcarnitine (β = 0.0037), and vanillactate (β = 0.0032) were among the top contributors to MAD. As an additional control, we tested the 50 top predictive metabolites for associations with 40 known disease biomarkers, chronic conditions, and diagnostic measurements, adjusting for age and sex. All analytes showed strong associations after Bonferroni correction (n = 2000) with at least five distinct phenotypes tested (Supplementary Fig. 8).

Replication of associations of MAD with health status and mortality

To replicate the association between MAD, health decline, and mortality, we performed untargeted metabolomic profiling on 667 participants from the Super Seniors Study³⁷ using EDTA plasma samples. For each batch, the data were normalized to the same QC samples as were used by CLSA (“Methods” section), which allowed us to apply the previously identified CLSA sweet spots to the Super Seniors data. We then computed MAD scores using the original, pretrained Sweet Spot Clock without retraining. The dataset comprised 548 exceptionally healthy Super Seniors aged 85 and older (mean age = 89, SD = 3.5, 62% females) who have never been diagnosed with cancer, cardiovascular or pulmonary disease, diabetes, or dementia; and 119 age-matched controls with one or more chronic diseases (mean age = 89, SD = 3.7, 52% females); 612 (92%) were of European ancestry.

First, we calculated MAD for each participant, resulting in a mean age deviation of −0.4 years among the Super Seniors (SD = 3.1) versus 1.1 years (SD = 3.5) among controls. Logistic regression for the all-cause mortality target adjusted for age, sex, and ancestry (European/non-European), resulted in a significant association between MAD and Super-Senior/age-matched control phenotype (β = 0.15, SE = 0.03, OR = 1.16, p = 2.8 × 10⁻⁶, Supplementary Fig. 9). Next, we conducted survival analysis using 658 individuals with known status, 334 of whom died by the status date. After adjusting for age, sex, and ancestry, MAD showed a significant association with mortality (HR = 1.08, SE = 0.02, p = 3.3 × 10⁻⁵, Supplementary Fig. 10) and higher discrimination (C-index = 0.66) than the null model with only age, sex, and ancestry as covariates (C-index = 0.61).

To test generalizability, we calculated MAD for CLSA participants with Asian or Black ancestry, comprising 111 and 63 individuals with metabolomic data, respectively, who had not withdrawn from the CLSA by the second follow-up. There were no deaths among the subset with Asian ancestry, and three deaths in the subset with Black ancestry. Cox regression, adjusted for sex and age, did not reveal a significant association between the MAD and all-cause mortality among the subset with Black ancestry (HR = 1.15, SE = 0.12, p = 0.26).

Discussion

We have introduced the Sweet Spot Clock, a metabolomic predictor of biological age. Our model is restricted to metabolites related to health, and demonstrates that deviations from optimal metabolite values are better predictors of biological age than metabolite levels alone. To the best of our knowledge, this is the first study to explicitly account for non-linearity in metabolomic data. Our blood-based biomarker was strongly associated with all-cause mortality and health-related outcomes. Replication in an independent, older cohort demonstrated the generalizability of the Sweet Spot Clock, although predictive performance was attenuated, likely reflecting age-related metabolic shifts and cohort differences. These findings are consistent with previous research on the metabolome, which has shown that an older predicted metabolomic age compared to chronological age is associated with multiple risk factors for premature mortality²⁵. Our model also showed strong correlations with comorbidity burden, physical decline, and cognitive decline, while exhibiting a weak correlation with epigenetic age deviation computed by the DNAm PhenoAge, Hannum, and Horvath Pan-Tissue clocks, and a moderate correlation with the GrimAge clock that incorporates smoking status and clinical biomarkers. A preliminary multi-omics model combining MAD with GrimAge v2 achieved the highest predictive performance in the paired subset than any individual predictor alone, highlighting a promising future direction. These results further confirm that metabolomic models effectively capture differences in biological age independent of epigenetic changes²⁵.

As expected, age and sex were the strongest predictors of age-related outcomes. While the extended model incorporating lifestyle and socioeconomic factors sometimes achieved higher concordance, the Sweet Spot Clock captured distinct biological variation not explained by these variables. For stroke and kidney disease, it even outperformed conventional predictors (i.e., BMI, alcohol consumption, smoking status, physical activity, and education) combined, suggesting added value in specific domains. These findings highlight the biomarker’s potential to complement established predictors and offer mechanistic insight into why individuals age differently, even if its standalone clinical utility is modest.

To investigate why some individuals remain healthier and experience a later onset of age-related diseases compared to their peers, we focused on deficit accumulation across multiple systems, and trained a statistical model on the Frailty Index (an intermediary target) instead of on age or mortality. Many large-scale biobank studies have developed metabolomic clocks primarily trained to predict chronological age⁵². While these studies demonstrated strong associations with mortality risk, cardiovascular phenotypes, and various disease risk factors, our findings suggest that focusing on deficit accumulation results in superior performance in predicting mortality and age-related diseases. Generally speaking, models trained to accurately predict chronological age do not capture the underlying biological variability. These models focus on aligning with the chronological timeline rather than identifying the physiological changes and risk factors associated with different health outcomes. Another study employed targeted plasma metabolomics to develop metabolomic risk scores trained on all-cause mortality¹⁹. Although mortality is a clearly defined endpoint and has been used in several recent studies, including GrimAge⁴⁵, training statistical models on functional decline or deficit accumulation has a clear advantage. Elucidating dysregulated pathways associated with health decline provides an opportunity for lifestyle or medical intervention. Although deficit accumulation reflects established physiological changes and a degree of resilience loss, it currently represents a comprehensive and clinically meaningful proxy for multisystem health decline in population-based studies. By training metabolomic models on this intermediary phenotype, our approach aims to extract biological signals that precede overt clinical manifestations. This framework offers a path toward refining instruments to capture earlier stages of dysregulation along the aging trajectory.

Previous multi-omics analysis of the Frailty Index reported metabolites and genetic variants associated with frailty in female twins⁵³. While our GWAS analyses did not reveal any significant associations, we were able to replicate some metabolite-FI associations—glutamate, C-glycosyltryptophan, pseudouridine, and epiandrosterone sulfate were among the metabolomic predictors of FI. Additionally, C-glycosyltryptophan, erythronate, and androstenediol (3beta,17beta) disulfate (2) were previously associated with all-cause mortality in the Alpha-Tocopherol, Beta-Carotene Cancer Prevention Study Cohort⁵⁴.

As observed in previous work on Super Seniors⁵⁵ and validated in previous work on phenotypic measures, lower variance among the healthiest group is key to elucidating health-related measurements²⁹. Instead of selecting measurements based on correlation with age, we concentrated on the effect on health to explain differences in aging processes. Variance analysis resulted in a number of metabolic predictors, many of which were already related to diseases. For instance, N6,N6,N6-trimethyllysine (TML) and glucose demonstrated significant variance effects on health, serving as key indicators of disruptions in metabolic pathways such as carnitine synthesis and epigenetic variation⁵⁶ and glucose metabolism. Previous studies have shown an association between elevated TML and glucose levels with metabolic disorders, including diabetes and cardiovascular diseases^57–59. Aconitate has been suggested as a diagnostic marker for mitochondrial aconitase deficiency resulting in mitochondrial dysfunction⁶⁰. Levels of analytes with unknown identity also varied with health, suggesting the presence of significant yet uncharacterized metabolites that may provide additional understanding of metabolic functions and disorders. Overall, by defining health-related metabolites through the lens of variance, we were able to gain meaningful insights into homeostatic regulation and prioritize the most significant analytes for further analysis.

Relationship analysis revealed metabolites that were not previously known to have sweet spots. Notably, we sought to identify age-independent optimal metabolite levels. While previous studies indicate that age significantly influences metabolomic profiles^61,62, it is likely that the observed variability may be attributed more to differential maintenance of metabolic health rather than to the aging process itself. Notably, the majority of the estimated optimal levels deviated from the population mean, suggesting that older adults, on average, are not in optimal health. Although average population values are often used to assess dysregulation levels of individuals, they may not accurately represent health⁶³.

Higher TML levels were previously associated with elevated risk of all-cause and cardiovascular mortality⁵⁸. Our findings extend this observation by showing that levels below the optimal range are also associated with worse health. Increased metabolonic lactone sulfate levels were previously associated with cardiometabolic disease in a Mexican American population-based cohort⁶⁴, while low levels had not been associated with adverse outcomes, but our analyses imply that deviations in either direction from an optimal value are relevant to health.

This study has several limitations. Firstly, 98% of the CLSA study population analyzed consisted of individuals of European ancestry, limiting the generalizability of the findings to diverse ancestries. To address the limited generalizability, it is crucial for future research to include diverse populations. The metabolome is inherently noisy and influenced by factors such as diet and environmental exposures²⁵, making longitudinal metabolomic data necessary, though correction for batch effects remains challenging²⁸. Untargeted metabolomic profiling with relative concentrations of metabolites prevented us from calculating specific optimal levels. Although this study offers strong support for the relative locations of sweet spots, further analysis is needed to establish optimal metabolite levels in absolute concentrations for potential clinical use. Comparisons between our biomarker and established epigenetic clocks should be interpreted with caution. The epigenetic clocks were trained on external datasets that differ from CLSA in age structure, phenotypic breadth, mortality follow-up, and population characteristics. Finally, while we included comparator models reflecting the design principles of age-based (i.e., MetaboAge¹⁷) and mortality-based models (i.e., MetaboHealth¹⁹), direct benchmarking against these established clocks is limited by platform differences (targeted NMR vs. untargeted LC-MS) and distinct training endpoints. Our approach, therefore, complements rather than replicates existing metabolomic clocks, emphasizing health-related decline and non-linear transformations.

To summarize, our study identified metabolites related to health and estimated their optimal levels. We developed a sweet spot metabolomic clock that predicts biological age and examined the genetic and environmental components of age deviation. Associations between the metabolomic biomarker, health decline, and mortality were replicated in an independent cohort of individuals aged 85 years and older.

Supplementary information

Supplementary Material^{(375.9KB, pdf)}

43856_2026_1375_MOESM2_ESM.pdf^{(31.7KB, pdf)}

Description of Additional Supplementary files

Supplementary Data 1^{(122.6KB, xlsx)}

Supplementary Data 2^{(806.7KB, xlsx)}

Supplementary Data 3^{(13KB, xlsx)}

Supplementary Data 4^{(329.3KB, xlsx)}

Supplementary Data 5^{(20KB, xlsx)}

Supplementary Data 6^{(28.8KB, xlsx)}

Supplementary Data 7^{(21.7KB, xlsx)}

Acknowledgements

This research is funded by the Canadian Institutes of Health Research (grant # PAD 179760) to A Brooks-Wilson and LT Elliott. LT Elliott’s research is supported by a Michael Smith Health Research BC Scholar Award. This research was made possible using the data collected by the Canadian Longitudinal Study on Aging (CLSA). Funding for the Canadian Longitudinal Study on Aging (CLSA) is provided by the Government of Canada through the Canadian Institutes of Health Research (CIHR) under grant reference: LSA 94473 and the Canada Foundation for Innovation, as well as the following provinces: Newfoundland, Nova Scotia, Quebec, Ontario, Manitoba, Alberta, and British Columbia. This research has been conducted using the CLSA Baseline Comprehensive Dataset version 7.0, Follow-up 1 Comprehensive version 4.0, Follow-up 2 Comprehensive version 1.0, Genomic data version 3.0, Epigenetics version 1.1, Metabolomics version 2.0 under Application Number 2206033. The CLSA is led by Drs. Parminder Raina, Christina Wolfson, and Susan Kirkland. The AB SCREEN™ II assessment tool is owned by Dr. Heather Keller. Use of the AB SCREEN™ II assessment tool was made under license from the University of Guelph. The opinions expressed in this manuscript are the authors’ own and do not reflect the views of the Canadian Longitudinal Study on Aging.

Author contributions

L.T.E. and A.B.W. developed and directed the project. O.V. developed and performed the modeling pipeline and wrote the original draft. S.L. processed and aliquoted study samples and curated data. J.M., X.S., and K.R. provided critical feedback on the study and reviewed/edited the manuscript. O.V. and L.T.E. prepared the manuscript with input from all authors, and all authors approved the final manuscript.

Peer review

Peer review information

Communications Medicine thanks Sithara Vivek, Chiara Herzog, and Valentin Vetter for their contribution to the peer review of this work.

Data availability

Data are available from the Canadian Longitudinal Study on Aging (www.clsa-elcv.ca) for researchers who meet the criteria for access to de-identified CLSA data. The Super Seniors dataset is stored at Genome Sciences Center, BC Cancer, Canada. For inquiries about this dataset, or for requests to collaborate on projects involving this dataset, please contact Dr. Angela Brooks-Wilson at abrooks-wilson@bcgsc.ca. The list of metabolites used for analysis is provided in Supplementary Data 1. Source data for recreating figures are available at https://zenodo.org/records/17594106⁶⁵ and in Supplementary Data Files 2–7: The source data for Fig. 2a, b is in Supplementary Data 2, the source data for Fig. 2c is in Supplementary Data 3, the source data for Fig. 3 is in Supplementary Data 4, the source data for Fig. 4d is in Supplementary Data 6, and the source data for Fig. 4e is in Supplementary Data 7. The list of predictors included in the Sweet Spot Clock model is in Supplementary Data 5.

Code availability

All code used in this study is available at https://zenodo.org/records/17594106⁶⁵. The data analysis was performed in R v. 4.4.2.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors jointly supervised this work: Angela Brooks-Wilson, Lloyd T. Elliott.

Contributor Information

Angela Brooks-Wilson, Email: abrooks-wilson@bcgsc.ca.

Lloyd T. Elliott, Email: lloyd_elliott@sfu.ca

Supplementary information

The online version contains supplementary material available at 10.1038/s43856-026-01375-2.

References

1.Lowsky, D. J., Olshansky, S. J., Bhattacharya, J. & Goldman, D. P. Heterogeneity in healthy aging. J. Gerontol. A Biol. Sci. Med. Sci.69, 640–649 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Rae, M. J. et al. The demographic and biomedical case for late-life interventions in aging. Sci. Transl. Med.2, 1–6 (2010). [DOI] [PMC free article] [PubMed]
3.van den Berg, N. et al. Longevity defined as top 10% survivors and beyond is transmitted as a quantitative genetic trait. Nat. Commun.10, 1–12 (2019). [DOI] [PMC free article] [PubMed]
4.Chiu, M. et al. Mortality risk associated with psychological distress and major depression: A population-based cohort study. J. Affect. Disord.234, 117–123 (2018). [DOI] [PubMed] [Google Scholar]
5.Stringhini, S. et al. Socioeconomic status, non-communicable disease risk factors, and walking speed in older adults: multi-cohort population based study. BMJ360, 1–10 (2018). [DOI] [PMC free article] [PubMed]
6.Moqri, M. et al. Biomarkers of aging for the identification and evaluation of longevity interventions. Cell186, 3758–3775 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Lohman, T., Bains, G., Berk, L. & Lohman, E. Predictors of biological age: the implications for wellness and aging research. Gerontol. Geriatr. Med.7, 1–13 (2021). [DOI] [PMC free article] [PubMed]
8.Jylhävä, J., Pedersen, N. L. & Hägg, S. Biological age predictors. EBioMedicine21, 29–36 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Rutledge, J., Oh, H. & Wyss-Coray, T. Measuring biological age using omics data. Nat. Rev. Genet.23, 715–727 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Lehallier, B. et al. Undulating changes in human plasma proteome profiles across the lifespan. Nat. Med.25, 1843–1850 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Peters, M. J. et al. The transcriptional landscape of age in human peripheral blood. Nat. Commun.6, 1–14 (2015). [DOI] [PMC free article] [PubMed]
12.Tanaka, T. et al. Plasma proteomic signature of age in healthy humans. Aging Cell17, 1–13 (2018). [DOI] [PMC free article] [PubMed]
13.Levine, M. E. et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging10, 573–591 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Horvath, S. DNA methylation age of human tissues and cell types. Genome Biol.14, 1–20 (2013). [DOI] [PMC free article] [PubMed]
15.Hannum, G. et al. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol. Cell49, 359–367 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Menni, C. et al. Metabolomic markers reveal novel pathways of ageing and early development in human populations. Int. J. Epidemiol.42, 1111 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Van Den Akker, E. B. et al. Metabolic age based on the BBMRI-NL 1H-NMR metabolomics repository as biomarker of age-related disease. Circ. Genom. Precis. Med.13, 541–547 (2020). [DOI] [PubMed] [Google Scholar]
18.Robinson, O. et al. Determinants of accelerated metabolomic and epigenetic aging in a UK cohort. Aging Cell19, 1–13 (2020). [DOI] [PMC free article] [PubMed]
19.Deelen, J. et al. A metabolic profile of all-cause mortality risk identified in an observational study of 44,168 individuals. Nat. Commun.10, 1–8 (2019). [DOI] [PMC free article] [PubMed]
20.Galkin, F. et al. Human gut microbiome aging clock based on taxonomic profiling and deep learning. iScience23, 1–10 (2020). [DOI] [PMC free article] [PubMed]
21.Sayed, N. et al. An inflammatory aging clock (iAge) based on deep learning tracks multimorbidity, immunosenescence, frailty and cardiovascular aging. Nat. Aging1, 598–615 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Kuiper, L. M. et al. Epigenetic and metabolomic biomarkers for biological age: a comparative analysis of mortality and frailty risk. J. Gerontol. A Biol. Sci. Med. Sci.78, 1753–1762 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Okada, D., Cheng, J. H., Zheng, C., Kumaki, T. & Yamada, R. Data-driven identification and classification of nonlinear aging patterns reveals the landscape of associations between DNA methylation and aging. Hum. Genomics17, 1–11 (2023). [DOI] [PMC free article] [PubMed]
24.Tanaka, T. et al. Plasma proteomic biomarker signature of age predicts health and life span. eLife9, 1–24 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Robinson, O. & Lau, C. H. E. Measuring biological age using metabolomics. Aging12, 22352–22353 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Rattray, N. J. W. et al. Metabolic dysregulation in vitamin E and carnitine shuttle energy mechanisms associate with human frailty. Nat. Commun.10, 11 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Adav, S. S. & Wang, Y. Metabolomics signatures of aging: recent advances. Aging Dis.12, 646–661 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Yu, Y. et al. Correcting batch effects in large-scale multiomics studies using a reference-material-based ratio method. Genome Biol.24, 1–26 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Vishnyakova, O., Song, X., Rockwood, K., Elliott, L. T. & Brooks-Wilson, A. Physiological phenotypes have optimal values relevant to healthy aging: sweet spots deduced from the Canadian Longitudinal Study on Aging. Geroscience46, 1589–1605 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Shen, X. et al. Nonlinear dynamics of multi-omics profiles during human aging. Nat. Aging4, 1619–1634 (2024). [DOI] [PMC free article] [PubMed]
31.Raina, P. et al. Erratum: Cohort profile: the Canadian Longitudinal Study on Aging (CLSA)(International Journal of Epidemiology (2019) 2019 DOI: 10.1093/ije/dyz173). Int. J. Epidemiol.48, 2066 (2019). [DOI] [PMC free article] [PubMed]
32.Taliun, D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature590, 290–299 (2021). [DOI] [PMC free article] [PubMed]
33.Forgetta, V. et al. Cohort profile: genomic data for 26 622 individuals from the Canadian Longitudinal Study on Aging (CLSA). BMJ Open12, 1–9 (2022). [DOI] [PMC free article] [PubMed]
34.Forgetta, V. et al. The Canadian Longitudinal Study on Aging Genome-wide Genetic Data Release (version 3). CLSA Data Support Document v20201207.1851 https://www.clsa-elcv.ca/wp-content/uploads/2023/06/clsa_gwas_v3.pdf (2023).
35.Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature562, 203–209 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Wang, J., Wen, S., Fraser Symmans, W., Pusztai, L. & Coombes, K. R. The bimodality index: a criterion for discovering and ranking bimodal signatures from cancer gene expression profiling data. Cancer Inf.7, 199–216 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Halaschek-Wiener, J. et al. The super-seniors study: phenotypic characterization of a healthy 85+ population. PLoS ONE13, 1–11 (2018). [DOI] [PMC free article] [PubMed]
38.Darst, B. F., Koscik, R. L., Hogan, K. J., Johnson, S. C. & Engelman, C. D. Longitudinal plasma metabolomics of aging and sex. Aging11, 1262–1282 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Brown, M. B. & Forsythe, A. B. Robust tests for the equality of variances. J. Am. Stat. Assoc.69, 364–367 (1974). [Google Scholar]
40.Muggeo, V. M. R. Estimating regression models with unknown break-points. Stat. Med.22, 3055–3071 (2003). [DOI] [PubMed] [Google Scholar]
41.Muggeo, V. M. R. Testing with a nuisance parameter present only under the alternative: a score-based approach with application to segmented modelling. J. Stat. Comput. Simul.86, 3059–3067 (2016). [Google Scholar]
42.Muggeo, V. M. R. Interval estimation for the breakpoint in segmented regression: a smoothed score-based approach. Aust. N. Z. J. Stat.59, 311–322 (2017). [Google Scholar]
43.Kuhn, M. Building predictive models in R using the caret package. J. Stat. Softw.28, 1–26 (2008).27774042 [Google Scholar]
44.Rivals, I., Personnaz, L., Taing, L. & Potier, M. C. Enrichment or depletion of a GO category within a class of genes: which test? Bioinformatics23, 401–407 (2007). [DOI] [PubMed] [Google Scholar]
45.Lu, A. T. et al. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging11, 303–327 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Zhou, W. et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet.50, 1335–1341 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet.47, 1228–1235 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Keller, H. H., Goy, R. & Kane, S. L. Validity and reliability of SCREEN II (Seniors In The Community: Risk Evaluation for eating and nutrition, Version II). Eur. J. Clin. Nutr.59, 1149–1157 (2005). [DOI] [PubMed] [Google Scholar]
49.Washburn, R. A., McAuley, E., Katula, J., Mihalko, S. L. & Boileau, R. A. The Physical Activity Scale for the Elderly (PASE): evidence for validity. J. Clin. Epidemiol.52, 643–651 (1999). [DOI] [PubMed] [Google Scholar]
50.Kessler, R. C. et al. Screening for serious mental illness in the general population. Arch. Gen. Psychiatry60, 184–189 (2003). [DOI] [PubMed] [Google Scholar]
51.Ying, K., Paulson, S., Eames, A. et al. A unified framework for systematic curation and evaluation of aging biomarkers. Nat. Aging5, 2323–2339 (2025). [DOI] [PubMed]
52.Moqri, M. et al. Validation of biomarkers of aging. Nat. Med.30, 360–372 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Livshits, G. et al. Multi-OMICS analyses of frailty and chronic widespread musculoskeletal pain suggest involvement of shared neurological pathways. Pain159, 2565–2572 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Huang, J. et al. Serum metabolomic profiling of all-cause mortality: a prospective analysis in the alpha-tocopherol, beta-carotene cancer prevention (ATBC) study cohort. Am. J. Epidemiol.187, 1721–1732 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Halaschek-Wiener, J. et al. Reduced telomere length variation in healthy oldest old. Mech. Ageing Dev.129, 638–641 (2008). [DOI] [PubMed] [Google Scholar]
56.Maas, M. N., Hintzen, J. C. J., Porzberg, M. R. B. & Mecinović, J. Trimethyllysine: from carnitine biosynthesis to epigenetics. Int. J. Mol. Sci.21, 1–33 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Vekic, J., Silva-Nunes, J. & Rizzo, M. Glucose metabolism disorders: challenges and opportunities for diagnosis and treatment. Metabolites12, 1–4 (2022). [DOI] [PMC free article] [PubMed]
58.Bjornestad, E. O. et al. Trimethyllysine predicts all-cause and cardiovascular mortality in community-dwelling adults and patients with coronary heart disease. Eur. Heart J. Open1, 1–10 (2021). [DOI] [PMC free article] [PubMed]
59.Li, X. S. et al. Untargeted metabolomics identifies trimethyllysine, a TMAO-producing nutrient precursor, as a predictor of incident cardiovascular disease risk. JCI Insight3, 1–18 (2018). [DOI] [PMC free article] [PubMed]
60.Abela, L. et al. Plasma metabolomics reveals a diagnostic metabolic fingerprint for mitochondrial aconitase (ACO2) deficiency. PLoS ONE12, 1–15 (2017). [DOI] [PMC free article] [PubMed]
61.Trabado, S. et al. The human plasma-metabolome: Reference values in 800 French healthy volunteers; impact of cholesterol, gender and age. PLoS ONE12, 1–17 (2017). [DOI] [PMC free article] [PubMed]
62.Johnson, L. C. et al. Amino acid and lipid associated plasma metabolomic patterns are related to healthspan indicators with ageing. Clin. Sci.132, 1765–1777 (2018). [DOI] [PubMed] [Google Scholar]
63.Cohen, A. A. et al. Statistical distance as a measure of physiological dysregulation is largely robust to variation in its biomarker composition. PLoS One10, 1–25 (2015). [DOI] [PMC free article] [PubMed]
64.Das, S. K. et al. Metabolomic architecture of obesity implicates metabolonic lactone sulfate in cardiometabolic disease. Mol. Metab.54, 1–10 (2021). [DOI] [PMC free article] [PubMed]
65.Vishnyakova, O. et al. Data for: Metabolomic sweet spot clock predicts mortality and age-related diseases in the Canadian Longitudinal Study on Aging. Zenodo. 10.5281/ZENODO.17594106. [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material^{(375.9KB, pdf)}

43856_2026_1375_MOESM2_ESM.pdf^{(31.7KB, pdf)}

Description of Additional Supplementary files

Supplementary Data 1^{(122.6KB, xlsx)}

Supplementary Data 2^{(806.7KB, xlsx)}

Supplementary Data 3^{(13KB, xlsx)}

Supplementary Data 4^{(329.3KB, xlsx)}

Supplementary Data 5^{(20KB, xlsx)}

Supplementary Data 6^{(28.8KB, xlsx)}

Supplementary Data 7^{(21.7KB, xlsx)}

Data Availability Statement

All code used in this study is available at https://zenodo.org/records/17594106⁶⁵. The data analysis was performed in R v. 4.4.2.

[CR1] 1.Lowsky, D. J., Olshansky, S. J., Bhattacharya, J. & Goldman, D. P. Heterogeneity in healthy aging. J. Gerontol. A Biol. Sci. Med. Sci.69, 640–649 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Rae, M. J. et al. The demographic and biomedical case for late-life interventions in aging. Sci. Transl. Med.2, 1–6 (2010). [DOI] [PMC free article] [PubMed]

[CR3] 3.van den Berg, N. et al. Longevity defined as top 10% survivors and beyond is transmitted as a quantitative genetic trait. Nat. Commun.10, 1–12 (2019). [DOI] [PMC free article] [PubMed]

[CR4] 4.Chiu, M. et al. Mortality risk associated with psychological distress and major depression: A population-based cohort study. J. Affect. Disord.234, 117–123 (2018). [DOI] [PubMed] [Google Scholar]

[CR5] 5.Stringhini, S. et al. Socioeconomic status, non-communicable disease risk factors, and walking speed in older adults: multi-cohort population based study. BMJ360, 1–10 (2018). [DOI] [PMC free article] [PubMed]

[CR6] 6.Moqri, M. et al. Biomarkers of aging for the identification and evaluation of longevity interventions. Cell186, 3758–3775 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Lohman, T., Bains, G., Berk, L. & Lohman, E. Predictors of biological age: the implications for wellness and aging research. Gerontol. Geriatr. Med.7, 1–13 (2021). [DOI] [PMC free article] [PubMed]

[CR8] 8.Jylhävä, J., Pedersen, N. L. & Hägg, S. Biological age predictors. EBioMedicine21, 29–36 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Rutledge, J., Oh, H. & Wyss-Coray, T. Measuring biological age using omics data. Nat. Rev. Genet.23, 715–727 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Lehallier, B. et al. Undulating changes in human plasma proteome profiles across the lifespan. Nat. Med.25, 1843–1850 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Peters, M. J. et al. The transcriptional landscape of age in human peripheral blood. Nat. Commun.6, 1–14 (2015). [DOI] [PMC free article] [PubMed]

[CR12] 12.Tanaka, T. et al. Plasma proteomic signature of age in healthy humans. Aging Cell17, 1–13 (2018). [DOI] [PMC free article] [PubMed]

[CR13] 13.Levine, M. E. et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging10, 573–591 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Horvath, S. DNA methylation age of human tissues and cell types. Genome Biol.14, 1–20 (2013). [DOI] [PMC free article] [PubMed]

[CR15] 15.Hannum, G. et al. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol. Cell49, 359–367 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Menni, C. et al. Metabolomic markers reveal novel pathways of ageing and early development in human populations. Int. J. Epidemiol.42, 1111 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Van Den Akker, E. B. et al. Metabolic age based on the BBMRI-NL 1H-NMR metabolomics repository as biomarker of age-related disease. Circ. Genom. Precis. Med.13, 541–547 (2020). [DOI] [PubMed] [Google Scholar]

[CR18] 18.Robinson, O. et al. Determinants of accelerated metabolomic and epigenetic aging in a UK cohort. Aging Cell19, 1–13 (2020). [DOI] [PMC free article] [PubMed]

[CR19] 19.Deelen, J. et al. A metabolic profile of all-cause mortality risk identified in an observational study of 44,168 individuals. Nat. Commun.10, 1–8 (2019). [DOI] [PMC free article] [PubMed]

[CR20] 20.Galkin, F. et al. Human gut microbiome aging clock based on taxonomic profiling and deep learning. iScience23, 1–10 (2020). [DOI] [PMC free article] [PubMed]

[CR21] 21.Sayed, N. et al. An inflammatory aging clock (iAge) based on deep learning tracks multimorbidity, immunosenescence, frailty and cardiovascular aging. Nat. Aging1, 598–615 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Kuiper, L. M. et al. Epigenetic and metabolomic biomarkers for biological age: a comparative analysis of mortality and frailty risk. J. Gerontol. A Biol. Sci. Med. Sci.78, 1753–1762 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Okada, D., Cheng, J. H., Zheng, C., Kumaki, T. & Yamada, R. Data-driven identification and classification of nonlinear aging patterns reveals the landscape of associations between DNA methylation and aging. Hum. Genomics17, 1–11 (2023). [DOI] [PMC free article] [PubMed]

[CR24] 24.Tanaka, T. et al. Plasma proteomic biomarker signature of age predicts health and life span. eLife9, 1–24 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Robinson, O. & Lau, C. H. E. Measuring biological age using metabolomics. Aging12, 22352–22353 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Rattray, N. J. W. et al. Metabolic dysregulation in vitamin E and carnitine shuttle energy mechanisms associate with human frailty. Nat. Commun.10, 11 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Adav, S. S. & Wang, Y. Metabolomics signatures of aging: recent advances. Aging Dis.12, 646–661 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Yu, Y. et al. Correcting batch effects in large-scale multiomics studies using a reference-material-based ratio method. Genome Biol.24, 1–26 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Vishnyakova, O., Song, X., Rockwood, K., Elliott, L. T. & Brooks-Wilson, A. Physiological phenotypes have optimal values relevant to healthy aging: sweet spots deduced from the Canadian Longitudinal Study on Aging. Geroscience46, 1589–1605 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Shen, X. et al. Nonlinear dynamics of multi-omics profiles during human aging. Nat. Aging4, 1619–1634 (2024). [DOI] [PMC free article] [PubMed]

[CR31] 31.Raina, P. et al. Erratum: Cohort profile: the Canadian Longitudinal Study on Aging (CLSA)(International Journal of Epidemiology (2019) 2019 DOI: 10.1093/ije/dyz173). Int. J. Epidemiol.48, 2066 (2019). [DOI] [PMC free article] [PubMed]

[CR32] 32.Taliun, D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature590, 290–299 (2021). [DOI] [PMC free article] [PubMed]

[CR33] 33.Forgetta, V. et al. Cohort profile: genomic data for 26 622 individuals from the Canadian Longitudinal Study on Aging (CLSA). BMJ Open12, 1–9 (2022). [DOI] [PMC free article] [PubMed]

[CR34] 34.Forgetta, V. et al. The Canadian Longitudinal Study on Aging Genome-wide Genetic Data Release (version 3). CLSA Data Support Document v20201207.1851 https://www.clsa-elcv.ca/wp-content/uploads/2023/06/clsa_gwas_v3.pdf (2023).

[CR35] 35.Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature562, 203–209 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Wang, J., Wen, S., Fraser Symmans, W., Pusztai, L. & Coombes, K. R. The bimodality index: a criterion for discovering and ranking bimodal signatures from cancer gene expression profiling data. Cancer Inf.7, 199–216 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Halaschek-Wiener, J. et al. The super-seniors study: phenotypic characterization of a healthy 85+ population. PLoS ONE13, 1–11 (2018). [DOI] [PMC free article] [PubMed]

[CR38] 38.Darst, B. F., Koscik, R. L., Hogan, K. J., Johnson, S. C. & Engelman, C. D. Longitudinal plasma metabolomics of aging and sex. Aging11, 1262–1282 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] 39.Brown, M. B. & Forsythe, A. B. Robust tests for the equality of variances. J. Am. Stat. Assoc.69, 364–367 (1974). [Google Scholar]

[CR40] 40.Muggeo, V. M. R. Estimating regression models with unknown break-points. Stat. Med.22, 3055–3071 (2003). [DOI] [PubMed] [Google Scholar]

[CR41] 41.Muggeo, V. M. R. Testing with a nuisance parameter present only under the alternative: a score-based approach with application to segmented modelling. J. Stat. Comput. Simul.86, 3059–3067 (2016). [Google Scholar]

[CR42] 42.Muggeo, V. M. R. Interval estimation for the breakpoint in segmented regression: a smoothed score-based approach. Aust. N. Z. J. Stat.59, 311–322 (2017). [Google Scholar]

[CR43] 43.Kuhn, M. Building predictive models in R using the caret package. J. Stat. Softw.28, 1–26 (2008).27774042 [Google Scholar]

[CR44] 44.Rivals, I., Personnaz, L., Taing, L. & Potier, M. C. Enrichment or depletion of a GO category within a class of genes: which test? Bioinformatics23, 401–407 (2007). [DOI] [PubMed] [Google Scholar]

[CR45] 45.Lu, A. T. et al. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging11, 303–327 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR46] 46.Zhou, W. et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet.50, 1335–1341 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet.47, 1228–1235 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR48] 48.Keller, H. H., Goy, R. & Kane, S. L. Validity and reliability of SCREEN II (Seniors In The Community: Risk Evaluation for eating and nutrition, Version II). Eur. J. Clin. Nutr.59, 1149–1157 (2005). [DOI] [PubMed] [Google Scholar]

[CR49] 49.Washburn, R. A., McAuley, E., Katula, J., Mihalko, S. L. & Boileau, R. A. The Physical Activity Scale for the Elderly (PASE): evidence for validity. J. Clin. Epidemiol.52, 643–651 (1999). [DOI] [PubMed] [Google Scholar]

[CR50] 50.Kessler, R. C. et al. Screening for serious mental illness in the general population. Arch. Gen. Psychiatry60, 184–189 (2003). [DOI] [PubMed] [Google Scholar]

[CR51] 51.Ying, K., Paulson, S., Eames, A. et al. A unified framework for systematic curation and evaluation of aging biomarkers. Nat. Aging5, 2323–2339 (2025). [DOI] [PubMed]

[CR52] 52.Moqri, M. et al. Validation of biomarkers of aging. Nat. Med.30, 360–372 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR53] 53.Livshits, G. et al. Multi-OMICS analyses of frailty and chronic widespread musculoskeletal pain suggest involvement of shared neurological pathways. Pain159, 2565–2572 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR54] 54.Huang, J. et al. Serum metabolomic profiling of all-cause mortality: a prospective analysis in the alpha-tocopherol, beta-carotene cancer prevention (ATBC) study cohort. Am. J. Epidemiol.187, 1721–1732 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR55] 55.Halaschek-Wiener, J. et al. Reduced telomere length variation in healthy oldest old. Mech. Ageing Dev.129, 638–641 (2008). [DOI] [PubMed] [Google Scholar]

[CR56] 56.Maas, M. N., Hintzen, J. C. J., Porzberg, M. R. B. & Mecinović, J. Trimethyllysine: from carnitine biosynthesis to epigenetics. Int. J. Mol. Sci.21, 1–33 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR57] 57.Vekic, J., Silva-Nunes, J. & Rizzo, M. Glucose metabolism disorders: challenges and opportunities for diagnosis and treatment. Metabolites12, 1–4 (2022). [DOI] [PMC free article] [PubMed]

[CR58] 58.Bjornestad, E. O. et al. Trimethyllysine predicts all-cause and cardiovascular mortality in community-dwelling adults and patients with coronary heart disease. Eur. Heart J. Open1, 1–10 (2021). [DOI] [PMC free article] [PubMed]

[CR59] 59.Li, X. S. et al. Untargeted metabolomics identifies trimethyllysine, a TMAO-producing nutrient precursor, as a predictor of incident cardiovascular disease risk. JCI Insight3, 1–18 (2018). [DOI] [PMC free article] [PubMed]

[CR60] 60.Abela, L. et al. Plasma metabolomics reveals a diagnostic metabolic fingerprint for mitochondrial aconitase (ACO2) deficiency. PLoS ONE12, 1–15 (2017). [DOI] [PMC free article] [PubMed]

[CR61] 61.Trabado, S. et al. The human plasma-metabolome: Reference values in 800 French healthy volunteers; impact of cholesterol, gender and age. PLoS ONE12, 1–17 (2017). [DOI] [PMC free article] [PubMed]

[CR62] 62.Johnson, L. C. et al. Amino acid and lipid associated plasma metabolomic patterns are related to healthspan indicators with ageing. Clin. Sci.132, 1765–1777 (2018). [DOI] [PubMed] [Google Scholar]

[CR63] 63.Cohen, A. A. et al. Statistical distance as a measure of physiological dysregulation is largely robust to variation in its biomarker composition. PLoS One10, 1–25 (2015). [DOI] [PMC free article] [PubMed]

[CR64] 64.Das, S. K. et al. Metabolomic architecture of obesity implicates metabolonic lactone sulfate in cardiometabolic disease. Mol. Metab.54, 1–10 (2021). [DOI] [PMC free article] [PubMed]

[CR65] 65.Vishnyakova, O. et al. Data for: Metabolomic sweet spot clock predicts mortality and age-related diseases in the Canadian Longitudinal Study on Aging. Zenodo. 10.5281/ZENODO.17594106. [DOI] [PMC free article] [PubMed]

PERMALINK

Metabolomic sweet spot clock predicts mortality and age-related diseases in the Canadian Longitudinal Study on Aging

Olga Vishnyakova

Joosung Min

Stephen Leach

Xiaowei Song

Kenneth Rockwood

Angela Brooks-Wilson

Lloyd T Elliott

Abstract

Background

Methods

Results

Conclusions

Plain Language Summary

Introduction

Methods

Data

Genomic data

Untargeted metabolomic data and quality control

Replication cohort

Instruments for health assessment

Statistics and reproducibility

Fig. 1. Overview of the Sweet Spot metabolomic clock framework.

Identifying health-related metabolites

Defining metabolite optimal levels

Distances from sweet spots

Developing metabolomic clocks

Metabolic pathway enrichment analysis

Survival analysis

Heritability estimate

Association analysis

Comparison to established epigenetic clocks

Results

Health assessment and participants characteristics

Twenty percent of metabolites were identified as related to health

Fig. 2. Metabolites identified as health-related.

Most sweet spot estimates differ from population means

Fig. 3. Estimated optimal levels for health-related metabolites.

Sweet spot distance transformation enhances the mortality association of the metabolomic aging biomarker

Fig. 4. Metabolomic age deviation predicts mortality and age-related diseases.

Table 2.

Table 1.

Metabolomic age deviation is strongly associated with mortality and the onset of age-related diseases

Metabolomic age deviation is strongly associated with lifestyle factors

Table 3.

Metabolomic biomarker and mortality-predictive epigenetic biomarker share signals of aging heterogeneity

Metabolite-level insights into biological age deviation

Replication of associations of MAD with health status and mortality

Discussion

Supplementary information

Acknowledgements

Author contributions

Peer review

Peer review information

Data availability

Code availability

Competing interests

Footnotes

Contributor Information

Supplementary information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases