Skip to main content
Breast Cancer Research : BCR logoLink to Breast Cancer Research : BCR
. 2025 Aug 27;27:156. doi: 10.1186/s13058-025-02102-w

Pre-diagnostic serum metabolome and breast cancer risk: a nested case-control study

Ly Trinh 1, Jaclyn Parks 2, Treena McDonald 2, Andrew Roth 3,4,5, Grace Shen-Tu 6, Jennifer Vena 6, Rachel A Murphy 1,2, Parveen Bhatti 1,2,
PMCID: PMC12382045  PMID: 40867009

Abstract

Background

Metabolomics offers a promising approach to identify biomarkers for timely intervention and enhanced screening of individuals at increased risk of developing breast cancer.

Methods

We conducted a study of 593 female breast cancer cases and 593 matched controls nested in two prospective cohort studies. Mass spectrometry, without liquid chromatography, was used to conduct untargeted metabolomics profiling of serum samples collected, on average, 5.3 years before cancer diagnosis. Logistic regression was used to estimate odds ratios (OR) for a one standard deviation increase of metabolite intensities. Partial least squares discriminant analyses were applied to those metabolites significantly associated with breast cancer to develop risk prediction models.

Results

Associations were evaluated with a total of 837 metabolites. Twenty-four metabolites were significantly associated with overall breast cancer risk, including 13 associated with decreased risk and 11 associated with increased risk. Putative identities of the metabolites included various amino acids (n = 3), dietary factors (n = 10), fatty acids (n = 2), phosplipids (n = 4), sex hormone derivatives (n = 2), and xenobiotics (n = 3). For example, a metabolite identified as acetyl tributyl citrate, a plasticizer in food wrappings, was associated with an increased risk of breast cancer (OR = 1.21; 95% CI: 1.07–1.37). Risk prediction models for overall breast cancer and the various subtypes were found to have modest levels of prediction accuracy (area under the curve ranged from 0.60 to 0.63).

Conclusions

Additional studies are needed to confirm the identities of the metabolites and validate their associations with breast cancer risk. Metabolomics should be evaluated in conjunction with other ‘omics’ technologies for creation of clinically useful risk prediction models.

Supplementary Information

The online version contains supplementary material available at 10.1186/s13058-025-02102-w.

Keywords: Breast cancer, Biomarkers, Metabolomics, Mass-spectometry, Risk prediction

Introduction

Tools to predict the future occurrence of breast cancer could significantly reduce the burden of disease by supporting targeted prevention strategies and/or enhanced screening for early detection. Current breast cancer risk prediction models based on known breast cancer risk factors (e.g., age, reproductive events, family history, history of breast diseases) have insufficient discriminatory accuracy to support targeted prevention and screening efforts [1]. Metabolomics, which involves the analysis of low-molecular-weight molecules in biological samples, is a promising approach for predicting breast cancer risk since the metabolome can serve as a functional readout of the impact of genetic and environmental factors on mechanisms that contribute to the initiation and progression of breast cancer [2, 3].

Multiple studies of metabolomics, measured in pre-diagnostic blood samples, and breast cancer risk have been recently published [413]. For example, in one of the largest studies conducted to date, including 1695 incident breast cancer cases and a subcohort of 1983 women drawn from the Cancer Prevention Study 3, associations with breast cancer were evaluated with 868 metabolites [4]. Compelling associations with various sex hormone derivatives, phospholipids, amino acids, dietary factors, and xenobiotics (e.g., environmental pollutants) were observed. Associations with these general categories of metabolites have been observed across multiple studies, though the specific metabolites identified in the studies have tended to differ. The few replicated metabolites across studies may be attributable, at least in part, to multiple studies using targeted assay platforms that assessed relatively few metabolites or using nuclear magnetic resonance (NMR) platforms which are not as sensitive as mass spectrometry (MS) platforms [14]. Only three studies, two of which used NMR platforms, reported breast cancer risk prediction results [6, 9, 12]. Two of the studies demonstrated high prediction accuracy, one of which was nested within a registry of women with a family history of breast and/or ovarian cancer [9, 12].

By pooling data from two prospective Canadian cohort studies, our primary objective was to identify metabolites, measured in pre-diagnostic blood samples using a high throughput untargeted MS-platform, that were associated with breast cancer risk, including risk of major breast cancer subtypes. We also sought to evaluate the utility of the identified metabolites for breast cancer risk prediction.

Methods

Study population

Data for the study came from two prospective longitudinal population cohort studies, the BC Generations Project (BCGP) and Alberta’s Tomorrow Project (ATP). BCGP consists of 29,788 participants from across British Columbia, aged 35 to 69 at time of recruitment, which occurred between 2009 and 2016 [15]. ATP consists of 54,922 participants from across Alberta, aged 35 to 69 at time of recruitment, which occurred in two phases (Phase I, 2000–2008; Phase II, 2009–2015) [16]. During Phase II, newly recruited participants and a portion of previously recruited participants (n = 24,075) completed questionnaires similar to those completed by BCGP participants at time of recruitment. These questionnaires collected health and reproductive history, social demographic characteristics, and behavioral factors. As both BCGP and ATP belong to the Canadian Partnership for Tomorrow’s Health (CanPath) [17], these questionnaires underwent an extensive harmonization process. Participants also provided height and weight measurements either via in-person assessments at a study center or through self-report. Participants provided blood samples, which were divided into aliquots of plasma, serum, buffy coat, and red blood cells and stored at -80 °C.

Study design

Participants were eligible to be included in the current study if they were female, completed a questionnaire (baseline questionnaire for BCGP; Phase II questionnaire for ATP), provided blood samples at time of questionnaire completion, and were cancer-free, except for nonmelanoma skin cancer, at time of questionnaire completion (Fig. 1). A total of 16,494 BCGP and 18,165 ATP participants met these eligibility criteria. Among these participants, cases were selected if they were diagnosed with incident invasive breast cancer. Incident invasive breast cancer cases were identified through annual linkage with each province’s respective Cancer Registry (International Classification of Diseases for Oncology, Third Edition site code: C50.x; morphology codes: 80103, 80323, 80413, 81403, 82003, 82113, 82553, 84013, 84803, 85003, 85033, 85043, 85073, 85203, 85223, 85233, 85243, 85403, 84413, 85753, 89833). Based on morphology codes, cases were classified into four histological subgroups: invasive ductal (85003, 85043, 85073), lobular (85203), mixed ductal/lobular (8522, 8523, 8524, 8255), and other carcinomas. In addition, cases with known hormone receptor expression status were categorized as: luminal A [estrogen receptor positive (ER+) and/or progesterone receptor positive (PR+) and human epidermal growth factor receptor 2 negative (HER2-)], HER2+, and triple-negative breast cancers.

Fig. 1.

Fig. 1

Inclusion criteria for study of serum metabolome and breast cancer risk

Each case was matched to a single control who was cancer free at the time of case diagnosis, except for nonmelanoma skin cancer, on source cohort, baseline menopause status, age at blood collection (± 2 years), and year of blood collection. Those with missing menopause status (n = 16) were classified as premenopausal if age at time of questionnaire completion was under 51 and postmenopausal if age at time of questionnaire completion was 51 – the median age at natural menopause reported in Canada – or older [18].

Health, lifestyle, and reproductive data

Body mass index (BMI), measured in kg/m2, was calculated based on height and weight data collected during in-person assessments (80% of eligible participants). For those without in-person height and weight data, self-reported measures from the harmonized questionnaire were utilized. Other factors obtained from the questionnaire included age at time of questionnaire completion (same as age at blood collection), ethnicity, annual household income, highest attained education, family history of breast cancer (first degree relatives), age at menarche, number of live births, age at first pregnancy, history of oral contraceptive use, history of hormone replacement therapy, alcohol consumption (average over last year), and smoking status. Missing covariate data were assumed to be missing at random and, as such, were imputed using the Multiple Imputation by Chained Equations (MICE) algorithm [19]. All risk factors and the outcome were used as auxiliary variables for imputation. Ten imputed datasets were generated, each with thirty iterations [20].

Metabolomics data

Non-fasting serum samples (30 µl each) were plated on 96-well plates and shipped to General Metabolics LLC (Boston, Massachusetts). For a randomly selected 5% of participants, an additional sample was included as a blind duplicate for quality control assessment. Metabolites were measured using previously described MS methods [21]. Briefly, each sample was diluted with 180 µl of 80% ethanol in water. 5 µl of the resulting solution was injected twice, consecutively, at a flow rate of 0.150 ml/minute, into an Agilent 6550 iFunnel Quadrupole time-of-flight (Q-TOF) mass spectrometer. Blanks, pooled study samples, and National Institute of Standards and Technology standard reference materials were injected after every 48 study samples for quality assurance. Metabolites were detected as peaks with specific mass-to-charge ratios (m/z), and the intensity of each peak was a measure of the relative abundance of each metabolite. All peaks with m/z between 50 and 1,000 were recorded for each sample. For those samples with missing values for a particular metabolite (i.e., no peak detected in the sample), the missing values were backfilled using the measured intensity in the sample at the recorded m/z for that metabolite.

Mass spectra data were processed by General Metabolics using MATLAB (The Mathworks, Natick, MA, USA). Metabolite annotations were identified by comparing measured m/z to theoretical m/z within 0.001 Da recorded in the Human Metabolome Database (HMDB) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and cross-referenced against the Chemical Entities of Biological Interest (ChEBI) database [2224]. Since no chromatographic separation was performed in tandem with mass spectrometry, it was not possible to distinguish between metabolites with identical molecular formulas.

Metabolites that were missing in over 50% of the study samples were excluded. Coefficients of variation (CVs) were calculated to assess reproducibility based on the blind duplicate samples. For each pair of duplicate samples, the CV of each metabolite was computed by dividing the standard deviation (SD) by the mean ion intensity, and median CVs across all samples were calculated. Metabolites with median CVs above 20% were excluded. Metabolite intensity measures were normalized using a cubic spline method [25].

After normalization, duplicate intensity measures for each metabolite were averaged to obtain a single measurement per participant. To account for the matched study design, the mean intensity measure of each metabolite in a matched pair was set to zero [26]. In addition, each metabolite intensity was divided by the standard deviation across all samples to assess breast cancer risk associated with one standard deviation increase in metabolite intensity.

Statistical analysis

Associations of each metabolite with breast cancer risk were estimated using unconditional logistic regression adjusted for matching factors [source cohort (BCGP or ATP), menopause status (pre- or post-), age at blood collection (continuous), year of blood collection (2009–2019, 2011–2012, or ≥ 2013)] (R version 4.3.2). Multiple testing was accounted for by applying the Benjamini-Hochberg procedure with the false discovery rate (FDR, also known as q-value) set at 0.1. For metabolites that were deemed significant after multiple testing correction, we fitted separate logistic regression models to investigate the potential for confounding effects by established breast cancer risk factors, including ethnicity (White or Non-white), annual household income (<$50,000, $50,000-<$100,000, or ≥$100,000), highest attained education (high school or less, some post-secondary, or Bachelor’s or higher), family history of breast cancer (yes or no), age at menarche (< 12 years, 12–13 years, or ≥ 14 years), number of live births (continuous), age at first pregnancy (never pregnant, < 25 years, 25–29 years, or ≥ 30 years), oral contraceptive use (ever or never), hormone replacement therapy (ever or never), BMI (< 18.5–24.9 kg/m2, 25.0–29.9 kg/m2, or ≥ 30 kg/m2), alcohol consumption (none, 1–3 times/month, 1–3 times/week, or ≥ 4 times/week), and smoking status (never, past, or current). Analyses were completed on each of the imputed data sets and the resulting odds ratio (OR) estimates were averaged. A pooled Wald-based 95% CI was generated using Rubin’s Rule [27]. Statistically significant metabolites from the overall analysis were also evaluated in the following subgroups: participants who were postmenopausal at time of blood sample collection, cases diagnosed with invasive ductal carcinomas along with their matched controls, and cases diagnosed with luminal A breast cancers and their matched controls. Numbers of other breast cancer subtypes were insufficient to conduct meaningful analyses.

Risk prediction models were developed using partial least squares discrimination analysis (PSL-DA) (R version 4.3.2) [28]. Prediction models were generated using the metabolites found to be statistically significantly associated with breast cancer in the overall logistic regression analysis, with and without established breast cancer risk factors (all models included matching factors). For each model, 80% of the data were used as a training set and 20% were used as a testing set. Pair identifiers were used to split the data to preserve the matched design. For cross-validation, training data were randomly split into 10 equal-sized subsets, each also maintaining case-control matching. Nine subsets were used to train the algorithms across a grid of hyperparameters, and the tenth subset was used to test performance. This process was repeated 10 times, and the hyperparameters that resulted in the highest cross-validated area under the curve (AUC) were used to predict outcome in the testing data set. One-hundred models were generated from 100 independent training-testing partitions. The empirical mean and 95% CI for predictive performance were calculated.

Results

A total of 593 matched cases and controls – consisting of 226 cases and controls from ATP and 367 cases and controls from BCGP – were included in the study. Distributions of variables for cases and controls, before imputation, are summarized in Table 1. The distributions of breast cancer risk factors were generally similar between cases and controls, though more controls (79.4%) than cases (73.5%) reported no family history of breast cancer. For cases, cancer diagnosis occurred within a mean (SD) of 5.3 (2.6) years after questionnaire completion/blood sample collection, with a mean (SD) age at diagnosis of 62.5 (7.9) years. Hormone receptor status data were missing for 206 (34.7%) breast cancer cases.

Table 1.

Characteristics of cases and controls

Variable Casea Controla
n = 593 n = 593
Cohort, n (%)
 BCGP 367 (61.9%) 367 (61.9%)
 ATP 226 (38.1%) 226 (38.1%)
Menopause stats, n(%)
 Premenopausal 167 (28.2%) 167 (28.2%)
 Postmenopausal 426 (71.8%) 426 (71.8%)
Age at blood collection, mean (SD) 58.1 (7.9) 58.0 (8.0)
Year of blood collection, n (%)
 2009–2010 150 (25.3%) 148 (25.0%)
 2011–2012 190 (32.0%) 201 (33.9%)
 ≥2013 253 (42.7%) 244 (41.1%)
Ethnicity, n (%)
 White 481 (81.1%) 479 (80.8%)
 Non-white 36 (6.1%) 37 (6.2%)
 Missing 76 (12.8%) 77 (13.0%)
Annual household income, n (%)
 <$50,000 135 (22.8%) 131 (22.1%)
 $50,000-<$100,000 214 (36.1%) 217 (36.6%)
 ≥$100,000 195 (32.9%) 210 (35.4%)
 Missing 49 (8.3%) 35 (5.9%)
Highest attained education, n (%)
 High school or less > 100 (> 18.6%) > 100 (> 18.7%)
 Some post-secondary 229 (38.6%) 239 (36.6%)
 Bachelor’s or higher 218 (32.9%) 214 (35.4%)
 Missing < 10 (< 1.7%) < 10 (< 1.7%)
Family history of breast cancer, n (%)
 Yes > 100 (> 18.6%) 108 (18.2%)
 No 436 (73.5%) 471 (79.4%)
 Missing < 10 (< 1.7%) 14 (2.4%)
Age at menarche, n (%)
 <12 years 99 (16.7%) 93 (15.7%)
 12–13 years 341 (57.5%) 330 (55.6%)
 ≥14 years 131 (22.1%) 150 (25.3%)
 Missing 22 (3.7%) 20 (3.4%)
Number of live births, mean (SD) 1.7 (1.2) 1.9 (1.2)
 Missing, n < 10 < 10
Age at first pregnancy, n (%)
 Never pregnant 96 (16.3%) 90 (15.3%)
 <25 years 248 (41.8%) 281 (47.4%)
 25–29 years 142 (24.1%) 132 (22.4%)
 ≥30 years 103 (17.5%) 87 (14.7%)
Oral contraceptive use, n (%)
 Ever 539 (90.9%) 520 (87.7%)
 Never > 50 (< 8.4%) > 50 (< 8.4%)
 Missing < 10 (< 1.7%) < 10 (< 1.7%)
Hormone replacement therapy use, n (%)
 Ever 232 (39.5%) 220 (37.2%)
 Never 356 (60.5%) 372 (62.5%)
Body mass index, n (%)
 <18.5–24.9 kg/m2 257 (43.3%) 271 (45.7%)
 25.0–29.9 kg/m2 175 (29.5%) 163 (27.5%)
 ≥30.0 kg/m2 115 (19.4%) 117 (19.7%)
 Missing 46 (7.8%) 42 (7.1%)
Alcohol consumption frequency/week, n (%)
 None > 50 (> 8.4%) > 50 (> 8.4%)
 1–3 times/month 208 (35.1%) 197 (33.2%)
 1–3 times/week 177 (29.8%) 180 (30.4%)
 ≥4 times/week 148 (25.0%) 144 (24.3%)
 Missing < 10 (< 1.7%) < 10 (1.7%)
Smoking status, n (%)
 Never smoker 315 (53.1%) 311 (52.4%)
 Past smoker 237 (40.0%) 237 (40.0%)
 Current smoker 28 (4.7%) 28 (4.7%)
 Missing 13 (2.2%) 17 (2.9%)
Age at breast cancer diagnosis, mean (SD) 62.5 (7.9)
Time to diagnosis (years), mean (SD) 5.3 (2.6)
Histological subtype, n (%)
 Invasive ductal 442 (74.5%)
 Lobular 73 (12.3%)
 Mixed ductal/lobular 45 (7.6%)
 Other 33 (5.6%)
Molecular subtype, n (%)
 ER + and/or PR+, HER2- (Luminal A) 300 (50.6%)
 HER2+ 48 (8.1%)
 Triple-negative 39 (6.6%)
 Missing 206 (34.7%)

Any cells in the table with less than 10 observations were labelled as ‘<10’. Adjacent cells were labelled as ‘>50’ or ‘>100’ to help obscure the low number of observations

A total of 1,676 metabolites were annotated, of which 854 (50.9%) were detected among at least 50% of study samples. Median CVs among duplicate samples ranged from 1.1 to 31.9%, with 17 metabolites (2.0%) having CVs greater than 20%. These metabolites were excluded, leaving 837 metabolites for analysis.

Associations between each of the 837 metabolites and overall breast cancer risk are presented in Supplementary Table 1. Each metabolite was assigned an identification number, and for each metabolite identification number, Supplementary Table 2 lists potential annotations. The potential annotations were not always distinct metabolites; some were alternative names for the same metabolite. After correcting for multiple comparisons, 24 metabolites were statistically significantly associated with overall breast cancer risk (Table 2). Thirteen metabolites were associated with reduced breast cancer risk, with ORs ranging from 0.63 to 0.84. Metabolite 63 [monomethyl sulfate was only identified annotation (Supplementary Table 2)] had the strongest inverse association (OR = 0.63; 95% CI: 0.51–0.78). Eleven metabolites were positively associated with breast cancer risk, with ORs ranging from 1.20 to 1.23. Metabolite 1110, annotated as multiple glycoside compounds, had the strongest positive association (OR = 1.23 95% CI: 1.09–1.39). No confounders of these associations were identified (addition of covariates listed in Table 1 did not materially impact OR estimates) so final models included only matching factors. Figure 2 displays the ORs for each of the 24 metabolites, grouped by metabolite category. Categories are based on the possible identities of the metabolites (see Discussion for details).

Table 2.

Associations of significant metabolites with overall breast cancer and breast cancer subgroups

Metabolite ID numbera Overallb Postmenopausalb Invasive ductalb Luminal Ab
OR 95% CI q OR 95% CI q OR 95% CI q OR 95% CI q
63 0.63 0.51, 0.78 0.01 0.57 0.44, 0.76 0.02 0.65 0.51, 0.84 0.05 0.82 0.60, 1.14 0.52
729 0.68 0.57, 0.81 0.01 0.61 0.49, 0.77 0.01 0.65 0.52, 0.82 0.04 0.73 0.59, 0.90 0.12
184 0.77 0.64, 0.91 0.10 0.68 0.55, 0.84 0.04 0.84 0.70, 1.00 0.34 0.65 0.51, 0.85 0.09
724 0.81 0.72, 0.92 0.10 0.75 0.64, 0.88 0.04 0.79 0.68, 0.91 0.05 0.86 0.70, 1.04 0.40
1139 0.81 0.72, 0.91 0.10 0.80 0.70, 0.93 0.10 0.84 0.74, 0.97 0.20 0.72 0.60, 0.87 0.06
809 0.82 0.73, 0.92 0.10 0.77 0.67, 0.89 0.04 0.82 0.71, 0.94 0.12 0.89 0.76, 1.04 0.44
147 0.83 0.74, 0.93 0.10 0.81 0.70, 0.94 0.12 0.81 0.71, 0.93 0.10 0.94 0.79, 1.12 0.76
227 0.83 0.74, 0.93 0.10 0.77 0.67, 0.89 0.04 0.84 0.73, 0.96 0.16 0.82 0.69, 0.97 0.21
630 0.83 0.74, 0.94 0.10 0.83 0.72. 0.95 0.13 0.81 0.71, 0.93 0.10 0.84 0.72, 0.99 0.25
1260 0.83 0.74, 0.93 0.10 0.84 0.73, 0.97 0.18 0.87 0.76, 0.99 0.31 0.81 0.68, 0.96 0.20
123 0.84 0.75, 0.94 0.10 0.80 0.69, 0.93 0.10 0.86 0.75, 0.98 0.23 0.90 0.76, 1.07 0.52
200 0.84 0.75, 0.94 0.10 0.84 0.73, 0.96 0.17 0.78 0.68, 0.90 0.04 0.85 0.71, 1.02 0.34
1370 0.84 0.75, 0.94 0.10 0.90 0.78, 1.03 0.4 0.89 0.78, 1.01 0.39 0.79 0.67, 0.93 0.14
634 1.20 1.07, 1.35 0.10 1.21 1.05, 1.40 0.14 1.18 1.03, 1.35 0.21 1.32 1.11, 1.57 0.10
689 1.20 1.07, 1.34 0.10 1.23 1.07, 1.41 0.10 1.26 1.10, 1.43 0.05 1.18 1.02, 1.38 0.24
1625 1.20 1.07, 1.34 0.10 1.12 0.97, 1.28 0.39 1.20 1.05, 1.36 0.14 1.09 0.92, 1.28 0.61
1158 1.21 1.07, 1.37 0.10 1.15 0.99, 1.34 0.31 1.18 1.02, 1.36 0.23 1.13 0.94, 1.36 0.49
1282 1.21 1.08, 1.36 0.10 1.16 1.01, 1.32 0.25 1.35 1.17, 1.56 0.01 1.09 0.92, 1.29 0.59
1377 1.21 1.08, 1.36 0.10 1.08 0.94, 1.24 0.52 1.29 1.12, 1.49 0.04 1.03 0.86, 1.23 0.87
1600 1.21 1.08, 1.36 0.10 1.23 1.07, 1.42 0.10 1.19 1.04, 1.37 0.16 1.23 1.04, 1.46 0.20
1612 1.21 1.08, 1.36 0.10 1.12 0.97, 1.28 0.38 1.33 1.16, 1.52 0.02 1.09 0.93, 1.29 0.58
1194 1.22 1.08, 1.38 0.10 1.19 1.03, 1.38 0.2 1.19 1.03, 1.37 0.22 1.15 0.96, 1.37 0.43
1624 1.22 1.09, 1.37 0.10 1.13 0.99, 1.30 0.33 1.34 1.17, 1.54 0.01 1.11 0.95, 1.31 0.50
1110 1.23 1.09, 1.39 0.10 1.19 1.02, 1.39 0.22 1.23 1.06, 1.43 0.13 1.09 0.91, 1.31 0.64

aSee Supplementary Table 2 for potential annotations

bAdjusted for source cohort (BCGP or ATP), menopause status (pre- or post-), age at blood collection (continuous), and year of blood collection (2009–2019, 2011–2012, or ≥ 2013)

Fig. 2.

Fig. 2

Statistically significant metabolite associations with overall breast cancer risk, grouped by metabolite category. Numbers correspond to metabolite identification numbers (Table 2). Categories are based on the possible identities of metabolites

Among postmenopausal participants (426 case-control pairs), statistically significant associations were observed for ten of the 24 metabolites (Table 2). Eight metabolites were associated with decreased risk of breast cancer, and two metabolites were associated with increased risk of breast cancer. As in the overall analysis, the strongest inverse association was observed with metabolite 63 (monomethyl sulfate, OR = 0.57, 95% CI: 0.44–0.76). Significant positive associations were observed with metabolite 689 (n-[4’-hydroxy-(e)-cinnamoyl]-l-aspartic acid was only identified annotation, OR = 1.23; 95% CI: 1.07–1.41) and metabolite 1600 (annotated as multiple phosphatidylethanolamines, OR = 1.23; 95% CI: 1.07–1.42).

Eleven of the 24 metabolites were statistically significantly associated with invasive ductal carcinoma of the breast (442 case-control pairs), including six associated with reduced risk and five associated with increased risk (Table 2). The strongest inverse associations were observed for metabolites 63 (monomethyl sulfate, OR = 0.65; 95% CI: 0.51–0.84) and 729 (lauroyl diethanolamide was only identified annotation, OR = 0.65; 95% CI: 0.52–0.82). The strongest positive association was observed with metabolite 1282 (multiple potential annotations, OR = 1.35; 95% CI: 1.17–1.56).

Three of the 24 metabolites were found to be statistically significantly associated with risk of luminal A breast cancer (300 case-control pairs), including two associated with reduced risk and one associated with increased risk of breast cancer. The strongest protective association was observed for metabolite 184 (multiple potential annotations, OR = 0.65; 95% CI: 0.51–0.85), while the positive association was observed for metabolite 634 (multiple potential annotations, OR = 1.32; 95% CI: 1.11–1.57).

The AUCs for the breast cancer risk prediction models are provided in Table 3. AUCs ranged from 0.60 to 0.63 for overall breast cancer and for each of the subgroups, with and without inclusion of known breast cancer risk factors.

Table 3.

Area under the curve (AUC) of risk prediction models for overall breast cancer and breast cancer subgroups

Group Predictorsa AUC (95% CI)
Overall BC Metabolites 0.62 (0.56–0.70)
Metabolites and risk factorsb 0.60 (0.52–0.69)
Postmenopausal Metabolites 0.63 (0.52–0.71)
Metabolites and risk factorsb 0.61 (0.58–0.72)
Invasive ductal Metabolites 0.62 (0.52–0.71)
Metabolites and risk factorsb 0.60 (0.56–0.68)
Luminal A Metabolites 0.62 (0.46–0.73)
Metabolites and risk factorsb 0.60 (0.57–0.71)

aAll models included matching factors (cohort, menopause status, age at blood collection, year of blood collection

bRisk factors included ethnicity, annual household income, highest attained education, family history of breast cancer, age at menarche, number of live births, age at first pregnancy, oral contraceptive use, hormone replacement therapy, BMI, alcohol consumption, and smoking status

Discussion

We observed multiple metabolites to be associated with increased and decreased risks of breast cancer in our study. No notable differences in direction and strength of associations between metabolites and breast cancer subtypes were noted. Risk prediction models, with and without breast cancer risk factors, had only modest levels of predictive performance.

The strongest overall association was a protective effect observed with metabolite 63, which was identified as monomethyl sulfate. This was counter-intuitive since monomethyl sulfate is a xenobiotic with genotoxic properties found in airborne particulate matter and as an impurity in pharmaceutical manufacturing [29, 30]. Another metabolite, 1158, was associated with an increased risk of breast cancer and was identified as acetyl tributyl citrate. This is a common plasticizer in food wrappings, children’s toys, and medical coatings [31] that has been previously associated with increased breast cancer risk [5]. Metabolite 1139 was associated with a decreased risk of breast cancer, and the only annotation identified was N-benzoyl-D-arginine-4-nitroanilide. Limited information on sources and toxicity of this compound was found, though it was previously identified in a human blood exposome database and is likely of exogenous origin [32].

Multiple metabolites with potential links to dietary intake were associated with breast cancer risk in our study. Metabolite 729, identified as lauroyl diethanolamide, is a derivative of lauric acid, which is commonly found in coconut and palm kernel oils. Consistent with our observation, a previous study linked increased breast cancer risk with lower plasma lauric acid concentrations [33]. We observed metabolites 147 and 200 to be associated with decreased risks of breast cancer. One of the annotations for metabolite 147 was ethyl maltol, a commonly used flavoring in food and e-cigarettes with possible cytotoxic properties [34]. No information on anti-carcinogenic effects of the other annotations for this metabolite was identified. Annotations for metabolite 200 included multiple compounds that, according to the Human Metabolome Database, are naturally occurring in food (e.g. 3-ethyl-5-methoxyphenol) or added to food as flavoring (e.g., 3-(5-methyl-2-furanyl)butanal) [35]. Limited information on the toxicity of these compounds was identified. The potential annotations for metabolite 184, which was associated with a decreased risk of breast cancer, included cinnamate. Cinnamate is a plant compound with multiple anticarcinogenic properties, including suppression of cell proliferation and metastasis [36]. Similarly, one of the annotations for metabolite 1260, which was also associated with a decreased risk of breast cancer, was the plant compound kurarinone, which has anti-inflammatory and anti-oxidative properties [37]. Metabolite 634 was associated with an increased risk of breast cancer; however, all potential annotations were compounds that have been associated with anticarcinogenic effects. For example, one of the annotations was celerin, a coumarin in celery, and coumarins have anti-tumour effects such as inducing apoptosis and inhibition of tumor angiogenesis [38].

Metabolites 1377 and 1194, which were both associated with increased risks of breast cancer, were annotated as Fusicoccin H and atractyloside G, respectively. Both are glycosides produced by fungi and plants; exposure to glycosides would occur primarily through our diet. While atractylosides have anti-cancer properties [39], Fusicoccins have been shown to stabilize protein-protein interactions that may promote cancer cell proliferation and migration [40]. Metabolite 1110, which was also associated with an increased risk of breast cancer, was annotated as multiple glycosides. No information on the carcinogenic properties of these compounds was identified.

Metabolite 1282 was associated with an increased risk of breast cancer. One of its possible annotations was 1a,24R,25-trihydroxyvitamin D2, a metabolite of vitamin D2; vitamin D2 is obtained through our diet. While no information about the carcinogenic impacts of 1a,24R,25-trihydroxyvitamin was found, other vitamin D2 metabolites, such as 1,24(S)-dihydroxyvitamin D2, have been observed to have antitumor activity in animal breast cancer cells [41]. No information suggesting potential carcinogenic effects of the other possible annotations for this metabolite was found.

Metabolites 123 and 689 were identified as oxalurate and n-[4’-hydroxy-(e)-cinnamoyl]-l-aspartic acid, respectively. Oxalurate, which was associated with a decreased risk of breast cancer, is an amino acid while n-[4’-hydroxy-(e)-cinnamoyl]-l-aspartic acid, which was associated with an increased risk of breast cancer, is a derivative of the amino acid aspartate. No previous studies of oxalurate and cancer were identified; however, aspartate has been previously associated with increased risk of breast cancer which is consistent with our finding [42]. Metabolite 227 was associated with a decreased risk of breast cancer. Annotations for the metabolite included various breakdown products of the amino acid tyrosine, such as succinylacetone, which has been shown to have anti-proliferative effects [43].

Metabolites 1600, 1612, 1624, and 1625 were all associated with increased breast cancer risks, and each one was annotated as multiple phosphatidylethanolamines. While previous studies have identified various phospholipids – including phosphatidylethanolamines – to be associated with breast cancer risk, unlike our findings, mostly inverse relationships between these lipids and breast cancer risk have been observed [4, 7, 8].

Metabolite 630 was associated with a decreased risk of breast cancer. One of its possible annotations was 10-hydroxymyristic acid methyl ester, a fatty alcohol of myristic acid which is abundant in milk fat [44]. Contrary to our finding, circulating plasma myristic acid levels were observed to be associated with increased postmenopausal breast cancer risk in a previous study [45]. No information on potential anti-cancer properties of the other possible annotations of this metabolite was identified. One of the annotations for metabolite 809 was arachidonate, an omega-6 polyunsaturated fatty acid. Arachidonate induces anti-inflammatory lipoxins which may suppress proliferation of blood vessels that support tumor growth [46]. Lower blood arachidonate levels have been previously associated with increased postmenopausal breast cancer risk [47]. This is consistent with findings from our study in which metabolite 809 was observed to be associated with a decreased risk of breast cancer among postmenopausal women.

Metabolite 724 was associated with a decreased risk of breast cancer and was annotated as various estrogen metabolites. One of the potential annotations was 2-hydroxyestrone, which is protective against breast cancer since it does not stimulate cell division and, by binding with estrogen receptors, can prevent the proliferative effect of other estrogen metabolites [48]. Metabolite 1370 was also associated with a decreased risk of breast cancer. One of its potential annotations was 11-oxo-androsterone glucuronide, an androgen metabolite [49]. 11-oxygenated androgens have been previously associated with increased risk of breast cancer [50]. No information on anti-carcinogenic effects of the other annotations for this metabolite were identified.

In a previous study of 419 cases and 419 controls, Bro et al. (2015) applied PLS-DA to NMR and breast cancer risk factor data to create risk prediction models [9]. In that study, a total of 129 spectral regions were identified from the NMR data and included as separate variables in the analysis (the number of metabolites represented by these data was not indicated). The resulting model had good prediction accuracy for overall breast cancer (AUC = 0.89); models for breast cancer subtypes were not generated.

In a study of 791 breast cancer cases and 791 controls, Jobard et al. (2021) also applied PLS-DA to NMR data [6]. NMR spectra were reduced to 8500 bins, each included as a separate variable in the analysis (43 metabolites were identified from the NMR data). The overall model and model focused on postmenopausal cases had poor prediction accuracies (AUC = 0.51 and 0.53, respectively), while a model focused on premenopausal cases had modest prediction accuracy (AUC = 0.61). We did not have enough premenopausal participants in our study to support meaningful analyses. The lower predictive performance reported by Jobard et al. (2021) and our study compared to Bro et al. (2015) may stem from differences in the time interval between blood collection and cancer diagnosis. Shorter intervals are more likely to capture cases with overt disease, potentially resulting in greater alterations in metabolomic profile. While two to five years passed between blood collection and cancer diagnosis for cases in Bro et al. (2015), up to 12.75 years passed between blood collection and cancer diagnosis for cases in Jobard et al. (2021). The time intervals in our study are more consistent with those of Jobard et al. (2021) (up to 12 years). Given the limited sample size, we did not conduct effect modification analyses to determine if associations differed by time interval between blood collection and cancer diagnosis in our study.

Wu et al. (2024) conducted a small-scale study of 40 cases and 70 age-matched controls from a registry of women with a family history of breast and/or ovarian cancer. Eight metabolites were found to be associated with breast cancer risk before multiple comparisons adjustment [12]. When adding these metabolites to a PLS-DA model including age and the Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm (BOADICEA) risk score, the AUC improved from 0.66 to 0.83. While the study did include cases who were diagnosed with breast cancer more than 10 years after blood sample collection, the higher predictive performance as compared to our study may be because Wu et al. (2024) focused on a high-risk population.

Strengths of our study include availability of detailed harmonized data from two well-characterized cohort studies and use of a highly efficient and cost-effective approach with which to conduct untargeted metabolomics analyses and identify promising markers of cancer risk. Ours is also one of the few studies to have used metabolomics data to develop breast cancer prediction models. Given that previous studies used PLS-DA to develop prediction models [6, 9, 12], we did the same to enhance comparability of results. There are, however, multiple classification methods that can be used to develop prediction models, each with their own strengths and weaknesses [51, 52].

Because the metabolomics assay did not incorporate chromatography, we were unable to specifically identify each of the metabolites for which statistically significant associations were observed. This made it impractical to conduct meaningful pathway analyses. Ultimately, a follow-up study incorporating chromatography will be needed to specifically identify metabolites of interest. Previous studies have shown that non-fasting samples, as used in our study, lead to increased variability in metabolite measures [53, 54], potentially masking associations with disease risk. Sample sizes for the subgroup analyses were limited, particularly when examining the luminal A subtype of breast cancer; unfortunately, we were missing hormone receptor status data for a large number of cases.

We observed multiple biologically plausible associations between metabolites and breast cancer risk. However, prediction models generated using these metabolites were found to have modest discrimination accuracy. Multi-omics approaches that incorporate metabolomics data may better capture the complex transformations associated with cancer development [55] and, may, therefore, be better suited for developing useful risk prediction models.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (197KB, xlsx)
Supplementary Material 2 (271.1KB, xlsx)

Acknowledgements

The data and biosamples used in this study were made available by the BC Generations Project and Alberta’s Tomorrow Project. Thank you to all of the BCGP participants for their contributions to this study, without whom this work would not have been possible. Alberta’s Tomorrow Project is only possible because of the commitment of its research participants, its staff and its funders: Alberta Health, Alberta Cancer Foundation, Canadian Partnership Against Cancer and Health Canada, and substantial in kind funding from Alberta Health Services. Cancer registry data was obtained through linkage with Surveillance & Reporting, Cancer Research & Analytics, Cancer Care Alberta. The views expressed herein represent the views of the author(s) and not of Alberta’s Tomorrow Project or any of its funders.

Abbreviations

OR

Odds ratio

CI

Confidence interval

NMR

Nuclear Magnetic Resonance

MS

Mass spectrometry

BCGP

BC Generations Project

ATP

Alberta’s Tomorrow Project

CanPath

Canadian Partnership for Tomorrow’s Health

ER+

Estrogen receptor positive

PR+

Progesterone receptor positive

HER2-

Human epidermal growth factor receptor 2 negative

HER2+

Human epidermal growth factor receptor 2 positive

BMI

Body mass index

MICE

Multiple imputation by chained equations

Q-TOF

Quadrupole time-of-flight

m/z

Mass-to-charge ratios

HMDB

Human Metabolome Database

KEGG

Kyoto Encyclopedia of Genes and Genomes

ChEBI

Chemical Entities of Biological Interest

CV

Coefficient of variation

FDR

False discovery rate

PSL-DA

Partial least squares discrimination analysis

AUC

Area under the curve

SD

Standard deviation

Author contributions

Conceptualization, A.R., G.S., J.V., R.A.M., and P.B.; methodology, L.T., J.P, T.M., A.R., G.S., J.V., R.A.M., and P.B.; formal analysis, L.T and J.P.; resources, G.S., J.V., and P.B.; data curation, J.P. and T.M.; writing—original draft preparation, L.T., and P.B.; writing—review and editing, L.T., J.P., T.M., A.R., G.S., J.V., R.A.M., and P.B.; supervision, A.R., R.A.M., and P.B.; project administration, P.B.

Funding

The study was funded through an operating grant provided by the Cancer Research Society (837695).

Data availability

Data utilized in this manuscript are available through data access applications to the BC Generations Project (https://bcgpresearch.ca/) and Alberta’s Tomorrow Project (https://myatpresearch.ca/).

Declarations

Ethics approval and consent to participate

The study was approved by the BC Cancer/University of British Columbia Research Ethics Board (H21-03002). Informed consent was obtained from all of the subjects involved in this study.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Louro J, Posso M, Boon MH, Román M, Domingo L, Castells X, et al. A systematic review and quality assessment of individualised breast cancer risk prediction models. Br J Cancer. 2019;121:76–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Rattray NJW, Deziel NC, Wallach JD, Khan SA, Vasiliou V, Ioannidis JPA, et al. Beyond genomics: Understanding exposotypes through metabolomics. Hum Genomics. 2018;12:4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Wishart D. Metabolomics and the Multi-Omics view of cancer. Metabolites. 2022;12:154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stevens VL, Carter BD, Jacobs EJ, McCullough ML, Teras LR, Wang Y. A prospective case–cohort analysis of plasma metabolites and breast cancer risk. Breast Cancer Res. 2023;25:5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lécuyer L, Dalle C, Lefevre-Arbogast S, Micheau P, Lyan B, Rossary A, et al. Diet-Related metabolomic signature of Long-Term breast cancer risk using penalized regression: an exploratory study in the SU.VI.MAX cohort. Cancer Epidemiol Biomarkers Prev. 2020;29:396–405. [DOI] [PubMed] [Google Scholar]
  • 6.Jobard E, Dossus L, Baglietto L, Fornili M, Lécuyer L, Mancini FR, et al. Investigation of Circulating metabolites associated with breast cancer risk by untargeted metabolomics: a case-control study nested within the French E3N cohort. Br J Cancer. 2021;124:1734–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Brantley KD, Zeleznik OA, Rosner B, Tamimi RM, Avila-Pacheco J, Clish CB, et al. Plasma metabolomics and breast cancer risk over 20 years of Follow-up among postmenopausal women in the nurses’ health study. Cancer Epidemiol Biomarkers Prev. 2022;31:839–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Moore SC, Mazzilli KM, Sampson JN, Matthews CE, Carter BD, Playdon MC, et al. A metabolomics analysis of postmenopausal breast cancer risk in the cancer prevention study II. Metabolites. 2021;11:95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bro R, Kamstrup-Nielsen MH, Engelsen SB, Savorani F, Rasmussen MA, Hansen L, et al. Forecasting individual breast cancer risk using plasma metabolomics and biocontours. Metabolomics. 2015;11:1376–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.His M, Viallon V, Dossus L, Gicquiau A, Achaintre D, Scalbert A, et al. Prospective analysis of Circulating metabolites and breast cancer in EPIC. BMC Med. 2019;17:178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Playdon MC, Ziegler RG, Sampson JN, Stolzenberg-Solomon R, Thompson HJ, Irwin ML, et al. Nutritional metabolomics and breast cancer risk in a prospective study. Am J Clin Nutr. 2017;106:637–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wu H-C, Lai Y, Liao Y, Deyssenroth M, Miller GW, Santella RM, et al. Plasma metabolomics profiles and breast cancer risk. Breast Cancer Res. 2024;26:141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Mrowiec K, Kurczyk A, Jelonek K, Debik J, Giskeødegård GF, Bathen TF et al. Association of serum metabolome profile with the risk of breast cancer in participants of the HUNT2 study. Front Oncol [Internet]. Frontiers; 2023 [cited 2024 Aug 21];13. Available from: https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2023.1116806/full [DOI] [PMC free article] [PubMed]
  • 14.Emwas A-HM. The strengths and weaknesses of NMR spectroscopy and mass spectrometry with particular focus on metabolomics research. Methods Mol Biol. 2015;1277:161–93. [DOI] [PubMed] [Google Scholar]
  • 15.Dhalla A, McDonald TE, Gallagher RP, Spinelli JJ, Brooks-Wilson AR, Lee TK et al. Cohort profile: the British Columbia generations project (BCGP). Int J Epidemiol. 2018. [DOI] [PubMed]
  • 16.Ye M, Robson PJ, Eurich DT, Vena JE, Xu J, Johnson JA. Cohort profile: alberta’s tomorrow project. Int J Epidemiol. 2017;1097–8. [DOI] [PubMed]
  • 17.Dummer TJB, Awadalla P, Boileau C, Craig C, Fortier I, Goel V, et al. The Canadian partnership for tomorrow project: a pan-Canadian platform for research on chronic disease prevention. CMAJ. 2018;190:E710–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Costanian C, McCague H, Tamim H. Age at natural menopause and its associated factors in canada: cross-sectional analyses from the Canadian longitudinal study on aging. Menopause. 2018;25:265–72. [DOI] [PubMed] [Google Scholar]
  • 19.Azur MJ, Stuart EA, Frangakis C, Leaf PJ. Multiple imputation by chained equations: what is it and how does it work? Int J Methods Psychiatr Res. 2011;20:40–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sterne JAC, White IR, Carlin JB, Spratt M, Royston P, Kenward MG, et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. Br Med J Publishing Group. 2009;338:b2393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Fuhrer T, Heer D, Begemann B, Zamboni N. High-throughput, accurate mass metabolome profiling of cellular extracts by flow injection-time-of-flight mass spectrometry. Anal Chem. 2011;83:7074–80. [DOI] [PubMed] [Google Scholar]
  • 22.Wishart DS, Feunang YD, Marcu A, Guo AC, Liang K, Vázquez-Fresno R, et al. HMDB 4.0: the human metabolome database for 2018. Nucleic Acids Res. 2018;46:D608–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44:D457–462. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hastings J, Owen G, Dekker A, Ennis M, Kale N, Muthukrishnan V, et al. ChEBI in 2016: improved services and an expanding collection of metabolites. Nucleic Acids Res. 2016;44:D1214–1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kohl SM, Klein MS, Hochrein J, Oefner PJ, Spang R, Gronwald W. State-of-the Art data normalization methods improve NMR-based metabolomic analysis. Metabolomics. 2012;8:146–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Stanfill B, Reehl S, Bramer L, Nakayasu ES, Rich SS, Metz TO, et al. Extending classification algorithms to Case-Control studies. Biomed eng comput biol. Volume 10. SAGE Publications Ltd STM; 2019. p. 1179597219858954. [DOI] [PMC free article] [PubMed]
  • 27.Rubin DB. Multiple Imputation for Nonresponse in Surveys. 99th edition. New York: Wiley; 1987.
  • 28.Barker M, Rayens W. Partial least squares for discrimination. J Chemom. 2003;17:166–73. [Google Scholar]
  • 29.Rahman MHA, Gullick DR, Hoerner J, Bartlett MG. Determination of genotoxic impurities monomethyl sulfate and dimethyl sulfate in active pharmaceutical ingredients. Anal Methods Royal Soc Chem. 2017;9:1112–8. [Google Scholar]
  • 30.Lee ML, Later DW, Rollins DK, Eatough DJ, Hansen LD. Dimethyl and monomethyl sulfate: presence in coal fly Ash and airborne particulate matter. Volume 207. Science. American Association for the Advancement of Science; 1980. pp. 186–8. [DOI] [PubMed]
  • 31.Review of Exposure and Toxicity. Data for Phthalate Substitutes [Internet]. U.S. Consumer Product Safety Commission. [cited 2024 Dec 2]. Available from: https://www.cpsc.gov/content/Review-of-Exposure-and-Toxicity-Data-for-Phthalate-Substitutes
  • 32.Barupal DK, Fiehn O. Generating the blood exposome database using a comprehensive text mining and database fusion approach. Environ Health Perspect. 2019;127:97008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Roopashree PG, Shetty SS, Shetty VV, Nalilu SK. Medium-Chain fatty acids and breast cancer risk by receptor and pathological subtypes. Nutrients. 2022;14:5351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Hua M, Omaiye EE, Luo W, McWhirter KJ, Pankow JF, Talbot P. Identification of cytotoxic flavor chemicals in Top-Selling electronic cigarette refill fluids. Sci Rep Nat Publishing Group. 2019;9:2782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Human Metabolome Database [Internet]. [cited 2025 Apr 28]. Available from: https://hmdb.ca/
  • 36.Liu L, Hudgins WR, Shack S, Yin MQ, Samid D. Cinnamic acid: a natural product with potential use in cancer intervention. Int J Cancer. 1995;62:345–50. [DOI] [PubMed] [Google Scholar]
  • 37.Yazal T, Lee P-Y, Chen P-R, Chen I-C, Liu P-L, Chen Y-R, et al. Kurarinone exerts anti-inflammatory effect via reducing ROS production, suppressing NLRP3 inflammasome, and protecting against LPS-induced sepsis. Biomed Pharmacother. 2023;167:115619. [DOI] [PubMed] [Google Scholar]
  • 38.Wu Y, Xu J, Liu Y, Zeng Y, Wu G. A Review on Anti-Tumor Mechanisms of Coumarins. Front Oncol [Internet]. Frontiers; 2020 [cited 2024 Dec 1];10. Available from: https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2020.592853/full [DOI] [PMC free article] [PubMed]
  • 39.Jiang Y, Guo K, Wang P, Zhu Y, Huang J, Ruan S. The antitumor properties of atractylenolides: molecular mechanisms and signaling pathways. Biomed Pharmacother. 2022;155:113699. [DOI] [PubMed] [Google Scholar]
  • 40.Marra M, Camoni L, Visconti S, Fiorillo A, Evidente A. The surprising story of fusicoccin: A Wilt-Inducing phytotoxin, a tool in plant physiology and a 14-3-3-Targeted drug. Biomolecules. 2021;11:1393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Zinser GM, Tribble E, Valrance M, Urben CM, Knutson JC, Mazess RB, et al. 1,24(S)-dihydroxyvitamin D2, an endogenous vitamin D2 metabolite, inhibits growth of breast cancer cells and tumors. Anticancer Res. 2005;25:235–41. [PubMed] [Google Scholar]
  • 42.Lin Y, Yang Z, Li J, Sun Y, Zhang X, Qu Z, et al. Effects of glutamate and aspartate on prostate cancer and breast cancer: a Mendelian randomization study. BMC Genomics. 2022;23:213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Lee PJ, Woo SJ, Yoo HM, Cho N, Kim HP. Differential mechanism of ATP production occurs in response to succinylacetone in colon cancer cells. Molecules. 2019;24:3575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Verruck S, Balthazar CF, Rocha RS, Silva R, Esmerino EA, Pimentel TC, et al. Dairy foods and positive impact on the consumer’s health. Adv Food Nutr Res. 2019;89:95–164. [DOI] [PubMed] [Google Scholar]
  • 45.Matta M, Deubler E, Chajes V, Vozar B, Gunter MJ, Murphy N, et al. Circulating plasma phospholipid fatty acid levels and breast cancer risk in the cancer prevention Study-II nutrition cohort. Int J Cancer. 2022;151:2082–94. [DOI] [PubMed] [Google Scholar]
  • 46.Sala A, Proschak E, Steinhilber D, Rovati GE. Two-pronged approach to anti-inflammatory therapy through the modulation of the arachidonic acid cascade. Biochem Pharmacol. 2018;158:161–73. [DOI] [PubMed] [Google Scholar]
  • 47.Aro A, Männistö S, Salminen I, Ovaskainen ML, Kataja V, Uusitupa M. Inverse association between dietary and serum conjugated Linoleic acid and risk of breast cancer in postmenopausal women. Nutr Cancer. 2000;38:151–7. [DOI] [PubMed] [Google Scholar]
  • 48.Bradlow HL, Telang NT, Sepkovic DW, Osborne MP. 2-hydroxyestrone: the good Estrogen. J Endocrinol. 1996;150(Suppl):S259–265. [PubMed] [Google Scholar]
  • 49.Labrie F, Bélanger A, Bélanger P, Bérubé R, Martel C, Cusan L, et al. Androgen glucuronides, instead of testosterone, as the new markers of androgenic activity in women. J Steroid Biochem Mol Biol. 2006;99:182–8. [DOI] [PubMed] [Google Scholar]
  • 50.Houghton LC, Howland RE, Wei Y, Ma X, Kehm RD, Chung WK, et al. The steroid metabolome and breast cancer risk in women with a family history of breast cancer: the novel role of adrenal androgens and glucocorticoids. Cancer Epidemiol Biomarkers Prev. 2021;30:89–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Ruiz-Perez D, Guan H, Madhivanan P, Mathee K, Narasimhan G. So you think you can PLS-DA? BMC Bioinformatics. 2020;21:2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Uddin S, Khan A, Hossain ME, Moni MA. Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inf Decis Mak. 2019;19:281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Moriya T, Satomi Y, Kobayashi H. Metabolomics of postprandial plasma alterations: a comprehensive Japanese study. J Biochem. 2018;163:113–21. [DOI] [PubMed] [Google Scholar]
  • 54.Karimpour M, Surowiec I, Wu J, Gouveia-Figueira S, Pinto R, Trygg J, et al. Postprandial metabolomics: A pilot mass spectrometry and NMR study of the human plasma metabolome in response to a challenge meal. Anal Chim Acta. 2016;908:121–31. [DOI] [PubMed] [Google Scholar]
  • 55.Knox SS. From omics to complex disease: a systems biology approach to gene-environment interactions in cancer. Cancer Cell Int. 2010;10:11. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1 (197KB, xlsx)
Supplementary Material 2 (271.1KB, xlsx)

Data Availability Statement

Data utilized in this manuscript are available through data access applications to the BC Generations Project (https://bcgpresearch.ca/) and Alberta’s Tomorrow Project (https://myatpresearch.ca/).


Articles from Breast Cancer Research : BCR are provided here courtesy of BMC

RESOURCES