Abstract
This systematic review investigates the efficacy of metabolite biomarkers for risk assessment or early detection of colorectal cancer (CRC) and its precursors, focusing on pre-diagnostic biospecimens. Searches in PubMed, Web of Science, and SCOPUS through December 2023 identified relevant prospective studies. Relevant data were extracted, and the risk of bias was assessed with the QUADAS-2 tool. Among the 26 studies included, significant heterogeneity existed for case numbers, metabolite identification, and validation approaches. Thirteen studies evaluated individual metabolites, mainly lipids, while eleven studies derived metabolite panels, and two studies did both. Nine panels were internally validated, resulting in an area under the curve (AUC) ranging from 0.69 to 0.95 for CRC precursors and 0.72 to 1.0 for CRC. External validation was limited to one panel (AUC = 0.72). Metabolite panels and lipid-based biomarkers show promise for CRC risk assessment and early detection but require standardization and extensive validation for clinical use.
Subject terms: Predictive markers, Risk factors, Cancer screening
Introduction
Colorectal cancer (CRC) is the second leading cause of cancer-related death worldwide, with an estimated 1.9 million incident cases and 904,000 deaths in 20221. CRC often progresses slowly from precancerous to malignant neoplastic lesions, offering opportunities for prevention and enhanced prognosis by early detection and removal of precancerous lesions or detection and treatment of cancer at an earlier stage2. Various screening modalities have been developed for early detection of CRC and its precursors, including colonoscopy or fecal blood tests3. While colonoscopy is considered the gold standard for early detection of CRC and its precursors due to its high sensitivity and specificity, it is invasive, carries a risk of complications, and has low adherence4. Fecal blood tests are noninvasive but have limited sensitivity for early-stage CRC and precursors of CRC and are recommended every one to three years5–7. Despite the availability of these screening modalities, the development of further noninvasive methods with enhanced acceptability, accessibility, and performance would be highly desirable.
In recent years, metabolomics has emerged as a promising approach for cancer screening, including CRC. Metabolomics involves the systematic study of small molecule metabolites in biological fluids, cells, and tissues, and research on its potential application in the field of cancer biomarker discovery is rapidly expanding8,9. Previous studies using metabolomics have shown promise in differentiating individuals with and without CRC10. However, most studies have assessed metabolomics after CRC diagnosis and were carried out in clinical settings, which may limit their relevance for general population screening since it may reflect secondary changes in the metabolome after the onset of symptoms and diagnosis of CRC11. Studies conducted to identify and validate metabolite biomarkers for CRC risk based on pre-diagnostic biospecimens may help identify more effective and less invasive screening methods for CRC. Therefore, the aim of this systematic review is to evaluate the existing evidence on metabolite biomarkers for CRC or its precursors, which were identified in pre-diagnostic samples, such as in prospective cohorts or in a screening setting.
Results
Literature search result
The comprehensive literature search across the specified databases using the predefined search terms resulted in a total of 2,484 records. A detailed overview of the selection process is depicted in the PRISMA flow diagram shown in Fig. 1. After applying the eligibility criteria, 140 articles were chosen for an in-depth full-text review. Among these articles, 27 were excluded due to inadequate study design, 79 were excluded as the individuals were already diagnosed with CRC or a precursor at the time of biospecimen collection, five were excluded due to studied biospecimens others blood, urine, or stool, and three were excluded due to insufficient statistical data. The references of the studies excluded are listed in Supplementary Table 3. In the end, 26 studies focusing on the predictive performance of metabolite biomarkers, published up to December 30, 2023, were incorporated into this systematic review.
Fig. 1.
Preferred reporting items for systematic reviews and meta-analyses (PRISMA) flow diagram.
Study characteristics
Details on study characteristics are summarized in Table 1. The investigated outcomes comprised CRC in a total of 14 studies12–25, colon cancer in two studies26,27, adenomas in two studies28,29, polyps in four studies30–33, a combination of adenomas and polyps in two studies34,35, and a combination of adenomas and CRC in two studies36,37. The studies reviewed focused on individual metabolites (13 studies12–15,17–19,24,26,27,34,35,37) and metabolite panels (eleven studies20,22,23,25,28–33,36) for differentiating CRC or its precursors from controls. The studies reporting on individual metabolites utilized a variety of designs: two were screening trials35,37, six were nested case-control studies12,13,15,18,19,26, three were prospective cohort studies17,24,34, and two articles reported on both a cohort and a screening study14,27. Studies reporting on metabolite panels included nine screening studies22,23,25,29–33,36, one prospective cohort28, and one nested case-control study20. Additionally, two nested case-control studies investigated both individual metabolites and metabolite panels16,21.
Table 1.
Details of included studies reporting on the prediction of the presence or occurrence of CRC using metabolomics
| First author, Year ref. | Study type country | Study group | Time to diagnosisa (mean) | Population | Validation approach | |||
|---|---|---|---|---|---|---|---|---|
| N | Age (mean, SD) | Female (%) | IV | EV | ||||
| Individual Metabolites | ||||||||
| Cai (2006) 12 |
Nested case-control China |
CRC, CC, RC | 30 m | 150 | 60.3 (8.3) | 100 | - | - |
| Cn | - | 150 | 60.1 (8.5) | 100 | ||||
| Cross (2014) 13 |
Nested case-control USA |
CRC | 7.8 y med | 254 | 64.3 (5.1) | 44.1 | - | - |
| Cn | - | 254 | 64.3 (5.1) | 43.7 | ||||
| Kühn (2016) 15 |
Nested case-control Germany |
CRC | 6.57 y med | 163 | 55.8 (6.4) | 37.4 | - | - |
| Cn ♀ | - | 348 | 52.3 (7.1) | 100 | ||||
| Cn ♂ | - | 426 | 49.1 (8.5) | 0 | ||||
| Myte (2017) 18 |
Nested case-control Sweden |
CRC | 8.2 (4.7–11.9) y med, IQR | 613 | 59.3 (40.1–67.8) med, IQR | 59 |
Boot- strapping |
- |
| Cn | - | 1190 | 59.7 (40.0–67.8) med, IQR | 59 | ||||
| Pickens (2017) 35 |
Screening USA |
A | N/A | 37 | 58 (53–60) med, IQR | 0 | - | - |
| HPP | - | 20 | 58 (53–60) med, IQR | 0 | ||||
| Cn | - | 69 | 57 (53–61) med, IQR | 0 | ||||
| Geijsen (2019) 14 |
Prospective cohort/screening Germany and Austria |
CRC | N/R | 180 | 66.0 (58.0–73.0) med, IQR | 36.7 | - | Yes |
| Cn | - | 153 | 51.0 (42.0–63.0) med, IQR | 61.4 | ||||
| CRC (EV) | N/R | 88 | 70.0 (60.0–76.0) med, IQR | 31.8 | ||||
| Cn (EV) | - | 200 | 64.0 (57.0–74.0) med, IQR | 35.0 | ||||
| Kühn (2020) 26 |
Nested case-control Europe |
CC | N/R | 569 | 57.5 (36.7–74.3) med, range | 62.6 | - | - |
| Cn | - | 569 | 57.5 (36.7–74.3) med, range | 62.6 | ||||
| McCullough (2021) 17 |
Prospective cohort USA |
CRC | N/R | 517 | 70.2 (5.5) | 44.3 | - | - |
| Cn | - | 517 | 70.2 (5.5) | 44.3 | ||||
| Papadimitriou (2021) 27 |
Prospective cohort/screening Germany and Austria |
CC (ColoCare) | N/R | 110 | 65 (13) | 39 | - | Yes |
| Cn (ColoCare) | - | 153 | 51 (15) | 61 | ||||
| CC (CORSA) | N/R | 46 | 69 (14) | 28 | ||||
| Cn (CORSA) | - | 390 | 63 (13) | 35 | ||||
| CC (EPIC) | 6.6 (3.5) y | 456 | 56 (7.8) | N/R | ||||
| Cn (EPIC) | - | 456 | 56 (7.7) | N/R | ||||
| Tevini (2022) 37 |
Screening Austria |
CRC | N/A | 18 | 67 (12) | 38.9 | Split sampling | - |
| AA | - | 28 | 60 (10) | 50 | ||||
| Cn | - | 36 | 53 (8) | 50 | ||||
| CRC (IV) | N/A | 48 | 69 (10) | 35.4 | ||||
| Cn for CRC (IV) | - | 29 | 68 (7) | 89.7 | ||||
| AA (IV) | - | 48 | 66 (10) | 45.83 | ||||
| Cn for AA (IV) | - | 28 | 66 (5) | 0 | ||||
| Hang (2022) 34 |
Prospective cohort USA |
A | N/A | 586 | 53.6 (7.8) | 100 | ||
| Cn for A | - | 1141 | 53.8 (7.8) | 100 | ||||
| SP | N/A | 509 | 52.9 (7.5) | 100 | ||||
| Cn for SP | - | 993 | 53.1 (7.5) | 100 | ||||
| Pham (2022) 19 |
Nested case-control Europe |
CRC | 4.8 (2.7) y | 1,293 | 58.1 (7.0) | 52.7 | - | - |
| Cn | - | 1,293 | 58.1 (7.0) | 52.7 | ||||
| Vidman (2023) 24 |
Nested case-control Sweden |
CRC | 10.3 y | 902 | 56.2 (7.4) | 48.8 | Cross-validation | - |
| Cn | - | 902 | 56.2 (7.4) | 48.8 | ||||
| Metabolite panels | ||||||||
| Eisner (2013) 32 |
Screening Canada |
P | N/A | 355 | 58.9 (8.2) | 44.79 | Cross-validation | - |
| Cn | - | 633 | 56.2 (8.1) | 57.5 | ||||
| Wang (2014) 33 |
Screening Canada |
AP | N/A | 422 | 55.7 (0.4) | 41 | Split sampling | - |
| Cn | - | 162 | 59.1 (0.6) | 57 | ||||
| AP (IV) | N/A | 211 | 56.1 (0.6) | 38 | ||||
| Cn (IV) | - | 81 | 60.4 (0.8) | 58 | ||||
| Amiot (2015) 36 |
Screening France |
AA/CRC | N/A | 33 | 59.4 ( ± 6.9) med, IQR | 24 | Cross-validation | - |
| Cn | 22 | 52.0 ( ± 12.0) med, IQR | 32 | |||||
| Farshidfar (2016) 28 |
Prospective cohort Canada |
A | N/R | 31 | 59.5 (5.9) | 32 | Cross-validation | - |
| Cn | - | 31 | 60.5 (6.7) | 28 | ||||
| Deng (2017a) 30 |
Screening Canada |
AP | N/A | 155 | 59.9 (7.4) | 38.7 | Split sampling | - |
| Cn | - | 530 | 56.1 (8.2) | 58.1 | ||||
| Deng (2017b) 31 |
Screening China |
AP (EV) | N/A | 345 | 65.1 (6.6) | 43 | - | Yes |
| Cn (EV) | - | 316 | 61.8 (7.4) | 74 | ||||
| Troisi (2022) 23 |
Screening Italy |
CRC | N/A | 100 | 66.2 (11.3) | 36 | Split sampling | - |
| BCT | N/A | 50 | 62.8 (7.1) | 41 | ||||
| Cn | - | 50 | 61.6 (7.0) | 44 | ||||
| Rothwell (2022) 20 |
Nested case-control Europe |
CRC | 7.7 (4.4) y | 1,608 | 56.9 (7.5) | 45.4 | - | - |
| Cn | - | 1,608 | 56.8 (7.5) | 45.4 | ||||
| Telleria (2022) 22 |
Screening Spain |
CRC | N/A | 40 | 73.0 (11.3) | 50 | Split sampling | - |
| AA | - | 40 | 70.4 (10.8) | 50 | ||||
| Cn | - | 40 | 66.2 (14.1) | 50 | ||||
| Liu (2023) 29 |
Screening USA |
CTC | N/A | 23 | N/R | N/R | Cross-validation | - |
| Cn | - | 20 | N/R | 50 | ||||
| Xie (2023) 25 |
Screening China |
CRC | N/A | 35 | 57 (37–81) med, range | 45.7 | - | - |
| Cn | - | 30 | 45 (23‑67) med, range | 60.0 | ||||
| Individual metabolites & metabolite panels | ||||||||
| Shu (2018) 21 |
Nested case-control China |
CRC ♀ | N/R | 122 | 56.9 (8.4) | 100 | - | - |
| Cn ♀ | - | 122 | 57.0 (8.4) | 100 | ||||
| CRC ♂ | N/R | 123 | 56.2 (6.8) | 0 | ||||
| Cn ♂ | - | 123 | 56.5 (6.6) | 0 | ||||
| Loftfield (2022) 16 |
Nested case-control USA |
CRC ♀ | 10 y | 233 | 64.2 (5.3) | 100 | - | - |
| Cn ♀ | - | 233 | 64.1 (5.3) | 100 | ||||
| CRC ♂ | 10 y | 262 | 64.0 (5.0) | 0 | ||||
| Cn ♂ | - | 262 | 64.0 (5.1) | 0 | ||||
(A)A (advanced) adenoma, AP colonic adenomatous polyps, BCT benign colorectal tumor, Cn controls, CC colon cancer, CTC colonic tumor carriers, CRC colorectal cancer, SP serrated polyps, HPP hyperplastic polyps, P polyps, RC rectal cancer, SD standard deviation, med median, IQR interquartile range, y years, m months, N/A not applicable, N/R not reported, IV internal validation, EV external validation, ♀ female, ♂ male.
aonly applicable for cohort studies and for the outcome CRC/CC.
Besides four studies from China12,21,25,31, all studies were conducted in predominately white populations. Six studies were conducted in the United States13,16,17,29,34,35, four in Canada28,30,32,33, and 12 in European countries—five spanned several European countries14,19,20,26,27, and seven took place in single European countries, including Italy23, Sweden18,24, Spain22, France36, Austria37, and Germany15.
Two studies exclusively included females12,34 while one study focused solely on males35. The male to female proportion among cases varied across studies, with three reporting more female cases18,19,26, 18 reporting more male cases13–17,20,21,23–25,27,28,30–33,36,37, one reporting an equal proportion of males and females22, and one not specifying the sex distribution of participants29.
The number of CRC cases varied widely, ranging from 18 cases37 to 1608 cases20. For adenoma cases, the range was from 23 cases29 to 586 cases34, while for polyps, the range extended from of 20 cases35 to 355 cases32. Matching of cases and controls was employed in 13 studies, incorporating criteria such as age, sex, ethnicity, year of randomization, season of blood draw, recruitment time point, time period of endoscopy, fasting status, study cohort, smoking status, and menopausal status12–14,16–21,24,26–28,34.
The biospecimens utilized in the investigations included mainly blood (serum in seven studies13,16,19,23,28,29,37, plasma in ten studies14,15,17,18,21,24,26,27,34,35, combination of serum and plasma in one study20), urine in five studies12,30–33, and stool in three studies22,25,36. Technologies used for metabolomics analyses were mainly liquid chromatography–mass spectrometry (LC–MS), which was used by 9 studies12,16,17,22,24,26,27,30,34, and other mass-spectrometry (e.g., flow injection analysis–tandem mass spectrometry, isobaric labeling mass spectrometry)14,23,25,28,29, or a combination of mass spectrometry with a different technology13,15,18,20,21,37. Other techniques used were gas chromatography (GC)35, nuclear magnetic resonance (NMR)31–33,36, and ELISA assay19.
Validation techniques to address overoptimism
Validation efforts to mitigate overoptimism in model predictions were reported by 14 out of the 26 studies, with methodologies outlined in Table 1. These studies employed various validation techniques to enhance the reliability of their findings. Split-sampling method was utilized in five different studies22,23,30,33,37. More advanced techniques, including different types of cross-validation24,28,29,32,36 and bootstrapping18 were used by six studies. External validation was performed by three studies, two evaluated individual metabolites14,27, and one focused on a metabolite panel31.
Performance of individual metabolites and metabolite panels
Potential metabolite biomarkers for prediction or diagnosis of CRC were found in different biospecimen types (blood, urine, stool) and varied in their biochemical classes. Half of the included studies reported on the performance of individual metabolites (13 out of 26), eleven studies reported on a panel of metabolites, and two reported on the performance of individual metabolites as well as the performance of a panel. Table 2 shows the individual metabolite biomarkers for CRC and their precursors, identified by different analytical approaches. Six studies used an untargeted approach to discover the metabolites13,14,17,21,24,34, while the other nine studies used a targeted approach to measure predefined metabolites12,15,16,18,19,26,27,35,37. Three of the 15 studies reporting on individual metabolites did not find any significant associations between the metabolites studied and CRC13,15,19. The remaining twelve studies reported significant associations for a total of 101 metabolites (Fig. 2). Among the 59 metabolites inversely associated with CRC, two-thirds (n = 45, 76%) were lipids or lipid-like molecules. Organoheterocyclic compounds and organic acids and derivatives accounted for 10% (n = 6) and 7% (n = 4) of these inversely associated metabolites, respectively. Out of 42 identified individual metabolites positively associated with CRC, 28 (67%) were lipids and lipid-like molecules. The rest included organic acids and derivatives, organoheterocyclic compounds (each accounting for 14 and 12%, respectively). The remaining categories included nucleosides, nucleotides and their analogs, organic oxygen compounds, and benzenoids (each accounting for 2%, n = 1). While the lipids and lipid-like molecules with the positive association were mainly bile acids and fatty acylcarnitines, inverse associations were seen with alkylacyl-lysophosphatidylcholines, phosphatidylcholines, and sphingomyelins. Among the wide range of metabolites identified, only a select few appeared in more than one study. Specific plasma bile acids, including glycocholic acid, taurocholic acid, glycochenodeoxycholic acid, taurochenodeoxycholic acid, glycodeoxycholic acid, and taurodeoxycholic acid, were reported in two large cohort studies to be positively associated with CRC. These bile acids were noted by Kühn et al.26 in the EPIC cohort focusing on colon cancer, and by Loftfield et al. 16 in the PLCO cohort, with Loftfield et al.16 reporting these findings specifically in women. Similarly, amino acids such as valine and tryptophan were identified in multiple studies, though the direction of their associations with CRC varied. Tryptophan was positively associated with CRC in findings by Vidman et al.24, while two cohorts studied by Papadimitriou et al.27 showed a reverse trend. For the CRC precursors, significant inverse and positive associations were reported for four (i.e., C18:2-c linoleic acid, glycine, C36:3 phosphatidylcholine plasmalogen, and phenylacetylglutamine) and five metabolites (i.e., omega-6 polyunsaturated fatty acid, trans-fatty acid, methionine sulfoxide/methionine ratio, C18:1 sphingomyelin, and C54:8 triglyceride), respectively, of which two and four metabolites belonged to the group of lipids and lipid-like molecules. The three other metabolites belonged to the group of organic acids and derivatives.
Table 2.
Individual metabolites associated with the presence or occurrence of CRC in blood, urine, and stool samples
| First author Year | Platform | Biospecimen | Number of metabolites identified/ Metabolite identification approachc | Outcome | Associated metabolites with outcome a | |
|---|---|---|---|---|---|---|
| Inverse association | Positive association | |||||
| Screening | ||||||
| Pickens (2017) 35 | GC | Plasma |
24 fatty acids |
A ♂ | - |
ω−6 polyunsaturated fatty acid Trans-fatty acid |
| HPP ♂ | C18:2-c linoleic acid | - | ||||
| Tevini (2022) 37 | FIA and LC-MS/MS | Serum |
188 AbsoluteIDQ® p180 kit |
AA | Glycine |
methionine sulfoxide/methionine ratio SM C18:1 |
| CRC |
Glycerophospholipids (LysoPC a C17:0, LysoPC a C18:0, LysoPC a C18:2, LysoPC a C26:0, LysoPC a C28:0, LysoPC a C28:1, PC aa C28:1, PC aa C30:0, PC aa C32:2, PC aa C32:3, PC aa C34:3, PC aa C34:4, PC aa C36:2, PC aa C36:6, PC aa C38:0, PC aa C38:1, PC aa C42:6, PC ae C30:0, PC ae C34:0, PC ae C34:2, PC ae C34:3, PC ae C36:1, PC ae C36:2, PC ae C36:3, PC ae C38:0, PC ae C38:3, PC ae C40:1, PC ae C40:6) Sphingomyelins (SM (OH) C22:1,SM (OH) C22:2, SM (OH) C24:1, SM C16:1) Histidine Total AC-DC/Total AC Total PC ae Total SM (OH) Total SM (OH)/ total SM (non-OH) |
Acylcarnitines (C7-DC, C12, C12:1, C14:1, C16:2, C18:1) | ||||
| Cohorts | ||||||
| Hang (2022) 34 | LC–MS | Plasma |
207 Untargeted |
A ♀ | C36:3 phosphatidylcholine plasmalogen | |
| SP ♀ | Phenylacetylglutamine | C54:8 triglyceride | ||||
| Kühn (2020) 26 | LC–MS | Plasma |
17 Bile acids |
CC |
Glycocholic acid Taurocholic acid Glycochenodeoxycholic acid Taurochenodeoxycholic acid Glycohyocholic acid Glycodeoxycholic acid Taurodeoxycholic acid |
|
| Papadimitriou (2021) 27 | LC–MS | Plasma |
3 Tryptophan metabolites |
CC |
Tryptophan Kynurenine |
Kynurenine Serotinin Kynurenine−to−tryptophan ratio |
| Cai (2006) 12 | LC–MS | Urine |
1 Prostaglandin E2 Metabolite (PGE-M) |
CRC, CC, RC ♀ | PGE-M | |
| Cross (2014) 13 | LC–MS and GC-MS | Serum |
278 Untargeted |
CRC | - b | |
| Kühn (2016) 15 | LC-MS/MS and FIA-MS/MS | Plasma |
120 MetaDisIDQTM Kit |
CRC | - b | |
| Myte (2017) 18 | LC-MS/MS and GC-MS, Lactobacillus casei and Lactobacillus leichmannii | Plasma |
14 One-carbon metabolites |
CRC |
Riboflavin Ppyridoxal 5-phosphate |
|
| Shu (2018) 21 | UPLC-QTOFMS and GC-TOFMS | Plasma |
167 Untargeted |
CRC |
2-methyl-4-phenyl-2-butyl 2-methylpropanoate PE(20:0/18:2) PC(22:6/18:0) Ethyl 4-(methylthio)butyrate PE(p-16:0/20:4) 5,6–8,9-diepoxyergost-22-ene-3,7beta-diol |
Picolinic acid Selenocystine 2,3-epoxymenaquinone |
| Geijsen (2019) 14 | UHPLC-QTOF-MS | Plasma |
28 Untargeted |
CRC |
LysoPC(16:1) LysoPC(P-16:0) LysoPC(15:0) LysoPC(16:0) LysoPC(16:0) isomer LysoPC(17:0) LysoPC(18:0) Leucine Valine Bilirubin 1-Methylnicotinamide |
LysoPE(20:4) LysoPE(22:6) Taurine Hypoxanthine |
| McCullough (2021) 17 | LC-MS/MS | Plasma |
886 Untargeted |
CRC | 3-methylxanthine |
Guanidinoacetate Vanillylmandelate 2’-O-methylcytidine Bilirubin (E-E) N-palmitoylglycine |
| Loftfield (2022) 16 | LC-MS/MS | Serum |
21 Bile acids and short-chain fatty acids |
CRC ♀ | - |
Glycochenodeoxycholic acid Taurochenodeoxycholic acid Glycocholic acid Taurocholic acid Deoxycholic acid Glycodeoxycholic acid Taurodeoxycholic acid Glycholithocholic acid Taurolithocholic acid |
| CRC ♂ | Cholic acid | - | ||||
| Pham (2022) 19 | ELISA assays | Serum |
1 Resistin |
CRC | - b | |
| Vidman (2023) 24 | LC–MS | Plasma |
5015 Untargeted |
CRC |
Sebacic acid Pyroglutamic acid Hydroxytigecycline |
9,12,13-TriHOME 13-OxoODE Valine 3-hydroxybutyric acid l-tryptophan |
GC gas chromatography, LC-MS/MS liquid chromatography–mass spectrometry/liquid chromatography/ tandem mass spectrometry, FIA flow injection analysis, GC-MS gas chromatography–mass spectrometry, FIA-MS/MS flow injection analysis–tandem mass spectrometry, UPLC-QTOFMS ultra-performance liquid chromatography quadrupole-time-of-flight mass spectrometry, GC-TOFMS gas chromatography time-of-flight mass spectrometry, UHPLC-QTOF-MS ultra-high chromatography- quadrupole-time-of-flight mass spectrometry, AC acylcarnitine, LysoPC monoacyl-glycerophosphocholine, PC aa diacyl-glycerophosphocholine, PC ae alky-acyl-glycerophosphocholine, SM sphingomyelin, AC acylcarnitine, (A)A (advanced) adenoma, AP clonic adenomatous polyps, Cn controls, CC, colon cancer, RC rectal cancer, CRC colorectal cancer, SP serrated polyps, HPP hyperplastic polyps, ♀ female, ♂ male.
aIncludes only named metabolites.
bNo significant associations with metabolites identified (after correction for multiple testing).
cDescribes the metabolite identification method used: targeted groups, untargeted approaches, or specific commercial panels.
Fig. 2. Associations between individual metabolites and colorectal cancer risk, categorized by direction of association.
A Inversely associated metabolites with colorectal cancer risk. B Positively associated metabolites with colorectal cancer risk. Note: metabolites are grouped by Super Class from the Human Metabolome Database. Metabolites reportedas ratios are excluded.
Out of 15 studies that examined metabolites individually, only three conducted internal validation18,24,37 and two performed external validation14,27. Papadimitriou et al. 27 examined three metabolites of tryptophan in three different samples. However, they found inconsistent directions of association for two of the metabolites, tryptophan and kynurenine, and their ratio, between the three studies. Geijsen et al. 14 applied an untargeted approach and identified 15 metabolites that differed significantly between cases and controls of CRC in both their discovery and replication sets. However, whether these metabolites were of predictive or prognostic value was not identified. Except for the studies by Tevini et al. 37 and Cai et al. 12, all the other studies that investigated metabolites individually adjusted for several covariates in their analyses, such as age, smoking status, or BMI (see Supplementary Table 5).
Tables 3 and 4 present the metabolite biomarker panels developed for detection CRC and its precursors. Among the 13 studies that reported on these panels, one conducted an external validation31, while eight performed internal validations22,23,28–30,32,33,36. The remaining four studies did not conduct any form of validation16,20,21,25.
Table 3.
Performance characteristics of metabolite panels to predict the presence or occurrence of CRC in blood biospecimen
| First author Year | Biospecimen | Platform | Outcome | Metabolite panel | Performance | |||
|---|---|---|---|---|---|---|---|---|
| OR (95% CI) | AUC | Sensitivity (%) | Specificity (%) | |||||
| Screening | ||||||||
| Liu (2023) 29 | Serum | IL-MS | A | Glutamine, Threonine | 0.83d | |||
| Asparagine, Glutamine, Threonine | 0.85d | |||||||
| Arginine, Asparagine, Glutamine, Threonine | 0.87d | |||||||
| Troisi (2022) 23 | Serum | GC-MS | CRC | Acetic, Androstenedione, Aspartic, Estradiol, Fructose, Glucose, Glutamine, Guanine, Hydroxylamine, Isoleucine, Lactose, Myristic, Nicotinic, Norepinephrine, Oxalic, Oxoglutaric, Oxoproline, Propionic, Pyrocatechol, Pyruvic, Quinolinic, Tartaric, Tetra, Threonine, Urea, Valine | 1.0d | 100d | 100d | |
| Cohorts | ||||||||
| Farshidfar (2016) 28 | Serum | FIA-MS/MS | A | Decenoylcarnitine, Dodecenoylcarnitine, Hexadecadienylcarnitine, Hydroxytetradecenoylcarnitine, lysoPhosphatidylcholine acyl C17:0, Phosphatidylcholine acly-alkyl C40:2, Proline, Tetradecadienylcarnitine, Tryptophan | 0.82d | |||
| Shu (2018) 21 | Plasma | UPLC-QTOFMS and GC-TOFMS | CRC | 2,3-epoxymenaquinone, 2-methyl-4-phenyl-2-butyl 2-methylpropanoate, 5,6:8,9-diepoxyergost-22-ene-3,7beta-diol, Ethyl 4-(methylthio)butyrate, PC(22:6/18:0), PE(20:0/18:2), PE(p-16:0/20:4), Picolinic acid, Selenocystine | 0.76 | |||
| Loftfield (2022) 16 | Serum | LC-MS/MS | CRC ♀ | Chenodeoxycholic acid, Cholic acid, Deoxycholic acid, Glycochenodeoxycholic acid, Glycocholic acid, Glycodeoxycholic acid, Glycolithocholic acid, Glycoursodeoxycholic acid, Lithocholic acid, Taurochenodeoxycholic acid, Taurocholic acid, Taurodeoxycholic acid, Taurolithocholic acid, Ursodeoxycholic acid | 1.95 (1.04, 3.66)a | |||
| Acetic acid, Butyric acid, Hexanoic acid, Isobutyric acid, Isovaleric acid, Propionic acid | 0.55 (0.31, 0.98)a | |||||||
| Rothwell (2022) 20 | Serum and plasma | GC and LC-MS/MS | CRC | 2:1n-9, 15:0, 15:01, 16:00, 16:1n-7/n-9, 17:0, 18:1n-9c, 20:3n-9, 22:5n-6 | 0.51 (0.29, 0.90)b | |||
| CC | 0.53 (0.29, 0.97)b | |||||||
| CRC | Glycine, Glutamate, lysoPC a C17:0, lysoPC a C18:2, PC aa C32:1, PC aa C34:4, PC aa C36:4, PC aa C38:4, PC aa C40:4, PC ae C36:2, PC ae C38:2, PC ae C38:3, PC ae C40:6, Serine | 0.62 (0.50, 0.78)c | ||||||
| CC | 0.65 (0.50, 0.84)c | |||||||
| RC | 0.44 (0.25, 0.79)c | |||||||
LC-MS/MS liquid chromatography–mass spectrometry, GC-MS gas chromatography–mass spectrometry, FIA-MS/MS flow injection analysis–tandem mass spectrometry, IL-MS isobaric labeling mass spectrometry, GC-TOFMS gas chromatography time-of-flight mass spectrometry, UPLC-QTOFMS ultra-performance liquid chromatography quadrupole-time-of-flight mass spectrometry, OR odds ratio, CI confidence interval, AUC area under the curve, A adenoma, Cn Controls, CC colon cancer, CRC colorectal cancer, ♀ female, ♂ male.
aOR comparing highest versus lowest quartile;
bOR per unit increase;
cOR per unit change;
dInternally validated results.
Table 4.
Performance characteristics of metabolite panels to predict the presence or occurrence of CRC in stool and urine biospecimen
| First author Year | Biospecimen | Platform | Outcome | Metabolite panel | Performance | ||
|---|---|---|---|---|---|---|---|
| AUC | Sensitivity (%) | Specificity (%) | |||||
| Screening | |||||||
| Wang (2014) 33 | Urine | NMR | AP | 2-Oxoglutarate, 3-Hydroxybutyrate, 3-Hydroxyphenylacetate, 3-Hydroxymandelate, Acetone, Adipate, Asparagine, b-Alanine, Benzoate, Butyrate, Ethanol, Histidine, Methanol, p-Methylhistidine, Serine, Trigonelline, Tyrosin | - | 82.7b | 51.2b |
| Deng (2017a) 30 | Urine | LC-MS/MS | AP | Ascorbic acid, Carnitine, Succinic Acid | 0.69b | 82.4b,c | 36.0b,c |
| Deng (2017b) 31 | Urine | NMR | AP | Ascorbic acid, Carnitine, Succinic Acid | 0.72b | 82.6b | 42.4b |
| Eisner (2013) 32 | Urine | 1H-NMR | P | Acetone, Methanol, Trigonelline, Tyrosine | 0.72a | 64.0a | 65.0a |
| Telleria (2022) 22 | Stool | UPLC-MS/MS | AA | Bilirubin E,E, Glycocholenate sulfate, Lactosyl-N-palmitoyl-sphingosine, STLVT | 0.95b | 70.0b | 100b |
| Amiot (2015) 36 | Stool | 1H-NMR |
AA/ CRC |
Valerate, Butyrate, Propionate, Acetate, Glutamate, Glutamine, ß-Glucose | 0.94b | ||
| Xie (2023) 25 | Stool | UPLC‑MS/MS | CRC |
9,10‑dihydroxy‑12‑octadecenoic acid, cholesterol ester (18:2), lipoxinA4 |
0.97 | ||
NMR nuclear magnetic resonance spectrometry, LC-MS/MS liquid chromatography–mass spectrometry, 1H-NMR proton nuclear magnetic resonance, UPLC-MS/MS ultra-high-performance liquid chromatography–tandem mass spectroscopy, AUC area under the curve, AA advanced adenoma, AP colonic adenomatous polyps, Cn controls, CRC colorectal cancer, P polyps.
aExternally validated results,
bInternally validated results,
cDifferent sensitivities and specificities available according to threshold criteria in the publication.
Table 3 displays the efficacy of blood-based biomarker panels, with the most effective panel achieving an AUC of 1.0, and 100% sensitivity and specificity23. This panel consisted of 26 metabolites and used a machine-learning approach.
Table 4 shows the metabolite biomarkers from stool and urine samples. It includes three studies that analyzed stool sample panels, reporting AUCs of 0.9522, 0.9436, and 0.9725, with the latter not performing any kind of validation. The panel by Telleria et al. further reported a sensitivity of 70% and specificity of 100%, using four metabolites and levels of hemoglobin to discriminate between cases and non-cases of advanced adenoma22. For the panels of metabolites based on urine, all studies performed either internal or external validation. The study by Wang et al. 33 showed the highest AUC of 0.752, along with sensitivity of 88.9% and specificity of 50.2% for a panel composed of 18 metabolites to distinguish between polyp cases and non-cases. The internal validation confirmed these results, with a sensitivity of 82.7% and a specificity of 51.2%. Deng et al. 31 conducted an external validation of a urine-based diagnostic panel for the detection of adenomatous polyps, that was originally developed and validated using n = 1000 samples from a Canadian cohort30. The external validation in the Chinese cohort yielded an AUC of 0.72, a sensitivity of 82.6%, and a specificity of 42.4%. The panel consisted of four metabolites in combination with information on the age, sex, and smoking status of the participants.
Quality assessment of diagnostic accuracy studies
In this study, we utilized the QUADAS-2 tool to evaluate the risk of bias and applicability concerns. Detailed results are provided in Supplementary Table 4. For the “patient selection” domain, two studies were identified with a high risk of bias due to small case numbers and large significant differences between cases and controls, while 16 indicated a low risk, and eight were unclear. In the ‘index test’ domain, the risk of bias was low in ten studies, unclear in 15, and high in one. Similarly, for the ‘reference standard’ domain, the risk assessment showed 16 studies with low risk, ten with unclear risk, and none with high risk. The unclear risk assessments in the “index test” and “reference standard” domains were partly due to the absence of information in some studies about the independent execution of metabolite tests and their comparison counterparts, such as colonoscopies. In the “flow and timing” domain, eight studies were assessed as low risk, eight as high risk, and ten as unclear. Predominantly, the studies were highly applicable, a result of our focused method in selecting articles pertinent to CRC or its early predictors. Nonetheless, we observed significant applicability issues in “patient selection” for ten studies, mainly because of missing internal or external validation and a narrow demographic focus. There were no applicability concerns for the “index test” and predominantly no in “reference standard” domains, as these tests align with our review question.
Discussion
In the present systematic review, we identified 26 studies focusing on metabolite biomarkers for the prediction of the occurrence or presence of CRC or its precursors. These studies contributed valuable insights into metabolomics within the context of CRC screening trials and prospective cohort studies. Lipids and lipid-like molecules emerged as the most frequently investigated metabolites across various biospecimens, offering the potential for CRC and its precursors prediction in the context of CRC screening or risk assessment. However, the heterogeneity in data analysis methodologies and result reporting hindered a unified interpretation and precluded a meta-analytic approach. Specifically, this variability in the use of different metabolite panels, statistical models, and validation techniques limits comparability and introduces challenges in synthesizing data across studies. Therefore, a descriptive presentation of findings was conducted. Additionally, most studies showed a lack of robust validation for their biomarker panels, often only performing internal validation, which questions the generalizability of the findings. The small sample sizes in several studies further constrained the statistical power, increasing the risk of erroneous results. A notable geographical bias toward white and Asian populations was also observed, affecting the applicability of findings to other ethnic groups. While individual studies displayed advancements in metabolomics profiling, the absence of consistent validation across studies underscores the need for standardized methodological frameworks in future research.
The comparison between individual metabolites and metabolite panels reveals a notable pattern, suggesting that the latter holds superior potential as a screening tool or risk assessment tool for CRC screening. Three out of 15 studies13,15,19 based on individual metabolites did not find any significant associations after correcting for multiple testing. In contrast, studies examining metabolite panels consistently demonstrated good to very good predictive or diagnostic abilities. This observation, supported by a systematic review incorporating also post-diagnostic metabolite samples11, suggests that metabolite panels may possess the capacity to better reflect the complexity of biological systems, address disease heterogeneity, and offer synergistic insights into collective metabolic alterations associated with CRC development, unlike individual metabolites.
Notably, a range of panels have yielded exceptionally high AUC values between 0.76 and an optimal 1.0 for CRC detection or prediction, with AUCs exceeding 0.83 for early indicators of CRC, with some consisting of merely two metabolites29, while others included up to 26 metabolites23. However, these high-performance panels, in some instances, were evaluated in studies utilizing relatively small sample sizes of fewer than 50 cases25,28,29,36,37 and were only examined in a single population. While more than half of the studies implemented internal validation, predominantly using split-sampling methods for model testing, only three studies undertook external validation14,27,31. These studies revealed varied outcomes: certain metabolites displayed unreliable or minimal correlations with CRC in diverse populations, whereas others achieved results on par with current stool tests. While Gejisen et al. 14 replicated their untargeted approach findings, revealing 15 metabolites significantly associated with CRC in two European cohorts, Papadimitriou et al. 27 reported inconsistent associations between tryptophan metabolism-linked metabolites and colon cancer across cohorts. Deng et al. achieved comparable metabolite test performance in the studied Chinese cohort to the original Canadian cohort in which the metabolite panel was developed30,31. While this panel exhibited increased sensitivity, its specificity was somewhat lower compared to well-established fecal blood tests that have specificities for advanced adenomas ranging from 0.90 to 0.9538. These varied outcomes point to a significant challenge in the field of metabolite biomarker research, emphasizing the critical need for thorough independent validation39. The importance of such validation is heightened by the fact that metabolite stability can differ over time and with various sample collection methods40. Thorough independent validation is essential to mitigate the risk of overestimating predictive capabilities, often referred to as the “winner’s curse”, where models may appear highly predictive in initial derivation but fail to perform as well in subsequent applications. Internal validation helps address this by proper evaluation of the model within the same dataset (e.g., by a split sample or cross-validation approaches), reducing the likelihood of overfitting. External validation not only confirms the robustness of these findings but also identifies potential limitations in different demographic or clinical settings, ensuring that the predictive models can be reliably applied in various real-world scenarios.
Several metabolic pathways, including glycolysis, glutaminolysis, oxidative phosphorylation, and lipid metabolism41, appear to be altered during the cancer state. Notably, lipid metabolism stands out, as lipids and lipid-like molecules frequently emerge as the most altered metabolites in CRC risk prediction. Among these, two studies identified elevated levels of plasma bile acids, including glycocholic acid, taurocholic acid, glycochenodeoxycholic acid, taurochenodeoxycholic acid, glycodeoxycholic acid, and taurodeoxycholic acid, to be positively associated with CRC16,26. These bile acids may contribute to carcinogenesis through their roles in disrupting cell signaling pathways, promoting inflammation, and inducing DNA damage in colorectal epithelial cells42,43. Additionally, bile acids can activate nuclear receptors, which are involved in lipid metabolism, cellular proliferation, and apoptosis regulation42. This may reflect their vital roles in cellular functions essential for cancer development, such as cell membrane integrity, energy storage, and signaling44,45. Additionally, the prevalence of lipids in these findings could also be influenced by their prominence in commercially available metabolomics kits and the specific research focus on these molecules, which may skew the observed metabolic alterations toward lipid-related pathways. Further, the precise timing of these metabolic changes remains unclear, underscoring a significant area for future research to explore the temporality of metabolite biomarker alterations in the context of cancer progression. Research from screening trials and nested case-control studies within prospective cohorts provides a unique opportunity to investigate the temporality of metabolite biomarker performance. In nested case-control studies and prospective cohort studies, where samples are collected on average several years before diagnosis, risk-predictive biomarkers gain importance. For example, these biomarkers hold the potential for application in individuals before the starting age for screening, facilitating risk assessment, and the development of more refined risk prediction algorithms. Current risk-prediction algorithms, incorporating factors such as age, family history, genetic risk factors, and lifestyle factors, show promise but require further improvement46. Conversely, metabolite biomarkers identified in screening trials, shortly before the diagnosis of CRC or its precursors, may provide valuable insights for refining and optimizing diagnostic strategies, leveraging the screening trials’ capability to capture biomarkers indicative of the imminent occurrence of CRC.
Consideration should also be given to the temporal aspect related to the stage of colorectal carcinogenesis examined in the selected studies. Metabolite profiles may exhibit distinct patterns at various stages of CRC progression, with specific metabolites associated with aggressive tumor characteristics being more pronounced in CRC compared to adenomas or polyps47. Recognizing and leveraging these nuanced metabolic panels could enhance the accuracy of metabolite-based diagnostics, enabling more precise differentiation between CRC, adenomas, and polyps.
Metabolites, integral to the phenotype, are extractable from diverse biospecimens, including blood, urine, and stool, with blood and urine being the most common choices in the examined studies. The results based on different biospecimens are only partly comparable. Notably, negative correlations have been observed between metabolite concentrations in stool and urine samples, whereas positive correlations exist between blood and urine, as well as blood and stool metabolite concentrations48. Tumor-related detection of metabolites in blood samples, which are routinely collected in medical practice, exhibits challenges with indirect tumor analysis and potential analyte dilution from leaked cells49. Conversely, metabolites derived from urine and stool samples show promise in capturing CRC-related metabolic perturbations, potentially reflecting the tumor microenvironment50. In contrast to the complexity of blood analysis, the simplicity of urine and stool provides unique advantages. However, variations in metabolite concentrations due to circadian rhythm and diet necessitate standardizing collection time and controlling for nutrient consumption patterns51. Especially concentrations of fatty acids, lipids, and amino acids are known to show circadian variation52. Additionally, metabolite concentrations depend on whether a person is fasting or has recently eaten, with decreases in acylcarnitine and triglycerides and increases in amino acids and glucose-related metabolites after a meal52.
The inclusion of various sets of covariates adds to the complexity of comparing the performance of different individual metabolites and metabolite panels across the studies. Age, sex, and various clinical variables were included as covariates in the models, with age and sex being the most frequently integrated factors. However, many metabolites are affected by lifestyle and nutritional factors and are subject to temporal variation caused by such factors53,54. Standardized conditions of sample collection, along with careful ascertainment of potential non-tumor related determinants is crucial for establishing potential use of metabolomics in CRC risk assessment or early detection55.
Metabolite identification is subject to significant variation due to the varied use of analytical techniques, technical implementation, and the use of various techniques of data analyses across the included studies. The choice of analytical techniques, such as NMR, GC-MS, and LC–MS, introduces distinctive approaches to metabolite identification. NMR, as the most popular option, offers the possibility to detect a wide range of metabolites, while alternative methods like ELISA assays offer enhanced flexibility, demonstrating the diverse spectrum of tools available. Technical factors also play a crucial role in the variation of the metabolite identification. The time and temperature of sample collection and freezing can significantly influence outcomes. Standardizing protocols for sample collection, pre-analytical handling, and storage conditions is essential to minimize variations, ensuring reproducibility in metabolomics research55. Likewise, initiatives to standardize metabolomics analyses are crucial in this regard, as they aim to establish consistent protocols across studies55,56. These include guidelines for study design, sample processing, and data reporting, which are necessary to reduce inconsistencies and improve the comparability of results across different laboratories and studies55.
In parallel, the integration of various techniques of statistical analysis, exemplified by the LASSO algorithm and Bayesian network in the included studies18,23,30,32, introduces another layer of complexity. These techniques prove valuable in identifying metabolites that differentiate between CRC or precursor cases and controls. The combination of metabolomics and machine learning offers an alternative to traditional statistical methods, particularly for addressing the challenges presented by non-linear biological data57.
The direct comparison of the results obtained for the identified metabolite panels and for the individual metabolites is complicated by a variety of factors, such as differing analytical methods and technical considerations. The potential introduction of metabolomics testing in clinical practice should be accompanied by careful evaluation of cost-effectiveness studies. So far, cost-effectiveness studies have been very limited. One such study concluded that implementing urine-based metabolomics tests, such as those from Deng et al. 30,31, might be a cost-effective strategy in programmatic CRC screening programs58. Therefore, the translation of these findings into clinical practice is not imminent, highlighting the need for careful consideration of the complex intricacies involved.
A strength of our review is its sole focus on studies where biospecimens were collected before diagnosis of CRC or CRC precursors, differentiating it from most metabolomics research based on samples collected after diagnosis, whose relevance for early detection remains uncertain. Additionally, the review’s comprehensive approach, covering a broad spectrum of metabolite biomarkers in various biospecimens, improves our understanding of CRC metabolomics, potentially unlocking new insights into CRC prediction and risk assessment.
Limitations in the interpretation and implementation of metabolomics studies pose challenges. A major concern is the lack of standardization, with efforts from initiatives aimed at establishing standardized protocols from study design to sample collection and preparation55. This lack of standardization may hinder the comparability of studies included in this systematic review. While the review provides a narrative summary, it does not include a meta-analysis due to the heterogeneity of the studies. This decision, while justified in light of the lack of standardization, means that the review does not offer a quantitative synthesis of the data, which could potentially yield more definitive conclusions. Furthermore, this systematic review faces potential challenges beyond those inherent to the included studies, such as publication bias, and the variability and sometimes insufficient detail in the data reported by the individual study publications.
This systematic review emphasizes the significant potential of metabolite panels, particularly those that focus on lipids, in improving CRC prediction and risk assessment, outperforming the accuracy of individual metabolites. These panels, based on metabolites derived from blood, urine, and stool samples, have the potential to enhance CRC screening by enabling accurate risk assessment, thereby optimizing resource allocation, and identifying individuals at high risk. However, the variation in analytical methods and the lack of a standardized validation process underscore the need for methodological harmonization. By standardizing techniques, ensuring thorough validation, and examining metabolic variations at different CRC stages, metabolomics might have the potential to be effectively incorporated into clinical practice, potentially transforming CRC screening strategies to align with the emerging focus on personalized and precision medicine.
Methods
Our systematic review was conducted following a pre-registered study protocol with PROSPERO (registration number: CRD42023425862). Any modifications made during the review process were documented in PROSPERO to ensure transparency and consistency. Additionally, we adhered to the standardized methodology guidelines outlined in the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) (Supplementary Table 1)59.
Search strategy
Our systematic literature search aimed to identify relevant studies focusing on metabolite biomarkers in noninvasive (urine, stool) or minimally invasive (blood) biospecimens analyzed in pre-diagnostic settings, concentrating on CRC or its precursors. The search was conducted on December 30, 2023, across three electronic databases, including PubMed, Web of Science, and Scopus. The search terms employed consisted of “metabolomics”, “pre-diagnostic biomarker”, and “colorectal cancer” along with associated terms. Details regarding the employed terms for each database are available in Supplementary Table 2.
Study selection
In our selection process, we considered articles on studies conducted in a screening context that involved the measurement of metabolomics in biospecimens (blood, urine, or stool) taken before a diagnosis of CRC or its precursors. Additionally, we included articles based on prospective cohort studies in which metabolomics measurements were obtained from biospecimens collected at baseline. The primary outcome of interest encompassed CRC, its anatomic subsites (rectal or colon cancer), or precursors such as adenomas or polyps. Letters, editorials, comments, news articles, or articles published in languages other than English were not included. Records unrelated to our review question, such as those focusing on different cancer types or biospecimen collection after diagnosis, were also excluded. We furthermore excluded records that lacked sufficient statistical data or did not report on the diagnostic or predictive performance of metabolite biomarkers.
Data extraction and evaluation of study quality
Data extraction was performed independently by two authors, TS and CF. To ensure precision and reliability, any initial discrepancies were resolved through consensus after a thorough review and discussion. Information extracted from each study included publication details (e.g., first author, publication year), population characteristics (country, study design, study setting, sample size, mean or median age of participants, and proportion of female participants), sample characteristics (type of biospecimen, technique used for metabolomics analysis, and the specific metabolites evaluated), as well as effect measures, statistical methods, and study results, such as the diagnostic or predictive performance of the studied metabolite biomarkers.
The methodological quality of each record was independently assessed by two investigators, TS and CF, using the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) tool60. Initial disagreements were resolved through consensus after further review and discussion. The assessment of risk of bias included four domains: “patient selection,” “index test,” “reference standard,” and “flow and timing,” and the evaluation of applicability comprised three domains: “patient selection”, “index test”, and “reference standard”. The risk of bias and applicability assessment for each study was rated as “high risk/concern,” “low risk/concern,” or “unclear risk/concern” based on the QUADAS-2 signaling questions60.
Supplementary information
Acknowledgements
This work was supported by the German Federal Ministry of Education and Research (No. 01KD2104A). The sponsor had no role in the study design; the collection, analysis, and interpretation of data; the writing of the report; or the decision to submit the article for publication.
Author contributions
The author’s responsibilities were as contributions: H.B. designed and supervised the study; T.S. carried out the literature research and drafted the manuscript; T.S. and C.F. extracted data from eligible studies. C.F., R.C., M.B., M.H., and H.B. critically reviewed, contributed to, and approved the final manuscript.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Data availability
All data generated and analyzed during this study are included in the article and its supplementary information files.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41698-024-00732-5.
References
- 1.Bray, F. et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin.74, 229–263 (2024). [DOI] [PubMed]
- 2.Dekker, E., Tanis, P. J., Vleugels, J. L. A., Kasi, P. M. & Wallace, M. B. Colorectal cancer. Lancet394, 1467–1480 (2019). [DOI] [PubMed] [Google Scholar]
- 3.Shaukat, A. & Levin, T. R. Current and future colorectal cancer screening strategies. Nat. Rev. Gastroenterol. Hepatol.19, 521–531 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Brenner, H., Stock, C. & Hoffmeister, M. Effect of screening sigmoidoscopy and screening colonoscopy on colorectal cancer incidence and mortality: systematic review and meta-analysis of randomised controlled trials and observational studies. BMJ348, g2467 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Maida, M. et al. Screening of colorectal cancer: present and future. Expert Rev. Anticancer Ther.17, 1131–1146 (2017). [DOI] [PubMed] [Google Scholar]
- 6.Niedermaier, T., Balavarca, Y. & Brenner, H. Stage-specific sensitivity of fecal immunochemical tests for detecting colorectal cancer: systematic review and meta-analysis. Am. J. Gastroenterol.115, 56–69 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Niedermaier, T., Tikk, K., Gies, A., Bieck, S. & Brenner, H. Sensitivity of fecal immunochemical test for colorectal cancer detection differs according to stage and location. Clin. Gastroenterol. Hepatol.18, 2920–2928.e2926 (2020). [DOI] [PubMed] [Google Scholar]
- 8.Mina, P. R. in Metabolomics: Recent Advances and Future Applications. Metabolomics Approach to Identify Biomarkers of Epidemic Diseases, chapter 4(eds Soni, V. & Hartman, T. E.) pp. 93–126 (Springer International Publishing, 2023).
- 9.Schmidt, D. R. et al. Metabolomics in cancer research and emerging applications in clinical oncology. CA Cancer J. Clin.71, 333–358 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gold, A., Choueiry, F., Jin, N., Mo, X. & Zhu, J. The application of metabolomics in recent colorectal cancer studies: a state-of-the-art review. Cancers14, 725 (2022). [DOI] [PMC free article] [PubMed]
- 11.Erben, V., Bhardwaj, M., Schrotz-King, P. & Brenner, H. Metabolomics biomarkers for detection of colorectal neoplasms: a systematic review. Cancers10, 246 (2018). [DOI] [PMC free article] [PubMed]
- 12.Cai, Q. et al. Prospective study of urinary prostaglandin E2 metabolite and colorectal cancer risk. J. Clin. Oncol.24, 5010–5016 (2006). [DOI] [PubMed] [Google Scholar]
- 13.Cross, A. J. et al. A prospective study of serum metabolites and colorectal cancer risk. Cancer120, 3049–3057 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Geijsen, A. et al. Plasma metabolites associated with colorectal cancer: a discovery-replication strategy. Int. J. Cancer145, 1221–1231 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kühn, T. et al. Higher plasma levels of lysophosphatidylcholine 18:0 are related to a lower risk of common cancers in a prospective metabolomics study. BMC Med.14, 13 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Loftfield, E. et al. Prospective associations of circulating bile acids and short-chain fatty acids with incident colorectal cancer. JNCI Cancer Spectr.6, pkac027 (2022). [DOI] [PMC free article] [PubMed]
- 17.McCullough, M. L., Hodge, R. A., Campbell, P. T., Stevens, V. L. & Wang, Y. Pre-diagnostic circulating metabolites and colorectal cancer risk in the cancer prevention study-II nutrition cohort. Metabolites11, 156 (2021). [DOI] [PMC free article] [PubMed]
- 18.Myte, R. et al. Untangling the role of one-carbon metabolism in colorectal cancer risk: a comprehensive Bayesian network analysis. Sci. Rep.7, 43434 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Pham, T. T. et al. Pre-diagnostic circulating resistin concentrations are not associated with colorectal cancer risk in the European prospective investigation into cancer and nutrition study. Cancers14, 5499 (2022). [DOI] [PMC free article] [PubMed]
- 20.Rothwell, J. A. et al. Metabolic signatures of healthy lifestyle patterns and colorectal cancer risk in a European cohort. Clin. Gastroenterol. Hepatol.20, e1061–e1082 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Shu, X. et al. Prospective study of blood metabolites associated with colorectal cancer risk. Int. J. Cancer143, 527–534 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Telleria, O. et al. A comprehensive metabolomics analysis of fecal samples from advanced adenoma and colorectal cancer patients. Metabolites12, 550 (2022). [DOI] [PMC free article] [PubMed]
- 23.Troisi, J. et al. A metabolomics-based screening proposal for colorectal cancer. Metabolites12, 110 (2022). [DOI] [PMC free article] [PubMed]
- 24.Vidman, L. et al. Untargeted plasma metabolomics and risk of colorectal cancer-an analysis nested within a large-scale prospective cohort. Cancer Metab.11, 17 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Xie, Z. et al. Metabolomic analysis of gut metabolites in patients with colorectal cancer: Association with disease development and outcome. Oncol. Lett. 26, 358 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kühn, T. et al. Prediagnostic plasma bile acid levels and colon cancer risk: a prospective study. J. Natl Cancer Inst.112, 516–524 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Papadimitriou, N. et al. Circulating tryptophan metabolites and risk of colon cancer: results from case-control and prospective cohort studies. Int. J. Cancer149, 1659–1669 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Farshidfar, F. et al. A validated metabolomic signature for colorectal cancer: exploration of the clinical value of metabolomics. Br. J. Cancer115, 848–857 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Liu, Y. et al. Quantification of serum metabolites in early colorectal adenomas using isobaric labeling mass spectrometry. J. Proteome Res.22, 1483–1491 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Deng, L. et al. Development and validation of a high-throughput mass spectrometry based urine metabolomic test for the detection of colonic adenomatous polyps. Metabolites7, 32 (2017). [DOI] [PMC free article] [PubMed]
- 31.Deng, L. et al. Clinical validation of a novel urine-based metabolomic test for the detection of colonic polyps on Chinese population. Int. J. Colorectal Dis.32, 741–743 (2017). [DOI] [PubMed] [Google Scholar]
- 32.Eisner, R., Greiner, R., Tso, V., Wang, H. & Fedorak, R. N. A machine-learned predictor of colonic polyps based on urinary metabolomics. Biomed. Res. Int.2013, 303982 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wang, H., Tso, V., Wong, C., Sadowski, D. & Fedorak, R. N. Development and validation of a highly sensitive urine-based test to identify patients with colonic adenomatous polyps. Clin. Transl. Gastroenterol.5, e54 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Hang, D. et al. Plasma metabolomic profiles for colorectal cancer precursors in women. Eur. J. Epidemiol.37, 413–422 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Pickens, C. A., Albuquerque Pereira, M. F. & Fenton, J. I. Long-chain omega-6 plasma phospholipid polyunsaturated fatty acids and association with colon adenomas in adult men: a cross-sectional study. Eur. J. Cancer Prev.26, 497–505 (2017). [DOI] [PubMed] [Google Scholar]
- 36.Amiot, A. et al. (1)H NMR spectroscopy of fecal extracts enables detection of advanced colorectal neoplasia. J. Proteome Res.14, 3871–3881 (2015). [DOI] [PubMed] [Google Scholar]
- 37.Tevini, J. et al. Changing metabolic patterns along the colorectal adenoma-carcinoma sequence. J. Clin. Med.11, 721 (2022). [DOI] [PMC free article] [PubMed]
- 38.Imperiale, T. F., Gruber, R. N., Stump, T. E., Emmett, T. W. & Monahan, P. O. Performance characteristics of fecal immunochemical tests for colorectal cancer and advanced adenomatous polyps: a systematic review and meta-analysis. Ann. Intern. Med.170, 319–329 (2019). [DOI] [PubMed] [Google Scholar]
- 39.Marchand, C. R., Farshidfar, F., Rattner, J. & Bathe, O. F. A framework for development of useful metabolomic biomarkers and their effective knowledge translation. Metabolites8, 59 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Townsend, M. K. et al. Reproducibility of metabolomic profiles among men and women in 2 large cohort studies. Clin. Chem.59, 1657–1667 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hon, K. W., Zainal Abidin, S. A., Othman, I. & Naidu, R. The crosstalk between signaling pathways and cancer metabolism in colorectal cancer. Front. Pharmacol.12, 768861 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Jia, W., Xie, G. & Jia, W. Bile acid-microbiota crosstalk in gastrointestinal inflammation and carcinogenesis. Nat. Rev. Gastroenterol. Hepatol.15, 111–128 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Režen, T. et al. The role of bile acids in carcinogenesis. Cell. Mol. Life Sci.79, 243 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Yan, G., Li, L., Zhu, B. & Li, Y. Lipidome in colorectal cancer. Oncotarget7, 33429–33439 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Pakiet, A., Kobiela, J., Stepnowski, P., Sledzinski, T. & Mika, A. Changes in lipids composition and metabolism in colorectal cancer: a review. Lipids Health Dis.18, 29 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Jeon, J. et al. Determining risk of colorectal cancer and starting age of screening based on lifestyle, environmental, and genetic factors. Gastroenterology154, 2152–2164.e2119 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yusof, H. M., Ab-Rahim, S., Suddin, L. S., Saman, M. S. A. & Mazlan, M. Metabolomics profiling on different stages of colorectal cancer: a systematic review. Malays. J. Med. Sci.25, 16–34 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Erben, V., Poschet, G., Schrotz-King, P. & Brenner, H. Comparing metabolomics profiles in various types of liquid biopsies among screening participants with and without advanced colorectal neoplasms. Diagnostics11, 561 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gupta, A. K., Brenner, D. E. & Turgeon, D. K. Early detection of colon cancer: new tests on the horizon. Mol. Diagn. Ther.12, 77–85 (2008). [DOI] [PubMed] [Google Scholar]
- 50.Ciernikova, S., Sevcikova, A., Stevurkova, V. & Mego, M. Tumor microbiome – an integral part of the tumor microenvironment. Front. Oncol.12, 1063100 (2022). [DOI] [PMC free article] [PubMed]
- 51.Issaq, H. J., Waybright, T. J. & Veenstra, T. D. Cancer biomarker discovery: opportunities and pitfalls in analytical methods. Electrophoresis32, 967–975 (2011). [DOI] [PubMed] [Google Scholar]
- 52.Li, S., Looby, N., Chandran, V. & Kulasingam, V. Challenges in the metabolomics-based biomarker validation pipeline. Metabolites14, 200 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Cross, A. J. et al. Metabolites of tobacco smoking and colorectal cancer risk. Carcinogenesis35, 1516–1522 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Qi, J. et al. Metabolomics and cancer preventive behaviors in the BC Generations Project. Sci. Rep.11, 12094 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Long, N. P. et al. Toward a standardized strategy of clinical metabolomics for the advancement of precision medicine. Metabolites10, 51 (2020). [DOI] [PMC free article] [PubMed]
- 56.Salek, R. M. et al. COordination of standards in metabOlomicS (COSMOS): facilitating integrated metabolomics data access. Metabolomics11, 1587–1597 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Galal, A., Talal, M. & Moustafa, A. Applications of machine learning in metabolomics: disease modeling and classification. Front. Genet.13, 1017340 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Barichello, S. et al. Comparative effectiveness and cost-effectiveness analysis of a urine metabolomics test vs. alternative colorectal cancer screening strategies. Int. J. Colorect. Dis.34, 1953–1962 (2019). [DOI] [PubMed] [Google Scholar]
- 59.Page, M. J. et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ372, n71 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Whiting, P. F. et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann. Intern. Med.155, 529–536 (2011). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data generated and analyzed during this study are included in the article and its supplementary information files.


