Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2021 Oct 5;11:19801. doi: 10.1038/s41598-021-99396-5

A prediction model using 2-propanol and 2-butanone in urine distinguishes breast cancer

Shoko Kure 1,✉,#, Sera Satoi 2,#, Toshihiko Kitayama 3, Yuta Nagase 3, Nobuo Nakano 3, Marina Yamada 4, Noboru Uchiyama 5, Satoshi Miyashita 6, Shinya Iida 7, Hiroyuki Takei 8, Masao Miyashita 9,10
PMCID: PMC8492640  PMID: 34611278

Abstract

Safe and noninvasive methods for breast cancer screening with improved accuracy are urgently needed. Volatile organic compounds (VOCs) in biological samples such as breath and blood have been investigated as noninvasive novel markers of cancer. We investigated volatile organic compounds in urine to assess their potential for the detection of breast cancer. One hundred and ten women with biopsy-proven breast cancer and 177 healthy volunteers were enrolled. The subjects were divided into two groups: a training set and an external validation set. Urine samples were collected and analyzed by gas chromatography and mass spectrometry. A predictive model was constructed by multivariate analysis, and the sensitivity and specificity of the model were confirmed using both a training set and an external set with reproducibility tests. The training set included 60 breast cancer patients (age 34–88 years, mean 60.3) and 60 healthy controls (age 34–81 years, mean 58.7). The external validation set included 50 breast cancer patients (age 35–85 years, mean 58.8) and 117 healthy controls (age 18–84 years, mean 51.2). One hundred and ninety-one compounds detected in at least 80% of the samples from the training set were used for further analysis. The predictive model that best-detected breast cancer at various clinical stages was constructed using a combination of two of the compounds, 2-propanol and 2-butanone. The sensitivity and specificity in the training set were 93.3% and 83.3%, respectively. Triplicated reproducibility tests were performed by randomly choosing ten samples from each group, and the results showed a matching rate of 100% for the breast cancer patient group and 90% for the healthy control group. Our prediction model using two VOCs is a useful complement to the current diagnostic tools. Further studies inclusive of benign tumors and non-breast malignancies are warranted.

Subject terms: Breast cancer, Cancer screening

Introduction

Breast cancer is the most frequent cause of death in women worldwide. In 2020, over two million new cases of breast cancer were diagnosed, and 684,996 persons died from the disease1. The early detection of breast cancer is an important step toward achieving efficient treatment. Mammography (MG), the most commonly used screening test at present, can detect breast cancers during the asymptomatic phase and reduce mortality among women of certain ages24. Yet MG screening has several drawbacks. First, MG detects benign lesions, which can lead to unnecessary testing, treatment, and anxiety5. Second, MG is less sensitive in dense breast6. Third, MG is associated with significant pain caused by the relatively strong pressure applied to the breast. An alternative to MG that can screen for breast cancer safely, painlessly, and noninvasively is therefore urgently awaited.

Volatile organic compounds (VOCs) in biological samples such as breath and blood have been investigated in connection with cancer detection for more than two decades. The potential of VOCs as non-invasive biomarkers has been supported by reports on the capabilities of sniffer dogs and sensory devices in distinguishing between healthy controls and patients with cancers of the lung710, colon or rectum11, stomach12,13, liver14, head and neck1517, ovaries18, and breast1926. The combination of multiple biomarkers has strengthened the discriminatory power of this approach, raising accuracy to rates of 0.9 or higher in multiple studies.

While the previous studies have yielded promising results, critical steps still need to be taken to standardize the sample collection and storage and handling of the data, and to validate the results in independent samples. While the advantages of urine as an alternative matrix for volatile biomarkers have been outlined in lung cancer27, the data are scanty on cancers of other organs, including the breast. In this study we sought to identify and analyze VOCs that appear specifically in the urine of breast cancer patients. We assessed the potential of VOCs to become biomarkers of breast cancer by constructing a prediction model using a training set and an external validation set with triplicated reproducibility tests.

Methods

Subjects

The subjects were divided into two groups: patients with primary breast cancer and healthy volunteers (controls). The primary breast cancer patients were who were diagnosed by either fine-needle aspiration cytology or core-needle biopsy at Nippon Medical School Chiba-Hokusoh Hospital from November 2015 to October 2019 were enrolled. Clinical stages of the breast cancer patients were classified according to the Union for International Union Cancer Control (UICC) classification. The histological subtypes were based on the 15th St. Gallen International Breast Cancer Conference 201728. Control subjects with no histories of previously diagnosed cancer of any type were recruited from the public in systemic cancer screenings at Nippon Medical School Chiba-Hokusoh Hospital and Koyukai Asakusa Clinic over the same period. All procedures performed in this study involving human participants were performed in accordance with the Declaration of Helsinki (as reserved in 2013). The study was approved by the ethics committees of Nippon Medical School Chiba Hokusoh Hospital (IRB#320). All subjects provided their signed informed consent before enrolment.

Urine samples

Urine samples were collected with paper cups (Harn cup laminate A, Nissho Sangyo, Tokyo, Japan), transferred to sterile test tubes (Sterile SP tube TD4000, Eiken Chemical Co., Tokyo, Japan), sealed with caps, and stored at − 30 ℃ until analysis in 3 ml volumes. The breast cancer patients provided the samples a few days before surgery; the controls provided them during the cancer screening tests. All of the samples were transferred to the analysis institution, RIKEN KEIKI Co., Ltd., by a refrigerated courier service.

After the urine samples were thawed in a refrigerator, more than 3 mL of each sample was filtered and sterilized (Hawatch Scientific, PES Syringe Filter: Pore Size = 0.22 μm, Diameter = 25 mm, Material = PES Gama Sterile). Next, the sterilized samples were pipetted in 3 ml volumes into vials for an HS-20, and NaCl (> 99.5%, FUJIFILM Wako Pure Chemical Corporation) was added. These vials were sealed with the aluminum-cap (Silicone/PTFE, Shimazu GLC). These protocols were performed at a maximum of 3 samples at once to avoid degradation.

Gas chromatography–mass spectrometry (GCMS)

GCMS analysis was performed with GCMS-QP2010 Ultra Gas Chromatograph Mass Spectrometer (Shimadzu Co., Kyoto, Japan) equipped with HS-20 Trap with a capillary column (Inert Cap Pure WAX, 32 m length; 0.25 mm internal diameter; 0.25 μm film thickness). A helium (99.999%) carrier gas set at a flow rate of 1.76 mL/min was used for the GCMS analysis. The GC column temperature was maintained at 30 ℃ for 5 min, raised from 30 to 250 ℃ at a rate of 10 ℃ per min, and maintained at 260 ℃ for 6 min. The mass spectrometry was performed in a scanning mode (m/z = 33.00–300.00).

Sample and data analysis

Urine samples from breast cancer patients, healthy controls, and blank controls were analyzed in the GCMS on the same day using GCMS solution and LabSolutions Insight software Version 2.0 (Shimadzu, Co., Kyoto, Japan, https://www.shimadzu.com/an/products/liquid-chromatograph-mass-spectrometry/lc-ms-software/labsolutions-insight/index.html). The peak-data were obtained from the total ion current chromatogram (TIC) of each sample. Each set of peak-data was annotated according to the NIST/EPA/NIH Mass Spectral Library (NIST11) and WILEY REGISTRY® of Mass Spectral Data 9th Edition (Wiley9), and the area was quantitated. Air contamination and error were adjusted using a stable standard and a blank and samples. The urine of the experiment staff was used as a standard sample. Urine samples were collected from the same staff over several days and stored frozen. After a certain amount of urine has been collected, thaw the frozen urine sample in a refrigerator and mix all it to make it homogenized. After that, the sample was divided into several small-volume storage containers (Eiken Chemical FT2100 sterile screw round-bottom spits) and frozen again. These samples were used as the standard urine sample. A blank sample referred to a tube that contained room air on each day of the sample analysis. These data were further adjusted by urine creatinine levels.

The cancer urine samples were divided into two groups: a training set for building a prediction model, and an external validation set. Compounds detected in fewer than 80% of the samples from the training set were excluded from each group, and the remaining compounds were assessed by the following statistical analysis. Several of the variables were selected by stepwise analysis. Next, the compounds showing breast cancer patient (BCP) < healthy control (HC) and BCP > HC were selected. The prediction model was then constructed by discriminant analysis. The discriminant factor was analyzed by an Receiver Operating Characteristic (ROC) analysis. For reproducibility, 10 randomly chosen BCP samples and 10 randomly chosen HC samples were analyzed three times on different days. The external sample set was also tested, for validation. Discrimination analysis by multiple regression analysis was performed by Microsoft Excel (Microsoft 365). All the other statistical analyses were performed using the R statistical package (www.r-project.org).

Ethics approval and consent to participate

This study was approved by the ethics committees of Nippon Medical School Chiba Hokusoh Hospital (IRB#320).

Consent for publication

A signed informed consent was obtained from each participant. The informed consent is available upon request.

Results

Subjects enrolled in the study

Two hundred and eighty-seven subjects were enrolled in this study, including 110 BCPs and 177 HCs. The BCPs and HCs were randomly allocated to a training set and an external set using software. The training set included 60 BCPs (age 34–88 years, mean 60.3) and 60 HCs (age 34–81 years, mean 58.7). The external validation set included 50 BCPs (age 35–85 years, mean 58.8) and 117 HCs (age 18–84 years, mean 51.2) (Table 1). The clinical stages and histological subtypes are summarized in Table 1.

Table 1.

Subjects enrolled in the study.

Training set External set
BCP HC BCP HC
Number of subjects 60 60 50 117
Age
Median 60 (34–88) 58.5 (34–81) 60.5 (35–85) 48 (18–84)
Mean 60.3 58.7 58.8 51.2
S.D 12.1 12.2 13.7 18.5
Clinical stage
0 0 12
I 30 20
II 30 11
III 0 6
IV 0 1
Histological subtype
Luminal A-like 18 27
Luminal B-like 18 9
Luminal HER2-like 8 4
Pure HER2-like 4 3
Triple-negative-like 11 7
NA 1 0

BCP breast cancer patient, HC healthy control, NA not applicable, S.D. standard deviation.

GCMS analysis

The GCMS analysis of the urine samples showed numerous peaks. A representative total ion chromatogram (TIC) is presented in Fig. 1. Compounds detected in fewer than 80% of the subjects were excluded in each group, and 191 compounds were assessed by an ensuing statistical analysis. The compounds identified are listed in Supplemental Table S1.

Figure 1.

Figure 1

A representative GCMS total ion chromatograms (TIC) of urine volatile organic compounds. TIC of volatile organic compounds from urine samples collected from a breast cancer patient (A) and healthy control (B). Both samples showed various peaks. TIC total ion chromatogram.

Building and validating the prediction model

Using the detected peak data, 10 compounds were selected by a stepwise backward elimination method. Next, the prediction model was built based on the P-value, standardized partial regression coefficient, tolerance, and variance inflation factor (VIF). First, discriminant models were created using the area values of each compound and the area values of each compound corrected for creatinine concentration. To create the discriminant model, we used 60 samples from BCP (stage I = 30 samples, stage II = 30 samples) and 60 samples from HC. By stepwise variable selection method, the detected compounds were narrowed down to constructing the discriminant model. After narrowing down the original compound list to a few compounds, box charts of each compound were used to compare the areas between BCP and HC. The compounds which were "BCP < HC” and "BCP > HC" were selected. A linear discriminant analysis was performed using the selected compounds. The prediction model built through this procedure used two VOCs in combination, 2-propanol and 2-butanone. As the box plots generated by this model show, 2-butanone was higher in BCP than in HC, while 2-propanol was higher in HC than in BCP (Fig. 2). The obtained discriminant equation was used as the discriminant model equation. The discriminant coefficients in the discriminant model equation were obtained by ROC analysis. The scattered plot and the area under the curve (AUC) are shown in Fig. 3. Using this AUC, 0.9442, and the cutoff value were decided by Youden index. The sensitivity was 93.3%, specificity was 83.3%, positive predictive value was 84.8%, negative predictive value was 92.6%, and accuracy was 88.3% for this model (Table 2). The performance of the constructed model for the histological subtypes were also evaluated (Table 3).

Figure 2.

Figure 2

Box charts of the peak areas of the 2-butanone and 2-propanol using the model. Box plots of the peak areas of 2-propanol and 2-butanone generated by the model. The model indicated that 2-butanone was higher in breast cancer patients than in healthy controls (A), and that 2-propanol was higher in healthy controls than in breast cancer patients (B). BCP breast cancer patient, HC healthy control.

Figure 3.

Figure 3

Scatter plots and the area under the curve (AUC) of the samples in the training set. (A) The scatter plots show the areas of 2-butanone and 2-propanol in each sample in the training set. The X-axis and Y-axis represent 2-butanone and 2-propanol, respectively. (B) The AUC, sensitivity, and specificity for this model were 0.9442, 93.3%, and 83.3%, respectively.

Table 2.

The performance of the constructed model with the training set.

True condition
Condition positive Condition negative
Inspection results
Inspection results positive 56 10 Positive predictive value (%)
84.8
Inspection results negative 4 50 Negative predictive value (%)
92.6
Sensitivity (%) Specificity (%) Accuracy (%)
93.3 83.3 88.3

Table 3.

The results for each histological subtype when applying the model to the training set.

BCP HC
Subtype True positive False negative Sensitivity (%) True negative False positive Specificity (%)
Luminal A-like 17 1 94.4 50 10 83.3
Luminal B-like 17 1 94.4
Luminal HER2-like 8 0 100.0
Pure HER2-like 4 0 100.0
Triple-negative-like 10 1 90.9

BCP breast cancer patients, HC healthy controls.

Additional reproducibility tests were performed by randomly choosing ten models from each group in the training set. The reproducibility tests were triplicated on different days, and the results showed a matching rate of 100% for the BCP group and 90% for the HC group (Table 4).

Table 4.

The results of the reproducibility tests.

Name Stage Day 1 Day 2 Day 3 Day 1 = Day 2 = Day 3
Result Result Result Coincidence
BCP Case 1 1
Case 2 1
Case 3 1
Case 4 1
Case 5 1
Case 6 2 × × ×
Case 7 2
Case 8 2
Case 9 2
Case 10 2
100%
○ and × indicate true positive and false negative, respectively
HC Case 11
Case 12 × × ×
Case 13
Case 14
Case 15 × ×
Case 16
Case 17
Case 18
Case 19 × × ×
Case 20
90%
○ and × indicate true negative and false positive, respectively

BCP breast cancer patients, F female HC, healthy controls.

Validation test using an external validation set

Next, an external validation set was used to confirm the validity of the training models. The box plots of the peak areas of 2-butanone and 2-propanol generated by the constructed model are shown in Fig. 4. The AUC was 0.9228, sensitivity was 84%, specificity was 90.5%, positive predictive value was 79.2%, negative predictive value was 92.9%, and accuracy was 88.6% (Fig. 5 and Tables 5, 6).

Figure 4.

Figure 4

Box charts of the peak areas of the 2-butanone and 2-propanol of the external validation set using the model. Box plots of the peak areas of 2-propanol and 2-butanone in the external validation set using the model. The peak areas in the external validation set were similar to those in the training set. (A) Peak areas of 2-butanone and (B) 2-propanol. BCP breast cancer patient, HC healthy control.

Figure 5.

Figure 5

Scatter plots of the samples and the area under the curve (AUC) in the external validation sets. (A) The external validation set was used to confirm the validity of the training models. (B) The area under the curve using the external validation set. The AUC, sensitivity, and specificity for this model were 0.9228, 84.0%, and 90.6%, respectively.

Table 5.

The performance of the constructed model applying to the external validation set.

True condition
Condition positive Condition negative
Inspection results
Inspection results positive 42 11 Positive predictive value (%)
79.2
Inspection results negative 8 105 Negative predictive value (%)
92.9
Sensitivity (%) Specificity (%) Accuracy (%)
84.0 90.5 88.6

BCP breast cancer patient, HC healthy control.

Table 6.

The results for each histological subtype when applying the model to the external validation set.

BCP HC
Subtype True Positive False Negative Sensitivity (%) True Negative False Positive Specificity (%)
Luminal A-like 22 5 81.5 105 11 90.5
Luminal B-like 8 1 88.9
Luminal HER2-like 3 1 75.0
Pure HER2-like 2 1 66.7
Triple-negative-like 7 0 100.0

BCP breast cancer patient, HC healthy control.

Discussion

To the best of our knowledge, our study is the first to build a prediction model to detect breast cancer using a combination of only two VOCs, with the results validated by triplicated reproducibility tests and a validation set analyzed by GCMS. Among the different types of volatile compounds analyzed, we demonstrated that a combination of 2-butanone and 2-propanol was highly effective in detecting breast cancer at various clinical stages, achieving a sensitivity of 93.3% and a specificity of 83.3%.

While cancer screening biomarkers using biological samples have been extensively for the past decade, including several with blood and urine, a model to detect specific VOCs as cancer biomarkers holds the potential for application as a rapid, noninvasive, and inexpensive cancer screening technique that reduces the burdens on the individuals screened. The science of detecting cancers through body fluids was pioneered by Linus Pauling in 197129. More recently, the development of sensor techniques and devices has led to an exponential increase in studies to detect cancer from samples of exhaled breath, blood, urine, and cell-cultured mediums of cancers of the the lung710, colon or rectum11, stomach12,13, liver14, head and neck1517, prostate30, kidney31, ovaries18, and breast1926. Most of the previous studies on VOCs in breast cancer patients have examined breath samples. A series of studies by Phillips et al.25,32 demonstrated that breath oxidative stress markers can distinguish between women with breast cancer and healthy controls. Later, experiments by the same group found that breath samples show good potential as a biomarker for breast cancer21. Compared with earlier studies that included both early and advanced stages of breast cancer, our study is unique in enrolling a large population mainly consisting of patients with early-stage breast cancer, a disease difficult to diagnose with the current screening methods. Our study design supports the utility of this screening for early-stage breast cancer.

While breath sampling is easy and non-invasive, several obstacles impede further research on the development of the technique for cancer screening33. The procedure is impracticable for routine application, the samples are unstable, and noise from background concentrations can interfere with the very small concentrations of volatiles involved. Urine samples are thus expected to offer an alternative matrix for detecting VOC biomarkers. Adapting the headspace gas of urine samples to GCMS analysis has been well established using classical and basic water analysis techniques. Further, the urine can be partitioned, dispensed, mixed, spikes, stored, and dispatched. The only previous studies to investigate VOCs in urine samples of breast cancer patients were based on smaller populations and employed neither reproducibility tests nor external validation sets3436. Our study sought to build a VOC-screening model with urine samples using a training set, an external set, and triplicated reproducublity tests.

The previous studies on the breath samples of breast cancer patients investigated analyses with cross-validation tests20,21,25 and/or external validation sets21. We designed and performed triplicated reproducibility tests by randomly choosing ten samples from each group as a training set and then confirmed the results with an external validation set. The results showed a concordance of 100% for the BCP group and 90% for the HC group, and the constructed model was confirmed by the external validation set. This is the first study to perform triplicated reproducibility tests and validation tests with external sets.

No studies to date have a identified a single VOC that can determine the presence of cancer with high reliability, and no consensus has been reached on a causative connection between specific VOCs or any type of cancer. A number candidate VOCs have the potential to become common biomarkers among cancers in general, but none are specific to any one type of cancer. A combination of two different VOCs in breath might serve as a marker of disease when one is high and the other is low22. We therefore postulated that combinations of several VOCs may be able to increase the overall sensitivity and specificity.

Several previous studies on VOCs in breast cancer patients are listed in Table 7. Previous analyses of the urine samples of breast cancer patients3436 have identified a significant decrease in dimethyl disulfide, and increases in 4-carene, 3-heptanone, phenol, 1,2,4-trimethylbenzene, and 2-methoxythiophene. In our samples, the combination of 2-butanone and 2-propanol was the best detector of breast cancer of various clinical stages, achieving a sensitivity and specificity of 93.3% and 83.3%, respectively. The VOC 2-propanol, also known as isopropyl alcohol, is a colorless liquid used in making cosmetics, perfumes, skin and hair products, and other chemicals. Elevated levels of 2-propanol were identified in an earlier breath analysis of breast cancer patients22 and a study of lung cancer cells in vivo37. On the other hand, 2-propanol was significantly decreased VOCs in urine samples of cholangiocarcinoma or pancreatic cancer38. The level of 2-propanol may change in association with the altered activity of cytochrome P450 in breast cancer26, and the discrepancy of the result may attribute to the different sample types (i.e. breath or urine) used in each study. The detailed mechanism needs to be clarified. The VOC 2-butanone, more widely known as methyl ethyl ketone, is a widely used solvent. It can be obtained by the dehydration of 2,3-butanediol, a natural metabolite produced from glucose by several microorganisms such as Escherichia coli and Klebsiella pneumoniae3941. Our preliminary test confirmed that 2-butanone was not eluted from blank tubes (data not shown). Several ketones, including 2-butanone, are elevated in the breath samples of lung cancer patients42, urine samples of prostate cancer patients43, and one study identified higher levels of 2-butanone in cultured lung cell lines than in normal cell lines in vitro37. The possible origins are endogenous production, microbiota, or environmental exposure. Fatty acid oxidation, the mechanism found to cause 2-butanone production in cancer progression, may result in elevated levels of ketones44,45. Butanoate metabolism is also reported to be highly activated in breast cancer and colon cancer patients34. Though 2-butanone has not been identified as a specific marker of breast cancer, the combination of 2-propanol and 2-butanone, applied with our prediction model constructed by multivariate analysis, proved to be extremely sensitive and specific in distinguishing breast cancer of all histological subtypes from healthy controls. Our study is the first to build a prediction model based on the P-value, standardized partial regression coefficient, and VIF. Further studies to identify the underlying biological mechanism of this combination of VOCs, and its clinical significance for daily practice, are merited.

Table 7.

Published studies on VOCs on breast cancer.

Authors Sample Methods, results
Phillips et al. (2003)25 Breath GCMS, methylated alkane contour
BC (51) vs abnormal MG (50) Sensitivity 62.7%(32/51), specificity 84.0% (42/50)
BC (51) vs healthy (42) Sensitivity 94.1% (48/51), specificity 73.8% (31/42)
Phillips et al. (2006)22 Breath (re-analysis of ref.#24) GCMS
BC (51) vs abnormal MG (50) 2-propanol, 2,3-dihydro-1-phenyl-4(1H)-quinazolinone,
BC (51) vs healthy (42) 1-phenyl-ethanone, heptanal, and isopropyl myristate
Sensitivity 93.8%, specificity 84.6%
Phillips et al. (2010)24 Breath GCMS
BC (54) vs healthy (204) Training set: Sensitivity 78.5%, specificity 88.3%
Test set: sensitivity 75.3%, specificity 84.8%
Patterson et al. (2011)48 Breath GCMS
BC (20) vs healthy (20) Sensitivity 72%, specificity 64%
Silva et al. (2012)36 Urine GCMS
BC (26) vs healthy (21) ↓dimethyl disulfide
↑4-carene, 3-heptanone, phenol,
1,2,4-trimethylbenzene, 2-methoxythiophene,
Sensitivity/Specificity NA
Mangler et al. (2012)26 Breath GCMS
BC (10) vs healthy (10) ↓3-methylhexane, decene, caryophyllene, naphthalene
↑trichlorethylene
Sensitivity/Specificity NA
Li et al. (2014)46 Breath GCMS
BC (22) vs healthy (24) Hexanal, heptanal, octanal,
vs Breast benign tumor (17) and nonanal,
Sensitivity 72.7%, specificity 91.7%
Wang et al. (2014)47 Breath GCMS
BC (39) vs healthy (45) 2,5,6-trimethyloctane,
vs cyclomastopathy (25) 1,4-dimethoxy-2,3-butanediol, cyclohexanone
vs mammary gland fibroma (21) Sensitivity/specificity NA
Barash et al. (2015)20 Breath GCMS
BC (90) vs benign (13) vs healthy (23) 23 compounds
Sensitivities 81–88%, specificities 76–96%
Silva et al. (2017)49 BC cell lines GCMS
2-Pentanone, 2-heptanone, 3-methyl-3-buten-1-ol,
ethyl acetate,
ethyl propanoate and 2-methyl butanoate
Sensitivity/Specificity NA
Phillips et al. (2017)19 Breath GCMS
BC (54) vs healthy (214) 21 compounds,
Training set: AUC = 0.79,
Test set: AUC = 0.77
Cavaco (2018)50 Saliva GCMS
BC (66) vs healthy (40) 3-methyl-pentanoic acid, 4-methyl-pentanoic acid,
phenol, acetic acid, propanic acid, 1,2-decanediol
Sensitivity/specificity NA
Porto-Figueira et al. (2018)34 Urine Needle Trap Microextraction (NTME)/GCMS
BC vs healthy 2-bromophenol, octanoic acid, phenol,
Sensitivity/specificity NA
Phillips et al. (2018)21 Breath -GCMS: test accuracy = 90%
BC (54) vs healthy (124) -GC-surface acoustic wave detection (GCSAW): test accuracy = 86%
Silva (2019)35 Urine GCMS
BC (31) vs healthy (40) 10 compounds (sulfur compounds, terpenoids and
carbonyl compounds),
Sensitivity/Specificity NA, AUC = 0.842
de Leon-Martinez et al. (2020)51 Breath “Electrical nose”, Compounds NA,
BC (262) vs healthy (181) Sensitivity 100%, specificity 100%
Zhang et al. (2020)52 Breath GCMS, combination of (S)‐1,2‐propanediol,
BC (78) vs healthy (71) cyclopentanone, ethylenecarbonate, 3‐methoxy‐1,2
vs gastric cancer (54) propanediol, 3‐methylpyridine, phenol,
and tetramethylsilane
Sensitivity 93.36%, specificity 71.6%

BC breast cancer, GCMS gas chromatography–mass spectrometry, NA not applicable, AUC area under the curve.

Our study has some limitations. First, the results were derived from an analysis of a fairly non-diverse population, and thus may not extend to a broader population. Second, the control samples were only collected from healthy individuals, and the VOCs examined were not confirmed to be breast-cancer-specific as biomarkers. Previous analyses of VOCs in breath samples20,46,47 have shown that, among the VOCs that were significantly increased in breast cancer patients versus healthy controls and benign breast tumors, only one compound was significantly altered in the breast cancer patients versus the benign tumors. While the two markers in the current study are useful in complementing the current diagnostic tools, further studies with larger populations inclusive of benign tumors and non-breast malignancies are warranted. Furthermore, since breast cancer is a heterogeneous disease, analysis including the molecular subtypes, which is an independent classification from the histological subtypes, is desirable. However, the molecular subtypes were not available for the current study. To further substantiate our results, molecular subtypes by gene expression analysis are needed.

Conclusion

Our prediction model using the combination of the VOCs 2-propanol and 2-butanone usefully complements the current diagnostic tools for early-stage breast cancer. Further studies inclusive of benign tumors and non-breast malignancies are warranted.

Supplementary Information

Supplementary Table S1. (31.3KB, docx)

Abbreviations

AUC

Area under the curve

BCP

Breast cancer patient

GCMS

Gas chromatography–mass spectrometry

HC

Healthy control

MG

Mammography

NA

Not applicable

ND

Not determined

ROC

Receiver operating characteristic

S.D.

Standard deviation

TIC

Total ion current chromatogram

VIF

Variance inflation factor

VOC

Volatile organic compound

Author contributions

M.M., N.N., H.T., and S.I. conceptualized the study. T.K., Y.N., M.Y., N.U., H.T., S.I., and M.M. collected the data. M.M., S.M., T.K., Y.N., N.N., S.S., and S.K. analyzed the data. SK and SS drafted the manuscript. All of the authors reviewed and edited the manuscript, and read and approved the final version.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author, for reasonable uses, upon request.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Shoko Kure and Sera Satoi.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-021-99396-5.

References

  • 1.Sung HG, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2020 doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]
  • 2.Autier P, Boniol M. Mammography screening: A major issue in medicine. Eur. J. Cancer. 2018;90:34–62. doi: 10.1016/j.ejca.2017.11.002. [DOI] [PubMed] [Google Scholar]
  • 3.Nelson HD, et al. Screening for breast cancer: An update for the U.S. preventive services task force. Ann. Intern. Med. 2009;151:727–737. doi: 10.7326/0003-4819-151-10-200911170-00009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Njor S, et al. Breast cancer mortality in mammographic screening in Europe: A review of incidence-based mortality studies. J. Med. Screen. 2012;19(Suppl 1):33–41. doi: 10.1258/jms.2012.012080. [DOI] [PubMed] [Google Scholar]
  • 5.Brodersen J, Siersma VD. Long-term psychosocial consequences of false-positive screening mammography. Ann. Fam. Med. 2013;11:106–115. doi: 10.1370/afm.1466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Tohno E, Ueno E, Watanabe H. Ultrasound screening of breast cancer. Breast Cancer (Tokyo, Japan) 2009;16:18–22. doi: 10.1007/s12282-008-0082-8. [DOI] [PubMed] [Google Scholar]
  • 7.Gordon SM, Szidon JP, Krotoszynski BK, Gibbons RD, O'Neill HJ. Volatile organic compounds in exhaled air from patients with lung cancer. Clin. Chem. 1985;31:1278–1282. doi: 10.1093/clinchem/31.8.1278. [DOI] [PubMed] [Google Scholar]
  • 8.Westhoff M, et al. Ion mobility spectrometry for the detection of volatile organic compounds in exhaled breath of patients with lung cancer: Results of a pilot study. Thorax. 2009;64:744–748. doi: 10.1136/thx.2008.099465. [DOI] [PubMed] [Google Scholar]
  • 9.Wang Y, et al. The analysis of volatile organic compounds biomarkers for lung cancer in exhaled breath, tissues and cell lines. Cancer Biomark.: Sect. A Dis. Mark. 2012;11:129–137. doi: 10.3233/cbm-2012-00270. [DOI] [PubMed] [Google Scholar]
  • 10.D'Amico A, et al. An investigation on electronic nose diagnosis of lung cancer. Lung Cancer. 2010;68:170–176. doi: 10.1016/j.lungcan.2009.11.003. [DOI] [PubMed] [Google Scholar]
  • 11.Altomare DF, et al. Exhaled volatile organic compounds identify patients with colorectal cancer. Br. J. Surg. 2013;100:144–150. doi: 10.1002/bjs.8942. [DOI] [PubMed] [Google Scholar]
  • 12.Amal H, et al. Detection of precancerous gastric lesions and gastric cancer through exhaled breath. Gut. 2016;65:400–407. doi: 10.1136/gutjnl-2014-308536. [DOI] [PubMed] [Google Scholar]
  • 13.Xu ZQ, et al. A nanomaterial-based breath test for distinguishing gastric cancer from benign gastric conditions. Br. J. Cancer. 2013;108:941–950. doi: 10.1038/bjc.2013.44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Qin T, et al. The screening of volatile markers for hepatocellular carcinoma. Cancer Epidemiol., Biomark. Prev.: Publ. Am. Assoc. Cancer Res., Cospons. Am. Soc. Prev. Oncol. 2010;19:2247–2253. doi: 10.1158/1055-9965.EPI-10-0302. [DOI] [PubMed] [Google Scholar]
  • 15.Leunis N, et al. Application of an electronic nose in the diagnosis of head and neck cancer. Laryngoscope. 2014;124:1377–1381. doi: 10.1002/lary.24463. [DOI] [PubMed] [Google Scholar]
  • 16.Hakim M, et al. Diagnosis of head-and-neck cancer from exhaled breath. Br. J. Cancer. 2011;104:1649–1655. doi: 10.1038/bjc.2011.128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Gruber M, et al. Analysis of exhaled breath for diagnosing head and neck squamous cell carcinoma: A feasibility study. Br. J. Cancer. 2014;111:790–798. doi: 10.1038/bjc.2014.361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Amal H, et al. Assessment of ovarian cancer conditions from exhaled breath. Int. J. Cancer. 2015;136:E614–622. doi: 10.1002/ijc.29166. [DOI] [PubMed] [Google Scholar]
  • 19.Phillips M, Cataneo RN, Lebauer C, Mundada M, Saunders C. Breath mass ion biomarkers of breast cancer. J. Breath Res. 2017;11:016004. doi: 10.1088/1752-7163/aa549b. [DOI] [PubMed] [Google Scholar]
  • 20.Barash O, et al. Differentiation between genetic mutations of breast cancer by breath volatolomics. Oncotarget. 2015;6:44864–44876. doi: 10.18632/oncotarget.6269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Phillips M, et al. Prediction of breast cancer risk with volatile biomarkers in breath. Breast Cancer Res. Treat. 2018;170:343–350. doi: 10.1007/s10549-018-4764-4. [DOI] [PubMed] [Google Scholar]
  • 22.Phillips M, et al. Prediction of breast cancer using volatile biomarkers in the breath. Breast Cancer Res. Treat. 2006;99:19–21. doi: 10.1007/s10549-006-9176-1. [DOI] [PubMed] [Google Scholar]
  • 23.Phillips M, et al. Rapid point-of-care breath test for biomarkers of breast cancer and abnormal mammograms. PLoS ONE. 2014;9:e90226. doi: 10.1371/journal.pone.0090226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Phillips M. Volatile biomarkers in the breath of women with breast cancer. J. Breath Res. 2010 doi: 10.1088/1752-7155/4/2/026003. [DOI] [PubMed] [Google Scholar]
  • 25.Phillips M, et al. Volatile markers of breast cancer in the breath. Breast J. 2003;9:184–191. doi: 10.1046/j.1524-4741.2003.09309.x. [DOI] [PubMed] [Google Scholar]
  • 26.Mangler M, et al. Volatile organic compounds (VOCs) in exhaled breath of patients with breast cancer in a clinical setting. Ginekol. Pol. 2012;83:730–736. [PubMed] [Google Scholar]
  • 27.Hanai Y, et al. Urinary volatile compounds as biomarkers for lung cancer. Biosci. Biotechnol. Biochem. 2012;76:679–684. doi: 10.1271/bbb.110760. [DOI] [PubMed] [Google Scholar]
  • 28.Curigliano G, et al. De-escalating and escalating treatments for early-stage breast cancer: The St. Gallen international expert consensus conference on the primary therapy of early breast cancer 2017. Ann. Oncol. 2017;28:1700–1712. doi: 10.1093/annonc/mdx308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Pauling L, Robinson AB, Teranishi R, Cary P. Quantitative analysis of urine vapor and breath by gas-liquid partition chromatography. Proc. Natl. Acad. Sci. U.S.A. 1971;68:2374–2376. doi: 10.1073/pnas.68.10.2374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lima AR, et al. Identification of a biomarker panel for improvement of prostate cancer diagnosis by volatile metabolic profiling of urine. Br. J. Cancer. 2019;121:857–868. doi: 10.1038/s41416-019-0585-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Monteiro M, et al. GC-MS metabolomics-based approach for the identification of a potential VOC-biomarker panel in the urine of renal cell carcinoma patients. J. Cell. Mol. Med. 2017;21:2092–2105. doi: 10.1111/jcmm.13132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Phillips M, et al. Volatile organic compounds in breath as markers of lung cancer: A cross-sectional study. Lancet (London, England) 1999;353:1930–1933. doi: 10.1016/s0140-6736(98)07552-7. [DOI] [PubMed] [Google Scholar]
  • 33.Becker R. Non-invasive cancer detection using volatile biomarkers: Is urine superior to breath? Med. Hypotheses. 2020;143:110060. doi: 10.1016/j.mehy.2020.110060. [DOI] [PubMed] [Google Scholar]
  • 34.Porto-Figueira P, Pereira JAM, Câmara JS. Exploring the potential of needle trap microextraction combined with chromatographic and statistical data to discriminate different types of cancer based on urinary volatomic biosignature. Anal. Chim. Acta. 2018;1023:53–63. doi: 10.1016/j.aca.2018.04.027. [DOI] [PubMed] [Google Scholar]
  • 35.Silva CL. Implementing a central composite design for the optimization of solid phase microextraction to establish the urinary volatomic expression: A first approach for breast cancer. Metabolomics. 2019 doi: 10.1007/s11306-019-1525-2. [DOI] [PubMed] [Google Scholar]
  • 36.Silva CL, Passos M, Câmara JS. Solid phase microextraction, mass spectrometry and metabolomic approaches for detection of potential urinary cancer biomarkers—A powerful strategy for breast cancer diagnosis. Talanta. 2012;89:360–368. doi: 10.1016/j.talanta.2011.12.041. [DOI] [PubMed] [Google Scholar]
  • 37.Schallschmidt K, et al. In vitro cultured lung cancer cells are not suitable for animal-based breath biomarker detection. J. Breath Res. 2015;9:027103. doi: 10.1088/1752-7155/9/2/027103. [DOI] [PubMed] [Google Scholar]
  • 38.Navaneethan U, et al. Volatile organic compounds in urine for noninvasive diagnosis of malignant biliary strictures: A pilot study. Dig. Dis. Sci. 2015;60:2150–2157. doi: 10.1007/s10620-015-3596-x. [DOI] [PubMed] [Google Scholar]
  • 39.Guo X, et al. Effect of the inactivation of lactate dehydrogenase, ethanol dehydrogenase, and phosphotransacetylase on 2,3-butanediol production in Klebsiella pneumoniae strain. Biotechnol. Biofuels. 2014;7:44. doi: 10.1186/1754-6834-7-44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kim B, et al. Enhanced 2,3-butanediol production in recombinant Klebsiella pneumoniae via overexpression of synthesis-related genes. J. Microbiol. Biotechnol. 2012;22:1258–1263. doi: 10.4014/jmb.1201.01044. [DOI] [PubMed] [Google Scholar]
  • 41.Kim DK, et al. Metabolic engineering of a novel Klebsiella oxytoca strain for enhanced 2,3-butanediol production. J. Biosci. Bioeng. 2013;116:186–192. doi: 10.1016/j.jbiosc.2013.02.021. [DOI] [PubMed] [Google Scholar]
  • 42.Schallschmidt K, et al. Comparison of volatile organic compounds from lung cancer patients and healthy controls-challenges and limitations of an observational study. J. Breath Res. 2016;10:046007. doi: 10.1088/1752-7155/10/4/046007. [DOI] [PubMed] [Google Scholar]
  • 43.Jimenez-Pacheco A, et al. Furan and p-xylene as candidate biomarkers for prostate cancer. Urol. Oncol. 2018;36(243):e221–243 e227. doi: 10.1016/j.urolonc.2017.12.026. [DOI] [PubMed] [Google Scholar]
  • 44.Orywal K, Szmitkowski M. Alcohol dehydrogenase and aldehyde dehydrogenase in malignant neoplasms. Clin. Exp. Med. 2017;17:131–139. doi: 10.1007/s10238-016-0408-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Crabb DW, Matsumoto M, Chang D, You M. Overview of the role of alcohol dehydrogenase and aldehyde dehydrogenase and their variants in the genesis of alcohol-related pathology. Proc. Nutr. Soc. 2004;63:49–63. doi: 10.1079/pns2003327. [DOI] [PubMed] [Google Scholar]
  • 46.Li J, et al. Investigation of potential breath biomarkers for the early diagnosis of breast cancer using gas chromatography-mass spectrometry. Clin. Chim. Acta. 2014;436:59–67. doi: 10.1016/j.cca.2014.04.030. [DOI] [PubMed] [Google Scholar]
  • 47.Wang C, et al. Volatile organic metabolites identify patients with breast cancer, cyclomastopathy, and mammary gland fibroma. Sci. Rep. 2014;4:5383. doi: 10.1038/srep05383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Patterson SG, et al. Breath analysis by mass spectrometry: A new tool for breast cancer detection? Am. Surg. 2011;77:747–751. doi: 10.1177/000313481107700632. [DOI] [PubMed] [Google Scholar]
  • 49.Silva CL, Perestrelo R, Silva P, Tomas H, Camara JS. Volatile metabolomic signature of human breast cancer cell lines. Sci. Rep. 2017;7:43969. doi: 10.1038/srep43969. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Cavaco C. Screening of salivary volatiles for putative breast cancer discrimination: An exploratory study involving geographically distant populations. Anal. Bioanal. Chem. 2018;410:4459–4468. doi: 10.1007/s00216-018-1103-x. [DOI] [PubMed] [Google Scholar]
  • 51.de Leon-Martinez LD, et al. Identification of profiles of volatile organic compounds in exhaled breath by means of an electronic nose as a proposal for a screening method for breast cancer: A case-control study. J. Breath Res. 2020;14:046009. doi: 10.1088/1752-7163/aba83f. [DOI] [PubMed] [Google Scholar]
  • 52.Zhang Y, et al. Early diagnosis of breast cancer from exhaled breath by gas chromatography-mass spectrometry (GC/MS) analysis: A prospective cohort study. J. Clin. Lab. Anal. 2020;34:e23526. doi: 10.1002/jcla.23526. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Table S1. (31.3KB, docx)

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author, for reasonable uses, upon request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES