Abstract
Although tumor-associated autoantibodies (TAAbs) are promising biomarkers for early cancer detection, the diagnostic performance of individual TAAbs for esophageal squamous cell carcinoma (ESCC) remains limited. In this study, we constructed machine learning (ML) models for ESCC diagnosis using the serum levels of 6 TAAbs (c-myc, NY-ESO-1, p53, p62, RalA, and Sui1) and compared their performance with that of a conventional approach. When the training and test datasets were combined, serum samples from 339 patients with ESCC and 152 healthy controls were analyzed. To address class imbalance, the training data were oversampled using the synthetic minority oversampling technique. Classification models were developed using 5 ML algorithms, among which the svmLinear (support vector machine with a linear kernel), svmRadial (support vector machines with a radial basis function kernel), and neural network models demonstrated robust performance without overfitting. At specificities exceeding 90%, these models achieved substantially higher sensitivities than those obtained using any single TAAb. Notably, the svmLinear and neural network models achieved sensitivities greater than 40% for early-stage ESCC (stage 0/I) in the test dataset. Overall, the integration of multiple TAAbs with ML substantially improved the diagnostic performance for ESCC, highlighting the clinical potential of ML-based approaches.
Graphical Abstract
Introduction
With the success of cancer immunotherapies represented by immune checkpoint inhibitors, the close relationship of cancer with the immune system from the earliest stages of its development has gained cognizance [1]. Tumor-associated autoantibodies (TAAbs) are produced as a result of the humoral immune response against cancer. In lung cancer, TAAbs can be detected in the serum up to 5 years before the definitive tumor diagnosis [2]. Moreover, TAAbs are reportedly detectable with higher sensitivity than tumor-associated antigens [3]. Serum anti-p53 antibody, a TAAb, is reimbursed under the Japanese national health insurance as an in vitro diagnostic test for esophageal, colorectal, and breast cancers and is currently used in clinical practice [4,5].
Patients with esophageal squamous cell carcinoma (ESCC) have a poor prognosis [6], despite attempts to control ESCC using multidisciplinary treatments [7]. In addition, the early detection of ESCC is an important treatment strategy for improving prognosis [8]. However, in ESCC, the sensitivity of anti-p53 antibodies alone is only 25%, and many patients with ESCC remain undiagnosed [9]. Moreover, no TAAbs with a diagnostic performance higher than that of anti-p53 antibodies have been identified. In our previous study, we evaluated the diagnostic performance of 6 TAAbs (c-myc, NY-ESO-1, p53, p62, RalA, and Sui1) using a conventional approach, which yielded sensitivities ranging from 7.8% to 30% [8], signifying no dramatic improvement in diagnostic performance. In recent ESCC-related studies, the application of machine learning (ML) techniques for early detection has increased in imaging-based research [10–12]. In contrast, studies integrating ML with TAAbs remain limited.
In this study, we constructed ML models for the diagnosis of ESCC using the serum levels of the 6 TAAbs described above in patients with ESCC and healthy controls (HCs) and compared their performance with that obtained using the conventional approach used in our previous study. Our results clarify the significance of ML approaches in the clinical diagnosis of ESCC.
Materials and Methods
Patients and sample collection
The number of patients and HCs in each cohort is shown in Table 1. This study analyzed 339 patients with histologically proven primary ESCC. The clinicopathological characteristics of patients from Chiba Cancer Center or those assigned to the test dataset have been reported in our previous studies [8,9]. Only the tumor–node–metastasis stage data were available for patients from BioBank Japan. The HCs had no history of malignant disease. The average age of the controls assigned to the training (n = 74) and test (n = 78) datasets was 37.9 and 48.2 years, and the male-to-female ratios were 35:39 and 49:29, respectively.
Table 1.
Tumor-associated autoantibodies (TAAbs) used in this study
| Dataset | Sample type | Cohort | Sample size | TAAbs | |||||
|---|---|---|---|---|---|---|---|---|---|
| c-myc | Sui1 | RalA | p62 | p53 | NY-ESO-1 | ||||
| Training | ESCC | BBJ | 91 | This study | This study | This study | This study | This study | This study |
| CCC | 85 | This study | This study | This study | This study | Hoshino et al. [17] | Hoshino et al. [17] | ||
| HC | MBL | 74 | Sumazaki et al. [26] | ||||||
| Test | ESCC | KOU | 26 (1) | Shiratori et al. [8] and this study | |||||
| NUM | 1 (1) | ||||||||
| SCC | 3 (0) | ||||||||
| SUH | 44 (8) | ||||||||
| THO | 30 (6) | ||||||||
| TJU | 4 (0) | ||||||||
| TMD | 49 (5) | ||||||||
| TWM | 6 (5) | ||||||||
| HC | BBJ | 78 | Shiratori et al. [8] | ||||||
The numbers in parentheses indicate the number of samples newly utilized for this study out of the sample size of each cohort. “This study” refers to data newly used in the present research, whereas data cited from previous studies are reanalyzed data.
ESCC, esophageal squamous cell carcinoma; HC, healthy control; BBJ, BioBank Japan; CCC, Chiba Cancer Center; MBL, Medical & Biological Laboratories Co., Ltd.; KOU, Keio University School of Medicine; NUM, Nihon University School of Medicine; SCC, Saitama Cancer Center; SUH, Showa University School of Medicine; THO, Toho University School of Medicine; TJU, Jikei University School of Medicine; TMD, Tokyo Medical and Dental University; TWM, Tokyo Women’s Medical University.
Serum samples were collected before treatment and stored at −80 °C until use. All procedures were approved by the Research Ethics Boards of Toho University School of Medicine, Toho University Omori Medical Center (approval number: M23192), as well as by each participating hospital.
Isolation, purification, and amplification of recombinant tumor-associated antigens and enzyme-linked immunosorbent assay of serum autoantibodies
Full-length cDNA of c-myc, p62, RalA, p53, Sui1, and NY-ESO-1 were amplified using the polymerase chain reaction. Briefly, the recombinant proteins were expressed in Escherichia coli BL21-CodonPlus (DE3)-RIL (Stratagene, La Jolla, CA, USA). Each TAAb extract was added to Ni Sepharose 6 Fast Flow (GE Healthcare UL, Buckinghamshire, UK), and the column was washed with 50 mM imidazole in phosphate-buffered saline. Purified recombinant tumor-associated antigens were eluted with 200 mM imidazole in phosphate-buffered saline. DNA sequencing confirmed that the correct gene had been inserted into the constructed plasmid. Serum samples collected from the patients and controls were analyzed using the enzyme-linked immunosorbent assay. As this test was conducted as a corporate clinical trial, its quality was approved by the Pharmaceuticals and Medical Devices Agency.
Clustering and dimensionality reduction
To examine whether potentially concerning batch effects were present in the patient–TAAb matrix data, Gaussian mixture model (GMM) clustering was performed using the Mclust function of the mclust package (version 5.4.9) in R (version 4.2.3). The number of components was set as the number of cohorts when performing GMM clustering on the training and testing datasets. The patient–TAAb matrix data were subjected to dimensionality reduction using multidimensional scaling (MDS) and visualized in 2 dimensions with annotations indicating the results of GMM clustering and cohort information.
ML model construction and evaluation
The workflow for constructing the ML model is shown in Fig. 1. The training data were oversampled using the synthetic minority oversampling technique (SMOTE) [13] function of the DMwR package (version 0.4.1) in R. For the oversampled samples, the values of the 6 TAAbs were standardized for each patient using Eq. (1).
| (1) |
where denotes the standardized TAAb values of the training dataset, denotes the TAAb value of the training dataset, denotes the mean value of each TAAb, and denotes the sample variance. The test dataset was standardized using the mean and variance of the training dataset, assuming practical use in real-world clinical settings. Therefore, the values of the 6 TAAbs in the test dataset were standardized for each sample using Eq. (2).
| (2) |
where denotes the standardized TAAb values of the test dataset and denotes the TAAb values in the test dataset.
Fig. 1.
Overview of machine learning (ML) model construction and validation. Data on tumor-associated autoantibodies (TAAbs) for the training and testing datasets were obtained from multiple cohorts shown in Table 1. Training data imbalance was addressed using the synthetic minority oversampling technique (SMOTE), and models were constructed using 5 algorithms through 5-fold cross-validation and a grid search. Model accuracy was evaluated using test data. ESCC, esophageal squamous cell carcinoma; HC, healthy control; GS, grid search; CV, cross-validation.
Classification models were constructed using the training function of the caret package (version 6.0-94). The training samples were split into the training and validation sets in a ratio of 8:2, and model training was performed using 5-fold cross-validation. The support vector machines with a radial basis function kernel (svmRadial), support vector machine with a linear kernel (svmLinear), eXtreme gradient boosting (xgbTree), random forest (rf), and neural network (nnet) were selected to construct the classification models. The hyperparameter settings for each ML algorithm used in the grid search are listed in Table S1. When classifying normal controls and ESCC, probability thresholds were set in increments of 0.001, and the sensitivity and specificity were calculated at each threshold. The probability threshold that achieved a specificity of >90% in the training dataset was selected as the final threshold. The area under the curve (AUC) was calculated using the confusion matrix function of the caret package.
Results
Characterization of the TAAb datasets
The serum samples used in this study were collected from 3 cohorts for the training data and 9 cohorts for the test data (Table 1). If a batch effect exists among the cohorts, the dataset cannot be directly used for training and must be corrected using an appropriate method. Therefore, we first examined the presence or absence of batch effects among the cohorts through visualization using MDS. MDS showed that clusters formed by specific cohorts were absent in the training and test datasets (Fig. 2A and B). We further examined the potential batch effects using GMM clustering and found that none of the detected clusters consisted of a single specific cohort, and no association was observed between the detected clusters and cohorts in either dataset (Fig. 2C and D). In addition, we generated violin plots for both the training and test datasets to examine the distribution within each cohort, confirming that no markedly different distributions were observed in either dataset (Fig. S1A and B). These results indicated the absence of a batch effect among the cohorts. Because the HCs differed in mean age between the training and test datasets, we performed correlation analyses between age and the serum level of each TAAb. For most TAAbs, the P values were greater than 0.05, indicating no significant correlation with age. Although a few TAAbs showed P values below 0.05, inspection of the scatter plots suggested that these findings were driven by outliers. Taken together, these results indicate that the serum levels of the 6 TAAbs evaluated in this study were not meaningfully influenced by age (Fig. S2A and B).
Fig. 2.
Multidimensional scaling (MDS) and Gaussian mixture model (GMM) clustering of the training (A) and test datasets (B). The plot colors indicate cohorts, and plot shapes represent GMM cluster assignments. The bar plots show the number of samples in each GMM-derived cluster for the training (C) and test datasets (D). BBJ, BioBank Japan; CCC, Chiba Cancer Center; MBL, Medical & Biological Laboratories Co. Ltd.; KOU, Keio University School of Medicine; NUM, Nihon University School of Medicine; SCC, Saitama Cancer Center; SUH, Showa University School of Medicine; THO, Toho University School of Medicine; TJU, Jikei University School of Medicine; TMD, Tokyo Medical and Dental University; TWM, Tokyo Women’s Medical University.
ML model constructions and performance evaluations
Biomedical datasets are often severely imbalanced, rendering most ML algorithms unsuitable [14]. To address this problem, we used SMOTE to oversample the HCs in the training data. We then constructed classification models using the 5 algorithms to classify ESCC and HC. During the learning process, a grid search was performed to identify the optimal hyperparameters, followed by evaluation via 5-fold cross-validation (Fig. S3). The AUCs in each fold ranged from approximately 0.7 to 0.9, with no substantial differences among the ML models (Fig. S4).
At the >90% specificity cutoff in the training dataset, the ML models constructed with svmLinear, svmRadial, and nnet achieved sensitivities of 58.0%, 49.4%, and 57.4%, respectively, and specificities of 90.5% each in the training dataset (Table 2). For the test dataset, the 3 ML models achieved sensitivities of 52.8%, 44.8%, and 51.5% and specificities of 92.3%, 98.7%, and 92.3%, respectively. The sensitivity and specificity of the ML models constructed with rf and xgbTree in the training dataset were 100% and 90.5%, respectively. In contrast, their specificities in the test dataset plummeted to 0% and 8.97%, respectively. In the training dataset, the AUCs of svmLinear, svmRadial, and nnet were 0.822, 0.788, and 0.825, respectively, whereas those of rf and xgbTree were 1.0 and 1.0, respectively (Fig. 3A). In the test dataset, the AUCs of svmLinear, svmRadial, and nnet were 0.819, 0.814, and 0.814, respectively, whereas those of rf and xgbTree were 0.655 and 0.596, respectively (Fig. 3B). These results indicate that the models constructed with rf and xgbTree were overfitted, resulting in low performance on the test data. In contrast, the models built using svmLinear, svmRadial, and nnet were more robust, as evidenced by their high performances on the test data. Therefore, the ML models constructed using svmLinear, svmRadial, and nnet were used for the downstream analyses.
Table 2.
Sensitivity and specificity in the training and test datasets at the optimal cutoff
| svmLinear | svmRadial | rf | xgbTree | nnet | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | |
| Sensitivity | 58.0% (50.3%–65.3%) | 52.8% (44.8%–60.6%) | 49.4% (41.8%–57.1%) | 44.8% (37.0%–52.8%) | 100% (97.9%–100%) | 100% (97.8%–100%) | 100% (97.9%–100%) | 95.7% (91.4%–98.3%) | 57.4% (49.7%–64.8%) | 51.5% (43.6%–59.4%) |
| Specificity | 90.5% (81.5%–96.1%) | 92.3% (84.0%–97.1%) | 90.5% (81.5%–96.1%) | 98.7% (93.1%–99.9%) | 90.5% (81.5%–96.1%) | 0% (0%–4.62%) | 90.5% (81.5%–96.1%) | 8.97% (3.69%–17.6%) | 90.5% (81.5%–96.1%) | 92.3% (84.0%–97.1%) |
Values in parentheses indicate the 95% confidence intervals.
Fig. 3.
Model performance of each algorithm in the training (A) and test datasets (B). AUC, area under the curve.
Differences in sensitivities among stages and cohorts
The sensitivities for each stage in the training and test datasets for the 3 ML models (svmLinear, svmRadial, and nnet) are listed in Table 3. In the training dataset, the sensitivities of svmLinear and nnet were identical at 45.5% and 59.6% for stages 0/I and III, respectively. For stage II, svmLinear yielded a sensitivity of 55.3%, whereas nnet showed a sensitivity of 70.5% for stage IV, which were the highest among the models for these respective stages (Table 3). The sensitivities of svmRadial were lower than those of the svmLinear and nnet at all stages. In the test dataset, svmLinear showed the highest sensitivity for all stages: 41.6% for stage 0/I, 62.5% for stage II, 60.5% for stage III, and 73.3% for stage IV (Table 3). In addition, the sensitivities of the ML models were higher in the advanced stages than in the early stages (Table 3). To evaluate the robustness of the models, we calculated sensitivities using leave-one-cohort-out cross-validation (LOCOCV), taking into account the limited sample sizes in some cohorts. LOCOCV demonstrated that comparable sensitivities were obtained across all combinations in which each cohort was excluded in turn (Table 4).
Table 3.
Sensitivities for each stage and each model in the training and test datasets
| Dataset | Model | Stage | |||
|---|---|---|---|---|---|
| 0/I | II | III | IV | ||
| Training | svmLinear | 45.5% (28.1%–63.6%) | 55.3% (38.3%–71.4%) | 59.6% (45.8%–72.4%) | 68.2% (52.4%–81.4%) |
| svmRadial | 33.3% (18.0%–51.8%) | 42.1% (26.3%–59.2%) | 52.6% (39.0%–66.0%) | 63.6% (47.8%–77.6%) | |
| nnet | 45.5% (28.1%–63.6%) | 50.0% (33.4%–66.6%) | 59.6% (45.8%–72.4%) | 70.5% (54.8%–83.2%) | |
| Test | svmLinear | 41.6% (30.4%–53.4%) | 62.5% (40.6%–81.2%) | 60.5% (44.4%–75.0%) | 73.3% (44.9%–92.2%) |
| svmRadial | 35.1% (24.5%–46.8%) | 58.3% (36.6%–77.9%) | 46.5% (31.2%–62.3%) | 66.7% (38.4%–88.2%) | |
| nnet | 41.6% (30.4%–53.4%) | 62.5% (40.6%–81.2%) | 58.1% (42.1%–73.0%) | 66.7% (38.4%–88.2%) | |
Values in parentheses indicate the 95% confidence intervals.
Number of samples in the training dataset: stage 0/I (n = 33), stage II (n = 38), stage III (n = 57), and stage IV (n = 44).
Number of samples in the test dataset: stage 0/I (n = 77), stage II (n = 24), stage III (n = 43), and stage IV (n = 15).
Table 4.
Sensitivities in leave-one-cohort-out cross-validation (LOCOCV)
| Model | Excluded cohort | |||||||
|---|---|---|---|---|---|---|---|---|
| THO | KOU | SUH | TWM | TMD | SCC | TJU | NUM | |
| svmLinear | 53.4% (44.5%–62.1%) | 55.5% (46.7%–64.0%) | 52.1% (42.8%–61.3%) | 51.6% (43.5%–59.6%) | 54.4% (44.8%–63.7%) | 51.9% (43.8%–59.8%) | 51.6% (43.5%–59.6%) | 52.5% (44.5%–60.4%) |
| svmRadial | 43.6% (35.0%–52.5%) | 48.2% (39.6%–56.9%) | 43.7% (34.6%–53.1%) | 43.9% (36.0%–52.1%) | 47.4% (37.9%–56.9%) | 43.8% (35.9%–51.8%) | 44.0% (36.2%–52.1%) | 44.4% (36.6%–52.4%) |
| nnet | 49.6% (40.8%–58.4%) | 54.7% (46.0%–63.3%) | 51.3% (41.9%–60.5%) | 50.3% (42.2%–58.4%) | 54.4% (44.8%–63.7%) | 50.6% (42.6%–58.6%) | 50.9% (42.9%–58.9%) | 51.2% (43.3%–59.2%) |
Values in parentheses indicate the 95% confidence intervals.
THO, Toho University School of Medicine; KOU, Keio University School of Medicine; SUH, Showa University School of Medicine; TWM, Tokyo Women’s Medical University; TMD, Tokyo Medical and Dental University; SCC, Saitama Cancer Center; TJU, Jikei University School of Medicine; NUM, Nihon University School of Medicine.
Discussion
In this study, we constructed 5 ML models to diagnose ESCC or HCs using the serum levels of 6 TAAbs as independent variables. Among these, 3 models (svmLinear, svmRadial, and nnet) achieved higher sensitivities and specificities for the training and test datasets than those in our previous study. For individual TAAbs, the maximum sensitivity without distinguishing stages was approximately 30% [8]. In contrast, the present models outperformed any individual TAAb in terms of sensitivity. In particular, the sensitivity for identifying stage 0/I in the test dataset exceeded 40% with the svmLinear and nnet models. Thus, the ML approach using TAAbs improves the diagnostic performance for ESCC. In addition, the consistent performance of our models in LOCOCV suggests their ability to maintain diagnostic accuracy across multiple medical institutions.
Li et al. [15] developed an ML model that diagnosed ESCC using 5 TAAbs (CAST, HDAC1, HSF1, PTMS, and ZPR1). The final performance in the test set included a sensitivity of 66.7% and 73.8% in the early and advanced stages, respectively, and a specificity of 87%. Xu et al. [16] constructed a model for the same purpose using 4 TAAbs (LDOA, ENO1, p53, and NY-ESO-1). The final performance in external set 2 included a sensitivity of 67.3% or 61.4% in the early stage and a specificity of 90.1% or 95.0%. Because there is generally a trade-off between sensitivity and specificity, improving one usually compromises the other. However, the ML model constructed by Xu et al. [16] showed higher sensitivity and specificity compared with the model constructed by Li et al. [15], indicating that the model developed by the former had better performance. In this study, we built ML models for the diagnosis of ESCC using 6 TAAbs (c-myc, NY-ESO-1, p53, p62, RalA, and Sui1). Therefore, p53 and NY-ESO-1 overlapped with the independent variables used in the ML model of Xu et al. [16], suggesting that autoantibodies against p53 and NY-ESO-1 are important for predicting ESCC using ML models.
The p53 gene is mutated in most solid cancers, leading to frequent mutations and abnormal expression of the p53 gene and protein in these diseases [17]. The generation of p53 autoantibodies is hypothesized to be related to p53 protein overexpression, and missense mutations in the TP53 gene lead to the loss of protein function [16,18,19]. NY-ESO-1 is another commonly studied autoantibody. A previous study suggested that the NY-ESO-1 antigen elicits humoral immune responses not only by mutant but also by wild-type epitopes [16,20]. In our previous studies, we reported the presence of p53 autoantibodies in many cancer types [17,21,22]. In addition, a high positive rate of the NY-ESO-1 autoantibody has been reported not only in esophageal cancer but also in other cancer types [17,23–25]. Furthermore, we previously showed that serum levels of these TAAbs were higher in patients with hepatocellular carcinoma (HCC) or breast cancer than in HCs [26,27]. Therefore, autoantibodies against p53 and NY-ESO-1 may be useful for improving the diagnostic performance of ML models, not only for ESCC but also for other cancer types.
Although the autoantibodies against c-myc, p62, RalA, and Sui1 did not overlap with the independent variables used in the ML models built by Li et al. [15] and Xu et al. [16], their serum levels in patients with ESCC and HCC were higher than those in HCs [8,27]. In breast cancer, the serum levels of autoantibodies against RalA and Sui1 are higher than those in HCs, whereas those of autoantibodies against c-myc and p62 are not elevated [26]. Therefore, when constructing ML models for cancer diagnosis, autoantibodies against c-myc, p62, RalA, and Sui1 may be effective for specific cancer types, rather than being broadly effective across various cancers.
It is easier to construct ML models with a high diagnostic performance for ESCC because the pathological differences between ESCC and normal tissue are relatively clear. In contrast, most patients with HCC experience progression from a normal liver to chronic hepatitis (CH) and liver cirrhosis (LC) before tumorigenesis. Therefore, diagnostic models must not only distinguish between normal liver tissue and HCC but also between HCC and CH or LC. Moreover, from a clinical perspective, distinguishing HCC from CH or LC is more meaningful than distinguishing normal from HCC. Nevertheless, Wu et al. [28] measured various TAAbs in the serum of HCs and patients with HCC or LC and constructed diagnostic ML models. The diagnostic performance was relatively high for distinguishing normal tissue from HCC but not sufficiently high for distinguishing HCC from LC. Thus, despite its clinical importance, the construction of a practical ML model that accurately distinguishes HCC from LC (and CH) is considered highly challenging. Wu et al. [28] did not use autoantibodies against c-myc, NY-ESO-1, p53, p62, RalA, or Sui1 as independent variables when constructing their ML models. Further studies are warranted because the utility of autoantibodies against these antigens in building ML models to distinguish HCC from LC or HCC from CH remains unclear.
Conclusion
In this study, we developed and evaluated ML-based diagnostic models for ESCC using the serum levels of 6 TAAbs. Compared with conventional single-marker approaches, the ML models, particularly those based on svmLinear, svmRadial, and nnet, demonstrated substantially improved diagnostic performance with robust generalizability across independent test datasets. Notably, these models achieved sensitivities exceeding 50% at high specificity cutoffs (>90%), representing a marked improvement over the limited diagnostic performance of individual TAAbs such as anti-p53 antibodies alone. Importantly, enhanced performance was also observed in early-stage ESCC, where early detection is clinically critical yet remains challenging using existing biomarkers (Fig. 4).
Fig. 4.
Overview of the study design and conclusions. TAAbs, tumor-associated autoantibodies; ESCC, esophageal squamous cell carcinoma; HCs, healthy controls; ML, machine learning.
Overall, our findings demonstrate that the ML-based integration of multiple TAAbs offers a promising and practical strategy for improving the serological diagnosis of ESCC. This approach may complement the existing diagnostic modalities and has the potential to be extended to other cancer types, warranting further validation in prospective and clinically diverse cohorts.
Acknowledgments
Funding: This study was supported by research grants from Medical & Biological Laboratories Co. Ltd. to H.S. and the Japan Agency for Medical Research and Development (AMED) under grant number JP23ck0106885 to Y.N.
Author contributions: Y.N. and H.S. conceived and designed the experiments. Y.N., F.S., T.S., S.Y., and T.T. performed the experiments and analyzed the data. Y.N. contributed to the writing of the draft manuscript. F.S., T.S., S.Y., T.T., and H.S. reviewed and edited the draft manuscript. All authors discussed the data and the manuscript. All authors have read and approved the final manuscript.
Competing interests: H.S. has received research grants from Medical & Biological Laboratories Co. Ltd., Nagoya, Japan. The other authors declare that they have no competing interests.
Data Availability
All data supporting the findings of this study are available within the paper and the Supplementary Materials. All analyses were performed using the cited software, packages, and pipelines, whose codes are publicly available. Further information about the code used in this study can be obtained from the corresponding author on reasonable request.
Supplementary Materials
Figs. S1 to S4
Table S1
References
- 1.Alam A, Siddiqui MF, Hamoudi R, Kishore U, Fernandez-Cabezudo MJ, Al-Ramadi BK. A systems biology approach to unveil shared therapeutic targets and pathological pathways across major human cancers. Comput Struct Biotechnol J. 2025;27:5459–5478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zhong L, Coe SP, Stromberg AJ, Khattar NH, Jett JR, Hirschowitz EA. Profiling tumor-associated antibodies for early detection of non-small cell lung cancer. J Thorac Oncol. 2006;1(6):513–519. [PubMed] [Google Scholar]
- 3.Hanash S. Harnessing immunity for cancer marker discovery. Nat Biotechnol. 2003;21(1):37–38. [DOI] [PubMed] [Google Scholar]
- 4.Kikuchi Y, Shimada H, Hatanaka Y, Kinoshita I, Ikarashi D, Nakatsura T, Kitano S, Naito Y, Tanaka T, Yamashita K, et al. Clinical practice guidelines for molecular tumor markers, 2nd edition review part 1. Int J Clin Oncol. 2024;29(1):1–19. [DOI] [PubMed] [Google Scholar]
- 5.Kikuchi Y, Shimada H, Yamasaki F, Yamashita T, Araki K, Horimoto K, Yajima S, Yashiro M, Yokoi K, Cho H, et al. Clinical practice guidelines for molecular tumor marker, 2nd edition review part 2. Int J Clin Oncol. 2024;29(5):512–534. [DOI] [PubMed] [Google Scholar]
- 6.Tachimori Y, Ozawa S, Numasaki H, Fujishiro M, Matsubara H, Oyama T, Shinoda M, Toh Y, Udagawa H, Uno T, et al. Comprehensive registry of esophageal cancer in Japan, 2009. Esophagus. 2016;13(2):110–137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Watanabe M, Otake R, Kozuki R, Toihata T, Takahashi K, Okamura A, Imamura Y. Recent progress in multidisciplinary treatment for patients with esophageal cancer. Surg Today. 2020;50(1):12–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Shiratori F, Fujiwara N, Nakajima Y, Otsuka K, Murakami M, Matsuda S, Kawakubo H, Takahashi K, Tanishima Y, Oka D, et al. Prognostic significance of six autoantibodies in esophageal squamous cell carcinoma: A prospective multi-institutional study. Ann Gastroenterol Surg. 2025;9(4):658–667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Takashi S, Satoshi Y, Akihiko O, Naoya Y, Yusuke T, Kentaro M, Yu O, Yasuaki N, Koichi Y, Takashi F, et al. Clinical impact of preoperative serum p53 antibody titers in 1487 patients with surgically treated esophageal squamous cell carcinoma: A multi-institutional study. Esophagus. 2021;18(1):65–71. [DOI] [PubMed] [Google Scholar]
- 10.Tsai C-L, Mukundan A, Chung C-S, Chen Y-H, Wang Y-K, Chen T-H, Tseng YS, Huang CW, Wu IC, Wang HC. Hyperspectral imaging combined with artificial intelligence in the early detection of esophageal cancer. Cancer. 2021;13(18): Article 4593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Liao W-C, Mukundan A, Sadiaza C, Tsao Y-M, Huang C-W, Wang H-C. Systematic meta-analysis of computer-aided detection to detect early esophageal cancer using hyperspectral imaging. Biomed Opt Express. 2023;14(8):4383–4405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wu IC, Syu HY, Jen CP, Lu MY, Chen YT, Wu MT, Kuo CT, Tsai YY, Wang HC. Early identification of esophageal squamous neoplasm by hyperspectral endoscopic imaging. Sci Rep. 2018;8(1):13797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: Synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–357. [Google Scholar]
- 14.Nojima Y, Mizuguchi K. Identification of the differences in molecular networks between idiopathic pulmonary fibrosis and lung squamous cell carcinoma using machine learning. Comput Biol Chem. 2025;119: Article 108560. [DOI] [PubMed] [Google Scholar]
- 15.Li T, Sun G, Ye H, Song C, Shen Y, Cheng Y, Zou Y, Fang Z, Shi J, Wang K, et al. ESCCPred: A machine learning model for diagnostic prediction of early esophageal squamous cell carcinoma using autoantibody profiles. Br J Cancer. 2024;131(5):883–894. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Xu YW, Peng YH, Liu CT, Chen H, Chu LY, Chen HL, Wu ZY, Wei WQ, Xu LY, Wu FC, et al. Machine learning technique-based four-autoantibody test for early detection of esophageal squamous cell carcinoma: A multicenter, retrospective study with a nested case–control study. BMC Med. 2025;23(1):235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hoshino I, Nabeya Y, Takiguchi N, Gunji H, Ishige F, Iwatate Y, Shiratori F, Yajima S, Okada R, Shimada H. Prognostic impact of p53 and/or NY-ESO-1 autoantibody induction in patients with gastroenterological cancers. Ann Gastroenterol Surg. 2020;4(3):275–282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Brosh R, Rotter V. When mutants gain new powers: News from the mutant p53 field. Nat Rev Cancer. 2009;9(10):701–713. [DOI] [PubMed] [Google Scholar]
- 19.Butt J, Blot WJ, Visvanathan K, Le Marchand L, Wilkens LR, Chen Y, Sesso HD, Teras L, Ryser MD, Hyslop T, et al. Auto-antibodies to p53 and the subsequent development of colorectal cancer in a U.S. prospective cohort consortium. Cancer Epidemiol Biomarkers Prev. 2020;29(12):2729–2734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Chen WS, Haynes WA, Waitz R, Kamath K, Vega-Crespo A, Shrestha R, Zhang M, Foye A, Baselga Carretero I, Perez Garcilazo I, et al. Autoantibody landscape in patients with advanced prostate cancer. Clin Cancer Res. 2020;26(23):6204–6214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Shimada H, Ochiai T, Nomura F. Titration of serum p53 antibodies in 1085 patients with various types of malignant tumors. Cancer. 2003;97(3):682–689. [DOI] [PubMed] [Google Scholar]
- 22.Shimada H. p53 molecular approach to diagnosis and treatment of esophageal squamous cell carcinoma. Ann Gastroenterol Surg. 2018;2(4):266–273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Oshima Y, Shimada H, Yajima S, Nanami T, Matsushita K, Nomura F, Kainuma O, Takiguchi N, Soda H, Ueda T, et al. NY-ESO-1 autoantibody as a tumor-specific biomarker for esophageal cancer: Screening in 1969 patients with various cancers. J Gastroenterol. 2016;51(1):30–34. [DOI] [PubMed] [Google Scholar]
- 24.Healey GF, Lam S, Boyle P, Hamilton-Fairley G, Peek LJ, Robertson JFR. Signal stratification of autoantibody levels in serum samples and its application to the early detection of lung cancer. J Thorac Dis. 2013;5(5): Article 618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yang Z, Chevolot Y, Géhin T, Solassol J, Mange A, Souteyrand E, Laurenceau E. Improvement of protein immobilization for the elaboration of tumor-associated antigen microarrays: Application to the sensitive and specific detection of tumor markers from breast cancer sera. Biosens Bioelectron. 2013;40(1):385–392. [DOI] [PubMed] [Google Scholar]
- 26.Sumazaki M, Ogata H, Nabeya Y, Kuwajima A, Hiwasa T, Shimada H. Multipanel assay of 17 tumor-associated antibodies for serological detection of stage 0/I breast cancer. Cancer Sci. 2021;112(5):1955–1962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Okada R, Otsuka Y, Yokosuka O, Kato N, Imazaki F, Hoshino I, Sugiura N, Mizumoto H, Azemoto R, Kato K, et al. Six autoantibodies as potential differential biomarkers of hepatocellular carcinoma vs. liver cirrhosis and chronic hepatitis: A prospective multi-institutional study. Oncol Lett. 2022;24(4): Article 367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wu J, Wang P, Han Z, Li T, Yi C, Qiu C, Yang Q, Sun G, Dai L, Shi J, et al. A novel immunodiagnosis panel for hepatocellular carcinoma based on bioinformatics and the autoantibody-antigen system. Cancer Sci. 2022;113(2):411–422. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figs. S1 to S4
Table S1
Data Availability Statement
All data supporting the findings of this study are available within the paper and the Supplementary Materials. All analyses were performed using the cited software, packages, and pipelines, whose codes are publicly available. Further information about the code used in this study can be obtained from the corresponding author on reasonable request.





