A novel transfer-learning based physician-level general and subtype classifier for non-small cell lung cancer

Bingzhang Qiao; Kawuli Jumai; Julaiti Ainiwaer; Madinyat Niyaz; Yingxin Zhang; Yuqing Ma; Liwei Zhang; Wesley Luh; Ilyar Sheyhidin

doi:10.1016/j.heliyon.2022.e11981

. 2022 Nov 29;8(12):e11981. doi: 10.1016/j.heliyon.2022.e11981

A novel transfer-learning based physician-level general and subtype classifier for non-small cell lung cancer

Bingzhang Qiao ^a, Kawuli Jumai ^a, Julaiti Ainiwaer ^a, Madinyat Niyaz ^b, Yingxin Zhang ^c, Yuqing Ma ^d, Liwei Zhang ^a, Wesley Luh ^e,^f, Ilyar Sheyhidin ^a,^∗

PMCID: PMC9727670 PMID: 36506384

Abstract

Confirming histological patterns of lung carcinoma is important for determining the prognosis and the next steps of treatment for a patient. Confirming the histologic patterns (subtype) of lung adenocarcinoma is important for determining the prognosis and treatment options for a patient. The task is challenging, and often requires the input of experienced pathologists, who by themselves lack interobserver concordance. A computer-aided diagnosis holds the potential to accelerate the time to diagnosis. As many adenocarcinoma tissue samples contain multiple histologic patterns, accurate computer-aided diagnosis requires annotations manually labeled by pathologists. We propose a method that merges weak supervised learning and Integrated Learning using Transfer Learning using two datasets: The Cancer Genome Atlas (TCGA), and the Clinical Proteomic Tumor Analysis Consortium (CPTAC) to reduce the need for manual annotation by a pathologist while maintaining accuracy. Whole-slide images (WSI) are first determined to be either adenocarcinoma or squamous cell carcinoma, then further identify the subtypes by generating weak classifiers for each subtype, then using integrated learning to create a strong classifier.

Our model was evaluated with independent datasets from the CPTAC dataset and a dataset from a private hospital. It can achieve AUC values of 0.86, 0.91, 0.82, 0.77, 0.96, 0.98 in Acinar, LPA, Micropapillary, Papillary, Solid, and Normal, respectively.

Keywords: Lung adenocarcinoma, Adenocarcinoma subtype classification, Transfer learning, Weak supervised learning, The cancer genome atlas, Squamous cell carcinoma, NSCLC

Lung adenocarcinoma; Adenocarcinoma subtype classification; Transfer learning; Weak supervised learning; The cancer genome atlas, Squamous cell carcinoma, NSCLC.

1. Introduction

Lung cancer is the most common malignant tumor in the world and is a leading of death cancer patients [1, 2]. Lung adenocarcinoma and squamous cell carcinoma are the primary types of non-small cell carcinoma (NSCLC), of which adenocarcinoma accounts for almost half of all cases [3]. The most serious invasive lung adenocarcinoma typically consists of complex mixtures of multiple patterns [4]. In 2015, the World Health Organization released guidelines with five subtype patterns: lepidic, acinar, papillary, micropapillary, and solid with prognostic differences that may be helpful in identifying candidates for adjunctive therapy [4, 5, 6].

Identification of subtle histopathological patterns in complex tissue images under microscope is a time-consuming and subjective process [7, 8]. Due to the complexity and subjectivity of the classification, concordance between different pathologists is low regardless of the image source: microscopic or WSI. Notably, in a study classifying classical and difficult images of lung adenocarcinoma subtypes, the Cohen's kappa (κ) among 26 lung cancer pathologists were 0.77 ± 0.07 for the classical images and 0.38 ± 0.14 for difficult examples [10].

Recent studies have shown that advanced deep learning algorithms can enhance the ability of pathological image analysis across a multitude of tasks, such as discriminating cancer subtypes [12], identifying tumor regions [13], semantic segmentation [14], detecting tumor metastasis [15], mitotic counting [16], and it has also been used across difference species of cancer [17, 18, 19]. With regards to lung cancer, studies [20, 21] in 2017 have shown morphological features of WSI images can be used to predict the prognosis of lung cancer. We believe that artificial intelligence can be used to assist the pathologist—working alongside, rather than replacing. Wei, et al., in their 2019 study, first attempted to automatically classify histological subtypes of lung cancer on stained sections using emerging deep learning techniques, demonstrating that deep learning models could potentially help pathologists improve the classification of lung adenocarcinoma subtypes by automatically screening and highlighting cancer areas, with objective, physician-level results [22].

This study expands the dataset used to include The Cancer Genome Atlas (TCGA) [23], one of the largest publicly available pathological image datasets, and the Clinical Proteomic Tumor Analysis Consortium (CPTAC) [24]. Images for each cancer type are selected and fused weakly supervised learning and integrated learning with transfer learning to create a model that utilizes a voting strategy to determine the classification.

2. Methods

2.1. Overview

The experimental flow of our lung adenocarcinoma subtype classifier is shown in Figure 1.

ASC 6-class model experimental process. **Data processing:** The WSIs are first sliced according to the doctor's label results to obtain the tiles of each subtype. After that, the patients are divided into training patients and test patients, and the tiles are divided into training sets and test sets according to the patients. Then, use a resampling strategy to select training patient of sub-model from huge training patients and the rest patient as validation patient which is using to control training time of sub-model (A) **Training:** Each sub-model is trained using respective training set and validation set, and the result will be fed into bagging model as input (B) **Testing:** Put test tiles into the trained sub-model and get the predicted subtypes of the sub-models, then the sub-models vote to get the merged result.

First, images for adenocarcinoma and squamous cell carcinoma from the TCGA dataset are used to create a lung carcinoma classifier.

The adenocarcinoma subtype classifier (ASC) is then created using transfer learning from the lung carcinoma classifier model. The ASC consists of 10 different learners trained using the bagging strategy that use a voting strategy to determine the classification between six different subtypes. The data for the ASC comes from the CPTAC dataset, which is manually labelled by pathologists from the First Affiliated Hospital of Xinjiang Medical University.

2.2. Data collection

Two open datasets, the Clinical Proteomic Tumor Analysis Consortium (CPTAC) and the Cancer Genome Atlas (TCGA) are the data sources for this experiment. To reduce the amount of labelling needed to be done by doctors, TCGA lung cancer data is used to establish a classifier for lung adenocarcinoma and squamous cell carcinoma. The pre-training dataset is composed of 822 Lung Adenocarcinoma (LUAD), 751 Lung Squamous Cell Carcinoma (LUSC), and 591 Normal WSIs. This provides pre-training weights for the ASC.

Four pathologists from the First Affiliated Hospital of Xinjiang Medical University labelled specific areas of the CPTAC WSIs for lung cancer subtypes, which were subsequently used for both the training and testing sets of the models. The CPTAC dataset was employed over the TCGA dataset because it contains multiple subtypes of adenocarcinoma. All adenocarcinoma images were annotated with the following subtypes: acinar, adherent, papillary, micropapillary, and solid. A small number of adenocarcinoma WSIs were added to balance the model data, for a total of 113 WSI from 65 patients. 14 WSIs from 8 patients containing all subtypes are selected as testing patients to assess the ability of the model, and the 99 WSIs from the remaining 56 patients are used for training the model.

2.3. Data pre-processing

For the carcinoma classifier model, TCGA WSIs were sliced into 512 × 512 pixel sized tiles at 5x magnification. Then, each tile was given the same label as the WSI that it came from.

For the ASC, the WSI from the CPTAC dataset were manually labelled by three doctors of thoracic surgery from the First Affiliated Hospital of Xinjiang Medical University. Each WSI was annotated by the doctor into what they found to be appropriate subtypes.

First, the annotated areas were sliced into 512 × 512 pixel sized tiles at 5x magnification. The boundary tiles and tiles that are primarily background are removed. To balance the amount of data in each subtype, tiles were sliced with overlap. Thus, some pixels may appear in multiple slices. The number of tiles for each type was normalized to the number of tiles labelled “Solid” – the most abundant type. Tiles were sliced from outside the labelled area, and these were used for the “Normal” label.

Table 1 contains the number of tiles that are used for training and testing of the ASC. We utilize the Bagging method, which involves creating multiple models. The mean and standard deviation represent the distribution of tiles between the ten different models created under bagging. Bagging is described further below.

Table 1.

Mean and Std. Dev of the number of WSI/tiles in training and validation sets used by all learners in each subtype, and the numbers of WSI/tiles of the test sets for each subtype.

Class	WSI					Tiles
Class	μ_train	σ_train	μ_valid	σ_valid	test	μ_train	σ_train	μ_valid	σ_valid	test
Acinar	22	2	12	2	5	773	180	452	180	624
Lepidic	5	0	3	0	2	528	114	261	114	562
Micropapillary	13	2	9	2	4	501	147	333	147	431
Papillary	28	2	15	2	4	823	161	494	161	447
Solid	25	3	14	3	6	959	247	495	247	591
Background	36	3	18	3	14	960	134	403	133	770
Total	99				14	-				3425

Open in a new tab

2.4. Classifier

DeepPATH (DP) code adapted from Coudray et al. [12] and Deepslide (DS) code adapted from Wei, et al. [22] are used as the lung carcinoma classifier and the adenocarcinoma subtype classifier (ASC). The DeepPATH model is based on inception v3 architecture with initial 5 convolution nodes combined with 2 max pooling operations and followed by 11 stacks of inception modules. It ends with a fully connected and then a softmax output layer. The Deepslide model is based on an 18-layer ResNet using multi-class cross entropy loss function.

2.5. Bootstrap aggregating (bagging)

This study uses Bootstrap aggregating (Bagging) integrated learning to combine weak learners into strong one. Bagging is a method proposed by Breiman to reduce the variance of learning algorithms [25]. Given a model, bagging extracts samples with replacement from the training population several times, then uses the samples extracted each time to build multiple models (weak learners), and finally uses the mean (regression problem) or majority voting (classification problem) method to aggregate the results of the weak learners to get a strong learner [25, 26]. In this study, we utilize the majority voting method.

2.6. Data partitioning

Although overlapping is used when slicing, the number of tiles that we have is still unbalanced. The number of tiles from different patients in each subtype varies greatly, so to ensure diversity of data, an upper limit is set for the number of tiles that can be provided by any single patient. For different patients in each subtype, a resampling strategy is used to select patients for the training and verification sets, so that the number of patients in each weak learner is similar, but the patients are different. The mean and standard deviation of the numbers of WSI and tiles for the training and verification set used by the learners for each subtype are shown in Table 1.

2.7. Model training using transfer learning

Previous research [12] has shown that the Inceptionv3 [27] architecture can distinguish lung adenocarcinoma and squamous cell carcinoma well. In this experiment, the model parameters obtained from the pre-training model are taken as the initialization weights of the six classification model parameters, except the last layer. For the pre-training model we trained for 230,000 steps, and the ASC model adopts early stopping. In this experiment, the loss function is the cross entropy between the prediction probability and the real class label, and the optimization algorithm utilizes RMSPropOptimizer with an initial learning rate of 0.001, weight attenuation of 0.9, momentum of 0.9, and epsilon of 1.0.

2.8. Test process and visualization of results

A comprehensive assessment of the model is carried out at both WSI and patch levels in this study. Since the model is trained at the patch level, the test is also conducted on the patches cut from WSI. The predictive probability of each subtype for each type is determined and the category corresponding to the maximum prediction probability is obtained.

Because each test patch has a true subtype label from the pathologist, this study uses the confusion matrix to evaluate the model prediction results in all subtypes. We use ROC curves to reflect the generalization of the predicted subtypes of the model. At the WSI level, the prediction results of all patches on the same WSI are counted. In order to consider both quantity and probability, we sum the probability of all patches for their given subtype and the result is taken as the criterion of predicting the main and secondary components, which is then compared with the results provided by doctors.

We visualize the histological patterns of lung adenocarcinoma detected on full slide images. They are displayed by covering the slices of WSI with color blocks representing the predicted categories and then generating an overlay for the original WSI image. This visualization method can directly display the prediction results of the model and provide an easy-to-understand reference for doctors. If the doctor's annotation is provided, the prediction results of the model can be evaluated at the same time as shown in Figure 2.

Heatmap generated from the lung carcinoma classifier (A.i-iv) and ASC imode (B.i-iv). The latter simultaneously shows doctor's annotated curve to compare with the model results.

3. Results

3.1. Accurate prediction of subtype regions

This experiment demonstrates that the proposed method can directly predict the subtype and location of adenocarcinoma in WSI. First, we can judge whether an WSI contains adenocarcinoma or squamous cell carcinoma via our pre-training model. We can then further predict the subtype categories and location by our ASC subtype model if it is adenocarcinoma. The experimental results show that the ASC subtype model can accurately predict the locale of the subtype and the primary subtype category as shown in Figure 2.

On the WSI level, the quantity of the tiles and the predictive results of tiles on a WSI are used to determine the predominant and minor subtypes. The experimental results show that for predominant subtype accuracy of our model can reach 75% accuracy for primary subtypes and 67% accuracy for secondary subtype classification.

3.2. Transfer learning improves model accuracy

Because the training data of the ASC subtype model requires pathologists to annotate and provide ground truth labels, the process is highly time-consuming and laborious. In this experiment, we use the pre-training model without additional annotated data to provide pre-training weights for the ASC subtype model. This helps improve the accuracy of the model and reduces the amount of detailed annotated data required. Table 2 shows that compared to the average weak learners represented by the mean, the strong learners represented by bagging achieve better results. The DP rows are ones using the DeepPATH code adapted from Coudray et al. [12], while DS represents Deepslide code adapted from Wei, et al. [22]. Furthermore, using the pre-training model parameters as the pre-training weight, the average evaluation index of each model is 5%–20% higher than that of the unused model.

Table 2.

Precision, Recall, F1 score, AUC value of each model for training the ASC subype classifier. DP represents the DeepPATH model, DS represents Deepslide model. Rows with mean are the average of each weak learner, and rows with bagging represent the strong learner. Pretrain represents models using the pre-training weights.

Model	Precision	Recall	F1	AUC
DP_mean	0.22	0.29	0.22	0.68
DP_bagging	0.28	0.34	0.26	0.75
DP_pretrain_mean	0.51	0.51	0.48	0.82
DP_pretrain_bagging	0.59	0.55	0.52	0.88
DS_mean	0.55	0.52	0.49	0.84
DS_bagging	0.63	0.56	0.53	0.89
DS_pretrain_mean	0.57	0.55	0.51	0.86
DS_pretrain_bagging	0.66	0.58	0.54	0.91

Open in a new tab

Figure 3 shows the comparison of ROC curves with and without pre-training weights of the model under different subtypes. For nearly all subtypes, the result of the model with pre-training weights is better, and particularly improved for Lepidic-primary adenocarcinoma. Testing was also carried out on tiles at 20x magnification and 256∗256px size slices, but results were worse than those at 5x magnification and 512 × 512px size slices.

ROC curves with and without pre-training weight of our model on test set, and AUC value are show in legend.

3.3. Inconsistent annotations

Wei et al. [22] demonstrate that the consistency of annotations between different doctors only achieves a kappa of 0.4 for a group of 3 doctors. We achieved a kappa score between our model and a professional's annotations of 0.43, matching the kappa found by Wei et al..

In order to examine the effect of different doctors' labeling on the experimental results, we respectively selected one doctor's and two doctors' labeling data to carry out the experiment. Figure 4 shows the confusion matrix between the prediction result of each subtype in the test set and corresponding real label. In Figure 4A, the results of the two doctors are consistent with the result found by Wei et al. Figure 4B shows that one doctor's result is more concentrated than using labeled data from two doctors and average accuracy is higher although testing sets are not the same and the former has less training data.

The confusion matrix between the prediction result and the real label of each subtype in the test set under two doctors' annotation (A), under one doctor's annotation (B).

4. Discussion

The study demonstrates that transfer learning can be helpful to relieve the pressure of model tagging, and that the bagging strategy can be applied to the deep learning model for the classification of lung cancers and subtype recognition for adenocarcinomas. The results suggest that the strong learner result will be better when the weak classifier result is better. Both the present study and Wei et al. [22] show that the result of micropapillary is the worst in all invasive adenocarcinoma subtypes, while micropapillary and solid adenocarcinoma are the most likely pathological subtypes of recurrence, and adherent growth-oriented adenocarcinoma has a lower risk of recurrence. Acinar and papillary adenocarcinoma are the pathological subtypes with moderate recurrence. Therefore, it is necessary to improve the accuracy of microemulsion prediction. As shown in Figure 4B the accuracy of microemulsion is improved, demonstrating the importance of consistent annotation, as adding more pathologists when measuring concordance harms the metric.

Our study grows the number of classes that we can identify to the five subtypes within invasive adenocarcinomas as identified by the IASLC [3], as well as background material. Previous studies into utilizing machine learning for classification on adenocarcinoma slides have generally identified fewer classes, as shown in Table 3, this study is on par with other studies that are classifying the most number of potential classes in the tissue image. Furthermore, this study, along with Wei, et al. [22], are the only two studies that we could find that classified all adenocarcinoma subtypes, an important factor in determining a patient's prognosis.

Table 3.

Abbreviations: ACC, adenocarcinoma; SCC, squamous cell carcinoma; LP, lepidic; AC, acinar; PA, papillary; MP, micropapillary; SO, solid.

Researchers	Year	Objective	ACC Subtype Identified	Method
Gertych, et al. [28]	2019	5-class ACC subtype classification: AC, MP, SO, Cribriform, Non-tumorous	AC, MP, SO, cribriform	Fine-tuned and de-novo CNN
Nishio, et al. [29]	2021	5-class lung tissue classification: normal, emphysema, atypical adenomatous hyperplasia, lepidic pattern of ACC, and invasive ACC	LP	Homology-based
Yang, et al. [30]	2021	6-class lung tissue classification: ACC, SCC, small-cell lung cancer, pulmonary tuberculosis, organizing pneumonia, normal lung	None	DNN
Wei, et al. [22]	2019	6-class ACC WSI primary subtype classification; LP, AC, PA, MP, SO, benign/imperfect sample	LP, AC, PA, MP, SO	DNN
This Paper	–	6-class ACC WSI primary subtype classification; LP, AC, PA, MP, SO, background	LP, AC, PA, MP, SO	Transfer Learning + DNN

Open in a new tab

We note that the consistency of different doctors’ labeling results is very low, which can signal that the human labelers may not be able to determine what the common characteristics are for a particular subtype. This phenomenon is because various adenocarcinoma subtypes can be histologically divided into different types, and there are some differences in the morphological characteristics of each type. For one, acinar adenocarcinoma can be divided into simple acinar, complex acinar, glandular fusion, sieve arrangement, and the pathological manifestations of papillary adenocarcinoma can be divided into pseudo-clay, moderate papillary size, and different papillary size, and the prognosis of each type is also different. Therefore, we can try to predict the different categories of subtypes directly. Even with less data of each type, the consistency between the same data and the difference between the different data will be higher.

One challenge facing adenocarcinoma subtype classification is achieving high concordance with professionals, as well as establishing a gold standard. Comparing the accuracy of the trained model with one versus two pathologists changes the kappa significantly. Furthermore, even within doctors, concordance is low. Within the dataset used in this study, we found that variance between tiles marked Background and Acinar were higher than the rest. This led to worse performance for these classes, demonstrating the need for higher concordance between just pathologists, not just pathologists and AI models. Thus, we believe that our study ultimately reinforces the need for a new method to compare classification accuracies.

Declarations

Author contribution statement

All authors listed have significantly contributed to the development and the writing of this article.

Funding statement

Dr. Ilyar Sheyhidin was supported by National Key Research and Development Program of China [2017YFC0909903].

Data availability statement

The authors do not have permission to share data.

Declaration of interest's statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.

Acknowledgements

We thank Doctors Yuqing Ma, Wenli Ji and Xiaomei Ma from the First Affiliated Hospital of Xinjiang Medical University to offer annotation of data.

References

1.Torre L.A., Siegel R.L., Jemal A. Lung cancer statistics. Adv. Exp. Med. Biol. 2016;893:1–19. doi: 10.1007/978-3-319-24223-1_1. [DOI] [PubMed] [Google Scholar]
2.Malhotra J., Malvezzi M., Negri E. Risk factors for lung cancer worldwide. Eur. Respir. J. 2016;48:889–902. doi: 10.1183/13993003.00359-2016. [DOI] [PubMed] [Google Scholar]
3.Travis W.D., Brambilla E., Noguchi M., et al. International association for the study of lung cancer/American toracic society/European respiratory society international multidisciplinary classifcation of lung adenocarcinoma. J. Thorac. Oncol. 2011;6:244–285. doi: 10.1097/JTO.0b013e318206a221. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Travis W.D., Brambilla E., Nicholson A.G., et al. The 2015 World Health Organization classifcation of lung tumors. J. Thorac. Oncol. 2015;9:1243–1260. doi: 10.1097/JTO.0000000000000630. [DOI] [PubMed] [Google Scholar]
5.Yoshizawa A., Motoi N., Riely G.J., et al. Impact of proposed IASLC/ATS/ERS classification of lung adenocarcinoma: prognostic subgroups and implications for further revision of staging based on analysis of 514 stage I cases. Mod. Pathol. 2011;24:653–664. doi: 10.1038/modpathol.2010.232. [DOI] [PubMed] [Google Scholar]
6.Warth A., Muley T., Meister M., et al. The novel histologic International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society classification system of lung adenocarcinoma is a stage-independent predictor of survival. J. Clin. Oncol. 2012;30:1438–1446. doi: 10.1200/JCO.2011.37.2185. [DOI] [PubMed] [Google Scholar]
7.den B.V., Martin J. Interobserver variation of the histopathological diagnosis in clinical trials on glioma:A clinician’s perspective. Acta Neuropathol. 2010;120:297–304. doi: 10.1007/s00401-010-0725-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Cooper L.A., Kong J., Gutman D.A., et al. Novel genotype-phenotype associations in human cancers enabled by advanced molecular platforms and computational analysis of whole slide images. Lab. Invest. 2015;95:366–376. doi: 10.1038/labinvest.2014.153. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Thunnissen E., Beasley M.B., Borczuk A.C., et al. Reproducibility of histopathological subtypes and invasion in pulmonary adenocarcinoma. An international interobserver study. Mod. Pathol. 2012;25:1574–1583. doi: 10.1038/modpathol.2012.106. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Coudray N., Ocampo P.S., Sakellaropoulos T., et al. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nat. Med. 2018;24:1559–1567. doi: 10.1038/s41591-018-0177-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Cruz-Roa A., Gilmore H., Basavanhally A., et al. High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: application to invasive breast cancer detection. PLoS One. 2018;13 doi: 10.1371/journal.pone.0196828. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Mehta S., Mercan E., Bartlett J., et al. WACV; 2018. Learning to Segment Breast Biopsy Whole Slide Images; pp. 663–672. [Google Scholar]
15.Zheng Q., Yang L., Zeng B., et al. Guiqing Liao,Artificial intelligence performance in detecting tumor metastasis from medical radiology imaging: a systematic review and meta-analysis. EClinicalMedicine. 2021;31:2589–5370. doi: 10.1016/j.eclinm.2020.100669. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Balkenhol M.C.A., Tellez D., Vreuls W., et al. Deep learning assisted mitotic counting for breast cancer. Lab. Invest. 2019;99:1596–1606. doi: 10.1038/s41374-019-0275-0. [DOI] [PubMed] [Google Scholar]
17.Ker J., Bai Y., Lee H.Y., Rao Jai, et al. Automated brain histology classification using machine learning. J. Clin. Neurosci. 2019;66:239–245. doi: 10.1016/j.jocn.2019.05.019. [DOI] [PubMed] [Google Scholar]
18.Iizuka O., Kanavati F., Kato K., et al. Deep learning models for histopathological classification of gastric and colonic epithelial tumours. Sci. Rep. 2020;10:1504. doi: 10.1038/s41598-020-58467-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Thomsen K., Iversen L., Titlestad T.L., et al. Systematic review of machine learning for diagnosis and prognosis in dermatology. J. Dermatol. Treat. 2020;31:496–510. doi: 10.1080/09546634.2019.1682500. [DOI] [PubMed] [Google Scholar]
20.Yu K.H., Zhang C., Berry G.J., et al. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat. Commun. 2016;7 doi: 10.1038/ncomms12474. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Luo X., Zang X., Yang L., et al. Comprehensive computational pathological image analysis predicts lung cancer prognosis. J. Thorac. Oncol. 2017;12:501–509. doi: 10.1016/j.jtho.2016.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Wei J.W., Tafe L.J., Linnik Y.A., et al. Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks. Sci. Rep. 2019;9:3358. doi: 10.1038/s41598-019-40041-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Tomczak K., Czerwińska P., Wiznerowicz M. The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Contemp. Oncol. 2015;19:A68–77. doi: 10.5114/wo.2014.47136. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Rudnick P.A., Markey S.P., Roth J., et al. A description of the clinical proteomic tumor analysis Consortium (CPTAC) common data analysis pipeline. J. Proteome Res. 2016;15:1023–1032. doi: 10.1021/acs.jproteome.5b01091. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Breiman L. Bagging predictors. Mach. Learn. 1996;24:123–140. [Google Scholar]
26.Pino-Mejías R., Jiménez-Gamero M.D., Cubiles-de-la-Vega M.D., et al. Reduced bootstrap aggregating of learning algorithms. Pattern Recogn. Lett. 2008;29:265–271. [Google Scholar]
27.Szegedy C., Vanhoucke V., Iofe S., et al. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015. Rethinking the inception architecture for computer vision; pp. 2818–2826. [Google Scholar]
28.Gertych A., Swiderska-Chadaj Z., Ma Z., et al. Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides. Sci. Rep. 2019;9:1483. doi: 10.1038/s41598-018-37638-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Nishio M., Nishio M., Jimbo N., Nakane K. Homology- based image processing for automatic classification of histopathological images of lung tissue. Cancers. 2021;13:1192. doi: 10.3390/cancers13061192. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Yang H., Chen L., Cheng Z., et al. Deep learning-based six-type classifier for lung cancer and mimics from histopathological whole slide images: a retrospective study. BMC Med. 2021;19:80. doi: 10.1186/s12916-021-01953-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The authors do not have permission to share data.

[bib1] 1.Torre L.A., Siegel R.L., Jemal A. Lung cancer statistics. Adv. Exp. Med. Biol. 2016;893:1–19. doi: 10.1007/978-3-319-24223-1_1. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Malhotra J., Malvezzi M., Negri E. Risk factors for lung cancer worldwide. Eur. Respir. J. 2016;48:889–902. doi: 10.1183/13993003.00359-2016. [DOI] [PubMed] [Google Scholar]

[bib3] 3.Travis W.D., Brambilla E., Noguchi M., et al. International association for the study of lung cancer/American toracic society/European respiratory society international multidisciplinary classifcation of lung adenocarcinoma. J. Thorac. Oncol. 2011;6:244–285. doi: 10.1097/JTO.0b013e318206a221. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] 4.Travis W.D., Brambilla E., Nicholson A.G., et al. The 2015 World Health Organization classifcation of lung tumors. J. Thorac. Oncol. 2015;9:1243–1260. doi: 10.1097/JTO.0000000000000630. [DOI] [PubMed] [Google Scholar]

[bib5] 5.Yoshizawa A., Motoi N., Riely G.J., et al. Impact of proposed IASLC/ATS/ERS classification of lung adenocarcinoma: prognostic subgroups and implications for further revision of staging based on analysis of 514 stage I cases. Mod. Pathol. 2011;24:653–664. doi: 10.1038/modpathol.2010.232. [DOI] [PubMed] [Google Scholar]

[bib6] 6.Warth A., Muley T., Meister M., et al. The novel histologic International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society classification system of lung adenocarcinoma is a stage-independent predictor of survival. J. Clin. Oncol. 2012;30:1438–1446. doi: 10.1200/JCO.2011.37.2185. [DOI] [PubMed] [Google Scholar]

[bib7] 7.den B.V., Martin J. Interobserver variation of the histopathological diagnosis in clinical trials on glioma:A clinician’s perspective. Acta Neuropathol. 2010;120:297–304. doi: 10.1007/s00401-010-0725-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] 8.Cooper L.A., Kong J., Gutman D.A., et al. Novel genotype-phenotype associations in human cancers enabled by advanced molecular platforms and computational analysis of whole slide images. Lab. Invest. 2015;95:366–376. doi: 10.1038/labinvest.2014.153. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] 10.Thunnissen E., Beasley M.B., Borczuk A.C., et al. Reproducibility of histopathological subtypes and invasion in pulmonary adenocarcinoma. An international interobserver study. Mod. Pathol. 2012;25:1574–1583. doi: 10.1038/modpathol.2012.106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] 12.Coudray N., Ocampo P.S., Sakellaropoulos T., et al. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nat. Med. 2018;24:1559–1567. doi: 10.1038/s41591-018-0177-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Cruz-Roa A., Gilmore H., Basavanhally A., et al. High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: application to invasive breast cancer detection. PLoS One. 2018;13 doi: 10.1371/journal.pone.0196828. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] 14.Mehta S., Mercan E., Bartlett J., et al. WACV; 2018. Learning to Segment Breast Biopsy Whole Slide Images; pp. 663–672. [Google Scholar]

[bib15] 15.Zheng Q., Yang L., Zeng B., et al. Guiqing Liao,Artificial intelligence performance in detecting tumor metastasis from medical radiology imaging: a systematic review and meta-analysis. EClinicalMedicine. 2021;31:2589–5370. doi: 10.1016/j.eclinm.2020.100669. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] 16.Balkenhol M.C.A., Tellez D., Vreuls W., et al. Deep learning assisted mitotic counting for breast cancer. Lab. Invest. 2019;99:1596–1606. doi: 10.1038/s41374-019-0275-0. [DOI] [PubMed] [Google Scholar]

[bib17] 17.Ker J., Bai Y., Lee H.Y., Rao Jai, et al. Automated brain histology classification using machine learning. J. Clin. Neurosci. 2019;66:239–245. doi: 10.1016/j.jocn.2019.05.019. [DOI] [PubMed] [Google Scholar]

[bib18] 18.Iizuka O., Kanavati F., Kato K., et al. Deep learning models for histopathological classification of gastric and colonic epithelial tumours. Sci. Rep. 2020;10:1504. doi: 10.1038/s41598-020-58467-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] 19.Thomsen K., Iversen L., Titlestad T.L., et al. Systematic review of machine learning for diagnosis and prognosis in dermatology. J. Dermatol. Treat. 2020;31:496–510. doi: 10.1080/09546634.2019.1682500. [DOI] [PubMed] [Google Scholar]

[bib20] 20.Yu K.H., Zhang C., Berry G.J., et al. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat. Commun. 2016;7 doi: 10.1038/ncomms12474. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] 21.Luo X., Zang X., Yang L., et al. Comprehensive computational pathological image analysis predicts lung cancer prognosis. J. Thorac. Oncol. 2017;12:501–509. doi: 10.1016/j.jtho.2016.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22.Wei J.W., Tafe L.J., Linnik Y.A., et al. Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks. Sci. Rep. 2019;9:3358. doi: 10.1038/s41598-019-40041-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] 23.Tomczak K., Czerwińska P., Wiznerowicz M. The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Contemp. Oncol. 2015;19:A68–77. doi: 10.5114/wo.2014.47136. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] 24.Rudnick P.A., Markey S.P., Roth J., et al. A description of the clinical proteomic tumor analysis Consortium (CPTAC) common data analysis pipeline. J. Proteome Res. 2016;15:1023–1032. doi: 10.1021/acs.jproteome.5b01091. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] 25.Breiman L. Bagging predictors. Mach. Learn. 1996;24:123–140. [Google Scholar]

[bib26] 26.Pino-Mejías R., Jiménez-Gamero M.D., Cubiles-de-la-Vega M.D., et al. Reduced bootstrap aggregating of learning algorithms. Pattern Recogn. Lett. 2008;29:265–271. [Google Scholar]

[bib27] 27.Szegedy C., Vanhoucke V., Iofe S., et al. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015. Rethinking the inception architecture for computer vision; pp. 2818–2826. [Google Scholar]

[bib28] 28.Gertych A., Swiderska-Chadaj Z., Ma Z., et al. Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides. Sci. Rep. 2019;9:1483. doi: 10.1038/s41598-018-37638-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] 29.Nishio M., Nishio M., Jimbo N., Nakane K. Homology- based image processing for automatic classification of histopathological images of lung tissue. Cancers. 2021;13:1192. doi: 10.3390/cancers13061192. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] 30.Yang H., Chen L., Cheng Z., et al. Deep learning-based six-type classifier for lung cancer and mimics from histopathological whole slide images: a retrospective study. BMC Med. 2021;19:80. doi: 10.1186/s12916-021-01953-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

A novel transfer-learning based physician-level general and subtype classifier for non-small cell lung cancer

Bingzhang Qiao

Kawuli Jumai

Julaiti Ainiwaer

Madinyat Niyaz

Yingxin Zhang

Yuqing Ma

Liwei Zhang

Wesley Luh

Ilyar Sheyhidin

Abstract

1. Introduction

2. Methods

2.1. Overview

Figure 1.

2.2. Data collection

2.3. Data pre-processing

Table 1.

2.4. Classifier

2.5. Bootstrap aggregating (bagging)

2.6. Data partitioning

2.7. Model training using transfer learning

2.8. Test process and visualization of results

Figure 2.

3. Results

3.1. Accurate prediction of subtype regions

3.2. Transfer learning improves model accuracy

Table 2.

Figure 3.

3.3. Inconsistent annotations

Figure 4.

4. Discussion

Table 3.

Declarations

Author contribution statement

Funding statement

Data availability statement

Declaration of interest's statement

Additional information

Acknowledgements

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases