Abstract
Early identification and classification of pulmonary nodules are essential for improving the survival rates of individuals with lung cancer and are considered to be key requirements for computer-assisted diagnosis. To address this topic, the present study proposed a method for predicting the malignant phenotype of pulmonary nodules based on weighted voting rules. This method used the pulmonary nodule regions of interest as the input data and extracted the features of the pulmonary nodules using the Denoising Auto Encoder, ResNet-18. Moreover, the software also modifies texture and shape features to assess the malignant phenotype of the pulmonary nodules. Based on their classification accuracy (Acc), the different classifiers were assigned to different weights. Finally, an integrated classifier was obtained to score the malignant phenotype of the pulmonary nodules. The present study included training and testing experiments conducted by extracting the corresponding lung nodule image data from the Lung Image Database Consortium-Image Database Resource Initiative. The results of the present study indicated a final classification Acc of 93.10±2.4%, demonstrating the feasibility and effectiveness of the proposed method. This method includes the powerful feature extraction ability of deep learning combined with the ability to use traditional features in image representation.
Keywords: pulmonary nodules, lung cancer, malignancy level classification, deep learning, ensemble classification
Introduction
Lung cancer is one of the most common types of malignant tumor globally. According to cancer statistics, 2,093,000 new lung cancer cases were reported in 2018 (1), accounting for 12.22% of total cases cancer worldwide. A total of 1,761,000 patients died due to lung cancer in 2018, which represents 19.78% of total mortality due to cancer in the same period (1). In general, >80% of cases of lung cancer are attributed to non-small cell lung cancer (2). Early-stage non-small cell cancer usually occurs in the form of pulmonary nodules, which are not readily detected, and are often asymptomatic prior to excessive proliferation. This leads to frequent misdiagnoses. Lung database computed tomography (LDCT) is an effective screening method for pulmonary nodules due to its low radiation and price. Automatic detection and diagnosis of pulmonary nodules from chest CT images usually includes segmentation of the pulmonary parenchyma from the CT images, detection of suspected nodules in the parenchyma of the lung, extraction of the characteristics of pulmonary nodules and classification of the pulmonary nodules, which is the key step in supplying supplementary suggestions for diagnosis.
When making a diagnosis using medical imaging, physicians rate the characteristics (texture, margin, lobulation and calcification) of pulmonary nodules using empirical and subjective methods to determine their malignant phenotype (3–7). This method is subjective and highly dependent on the physician's experience. Concomitantly, physical examination and imaging of lung nodules is becoming increasingly onerous and presents a major challenge for physicians, affecting the diagnostic classification accuracy (Acc) of lung nodules. The continuous development of machine learning has enabled the application of advanced learning techniques in the research and diagnosis of a number of diseases (8–17). The information derived from lung nodule image data can be combined with machine learning in order to investigate the association between lung cancer incidence and clinicopathological features (18). Supervised machine learning uses the correspondence between data and labels to derive a mapping association between them, whereas unsupervised machine learning is used in cases where samples cannot be effectively classified, such as in the absence of sufficient prior labels. The automated rating of lung nodules using machine learning can improve the efficiency of inspections while reducing human error (19).
The present study proposed a method based on ensemble learning designed to classify the malignant levels of pulmonary nodules, using features such as morphological texture features (TF) and deep semantic features. These approaches are more suitable than previous computer-aided diagnosis (9–18) for clinical practice and replace a single identification method that can only distinguish between benign and malignant nodule states. As the precision of a single classifier (9–13) is not high and does not meet the clinical diagnosis requirements, the present study used the ensemble learning method to integrate three single classifiers according to certain strategies, namely comprehensive analysis of the characterization information of lung nodules and automatic assignment of the lung nodule malignancy, in order to improve the Acc. The training and testing protocols used in the present study included datasets from the Lung Image Database Consortium Image Database Resource Initiative (LIDC-IDRI). Radiologists assigned the corresponding features of pulmonary nodule lesions according to the image files of each study example (20,21).
The results of the present study demonstrated that the specific characteristics of CT images and pulmonary nodules can be used to quantitatively evaluate lung nodule features based on weighted voting, which differs from the previous classification of the benign and malignant pulmonary nodule algorithm. This process automatically scores the malignant phenotype levels of the lung nodules. In addition, the computer tomographic image features and the different semantic features are matched. The correspondence between different modalities can be used to design a more precise personalized treatment plan. Furthermore, a scheme is proposed for ensemble learning of different classifiers by training multiple classifiers, combining them according to the determined integration strategy and comprehensively assessing the final result. The feasibility of the proposed method was demonstrated in the LIDC-IDRI dataset and was compared with state-of-the-art methods for pulmonary nodule diagnosis and assessment.
Materials and methods
The materials and methods section describes the proposed method for classification of pulmonary nodules using ensemble learning. Characteristics of pulmonary nodules were extracted using techniques such as the Convolutional Neural Networks (CNN) features of supervised machine learning methods, the Denoising Auto Encoder (DAE) features of unsupervised machine learning methods and the Texture Feature (TF) and Shape Feature (SF) of lung nodules. CNN, autoencoder and TF and SF techniques were combined with the weighted voting model to predict the semantic feature scores of the lung nodules. According to the classification error rate of the sub-classifier, the weights of different classifiers were determined, and the integrated model lung nodule classification was obtained to determine malignant phenotype level (Fig. 1).
Lung nodule data
The lung CT datasets selected in the present study were obtained from LIDC-IDRI (22) (https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI). The LIDC-IDRI dataset contained a total of 1,018 CT images of patients with relevant clinical information. These CT images were marked by four physicians to indicate the location of the lung nodules, the edge contour information, the degree of benign and malignant characteristics and the quantitation of different signs. Pulmonary nodules with different malignant phenotypes exhibited a number of morphological characteristics. In the LIDC-IDRI dataset, the malignant phenotype of the lung nodules was quantified by the physician using specific numbers and the quantification range was set to 1–5, according to the definition of dataset (14,16). The probability of malignancy was indicated as follows: i) Malignancy 1, high probability of being benign; ii) malignancy 2, moderate probability of being benign; iii) malignancy 3, indeterminate probability being benign; iv) malignancy 4, moderate probability of being malignant; and v) malignancy 5, high probability of being malignant.
As nodules in the lung parenchyma are generally small in diameter, the remaining parts of the CT image may affect the classification results; therefore, the lung nodule images were extracted according to the required annotations. The annotation file records comprised the edge information used by the doctor to mark the position of the nodule. Based on the edge information, the center position of the nodule was determined and a 64×64 pixel2 image located at the center of the lung nodule was obtained as experimental data. The computer-aided diagnostic system automatically extracts the characteristics of the pulmonary nodules and assesses the malignant phenotype of the nodules, which improves the prediction efficiency. In the present study, the LIDC-IDRI dataset was used for model training. Initially, the 64×64 pixel2 regions of interest (ROI) containing the pulmonary nodules were extracted according to the annotation file. The extracted ROI images of the pulmonary nodules were used as the input of the three models, and the corresponding pulmonary malignant phenotype of the nodule was extracted.
Pulmonary nodule feature extraction using unsupervised learning
The autoencoder (23) is an unsupervised learning method that automatically maps input data into the hidden layers and reconstructs the output of the hidden layers to the same shape as the raw input data. It locates hidden features from specific inputs and extracts them to represent the original input. The process from the input to the hidden layers is known as encoding, whereas the process of reconstruction from the hidden layers is known as decoding. The difference between the raw and the reconstruction input data is the reconstruction error. The autoencoder assumes that the distribution representation of the hidden layers can capture the main factors of change within the data.
Following lung nodule extraction using the DAE, the Softmax function was used to classify the nodular malignancy, assuming the training sample {(x1,y1), (x2,y2), …, (xn,yn)}, where xi is the lung nodule image data, and yi is the corresponding malignancy score. The classification of a lung nodule by the Softmax function requires estimation of the probability corresponding to each malignancy score. The formula used to calculate the probability was:
(1) |
where ω is a parameter used in the model. The degree of malignant phenotype corresponding to the probability value was selected as the predicted malignancy of the lung nodule image xi. The process and feature extraction by Denoising Autoencoder (Fig. S1) are further described in Data S1.
Pulmonary nodule features extraction using supervised learning
CNN is a feedforward deep neural network, which consists of convolution operation. CNN calculates matching levels between the images and labels by extracting the feature representation of images (24–27). The deeper of network layers is, the stronger CNN representation is; however, it has been shown that the network degenerates as CNN increase in depth, and this increase results in a decrease in Acc. ResNet network is another typical CNN that adds a shortcut connection and an identity map to the network using residual learning (27). The extraction capability of network features is enhanced, and the network performance gradually improves as the network deepens (25).
In the present study, the sum of the cross-entropy loss function and the regularization loss function were used as the loss function of the residual network:
(2) |
where yi is the true label of the pulmonary nodule, ŷi is the prediction label for the pulmonary nodule, p is the regularization factor and θ, the network model parameter. The features extracted using ResNet-18 were also classified using the Softmax function. The process of feature extraction by ResNet-18 (Fig. S2) and the hyper-parameters of ResNet-18 (Table SI) are further described in Data S2.
Pulmonary nodule classification by handcrafted features
The handcrafted features of the pulmonary nodules were used to classify the lung nodules. Due to the particularity of the medical images, only SF and TF were used to classify the lung nodules (28,29). The geometric parameter method was used to determine the shape of the lung nodules, and the Gray Level Co-occurrence Matrix was used to determine the texture of the nodules.
Following extraction of TF and SF, the extracted features were concatenated using a set of feature vectors. The multi-class machine learning method was selected to classify the features of the lung nodules. In the present study, the K-Nearest Neighbor (KNN) method was selected to classify the extracted handcrafted features. KNN is a method for classifying targets based on feature space training examples, and consists of two components, learning and classification. In the present study, five grades were used for the classification of the malignant phenotype and an additional five categories were employed as vectors. All candidate feature vectors were classified by KNN and divided into five categories, representing the five malignancy grades. The process and some typical handcrafted features (Fig. S3) are further described in Data S3
Weighted voting method based on classification error rate
The three feature methods of unsupervised learning, supervised learning and handcrafted feature combination exhibited different classification abilities for the classification task, and different classification performances. If a single classifier is used alone, the generalization ability of the classifier may not be strong. Three classifiers were combined by certain rules and the combined model could make full use of the features extracted by the three methods. This approach may improve the Acc and generalization ability of the model and could decrease the risk of the model leading to local minimum points in the learning task during the training process. Fusion of the multi-classifiers resulted in cascade and parallel forms. The parallel mode adjusts the base classifiers into a parallel action. Therefore, in the present study, the three classifiers were combined in parallel.
Using parallel fusion, weighted voting was based on the error rate (30–32). The ensemble classification model can utilize the features of each classifier and further ensure flexibility between the different feature coefficients of each classifier (Table I).
Table I.
Algorithm 1 Weighted voting algorithm based on classification error rate |
---|
Evaluation criteria
In the present study, models were assessed based on accuracy (Acc), precision (Pre) and sensitivity (Sen). Acc is the correct proportion of the total sample, indicating the classification ability of the models. Pre is the positive predictive value, representing the proportion of true positives in the positive samples. Sen is the true positive rate, which is the proportion required to make a true positive prediction. Larger values indicate a better performance of classification. The calculation formula used were as follows:
(3) |
(4) |
(5) |
The pulmonary nodule classification was defined by the following parameters: TPi (true positive) indicated the probability that the malignancy i was classified as I; FNi (false negative) indicated the probability that the malignancy i was not classified as I; FPi (false positive) indicated the probability that a malignancy that was not i was not classified as I; and TNi (true negative) indicated the probability that a malignancy that was not i was classified as i (i=1, 2, 3, 4 and 5). The detailed data description and some nodule samples (Fig. S4) are shown in Data S4. In order to determine sensitivity and accuracy trends, receiver operating characteristic (ROC) curves and area under the curve (AUC) values were generated (Fig. 2). The ROC curves comprehensively demonstrate the association between precision and sensitivity, while the AUC value is the area under ROC curves. The larger the AUC value, the better the classifier performance (32).
Results
Classification results of different classifiers
The proposed method exhibited an average Acc of 93.10%, a Pre of 83.85% and a Sen of 81.75% for identification of the malignant phenotype of the lung nodules in the test set (Table II).
Table II.
Malignancy level | Accuracy, % | Precision, % | Sensitivity, % |
---|---|---|---|
1 | 93.70 | 88.92 | 76.25 |
2 | 92.70 | 72.11 | 82.37 |
3 | 93.60 | 84.30 | 84.71 |
4 | 92.22 | 81.42 | 81.82 |
5 | 93.33 | 89.18 | 83.06 |
Totals | 93.10 | 83.85 | 81.75 |
During the training phase, 10-fold cross-validation was used to obtain the Acc of the three classifiers. ResNet-18 exhibited high Acc, whereas DAE exhibited notably stable Acc (Fig. 3). Although the Acc of the handcrafted features was relatively low, it could describe the specific morphological and TF of the pulmonary nodules. In order to validate these findings, an ablation experiment was performed by removing the single methods (Table III).
Table III.
Method | Accuracy, % | Precision, % | Sensitivity, % |
---|---|---|---|
ResNet + Denoising Auto Encoder | 80.27 | 82.10 | 73.45 |
ResNet + KNN | 79.87 | 69.73 | 77.95 |
Denoising Auto Encoder + KNN | 82.59 | 75.11 | 80.80 |
KNN, K-nearest neighbor.
Comparative experiment
The present study further compared the performance of the proposed method based on the weighted voting classification method and similar classification methods (13,33–36) used under the same conditions (Table IV).
Table IV.
The results indicated that the Acc, Sen and AUC of the classification of the malignant phenotype of the pulmonary nodules were optimal in the methods used in the present study. The proposed method exhibited a higher classification performance regarding the pulmonary nodules, which could be used for their accurate assessment, thereby supplying an auxiliary suggestion for the judgment of the medical practitioner. The Pre of the method used in the present study was lower compared with the Multi-Crop CNN (MC-CNN) method proposed by Shen et al (34), as MC-CNN captures more prominent features of the nodule via multiple cropping strategies. However, MC-CNN reduces Sen to ensure Pre and does not improve the classification Acc of pulmonary nodules. Therefore, complex convolution networks may result in longer time periods.
Different CNN models
In the present study, a number of common supervised CNN models were selected for comparison, namely, GoogleNet, VGGNet and SENet (24–26). The CNN models were compared using different pre-training processes with the ResNet-18 under the same conditions (Table V).
Table V.
Convolutional neural network model | Accuracy, % | Precision, % | Sensitivity, % |
---|---|---|---|
Plain-18 | 66.75 | 65.30 | 66.00 |
ResNet-18 | 87.15 | 84.10 | 85.65 |
ResNet-50 | 85.75 | 69.75 | 69.25 |
GoogleNet | 86.20 | 79.00 | 79.50 |
VGGNet-16 | 86.30 | 85.25 | 80.90 |
SENet | 87.00 | 83.35 | 84.80 |
Discussion
Magnetic Resonance Imaging (MRI) uses a magnetic field to obtain electromagnetic signals from the body and reconstruct these signals into images. As lung tissue is rich in gas, the effectiveness of lung MRI is poor. Positron Emission CT (PET-CT) uses the Compton effect in order to reconstruct images. However, its use of radiation increases the risk of lung cancer. CT uses precise and collimated X-ray beams to scan the body and conduct tomography. CT is suitable for screening human respiratory diseases due to its high-density resolution (37). The radiation dose of LDCT is only 26% of that of conventional CT, and LDCT can therefore decrease the incidence of side effects, compared with MRI and PET-CT (38). It is therefore suitable for screening patients with lung cancer, especially non-small cell lung cancer.
For each characteristic of the pulmonary nodules, the appearance was different. Direct observation of the location of the pulmonary nodules or analysis of their malignant phenotype from the image is considered a difficult task. The key features of the vectors were extracted, in order to represent nodules for classification; however, the characteristics of the nodules varied, thus making the task difficult. With regards to nodule images, the supervised learning approaches can automatically extract different features of nodules according to the nodule's labels (benign and malignant). The advantage is that it can identify different categories of nodules according to given labels, without manual intervention. On the other hand, DAE also can extract effective features of nodules through back-propagation algorithm and gradient descent algorithm (23). The autoencoder can find the specific latent vectors from sample sets and extract it for classification. DAE have the ability to preserve the local and global structure of highly nonlinear networks, thus it can be better applied to nodules classification tasks. The handcrafted features, TF and SF reflect the information of the surface and appearance of the nodules, respectively; however, they are unable to completely reflect the essential attributes of the nodules in classification alone, thus these features need to be used in combination (39).
The present study proposed a method for classifying the malignant phenotype of pulmonary nodules on chest CT images. Initially, ResNet-18 and DAE were used to classify lung nodules and KNN was used to classify the SF and TF of these nodules. To get better classification results, we ensemble single classifier according to the classification error rate of the three classifiers, the ensemble model was integrated with three classifiers using weighted voting. A total of 4,578 lung nodule images were extracted from the LIDC-IDRI dataset to verify the validity of the method. Following data balancing and data augmentation, data were obtained from 20,000 images. In the final model, Acc, Pre and Sen reached 93.10, 83.85 and 81.75%, respectively. The overall performance was higher than that of state-of-the-art methods (13,29–32). In the present study, these data were compared with the different CNN models and ResNet-18. It was demonstrated that the classification performance of ResNet-18 was higher than that of the other CNN models. Therefore, the proposed classification method for the malignant phenotype of pulmonary nodules decreases the time and cost of CT imaging, increases the Acc of assisted lung cancer diagnosis, offers auxiliary support during diagnosis and improves the efficiency of lung cancer screening in hospitals.
Lung cancer automatic judgment is important but difficult as it predominantly includes detection, segmentation and evaluation (40). The present study successfully identified the classification of multi-class nodules, which is the first step of lung cancer judgment. Prospective studies will focus on lung tumor prediction and segmentation.
Supplementary Material
Acknowledgements
Not applicable.
Funding
The present study was supported in part by the National Natural Science Foundation of China (grant no. 61872261).
Availability of data and materials
The datasets generated and/or analyzed during the current study are available in the LIDC-IDRI repository (https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI). The implementation code used during the current study is available at https://github.com/XmaNm/nodules-classification.
Authors' contributions
NX and YQ conceived and designed the study. MBZ improved the algorithm for use in the present study. JHL collected and curated data. NX and JHL designed the experiment and analyzed the results. SHW coordinated the present study and collected background information. All authors read and approved the final manuscript.
Ethics approval and consent for publication
Not applicable.
Patient consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
References
- 1.Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]
- 2.Rzechonek A, Grzegrzolka J, Blasiak P, Ornat M, Piotrowska A, Nowak A, Dziegiel P. Correlation of expression of tenascin C and blood vessel density in non-small cell lung cancers. Anticancer Res. 2018;38:1987–1991. doi: 10.21873/anticanres.12436. [DOI] [PubMed] [Google Scholar]
- 3.Chen S, Harmon S, Perk T, Li X, Chen M, Li Y, Jeraj R. Diagnostic classification of solitary pulmonary nodules using dual time 18F-FDG PET/CT image texture features in granuloma-endemic regions. Sci Rep. 2017;7:9370. doi: 10.1038/s41598-017-08764-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Dai M, Qi J, Zhou Z, Gao F. The classification of pulmonary nodules based on texture features over local jet transformation space. Chin J Biomed Eng. 2017;36:12–19. [Google Scholar]
- 5.Felix A, Oliveira M, Machado A, Raniery J. In: Proceedings of 29th Conference on Graphics; Patterns and Images (SIBGRAPI), Sao Paulo: 2016. Using 3D texture and margin sharpness features on classification of small pulmonary nodules; pp. 394–400. [Google Scholar]
- 6.Song J, Hui L, Geng F, Zhang C. In: Proceedings of 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC/PiCom/DataCom/CyberSciTech); Auckland: 2016. Weakly-supervised classification of pulmonary nodules based on shape characters; pp. 228–232. [Google Scholar]
- 7.Niehaus R, Raicu DS, Furst J, Armato S., III Toward understanding the size dependence of shape features for predicting spiculation in lung nodules for computer-aided diagnosis. J Digit Imaging. 2015;28:704–717. doi: 10.1007/s10278-015-9774-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dhara AK, Mukhopadhyay S, Dutta A, Garg M, Khandelwal N. A Combination of shape and texture features for classification of pulmonary nodules in lung CT images. J Digit Imaging. 2016;29:466–475. doi: 10.1007/s10278-015-9857-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Li W, Cao P, Zhao D, Wang J. Pulmonary nodule classification with deep convolutional neural networks on computed tomography images. Comput Math Methods Med. 2016;2016:6215085. doi: 10.1155/2016/6215085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tartar A, Akan A, Kilic N. A novel approach to malignant-benign classification of pulmonary nodules by using ensemble learning classifiers. Conf Proc IEEE Eng Med Biol Soc. 2014;2014:4651–4654. doi: 10.1109/EMBC.2014.6944661. [DOI] [PubMed] [Google Scholar]
- 11.Nibali A, Zhen H, Wollersheim D. Pulmonary nodule classification with deep residual networks. Int J Comput Assist Radiol Surg. 2017;12:1799–1808. doi: 10.1007/s11548-017-1605-6. [DOI] [PubMed] [Google Scholar]
- 12.Shen W, Zhou M, Yang F, Yang C, Tian J. Multi-scale convolutional neural networks for lung nodule classification. Inf Process Med Imaging. 2015;24:588–599. doi: 10.1007/978-3-319-19992-4_46. [DOI] [PubMed] [Google Scholar]
- 13.Kumar D, Wong A, Clausi DA. In: Proceedings of the 2015 12th Conference on Computer and Robot Vision; Halifax, Canada. IEEE: 2015. Lung nodule classification using deep features in CT images; pp. 133–138. [Google Scholar]
- 14.Kaya A, Can AB. A weighted rule based method for predicting malignancy of pulmonary nodules by nodule characteristics. J Biomed Inform. 2015;56:69–79. doi: 10.1016/j.jbi.2015.05.011. [DOI] [PubMed] [Google Scholar]
- 15.Li G, Kim H, Tan JK, Ishikawa S, Hirano Y, Kido S, Tachibana R. Semantic characteristics prediction of pulmonary nodule using artificial neural networks. Conf Proc IEEE Eng Med Biol Soc. 2013;2013:5465–5468. doi: 10.1109/EMBC.2013.6610786. [DOI] [PubMed] [Google Scholar]
- 16.Chen S, Ni D, Qin J, Lei B, Wang T, Cheng JZ. Springer; Cham: 2016. Bridging computational features toward multiple semantic features with multi-task regression: A study of ct pulmonary nodules. International Conference on Medical Image Computing and Computer-Assisted Intervention; pp. 53–60. [Google Scholar]
- 17.Shewaye TN, Mekonnen AA. arXiv: Computer Vision and Pattern Recognition; arXiv:1605.08350v1 [cs.CV]: 2016. Benign-malignant lung nodule classification with geometric and appearance histogram features. [Google Scholar]
- 18.Orozco HM, Villegas OOV, de Jesús Ochoa Domínguez O, Sánchez VGC. Mexican International Conference on Artificial Intelligence. IEEE; 2014. Lung nodule classification in CT thorax images using support vector machines; pp. 277–283. [Google Scholar]
- 19.Zhao A, Qi L, Li J, Dong J, Yu H. SPIE Press; 2018. LSTM for diagnosis of neurodegenerative diseases using gait data. In: Proceedings of the 9th International Conference on Graphics and Image Processing. [Google Scholar]
- 20.Jacobs C, van Rikxoort EM, Twellmann T, Scholten ET, de Jong PA, Kuhnigk JM, Oudkerk M, de Koning HJ, Prokop M, Schaefer-Prokop C, van Ginneken B. Automatic detection of subsolid pulmonary nodules in thoracic computed tomography images. Med Image Anal. 2014;18:374–384. doi: 10.1016/j.media.2013.12.001. [DOI] [PubMed] [Google Scholar]
- 21.Ma J, Wang Q, Ren Y, Hu H, Zhao J. Medical Imaging 2016: PACS and Imaging Informatics: Next Generation and Innovations. Vol. 9789. SPIE Proceedings; 2016. Automatic lung nodule classification with radiomics approach. [Google Scholar]
- 22.Armato SG, III, McLennan G, Bidaut L, McNitt-Gray MF, Meyer CR, Reeves AP, Zhao B, Aberle DR, Henschke CI, Hoffman EA, et al. The lung image database consortium (LIDC) and image database resource initiative (IDRI): A completed reference database of lung nodules on CT scans. Med Phys. 2011;38:915–931. doi: 10.1118/1.3528204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Chen M, Weinberger KQ, Sha F, Bengio YO. Marginalized denoising auto-encoders for nonlinear representations. Proceedings of the 31st International Conference on Machine Learning. PMLR. 2014;32:1476–1484. [Google Scholar]
- 24.Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A. arXiv: Computer Vision and Pattern Recognition; arXiv:1409.4842v1 [cs.CV]: 2015. Going deeper with convolutions. [DOI] [Google Scholar]
- 25.Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv: Computer Vision and Pattern Recognition arXiv:1409.1556v6 [cs.CV] 2014 [Google Scholar]
- 26.Hu J, Shen L, Albanie S, Sun G, Wu E. arXiv: Computer Vision and Pattern Recognition; arXiv:1709.01507v4 [cs.CV]: 2017. Squeeze-and-excitation networks. [Google Scholar]
- 27.He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. IEEE Conference on Computer Vision and Pattern Recognition. 2016:770–778. [Google Scholar]
- 28.Haralick RM, Shanmugam K, Dinstein IH. IEEE Transactions on Systems; Man, and Cybernetics. Vol SMC-3. IEEE: 1973. Textural features for image classification; pp. 610–621. [Google Scholar]
- 29.Pan L, Qiang Y, Yuan J, Wu L. Rapid retrieval of lung nodule CT images based on hashing and pruning methods. Biomed Res Int. 2016;2016:3162649. doi: 10.1155/2016/3162649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Li X, Yang Y, Xiong H, Song S, Jia H. Control and Decision Conference. IEEE; 2017. Pulmonary nodules detection algorithm based on robust cascade classifier for CT images; pp. 231–235. [Google Scholar]
- 31.Zinovev D, Furst J, Raicu D. Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops. IEEE Computer Society; 2011. Building an ensemble of probabilistic classifiers for lung nodule interpretation; pp. 155–161. [Google Scholar]
- 32.Zou KH, O'Malley AJ, Mauri L. Receiver-operating characteristic analysis for evaluating diagnostic tests and predictive models. Circulation. 2007;115:654–657. doi: 10.1161/CIRCULATIONAHA.105.594929. [DOI] [PubMed] [Google Scholar]
- 33.Zinovev D, Feigenbaum J, Furst J, Raicu D. Probabilistic lung nodule classification with belief decision trees. Conf Proc IEEE Eng Med Biol Soc. 2011;2011:4493–4498. doi: 10.1109/IEMBS.2011.6091114. [DOI] [PubMed] [Google Scholar]
- 34.Shen W, Zhou M, Yang F, Yu D, Dong D, Yang C, Zang Y, Tian J. Multi-crop Convolutional Neural Networks for lung nodule malignancy suspiciousness classification. Pattern Recogn. 2017;61:663–673. doi: 10.1016/j.patcog.2016.05.029. [DOI] [Google Scholar]
- 35.Rodrigues MB, Da NóBrega RVM, Alves SSA, Filho PPR, Duarte JBF, Sangaiah AK, De Albuquerque VHC. Health of things algorithms for malignancy level classification of lung nodules. IEEE Access. 2018;6:18592–18601. doi: 10.1109/ACCESS.2018.2817614. [DOI] [Google Scholar]
- 36.Sun W, Huang X, Tseng TL, Zhang J, Qian W. Medical Imaging 2016: PACS and Imaging Informatics: Next Generation and Innovations. Vol. 9785. SPIE Proceedings; 2016. Computerized lung cancer malignancy level analysis using 3D texture features. [Google Scholar]
- 37.Seo N, Seok J, Lim S, Cho A. Radiologic diagnosis (CT, MRI, & PET-CT) Surg Gastric Cancer. 2019:67–86. doi: 10.1007/978-3-662-45583-8_4. [DOI] [Google Scholar]
- 38.Oliva MR, Saini S. Liver cancer imaging: Role of CT, MRI, US and PET. Cancer Imaging. 2004;4:S42–S46. doi: 10.1102/1470-7330.2004.0011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Muhammad MN, Raicu DS, Furst JD, Varutbangkul E. Medical Imaging 2008: PACS and Imaging Informatics. Vol. 6919. SPIE Proceedings; 2008. Texture versus shape analysis for lung nodule similarity in computed tomography studies. [Google Scholar]
- 40.Wormanns D, Fiebich M, Saidi M, Diederich S, Heindel W. Automatic detection of pulmonary nodules at spiral CT: Clinical application of a computer-aided diagnosis system. Eur Radiol. 2002;12:1052–1057. doi: 10.1007/s003300101126. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated and/or analyzed during the current study are available in the LIDC-IDRI repository (https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI). The implementation code used during the current study is available at https://github.com/XmaNm/nodules-classification.