Skip to main content
Journal of Healthcare Engineering logoLink to Journal of Healthcare Engineering
. 2018 Nov 1;2018:9409267. doi: 10.1155/2018/9409267

A Computer-Aided Pipeline for Automatic Lung Cancer Classification on Computed Tomography Scans

Emre Dandıl 1,
PMCID: PMC6236771  PMID: 30515286

Abstract

Lung cancer is one of the most common cancer types. For the survival of the patient, early detection of lung cancer with the best treatment method is crucial. In this study, we propose a novel computer-aided pipeline on computed tomography (CT) scans for early diagnosis of lung cancer thanks to the classification of benign and malignant nodules. The proposed pipeline is composed of four stages. In preprocessing steps, CT images are enhanced, and lung volumes are extracted from the image with the help of a novel method called lung volume extraction method (LUVEM). The significance of the proposed pipeline is using LUVEM for extracting lung region. In nodule detection stage, candidate nodules are determined according to the circular Hough transform- (CHT-) based method. Then, lung nodules are segmented with self-organizing maps (SOM). In feature computation stage, intensity, shape, texture, energy, and combined features are used for feature extraction, and principal component analysis (PCA) is used for feature reduction step. In the final stage, probabilistic neural network (PNN) classifies benign and malign nodules. According to the experiments performed on our dataset, the proposed pipeline system can classify benign and malign nodules with 95.91% accuracy, 97.42% sensitivity, and 94.24% specificity. Even in cases of small-sized nodules (3–10 mm), the proposed system can determine the nodule type with 94.68% accuracy.

1. Introduction

Nowadays, lung cancer is one of the ranking first causes of mortality worldwide among men and women [1, 2]. Although there are a lot of treatment options like surgery, radiotherapy, and chemotherapy, five year survival rate for patients is quite low [3]. However, survival rate may go up to 54% in case lung cancer is identified in an early stage [4]. Therefore, early detection of lung cancer is vital to decrease lung cancer mortality.

Medical imaging techniques have been important technology in screening of lung cancer recently. CT scan becomes a standard modality for detecting and assessing lung cancer [5]. Most of the lung nodules are usually benign. However, some nodules such as calcified, swollen, and hard can also be determined as benign. Similarly, a hard nodule generally is cancerous (malignant), but it may be considered as benign case in some cases [6]. Furthermore, medical CT images are needed to be diagnosed by radiologists.

Computer-aided detection (CAD) systems have been an important field in medical image processing. CAD systems also based on machine learning methods designed to diagnosis of cancer have become common in recent years. Radiologists and physicians may use findings of CAD systems as the second opinion before making their own final decisions. Therefore, CAD systems play an important role in CT scans to help radiologists for detection of lung cancer efficiently.

2. Related Work

Computer-aided detection (CAD) systems have been active research field for the pulmonary nodule detection and malign/benign nodule classification. Until now, many CAD systems have been proposed. For example, Ozekes and Camurcu proposed a method for pulmonary nodule detection method using template matching [7]. Schilham et al. presented a CAD system which consists of image preprocessing, candidate nodule detection, feature extraction, and classification for nodule detection in chest radiographs [8]. Dehmeshki et al. detected lung nodules using shape-based genetic algorithm template matching [9]. Suarez-Cuenca et al. also designed a system which discriminates the nodules and non-nodule cases using iris filter in CT images [10]. Murphy et al. automatically performed lung nodule detection using k-nearest neighbours classifier [11]. Giger et al. realized CAD system to detect lung nodules on CT images using geometric features [12]. In addition, Hasegawa et al. proposed image processing methods for identification of lung nodules using CT scans [13]. In another study, Kanazawa et al. used a CAD system to identify pulmonary nodules with fuzzy features [14]. In 2005, Suzuki et al. proposed a method using ANN for classification of malignant and benign nodules on CT images [15]. Sun et al. compared support vector machines (SVM) with the some classification methods for detection of lung cancer on CT images [16]. Kuruvilla and Gunavathi proposed a system using ANNs for classification of lung cancer [17].

In a recent study on lung nodule detection, Javaid et al. proposed a computer aided nodule detection method for the segmentation and detection of challenging in different type nodules [18]. ur Rehman et al. presented a systematic analysis of nodules detection techniques with the current trends and future challenges [19]. Wang et al. proposed a pulmonary nodule CAD based on semisupervised extreme learning machine [20]. Xie et al. proposed an automated pulmonary nodule detection system with 2D convolutional neural network (CNN) on LUNA16 dataset [21].

In this study, we have proposed fully automated computer-aided pipeline for the detection of pulmonary nodules and classification of benign/malign nodules in early stage. The contributions of this paper are (1) to review the systematic literature review; (2) to present the state of the art detection of pulmonary nodules and classification of lung cancer; (3) to propose the novel preprocessing method (LUVEM) for the lung volume extraction; (4) to suggest a novel candidate lung nodule detection method using CHT; (5) to design a holistic pipeline for the detection of pulmonary nodules as well as lung cancer; (6) the detailed comparison of feature extraction methods for lung nodule detection; and (7) to perform the detailed performance evaluation, high true detection rate, and low false detection rate for nodule detection and classification.

3. Architecture of the Computer-Aided Pipeline

Designed pipeline consists of four main stages such as image preprocessing (Stage I), lung nodule detection (Stage II), nodule feature computation (Stage III), and nodule classification (Stage IV). The work flow of the pipeline is presented in Figure 1.

Figure 1.

Figure 1

Work flow of the designed pipeline to detect lung cancer. The system consists of four stages: Stage I—enhancement of lung CT image and a novel lung volume segmentation method (LUVEM), Stage II—candidate nodule detection using CHT and segmentation of lung nodules using SOM, Stage III—computing of lung nodule features and reduction features using PCA, and Stage IV—classification of malign and benign lung nodule using PNN.

3.1. Lung Image Preprocessing

In the first step of the image preprocessing stage, reading of CT images is performed. The CT scans obtained for the work are stored as DICOM (Digital Imaging and Communications in Medicine) files [22].

The goal of image enhancement step is to prevent misleading results that may occur in subsequent processes. Thus, we firstly implemented the median filter to remove unnecessary noises and enhance the images. Moreover, the sharpening of nodule contours is an important step for the detection of nodules. Laplacian filter was used in our study. So, nodules on lung region were able to be detected more accurately. Furthermore, histogram equalization was also used in enhancement step in order to minimize contrast differences which occur due to scanning errors and to remove unnecessary grains.

In lung volume extraction step of image preprocessing stage, extracting of the lung region from CT image is performed. There are many methods for extracting lung volume from a lung CT scan [9, 10, 2325]. However, these methods are complex, and they require more processing overhead. In some cases, these methods may lead to losses of information about lung regions or cause noise. The purpose of this step is to extract the lung region completely from the full lung CT image. Therefore, a simple but effective and novel method has been proposed in this study for lung volume extraction named as lung volume extraction method (LUVEM). The pseudocode of LUVEM is shown in Algorithm 1.

Algorithm 1.

Algorithm 1

The pseudocode of lung volume extraction method (LUVEM).

In LUVEM method, lung lobes are extracted from CT images with the help of morphological operations. LUVEM removes unrelated segments on the sides and edges of the preprocessed image and obtain the lung region successful. In the algorithm, input image is firstly converted to double-formatted image. Afterwards, 1 or 0 values are assigned to each pixel of double-formatted image according to low and high threshold values. The low and high threshold values are determined 0.25 and 0.65 in this algorithm, respectively. The method removes the bright areas on the edges of the lung CT image since their average values change between low and high values. After this process, the image is converted to binary format and performed morphological operations which are eroding, dilating, and filling, respectively. Finally, the image is again converted to gray-scale format. The segmentation examples of LUVEM can be seen in Figure 2. It is clearly seen that LUVEM can successfully extract the lung volume. In addition, quantitative evaluation of LUVEM will be reported below.

Figure 2.

Figure 2

Extraction examples of lung regions: (a) preprocessed images and (b) lung volume extraction using LUVEM.

3.2. Lung Nodule Detection

The first step of lung nodule detection is candidate nodule detection. The nodule candidates in volume should be detected before nodule segmentation. The lung volume includes vessels and nodules. Moreover, the density of nodules, vessels, and lungs is different from each other [26]. Since the lung nodules have a circular and helical structure, they can be differentiated by means of circularity determination. Many methods have been suggested for identifying the round objects. Circular Hough transform (CHT), which proposed by Duda et al. [27], is one of the most successful method [28] for detection of round objects on the images. In this study, CHT operations are used for candidate nodule detection. CHT can detect the round object in the image; moreover, it can also detect the noncircular object by means of some operations. The image dataset is divided into 3 categories according to the nodule size such as <10 mm, 10–20 mm, and >20 mm. In order to detect the nodules in different size by CHT, three minimum and maximum radiuses such as 3–12 mm, 10–20 mm, and 15–45 mm are determined. In Figure 3, it is shown that the examples of determination of candidate nodules on CT images.

Figure 3.

Figure 3

Determination of candidate lung nodule using CHT.

The second step of lung nodule detection is nodule segmentation. In this study, SOM [29] is proposed to segment nodules on CT images. SOM is an unsupervised neural network learning [30] method. It can perform on large/complex datasets [31, 32]. Furthermore, it designs data maps that can be interpreted easily. In addition to these advantages, SOM can easily segment very small nodules on the lung CT images [3]. The examples of segmented lung nodule images using SOM are shown in Figure 4.

Figure 4.

Figure 4

Examples of segmented lung nodules: (a) images of extracted lung volume and (b) segmentation of lung nodule using SOM.

3.3. Nodule Feature Computation

Generally, CAD systems segment lung nodules for the determination of nodule candidates, and then features extract from the candidate nodules. The popular features are geometric feature, gray level features, gradient features, and energy level features. Therefore, we extracted 2D significative features from lung CT images to discriminate benign or malign nodule. Firstly, we used shape-based features for analyzing nodule geometry. We used first-order statistical features to obtain global statistic about nodule region. Moreover, we utilized gray level co-occurrence matrix (GLCM) texture features for gray level statistic of nodules. Finally, we extracted wavelet decomposition transform features to obtain the energy feature of nodules. All computed features are extracted from the slice of the segmented object.

First-order statistical features (SSF) of an image are calculated from the gray level histogram values of an image [33]. In this study, 6 basic features such as standard deviation, entropy, means, skewness, kurtosis, and variance were extracted by SSF using the histogram values of a gray level lung CT image. Shape-based features (SBF) allow feature extraction from the image by using geometric parameters [34]. Shape features give some information about an image such as sharpness, circularity, and convexity. In this study, a total of 16 shape features were extracted to facilitate the determination of nodule type from CT lung images. Statistical features of a gray level image (GTF) of a texture are first derived with the help of GLCM texture features proposed by Haralick [35, 36]. GLCM method shows the relationship between pixels of different gray level and is widely used in applications of medical image processing. In this study, a total of 88 features were extracted with GLCM from 0°, 45°, 90°, and 135° angle directions in d = 2 distance. Wavelet decomposition transform can denote distribution of energy features of different regions (TEF). ROI of the CT image is divided into four subbands with 2D wavelet decomposition. Three images are created in low frequencies, and an image is created in high frequencies with wavelet decomposition transform from an image [37]. In this study, 13 energy features of an image are extracted with wavelet decomposition. The number of features extracted by each feature extraction method used in this study is presented in Table 1.

Table 1.

The number of extracted 2D features from lung CT images.

Feature extraction method Number of feature Order
SSF 6 0–6
SBF 16 7–22
GTF 22 ∗ 4 = 88 23–110
TEF 13 111–123

On lung CT images, malign nodules are generally more complex and irregular, while benign nodules are rounder with certain borders. Most of the benign nodules have small variance values. However, malign nodules show relatively higher variance values [3]. Figure 5 shows the examples of benign and malign lung nodules on CT images.

Figure 5.

Figure 5

Examples of benign (a) and malign (b) lung nodules.

Since 123 features extracted are rather large in size, they may negatively affect accuracy during classification step. Thus, selecting the most appropriate features instead of using all features will be a more efficient method. We used PCA method for dimension reduction of feature vector. PCA is used to reduce dimensionality of large dataset [38, 39]. We can select a number of features only up to one-third of the number of data (patterns) in the smaller of the two classes. Thus, for our work, the smallest class has 104 patterns (benign nodules), and since we split the data to half, one-third of 52 is around 17. Therefore, we selected with PCA the most appropriate 17 features (components) from 123 features. Figure 6 denotes principal component analysis of extracted features with cumulative variance. As can be seen from the chart in Figure 6, it is seen that the variance of the first 20 components is more selective.

Figure 6.

Figure 6

Principal component analysis of extracted features with cumulative variance.

3.4. Nodule Classification

In the proposed pipeline, we have used a probabilistic neural network (PNN) model to make automated decision about the nodule types (benign or malignant). PNN is an effective tool for many classification implementations and can easily make classifications [40, 41]. Figure 7 presents the architecture of the PNN designed for this study. Neuron number in the input layer is selected 17 according to the number of inputs.

Figure 7.

Figure 7

Probabilistic neural network architecture used in the proposed method for nodule classification.

4. Experimental Results

In this work, we realized all experiments using a PC with i7 processor, 16 GB memory, and Windows 10. Moreover, MATLAB software was used for performance evaluation of the proposed pipeline. In all experiments, leave-one-out cross validation technique was run at the level of nodule. So, all of 220 nodules (104 benign and 116 malignant) were used for both trainings and tests. Figure 8 summarizes the processing steps of the proposed pipeline.

Figure 8.

Figure 8

Processing steps of the proposed pipeline: (a) original DICOM image; (b) image preprocessing and enhancement; (c) lung volume extraction from CT scan; (d) detection of candidate nodules; (e) segmentation of nodules; (f) classification of nodules.

4.1. CT Lung Dataset

In this study, an image dataset was prepared for the proposed pipeline. CT examinations were realized by using a helical CT scanner from Sincan Nafiz Korez Hospital. Its acquisition parameters are slice collimation 1.0 mm and slice width 1 mm. Scans were acquired in 130 kV and 75 mAs. The size of the images was 512 × 512 pixels. The images were stored as DICOM format files. The database consists of 47 CT scans from 47 different patients. 35 of volunteer patients are male and remaining of them are female. Their ages are between 30 and 79 (mean 58.7 ± 10.5 years). All patients agreed that they have a legal and moral right to accurate and reliable information for the scan. These patients should be given clearly the diagnosis and prognosis with a simple language. There are a total of 9504 CT modality images in the database, and the number of CT slices per scan varies between 116 and 283. After the CT scan, the physician provided the selection of the slice where the nodule is fully visible. 1128 ROI, which includes a total of 220 nodules (104 benign and 116 malignant), were selected from 9504 CT images with the help of a lung physician and three experienced radiologists in the lung parenchyma. This process has been conducted by means of an annotation tool. The nodules were also approved by biopsy. Sizes of nodules change from 3 to 65 mm in diameter. The size distribution of the nodules is shown in Figure 9.

Figure 9.

Figure 9

Size distribution of benign and malignant nodules in the image dataset.

4.2. Validation of LUVEM with Evaluation Metrics

Proposed lung volume extraction method (LUVEM) in this study is compared with the standard manual segmentations using measurement metrics. We evaluate manual segmentations of the expert and automated segmentations of LUVEM using two popular overlap measures. We used a segmentation software tool developed by us for manual segmentation on the dataset. The software tool outlines edges automatically, presenting us to obtain contours of the nodule boundaries. The metrics evaluate the overlapping between the two sets. The first overlap metric, represents the Jaccard coefficient (union overlap), defined as intersection over manual and automatic segmentations and measures the similarity of the S1 and S2 sets [42]. Our second overlap metric, the Dice coefficient (mean overlap), gives double the weight to agreements between the two sets [43]. Jaccard and Dice metrics are denoted in the following equations:

Jaccard=S1S2S1S2,Dice=2S1S2S1+S2. (1)

We show the overlap metrics (Jaccard and Dice) that result from both LUVEM and Otsu's method. These results are the comparison of automated segmentations of LUVEM and Otsu methods with manual segmentation on 254 lung CT image in our database. The results in Table 2 showed LUVEM is higher in Jaccard overlap (0.867) and Dice overlap (0.938) than Otsu's method.

Table 2.

The Jaccard and Dice metrics measures for LUVEM.

Jaccard overlap Dice overlap
Otsu's method 0.587 ± 0.093 0.786 ± 0.088
LUVEM 0.867 ± 0.051 0.938 ± 0.032

4.3. Detection Rates

Confusion matrixes of classification results with PNN according to each feature extraction and PCA method are presented in Table 3. As shown in Table 3, the usage of PCA affects the detection performance of the pipeline positively. Moreover, the usage of combined features extraction methods with PCA gives best success rate.

Table 3.

Confusion matrixes for feature extraction methods.

FE method Classification results without PCA Classification results with PCA
TP FP FN TN TP FP FN TN
SSF 90 22 26 82 90 22 26 82
SBF 105 17 11 87 109 12 7 92
GTF 101 19 15 85 110 11 6 93
TEF 109 15 7 89 111 11 5 93
All FE methods (combined) 111 12 5 92 113 6 3 98

Table 4 presents the values of performance criteria obtained in the classification results of the proposed pipeline when feature extraction methods were used separately and together. According to the table, performance values are more successful when all feature extraction methods are used together. Accuracy (Acc) was found to be 92.27% when 123 features were used in classification without feature selection through PCA, and this rate was found to be 95.91% with the use of PCA. Similarly, more successful results were obtained in sensitivity (Sen), specificity (Spc), positive decision value (PDV), negative decision value (NDV), and F1 score criteria as presented in Table 4.

Table 4.

Overall performance results of proposed pipeline.

Performance criteria Classification results without PCA Classification results with PCA
SSF SBF GTF TEF All SSF SBF GTF TEF All
Acc 78.18 87.27 84.55 90.00 92.27 78.18 91.36 92.27 90.00 95.91
Sen 77.57 90.52 87.07 93.97 95.67 77.57 93.97 94.83 95.67 97.42
Spc 78.85 83.65 81.73 85.58 88.46 78.85 88.46 89.42 89.42 94.24
PDV 80.36 86.07 84.17 87.90 90.24 80.36 90.08 90.91 90.98 94.96
NDV 79.93 88.78 85.00 92.71 94.85 79.93 92.93 93.94 94.90 97.03
F1 0.79 0.88 0.85 0.92 0.93 0.79 0.92 0.93 0.94 0.96

Since our CT image database was divided into 3 groups according to the size of nodules such as <10, 10–20, and> 20 mm, we also realized a performance evaluation according to size group of the nodules. These experiments were realized with all together feature extraction method using PCA. Table 5 presents the result of detection performance depending upon nodule size. As shown in Table 5, the proposed pipeline can classify even small nodules with high success rates. Overall detection result of proposed pipeline according to the nodule size is 95.91%.

Table 5.

Assessment of performance measurement criteria according to nodule size.

Nodule size (mm) The number of nodule Confusion matrix Performance criteria
TP FP FN TN Acc Sen Spc PDV NDV F1
<10 75 3 4 0 68 94.67 100 94.45 42.86 100 0.60
10–20 65 43 1 1 20 96.92 97.73 95.24 97.73 95.24 0.98
>20 80 67 1 2 10 96.25 97.10 90.91 98.53 83.34 0.98
Overall 220 113 6 3 98 95.91 97.42 94.24 94.96 97.03 0.96

Receiver operator characteristic (ROC) curve is another popular performance evaluation criteria used in detection systems. Area under an ROC curve is measured according to sensitivity and specificity values of system. This area shows how the system is successful. Therefore, we also present ROC curve of our proposed detection system. Figure 10 shows ROC curve of the system obtained classification results of each lung nodule group and overall system. As seen in this graphic, area under ROC curve and true positive rate of small size nodules are lower than big size nodules. Here, as can be seen from this figure, if the nodule size is too large and too small, the success rate decreases.

Figure 10.

Figure 10

ROC curve of classification precision in proposed pipeline in different nodule diameter.

Processing time is another performance criterion that we have used for the evaluation of the proposed pipeline. Longest time is needed for nodule detection step due to the use of SOM method for segmentation. Since SOM is an ANN model, it has a lot of time-consuming mathematical operations. In average, classification of a CT image as benign or malignant takes 2–3 seconds approximately. It can be accepted as a reasonable time period when it is compared with the time it needs for a physician to make decisions.

5. Conclusions and Discussion

In this study, a fully automated pipeline was proposed to classify benign and malign lung nodules on CT images. By means of the designed pipeline, nodule detection as well as benign/malign distinction was performed with high accuracy, sensitivity and specificity rates. Moreover, it was designed a preprocessing method called LUVEM for extracting the lung volume from CT images. SOM method was used to allow successful detection of lung nodules in early stages. According to the detailed experiment performed on large dataset with combined features, the proposed pipeline can differentiate benign/malign nodules with high accuracy rates such as 94.68% (3–10 mm), 96.92% (10–20 mm), and 96.25% (>20 mm) using PNN. The proposed pipeline can be used by the physicians as a supplementary tool for benign and malign nodule classification.

We evaluated the performance of the pipeline on Lung Imaging Database Consortium-Image Database Resource Initiative (LIDC-IDRI) as well [50]. LIDC-IDRI dataset is the largest publicly available reference database for detection of lung nodules. We choose LIDC-IDRI dataset since it contains almost all the related information for lung CT including annotations on nodule sizes, locations, diagnosis results, and other information. We collected a total of 38 lung nodules from the dataset, including 26 malignant and 12 benign nodules. According to the evolution results on the proposed pipeline, accuracy obtained 84.21% using all FE methods and PCA. In this test, F1 score result was found as 0.88. The obtained performance evaluation values of proposed pipeline on LIDC-IDRI dataset are denoted in Table 6.

Table 6.

The performance evaluation of proposed pipeline on LIDC-IDRI.

The number of nodules Confusion matrix Performance criteria
TP FP FN TN Acc Sen Spc PDV NDV F1
38 22 2 4 10 84.21 84.62 83.33 91.67 71.43 0.88

There are some advantages of the proposed pipeline compared to the state-of-the-art systems. Firstly, the proposed pipeline has two diagnosis possibilities. It can perform nodule detection together with nodule classification. Second advantage of the proposed pipeline is to provide the detection of small nodules in the lung with the use of SOM method during segmentation step. This is remarkable in terms of early detection of lung cancer. Third advantage of the proposed pipeline is to have relatively high detection performance. Accuracy, sensitivity, and specificity of the system were calculated as 95.91%, 97.42%, and 94.24%, respectively. It is fairly difficult to compare formerly reported CAD systems due to different datasets, nodule types, sizes, and validation methods. We picked out some CAD systems to compare their performances. Some of them [23, 24, 4446] used the LIDC database [4749], and the other used their own databases. Table 7 denotes the comparison of the proposed pipeline with some CAD systems. When the results are analyzed, our pipeline has high sensitivity on our CT image dataset.

Table 7.

The comparison of our pipeline with previously published CADs.

CAD system CT image database Number of cases Nodule size(mm) Sensitivity (%) Average FPR
Dehmenski et al. [9] Their own database 70 3–20 90.0 14.6
Suarez-Cuenca et al. [10] Their own database 22 4–27 80.0 7.7
Opfer and Wiemeker [46] LIDC database [47, 48, 50] 93 ≥4 74.0 4
Rubin et al. [51] Their own database 20 ≥3 76 3
Sahiner et al. [49] LIDC database [47, 48, 50] 48 3–36.4 79 4.9
Messay et al. [24] LIDC database [47, 48, 50] 84 3–30 82.66 3
Suzuki et al. [52] Their own database 101 8–20 80.3 16.1
Park et al. [53] Their own database 38 Indefinite 80
Choi and Choi [23] LIDC database [47, 48, 50] 32 3–30 94.1 5.45
Choi and Choi [44] LIDC database [47, 48, 50] 58 3–30 95.28 2.27
Proposed method Our database 47 3–35 97.42 4.54

Acknowledgments

The author would like to thank representatives of Sincan Nafiz Korez Hospital for creating the dataset.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The author declares that there are no conflicts of interest regarding the publication of this paper.

References

  • 1.Siegel R., Naishadham D., Jemal A. Cancer statistics. CA: A Cancer Journal for Clinicians. 2012;62(1):10–29. doi: 10.3322/caac.20138. [DOI] [PubMed] [Google Scholar]
  • 2.Jacobs C., Van Rikxoort E. M., Twellmann T., et al. Automatic detection of subsolid pulmonary nodules in thoracic computed tomography images. Medical Image Analysis. 2014;18(2):374–84. doi: 10.1016/j.media.2013.12.001. [DOI] [PubMed] [Google Scholar]
  • 3.Dandil E., Cakiroglu M., Eksi Z., Ozkan M., Kurt O. K., Canan A. Artificial neural network-based classification system for lung nodules on computed tomography scans. Proceedings of 2014 6th International Conference of Soft Computing and Pattern Recognition (SoCPaR); August 2014; Tunis, Tunisia. pp. 382–86. [Google Scholar]
  • 4.Howlader N., Noone A., Krapcho M., et al. SEER Cancer Statistics Review, 1975-2011. Bethesda, MD, USA: National Cancer Institute; 2014. [Google Scholar]
  • 5.Han H., Li L., Han F., Song B., Moore W., Liang Z. Fast and adaptive detection of pulmonary nodules in thoracic CT images using a hierarchical vector quantization scheme. IEEE Journal of Biomedical and Health Informatics. 2015;19(2):648–659. doi: 10.1109/jbhi.2014.2328870. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Jeong Y. J., Yi C. A., Lee K. S. Solitary pulmonary nodules: detection, characterization, and guidance for further diagnostic workup and treatment. American Journal of Roentgenology. 2007;188(1):57–68. doi: 10.2214/ajr.05.2131. [DOI] [PubMed] [Google Scholar]
  • 7.Ozekes S., Camurcu A. Y. Automatic Lung Nodule Detection Using Template Matching. Berlin, Germany: Springer; 2006. [Google Scholar]
  • 8.Schilham A. M., Van Ginneken B., Loog M. A computer-aided diagnosis system for detection of lung nodules in chest radiographs with an evaluation on a public database. Medical Image Analysis. 2006;10(2):247–58. doi: 10.1016/j.media.2005.09.003. [DOI] [PubMed] [Google Scholar]
  • 9.Dehmeshki J., Ye X., Lin X., Valdivieso M., Amin H. Automated detection of lung nodules in CT images using shape-based genetic algorithm. Computerized Medical Imaging and Graphics. 2007;31(6):408–17. doi: 10.1016/j.compmedimag.2007.03.002. [DOI] [PubMed] [Google Scholar]
  • 10.Suarez-Cuenca J. J., Tahoces P. G., Souto M., et al. Application of the iris filter for automatic detection of pulmonary nodules on computed tomography images. Computers in Biology and Medicin. 2009;39(10):921–33. doi: 10.1016/j.compbiomed.2009.07.005. [DOI] [PubMed] [Google Scholar]
  • 11.Murphy K., Van Ginneken B., Schilham A. M., De Hoop B. J., Gietema H. A., Prokop M. A large-scale evaluation of automatic pulmonary nodule detection in chest CT using local image features and k-nearest-neighbour classification. Medical Image Analysis. 2009;13(5):757–70. doi: 10.1016/j.media.2009.07.001. [DOI] [PubMed] [Google Scholar]
  • 12.Giger M. L., Bae K. T., Macmahon H. Computerized detection of pulmonary nodules in computed tomography images. Investigative Radiology. 1994;29(4):459–65. doi: 10.1097/00004424-199404000-00013. [DOI] [PubMed] [Google Scholar]
  • 13.Hasegawa J., Mori K., Toriwaki J., Anno H., Katada K. Automated extraction of lung cancer lesions from multi-slice chest CT images by using three-dimensional image processing. Systems and Computers in Japan. 1994;25(11):68–77. doi: 10.1002/scj.4690251107. [DOI] [Google Scholar]
  • 14.Kanazawa K., Kawata Y., Niki N., et al. Computer-aided diagnosis for pulmonary nodules based on helical CT images. Computerized Medical Imaging and Graphics. 1998;22(2):157–67. doi: 10.1016/s0895-6111(98)00017-2. [DOI] [PubMed] [Google Scholar]
  • 15.Suzuki K., Li F., Sone S., Doi K. Computer-aided diagnostic scheme for distinction between benign and malignant nodules in thoracic low-dose CT by use of massive training artificial neural network. IEEE Transactions on Medical Imaging. 2005;24(9):1138–50. doi: 10.1109/tmi.2005.852048. [DOI] [PubMed] [Google Scholar]
  • 16.Sun T., Wang J., Li X., et al. Comparative evaluation of support vector machines for computer aided diagnosis of lung cancer in CT based on a multi-dimensional data set. Computer Methods and Programs in Biomedicine. 2013;111(2):519–24. doi: 10.1016/j.cmpb.2013.04.016. [DOI] [PubMed] [Google Scholar]
  • 17.Kuruvilla J., Gunavathi K. Lung cancer classification using neural networks for CT images. Computer Methods and Programs in Biomedicine. 2014;113(1):202–9. doi: 10.1016/j.cmpb.2013.10.011. [DOI] [PubMed] [Google Scholar]
  • 18.Javaid M., Javid M., Rehman M. Z. U., Shah S. I. A. A novel approach to CAD system for the detection of lung nodules in CT images. Computer Methods and Programs in Biomedicine. 2016;135:125–139. doi: 10.1016/j.cmpb.2016.07.031. [DOI] [PubMed] [Google Scholar]
  • 19.ur Rehman M. Z., Javaid M., Shah S. I. A., Gilani S. O., Jamil M., Butt S. I. An appraisal of nodules detection techniques for lung cancer in CT images. Biomedical Signal Processing and Control. 2018;41:140–151. doi: 10.1016/j.bspc.2017.11.017. [DOI] [Google Scholar]
  • 20.Wang Z., Xin J., Sun P., Lin Z., Yao Y., Gao X. Improved lung nodule diagnosis accuracy using lung CT images with uncertain class. Computer Methods and Programs in Biomedicine. 2018;162:197–209. doi: 10.1016/j.cmpb.2018.05.028. [DOI] [PubMed] [Google Scholar]
  • 21.Xie H., Yang D., Sun N., Chen Z., Zhang Y. Automated pulmonary nodule detection in CT images using deep convolutional neural networks. Pattern Recognition. 2019;85:109–119. doi: 10.1016/j.patcog.2018.07.031. [DOI] [Google Scholar]
  • 22. DICOM (Digital Imaging and Communications in Medicine), http://medical.nema.org.
  • 23.Choi W. J., Choi T. S. Genetic programming-based feature transform and classification for the automatic detection of pulmonary nodules on computed tomography images. Information Sciences. 2012;212:57–78. doi: 10.1016/j.ins.2012.05.008. [DOI] [Google Scholar]
  • 24.Messay T., Hardie R. C., Rogers S. K. A new computationally efficient CAD system for pulmonary nodule detection in CT imagery. Medical Image Analysis. 2010;14(3):390–406. doi: 10.1016/j.media.2010.02.004. [DOI] [PubMed] [Google Scholar]
  • 25.Ye X., Lin X., Dehmeshki J., Slabaugh G., Beddoe G. Shape-based computer-aided detection of lung nodules in thoracic CT images. IEEE Transactions on Biomedical Engineering. 2009;56(7):1810–1820. doi: 10.1109/tbme.2009.2017027. [DOI] [PubMed] [Google Scholar]
  • 26.Akram S., Javed M. Y., Hussain A., Riaz F., Usman Akram M. Intensity-based statistical features for classification of lungs CT scan nodules using artificial intelligence techniques. Journal of Experimental and Theoretical Artificial Intelligence. 2015;27(6):1–15. doi: 10.1080/0952813x.2015.1020526. [DOI] [Google Scholar]
  • 27.Duda R. O., Hart P. E. Use of Hough transformation to detect lines and curves in pictures. Communications of the ACM. 1972;15(1):11–15. doi: 10.1145/361237.361242. [DOI] [Google Scholar]
  • 28.Yazid H., Yazid H., Harun M., et al. Circular discontinuities detection in welded joints using Circular Hough Transform. NDT and E International. 2007;40(8):594–601. doi: 10.1016/j.ndteint.2007.05.004. [DOI] [Google Scholar]
  • 29.Kohonen T. Self-organizing maps of massive databases. Engineering Intelligent Systems for Electrical Engineering and Communications. 2001;9(4):179–185. [Google Scholar]
  • 30.Haykin S. Neural Networks. Hamilton, ON, Canada: A Comprehensive Foundation; 1999. [Google Scholar]
  • 31.Chi D. Self-organizing map-based color image segmentation with k-means clustering and saliency map. ISRN Signal Processing. 2011;2011:18. doi: 10.5402/2011/393891.393891 [DOI] [Google Scholar]
  • 32.Hollitt C. A convolution approach to the circle Hough transform for arbitrary radius. Machine Vision and Applications. 2013;24(4):683–694. doi: 10.1007/s00138-012-0420-x. [DOI] [Google Scholar]
  • 33.Akilandeswari U., Nithya R., Santhi B. Review on feature extraction methods in pattern classification. European Journal of Scientific Research. 2012;71(2):265–272. [Google Scholar]
  • 34.Yang M., Kpalma K., Ronsin J. A survey of shape feature extraction techniques. Pattern Recognition. 2008:43–90. [Google Scholar]
  • 35.Clausi D. A. An analysis of co-occurrence texture statistics as a function of grey level quantization. Canadian Journal of Remote Sensing. 2002;28(1):45–62. doi: 10.5589/m02-004. [DOI] [Google Scholar]
  • 36.Haralick R. M., Shanmuga K., Dinstein I. Textural features for image classification. IEEE Transactions on Systems Man and Cybernetics. 1973;SMC-3(6):610–621. doi: 10.1109/tsmc.1973.4309314. [DOI] [Google Scholar]
  • 37.Van G., Wouver P., Scheunders D., Van D. Statistical texture characterization from discrete wavelet representation. IEEE Transactions on Image Processing. 1998;8(4):592–598. doi: 10.1109/83.753747. [DOI] [PubMed] [Google Scholar]
  • 38.Camdevyren H., Demyr N., Kanik A., Keskyn S. Use of principal component scores in multiple linear regression models for prediction of Chlorophyll-a in reservoirs. Ecological Modelling. 2005;181(4):581–589. doi: 10.1016/j.ecolmodel.2004.06.043. [DOI] [Google Scholar]
  • 39.Chen L. H., Chang S. Y. An adaptive learning algorithm for principal component analysis. IEEE Transactions on Neural Networks. 1995;6(5):1255–1263. doi: 10.1109/72.410369. [DOI] [PubMed] [Google Scholar]
  • 40.Mao K. Z., Tan K. C., Ser W. Probabilistic neural-network structure determination for pattern classification. IEEE Transactions on Neural Networks. 2000;11(4):1009–1016. doi: 10.1109/72.857781. [DOI] [PubMed] [Google Scholar]
  • 41.Specht D. F. Probabilistic neural networks. Neural Networks. 1990;3(1):109–118. doi: 10.1016/0893-6080(90)90049-q. [DOI] [PubMed] [Google Scholar]
  • 42.Jaccard P. The distribution of the flora in the alpine zone. New Phytologist. 1912;11(2):37–50. doi: 10.1111/j.1469-8137.1912.tb05611.x. [DOI] [Google Scholar]
  • 43.Dice L. R. Measures of the amount of ecologic association between species. Ecology. 1945;26(3):297–302. doi: 10.2307/1932409. [DOI] [Google Scholar]
  • 44.Choi W.-J., Choi T.-S. Automated pulmonary nodule detection system in computed tomography images: a hierarchical block classification approach. Entropy. 2013;15(2):507–523. doi: 10.3390/e15020507. [DOI] [Google Scholar]
  • 45.Erasmus J. J., Connolly J. E., Mcadams H. P., Roggli V. L. Solitary pulmonary nodules: Part I. Morphologic evaluation for differentiation of benign and malignant lesions 1. Radiographics. 2000;20(1):43–58. doi: 10.1148/radiographics.20.1.g00ja0343. [DOI] [PubMed] [Google Scholar]
  • 46.Opfer R., Wiemker R. Performance analysis for computer aided lung nodule detection on LIDC data. Medical Imaging 2007: Image Perception, Observer Performance, and Technology Assessment. 2007;6515:p. C5151. doi: 10.1117/12.708210. art. no. 65151C. [DOI] [Google Scholar]
  • 47.Mcnitt-Gray M. F., Armato S. G., Meyer C. R., et al. The Lung Image Database Consortium (LIDC) data collection process for nodule detection and annotation. Academic Radiology. 2007;14(12):1464–1474. doi: 10.1016/j.acra.2007.07.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Reeves A. P., Biancardi A. M., Apanasovich T. V., et al. The Lung Image Database Consortium (LIDC): a comparison of different size metrics for pulmonary nodule measurements. Academic Radiology. 2007;14(12):1475–1485. doi: 10.1016/j.acra.2007.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Sahiner B., Hadjiiski L. M., Chan H. P., et al. Effect of CAD on radiologists’ detection of lung nodules on thoracic CT scans: observer performance study. Medical Imaging 2007: Image Perception, Observer Performance, and Technology Assessment. 2007;6515:D5151–D5151. doi: 10.1117/12.709851. art. no. 65151D. [DOI] [Google Scholar]
  • 50.Armato S. G., III, Mclennan G., Mcnitt-Gray M. F., et al. Lung image database consortium: developing a resource for the medical imaging research community 1. Radiology. 2004;232(3):739–748. doi: 10.1148/radiol.2323032035. [DOI] [PubMed] [Google Scholar]
  • 51.Rubin G. D., Lyo J. K., Paik D. S., et al. Pulmonary nodules on multi–detector row CT scans: performance comparison of radiologists and computer-aided detection 1. Radiology. 2005;234(1):274–283. doi: 10.1148/radiol.2341040589. [DOI] [PubMed] [Google Scholar]
  • 52.Suzuki K., Armato S. G., III, Li F., Sone S., Doi K. Massive training artificial neural network (MTANN) for reduction of false positives in computerized detection of lung nodules in low-dose computed tomography. Medical Physics. 2003;30(7):1602–1617. doi: 10.1118/1.1580485. [DOI] [PubMed] [Google Scholar]
  • 53.Park S. C., Tan J., Wang X., et al. Computer-aided detection of early interstitial lung diseases using low-dose CT images. Physics in Medicine and Biology. 2011;56(4):1139–53. doi: 10.1088/0031-9155/56/4/016. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.


Articles from Journal of Healthcare Engineering are provided here courtesy of Wiley

RESOURCES