Discovery radiomics via evolutionary deep radiomic sequencer discovery for pathologically proven lung cancer detection

Mohammad Javad Shafiee; Audrey G Chung; Farzad Khalvati; Masoom A Haider; Alexander Wong

doi:10.1117/1.JMI.4.4.041305

. 2017 Oct 6;4(4):041305. doi: 10.1117/1.JMI.4.4.041305

Discovery radiomics via evolutionary deep radiomic sequencer discovery for pathologically proven lung cancer detection

Mohammad Javad Shafiee ^a,^*, Audrey G Chung ^a, Farzad Khalvati ^b, Masoom A Haider ^b, Alexander Wong ^b

PMCID: PMC5629455 PMID: 29021990

Abstract.

While lung cancer is the second most diagnosed form of cancer in men and women, a sufficiently early diagnosis can be pivotal in patient survival rates. Imaging-based, or radiomics-driven, detection methods have been developed to aid diagnosticians, but largely rely on hand-crafted features that may not fully encapsulate the differences between cancerous and healthy tissue. Recently, the concept of discovery radiomics was introduced, where custom abstract features are discovered from readily available imaging data. We propose an evolutionary deep radiomic sequencer discovery approach based on evolutionary deep intelligence. Motivated by patient privacy concerns and the idea of operational artificial intelligence, the evolutionary deep radiomic sequencer discovery approach organically evolves increasingly more efficient deep radiomic sequencers that produce significantly more compact yet similarly descriptive radiomic sequences over multiple generations. As a result, this framework improves operational efficiency and enables diagnosis to be run locally at the radiologist’s computer while maintaining detection accuracy. We evaluated the evolved deep radiomic sequencer (EDRS) discovered via the proposed evolutionary deep radiomic sequencer discovery framework against state-of-the-art radiomics-driven and discovery radiomics methods using clinical lung CT data with pathologically proven diagnostic data from the LIDC-IDRI dataset. The EDRS shows improved sensitivity (93.42%), specificity (82.39%), and diagnostic accuracy (88.78%) relative to previous radiomics approaches.

Keywords: discovery radiomics, radiomic sequencing, lung cancer, evolutionary deep intelligence, evolved deep radiomic sequencer

1. Introduction

Lung cancer is the second most diagnosed form of cancer in men and women after prostate cancer and breast cancer, respectively. In 2016, lung cancer accounted for an estimated 158,080 deaths ( $\sim 27 %$ of cancer deaths) and 224,390 new cases in Americans.¹ Similarly, lung cancer accounted for an estimated 20,800 deaths ( $\sim 26 %$ of cancer deaths) and 28,400 new cases in Canadians.² Early detection of lung cancer can significantly impact the patient survival rate, making efficient and reliable lung cancer screening methods crucial.

Imaging-based cancer detection or radiomics-driven methods have recently grown in popularity to help streamline the cancer screening process and increase diagnostic consistency. Referring to the extraction and analysis of large amounts of quantitative features from medical imaging data, radiomics³ allows for the creation of a high-dimensional abstract feature space that can be utilized for cancer detection via the detailed characterization of cancer phenotypes. The prognostic potential of radiomics has previously been demonstrated in studies on lung and head-and-neck cancer patients.⁴^,⁵ Aerts et al.⁴ introduced a comprehensive study spanning over 1000 patients across seven datasets to demonstrate the application of radiomics toward differentiating between tumor phenotypes, indicating clinical and prognostic implications. In addition, radiomics has shown promise in combination with multiparametric magnetic resonance imaging for breast cancer detection⁶ and prostate cancer detection.⁷^,⁸

Radiomics-driven methods have previously been developed for malignant lung nodule detection using computed tomography (CT) images.⁹^–¹² Anirudh et al.⁹ used weakly labeled lung data from the SPIE-LUNGx dataset to train a three-dimensional convolutional neural network (CNN) and generate radiomic sequences for lung nodule detection. In contrast, Orozco et al.¹⁰ generated wavelet-based radiomic sequences and demonstrated the effectiveness of wavelet-based features using a subset of images from the early lung cancer action project and lung image database consortium (LIDC) datasets.

Shen et al.¹¹ proposed multiscale CNN, a hierarchical framework for extracting discriminative features from lung nodules. Specifically, the framework is comprised of alternating, stacked layers, and uses multiscale nodule patches to learn class-specific features. More recently, Shen et al. extended their previous work to malignancy suspiciousness classification.¹² In addition, the extension simplified the training process via a multicrop pooling architecture. An important aspect of these aforementioned radiomics-driven methods is that they leverage radiologist-driven nodule annotations for predicting the malignancy of lung nodules, rather than using pathology-proven data.

There are relatively few radiomics-driven methods that perform lung cancer detection using pathology-proven diagnostic data.¹³^,¹⁴ Kumar et al.¹³ introduced an unsupervised deep autoencoder for feature extraction with a binary decision tree classifier for lung nodule classification. Shen et al.¹⁴ proposed a domain-adaptation framework for lung nodule malignancy prediction; more specifically, Shen et al. proposed CNN-multiple instance learning (CNN-MIL) for learning transferable patient-level malignancy knowledge, which combines a CNN model with an MIL model.

Recently, the concept of discovery radiomics was introduced where notions of using predefined, hand-crafted features for cancer detection are bypassed in favor of radiomic sequencers that produce abstract imaging-based features that are discovered directly from the wealth of readily available medical imaging data. This allows for custom-tailored features to be discovered that can better characterize cancerous tissue and distinguish cancer phenotypes relative to conventional features. Discovery radiomics has shown promising results for both prostate cancer¹⁵ and lung cancer¹³^,¹⁶^,¹⁷ detection.

A number of different radiomic sequencers have been proposed within the discovery radiomics framework for the purpose of lung cancer detection. Kumar et al.¹³ introduced the notion of deep autoencoding radiomic sequencers (DARS), which are comprised of a deep autoencoder architecture. Shafiee et al.¹⁶ proposed deep radiomics sequencers based on a deep convolutional StochasticNet¹⁸ architecture, referred to as StochasticNet sequencers. More recently, Kumar et al.¹⁷ leveraged deep radiomic sequencers built upon a deep CNN architecture.

While diagnostically powerful, the discovered radiomic sequencers (DRSs) were both computationally expensive and memory intensive, which could make it difficult for on-site clinical deployment and would require the transfer of patient information to more powerful cloud computing leading to patient privacy concerns. To mitigate computational requirements and increase operating efficiency, we propose an evolutionary deep radiomic sequencer discovery framework for discovering more efficient yet powerful deep radiomic sequencers. Using the concept of evolutionary deep intelligence¹⁹^,²⁰ to mimic biological evolution mechanisms, the proposed evolutionary deep sequencer discovery process discovers progressively more efficient yet diagnostically powerful deep radiomic sequencers over multiple generations. The resulting evolved deep radiomic sequencers (EDRSs) are not only significantly more efficient, thus making them more suitable for on-site clinical deployment, but can provide improved diagnostic performance compared to existing deep radiomic sequencers.

2. Methods

In this section, we will first discuss the concepts behind discovery radiomics and evolutionary deep intelligence. We will then present the proposed evolutionary deep sequencer discovery approach in detail.

2.1. Discovery Radiomics

The idea behind discovery radiomics can be described as follows (see Fig. 1). Given past radiology data and corresponding pathology-verified radiologist tissue annotations from a medical imaging data archive (i.e., provided by Cancer Imaging Archive²¹^,²² and consisting of diagnostic and lung cancer screening thoracic CT), the radiomic sequencer discovery process learns a radiomic sequencer that can extract highly customized radiomic features (which we will refer to as a radiomic sequence) that are tailored for characterizing unique tissue phenotypes that differentiate cancerous tissue from healthy tissue. The DRS can be applied to new patient data to extract the corresponding radiomic sequence for cancer screening and diagnosis purposes.

As discussed earlier, one of the key limitations of previously proposed deep radiomic sequencers¹³^,¹⁷ for the purpose of lung cancer detection is that, while diagnostically powerful, they are both computationally expensive and memory intensive. They usually utilize very deep neural network architectures with a large number of parameters (i.e., which needs a large amount of memory to store) such that a huge set of arithmetic operations are required to generate the radiomic sequence and as a result it needs a fair amount of time to produce the results. This could make it difficult for on-site clinical deployment and would require the transfer of patient information to more powerful cloud computing leading to patient privacy concerns. To mitigate computational requirements and increase operating efficiency to enable on-site clinical deployment, we will leverage the concept of evolutionary deep intelligence¹⁹^,²⁰ to discover highly efficient deep radiomic sequencers that still provide strong diagnostic performance.

2.2. Evolutionary Deep Intelligence

Prior to describing the proposed evolutionary deep sequencer discovery approach, it is first important to discuss the idea behind evolutionary deep intelligence. First introduced by Shafiee et al.,¹⁹ the general idea is to synthesize progressively more efficient deep neural networks over multiple generations. The evolution of deep neural networks is modeled in a probabilistic manner, where the architectural traits of ancestor networks are encoded by a probabilistic DNA. The probabilistic DNA is utilized to mimic biological heredity, and new offspring networks are synthesized stochastically based on this probabilistic model. Each synaptic connectivity is modeled by a probability distribution based on the corresponding weight magnitude in the ancestor network such that the strength of the weight determines the probability of each synapse to be connected in the offspring network. To close the cycle of evolution, environmental factors are applied to the model to mimic random mutation and natural selection. The environmental factor is combined with the probabilistic DNA to enforce how the random mutation should be applied (i.e., what the rate of mutation of synaptic connectivity should be in the offspring network architecture). Loosely speaking, when the environmental factor forces the offspring network architectures to be smaller than their ancestor, this causes a decrease in the chance of each synaptic connectivity to be synthesized in the offspring network such that weaker synaptic connectivity in the ancestor network will have a lower chance of being connected in the offspring network. At each generation, the offspring network (which is more efficient than its parent) is then trained to refine its modeling capabilities and maximize its modeling accuracy.

Figure 2 shows the evolution process visually. As seen, the evolution is initialized using a known network structure as the first generation. The network is trained based on the available training data and the weights associated with each synaptic strength are computed. The underlying heredity of the network (i.e., as the parent network) is encoded by the probabilistic DNA, which is modeled based on the synaptic strengths. The environmental factors are then formulated into the model to account for the requirements needed to be satisfied by the offspring network. The offspring network is then synthesized by taking advantage of random mutation to diversify the offspring network from its ancestors. This process is repeated until all requirements are satisfied by the latest offspring network. Given its ability to produce progressively more efficient yet powerful deep neural networks, we are motivated to leverage the ideas behind evolutionary deep intelligence within the discovery radiomics framework to discover highly efficient yet diagnostically accurate deep radiomic sequencers for the purpose of lung cancer detection.

Fig. 2 — Evolutionary deep intelligence framework. The heredity is encoded by a probabilistic DNA modeling the architectural traits, which should be carried to the next generation. The environmental conditions simulate the factors that must be considered to synthesize an offspring network. The evolutionary approach is repeated over multiple generations until all conditions are satisfied by the latest generation.

2.2.1. Evolutionary Deep Radiomic Sequencer Discovery

Motivated to leverage evolutionary deep intelligence within the discovery radiomics framework, we introduce an evolutionary deep radiomic sequencer discovery process for discovering deep radiomic sequencers. As seen in Fig. 3, the evolutionary deep radiomic sequencer discovery framework discovers a more optimal deep radiomic sequencer generation by generation and, as a result, the generated radiomic sequence at each generation is more concise compared to radiomic sequences generated by previous deep radiomic sequencers in past generations.

Fig. 3 — Evolutionary deep radiomic sequencer discovery to synthesize optimized radiomic sequencer from an archive of medical images (in this study, lung nodule CT images). At each generation, the past radiomic sequencer and archive of medical images are used by the evolutionary deep radiomic sequencer discovery process to synthesize a more efficient radiomic sequencer. As shown, the size of parameters of radiomic sequencer is decreased over generations, resulting to a more concise radiomic sequence to describe the input radiology image.

The methodology behind the proposed framework can be described as follows. Inspired by Refs. 20 and 23, let the deep radiomic sequencer be modeled as $H (N, S)$ , denoting a network architecture with the set of neurons $N$ and set of synaptic connectivities $S$ . In this study, we will utilize a deep CNN architecture for the deep radiomic sequencer (see Fig. 4). The structural information of a deep radiomic sequencer at generation $g$ can be encoded by $S_{g}$ . $W_{g - 1}$ is the set of weights that encode the strength associated with each synapse in the network at generation $g - 1$ , where a synaptic weight of zero indicates that the associated synapse is not connected. It should be noted that $W_{g - 1}$ can, therefore, encode the structural information $S_{g - 1}$ of a network at generation $g - 1$ . As a result, it is possible to reformulate $P (H_{g} | H_{g - 1})$ as $P (S_{g} | W_{g - 1})$ without any loss of modeling accuracy. Thus, the probabilistic DNA of a deep radiomic sequencer at generation $g$ is formulated as $P (S_{g} | W_{g - 1})$ , such that at each generation $g$ the structure of the sequencer $S_{g}$ is synthesized given the trained weights of the sequencer of the previous generation $W_{g - 1}$ .

Fig. 4 — Deep radiomic sequencer based on deep CNN architecture. The input to the deep CNN is a suspicious region of a CT image. The architecture of the original ancestor network (i.e., generation 1 in Fig. 3) is a Lenet5 network architecture where it has 32 at $5 \times 5$ filters in the first and second layers, 64 at $5 \times 5$ filters in the third layers, and 64 at $4 \times 4$ in the last layer. The output of last layer generates the radiomic sequence with 1024 feature length.

The genetic encoding scheme (i.e., probabilistic DNA) can be formulated in different ways to favor special requirements needed to be applied when the new offspring deep radiomic sequencers are synthesized. For promoting computational efficiency and compactness, $P (S_{g} | W_{g - 1})$ is modeled such that it promotes the formation of a particular cluster of synapses while considering the synthesis of each individual synapse in the offspring deep radiomic sequencer as well²⁰

P (S_{g} | W_{g - 1}) = \prod_{c \in C} [P (S_{g}^{c} | W_{g - 1}) \cdot \prod_{i \in c} P (s_{g}^{i} | w_{g - 1}^{i})],

(1)

where $P (S_{g}^{c} | W_{g - 1})$ promotes the synthesis of a particular cluster of synapses $c$ , $S_{g}^{c} \subset S_{g}$ , given the weights of the network at generation $g - 1$ and $P (s_{g}^{i} | w_{g - 1}^{i})$ is the probability that synapse $s_{g}^{i} \in S^{c}$ will be synthesized in the offspring deep radiomic sequencer at generation $g$ .

A cluster of synapses can be defined and represented based on different factors, such as faster run time of the offspring radiomic sequencer on a parallel computing device or decreased storage requirements relative to its ancestor sequencer. However, the main advantage of Eq. (1) is that the $P (S_{g}^{c} | W_{g - 1})$ not only favors strong synapses, which are more effective in maintaining a high modeling accuracy, but it promotes the persistence of clusters of synapses in the offspring deep radiomic sequencer, which can extract more discriminative features, resulting in a sequencer that can model the problem more accurately. Here, we define a set of synapses constructing a filter in each convolutional layer as a cluster of synapses in the network structure of a deep radiomic sequencer. As shown in Fig. 4, each filter in a convolutional layer is responsible for producing one output channel of the layer. By extending this definition to all convolutional layers in the radiomic sequencer, the length of the radiomic sequence varies over the generations as the number of filters in the last layer determines the actual length of the radiomic sequence.

The probabilistic DNA $P (S_{g} | W_{g - 1})$ is combined with the environmental factor model $F (E)$ to mimic natural selection, such that the offspring deep radiomic sequencer for the next generation is comprised of stochastically selected synapses or clusters of synapses. The environmental factor simulates the conditions that the offspring networks should be adapted for. For example, if in the new environment the offspring network should be faster in computation, the environmental factors enforce the offspring network to be synthesized with fewer numbers of filters for decreasing the processing time. The environmental factors can also reflect what is the situation in terms of memory availability and the host hardware in which the computation will be done; as a result the synthesized offspring network architecture adapts itself to these environmental factors to be able to survive. The probabilistic model of the network structure $P (H_{g})$ at generation $g$ can be formulated as

P (H_{g}) = F (E) \cdot P (S_{g} | W_{g - 1}),

(2)

where $F (E)$ quantitatively encodes the environmental conditions, and the offspring deep radiomic sequencer structures must adapt to them to survive over generations. As mentioned before, the goal here is to synthesize a deep radiomic sequencer with fewer parameters while preserving the modeling accuracy; therefore, the environmental factor model $F (E)$ favors the formation of a deep radiomic sequencer with fewer parameters and increased efficiency over the generations. This property is applied via a cluster-based encoding scheme, which decreases the number of filters of different layers over generations

P (H_{g}) = \prod_{c \in C} [F_{c} (E) \cdot P (S_{g}^{c} | W_{g - 1})] .

(3)

More specifically, the environmental factor $F_{c} (E)$ is formulated such that the offspring radiomic sequencer is limited to 80% of the total number of synapses in its direct ancestor sequencer.

3. Results

3.1. Experimental Setup

The proposed evolutionary deep radiomic sequencer discovery framework was examined using the pathology-proven subset of the LIDC-IDRI²¹^,²² dataset and was compared to state-of-the-art methods. In this section, the configuration of the dataset, the underlying network architecture of the DRSs, and the competing methods are explained.

3.1.1. Lung Dataset

In this study, we used the subset of the LIDC-IDRI²¹^,²² dataset that had corresponding pathology-proven diagnostic data. The dataset is a public dataset provided by Cancer Imaging Archive²¹^,²² consisting of diagnostic and lung cancer screening thoracic CT scans with marked-up annotated lesions. The CT images were captured using a broad range of scanner models from different manufacturers by applying the following tube peak potential energies for acquiring the scans: 120 kV ( $n = 818$ ), 130 kV ( $n = 31$ ), 135 kV ( $n = 69$ ), and 140 kV ( $n = 100$ ). A subset of 93 patient cases, which have definite diagnostic results, was selected from the LIDC-IDRI. While pathology data were used to generate labels for the CT images in the LIDC-IDRI dataset, the pathology data were not available for comparison and labels were provided on a nodule basis. Using data augmentation, an enriched dataset of 42,340 lung lesions was obtained via the rotation of each malignant and benign lesion by 45-deg and 10-deg increments, respectively. The proposed method is examined by a 10-fold cross-validation approach where 9 out of 10 folds of patient cases (subset of patient cases) are used in the training while the other fold (subset of patient cases) is utilized as test samples and the results are reported based on the average performance of 10 trials.

3.1.2. Network Architecture (Lenet5)

The deep neural network architecture of the original, first generation radiomic sequencer used in this study builds upon the Lenet5 architecture.²⁴ The radiomic sequencer is comprised of three convolutional layers: $c_{1}$ : $3 \times 3$ , $c_{2}$ : $5 \times 5$ , and $c_{3}$ : $3 \times 3$ , where the first layer consists of 32 filters, the second layer has 32 filters, and the last layer has 64 filters. The radiomic sequence generated by the original, first generation radiomic sequencer has a length of $16 \times 64$ and is the input into two fully connected layers ( $f_{1}$ : 64 and $f_{2}$ : 2) to classify each input as cancerous or benign.

3.1.3. Competing Frameworks

The proposed evolutionary deep radiomic sequencer discovery was evaluated using the enriched dataset and quantitatively compared to four state-of-the-art radiomics-driven approaches.¹³^,¹⁴^,¹⁶^,¹⁷

Kumar et al.’s DARS¹³ uses a five layer denoising autoencoder trained by L-BFGS with 30 iterations and a batch size of 400, as suggested by past work;²⁵ a 200 dimension feature vector is extracted from the fourth layer and paired with a binary decision tree classifier. Shen et al.’s proposed CNN-MIL¹⁴ is composed of three concatenated convolutional layers, each with 64 convolutional kernels of size $3 \times 3$ . Each convolutional layer is followed by a rectified linear unit and a max-pooling layer ( $4 \times 4$ pooling window in the first layer and $2 \times 2$ in the subsequent layers), and two fully connected layers are used to determine nodule malignancy. Shafiee et al.’s StochasticNet radiomic sequencer (SNRS)¹⁶ is constructed using three stochastically formed convolutional layers of 32, 32, and 64 receptive fields, respectively. Each receptive field is $5 \times 5$ in size and is part of a random graph realization with a uniform neural connectivity probability of 0.5. Similarly, Kumar et al.’s DRS¹⁷ is comprised of three convolutional sequencing layers of 20, 50, and 500 receptive fields, respectively, each of size $3 \times 3$ .

3.2. Experimental Results

The proposed evolutionary deep radiomic sequencer discovery process was performed through 11 generations where in each generation, the environmental factor restricts the offspring radiomic sequencer to 80% of the total number of synapses in its direct previous network. Using this environmental factor, the number of parameters in the deep neural network of radiomic sequencer is decreased generation by generation, allowing for the generated radiomic sequences to be more compact over generations. Decreasing the number of parameters in the sequencer is important as it affects the generalizability of the sequencer such that a more generalized sequencer is less likely to be over-trained to the training data and can perform more accurately in the evaluation step.

The performance of the proposed framework is examined in a 10-fold cross validation approach where 9 out of 10 subsets of the data are used in the training step while the 10th subset is used to evaluate the model. This training and testing process is repeated over all permutations of the training and testing subsets. The cross-validation approach is combined with evolutionary deep intelligence, where in each validation step, the radiomic sequencers are synthesized generation by generation with the same training dataset and validated with the same testing data.

Table 1 shows the average performance of the proposed framework over 11 generations. As seen, by moving generation by generation, the number of filters used in the radiomic sequencer is decreased and the length of the radiomic sequence is correspondingly shortened. However, the performance of the radiomic sequencers improves over generations, which demonstrates the increase in the generalizability of the models through generations. As seen, the performance of evolved radiomic sequencers (i.e., sensitivity, specificity, and accuracy) increases after the first generation and it reaches a stable point (e.g., generation 7). However, evolving the radiomic sequencer after the seventh generation can improve the compactness of the radiomic sequences.

Table 1.

Radiomic sequence lengths and the modeling accuracies over generations. “ANF" stands for average number of filters in the sequencer and “RSL” column represents the average length of the radiomic sequence at each generation. Since the numbers are averaged over 10 folds of evaluation, they are reported with one floating point precision. As seen while the radiomic sequences become more compact over generations, the modeling accuracy, sensitivity, and specificity are increasing.

	ANF	RSL	Sensitivity	Specificity	Accuracy
Gen. 1	194.0	3104.0	0.8786	0.7570	0.8255
Gen. 2	180.4	2886.4	0.9156	0.7788	0.8590
Gen. 3	171.0	2736.0	0.9305	0.8063	0.8795
Gen. 4	161.1	2577.6	0.9276	0.8062	0.8812
Gen. 5	150.9	2414.4	0.9311	0.8109	0.8845
Gen. 6	142.6	2281.6	0.9295	0.8125	0.8834
Gen. 7	135.1	2161.6	0.9390	0.8105	0.8898
Gen. 8	125.8	2012.8	0.9341	0.8129	0.8879
Gen. 9	118.5	1896.0	0.9384	0.8169	0.8917
Gen. 10	111.2	1779.2	0.9385	0.8107	0.8901
Gen. 11	104.5	1672.0	0.9342	0.8239	0.8878

Open in a new tab

Note: The boldface values show the best accuracy.

Table 1 demonstrates that the specificity of the radiomic sequencers increases when the sequencers are evolved over generations, which is a good indication of generalizability of the final model. It is worth noting that in lung cancer classification, improving the specificity is challenging²⁶ and increasing the specificity while maintaining a reasonable sensitivity is highly desirable.

Table 1 also shows that the evolved radiomic sequencers can perform better in terms of sensitivity compared to the first generation original ancestor radiomic sequencer, resulting in a model with higher accuracy. As mentioned before, one of the important obstacles in using a deep neural network as the underlying architecture for a radiomic sequencer is the efficiency of the underlying deep neural network. As seen, the average number of filters constructing the radiomic sequencer is decreased over generations, indicating that the efficiency of the radiomic sequencer is increasing generation by generation. It is also worth noting that the number of filters of a deep neural network determines the number of parameters needed to be computed in one forward pass of the network to compute the final prediction; therefore, decreasing this number can increase the efficiency of the radiomic sequencer.

To evaluate the efficiency of the synthesized radiomic sequencers, the running time computation of the sequencers is examined at each generation with 1500 sample inputs. Figure 5 shows the running time performance of synthesized radiomic sequencers through generations. As seen, the subsequent generations perform faster than their ancestors, which shows the efficiency of the proposed evolutionary deep intelligence framework.

Fig. 5 — Running time evaluation of synthesized radiomic sequencer at each generation is evaluated by 1500 sample inputs. As seen the subsequent radiomic sequencers perform faster than their ancestors, which shows the efficiency of the synthesized sequencer discovered via the proposed evolutionary deep radiomic sequencer discovery process.

Decreasing the number of filters in the model decreases the length of the radiomic sequence. As shown in Table 1, the length of the radiomic sequence is shortened generation by generation and the length of the radiomic sequence in the last generation is about half the size of the radiomic sequence of the first generation, demonstrating that it is possible to increase the concision of the radiomic sequence while simultaneously increasing the modeling accuracy.

Figure 6 shows the sensitivity of the evolved radiomic sequencers overlaid with the standard deviation across different folds of cross validation over multiple generations. By evolving the radiomic sequencers generation by generation, the sensitivity increases while the standard deviation decreases (notice that the purple margin narrows over generations). This is another indication of generalizability of the evolved radiomic sequencers as the variance of the models in different cross validation folds of evaluation decreases over generations. This effect is more obvious in Fig. 7 as the standard deviation of the specificity measure is decreased generation by generation and as mentioned before, a more reliable specificity is highly desirable in lung cancer classification. Figure 8 shows the same behavior of the modeling accuracy over generations.

Fig. 6 — Sensitivity of the evolved radiomic sequencers over generations. The standard deviation of the models based on 10-fold cross validation is overlaid with purple margin.

Fig. 7 — Specificity of the radiomic sequencer overlaid by their modeling standard deviation over generations. As seen, the generalizability of radiomic sequencer increases generation by generation as the standard deviation of modeling decreases.

Fig. 8 — Radiomic sequencers modeling accuracy over generations.

As the last experimental result, Table 2 shows the comparison of the proposed framework (EDRS) with other state-of-the-art approaches. It should be noted that the statistics and modeling performances of other state-of-the-art frameworks are reported directly by Kumar et al.¹⁷ and Shafiee et al.¹⁸ As seen, the proposed radiomic sequencer in the discovery radiomics framework outperforms other state-of-the-art methods in sensitivity (93.42%), specificity (82.39%), and accuracy (88.78%). To demonstrate the effect of the evolutionary deep intelligence framework on discovery radiomic sequencer, the final network architecture synthesized by the evolutionary deep intelligence framework is trained from scratch. The performance of this network (so-called last generation) is compared with the result of the evolutionarily deep intelligence approach. As seen in Table 1, although the optimized network architecture synthesized by the evolutionary framework is utilized to train the last-generation approach, the sequencer could not compete with the EDRS performance and could not gain the same accuracy level.

Table 2.

Comparison with state-other-the-art methods for lung cancer classification. As seen, the proposed EDRS framework outperforms other methods in sensitivity, specificity, and accuracy.

	Sensitivity	Specificity	Accuracy
DARS¹³	0.8314	0.2018	0.7501
CNN-MIL¹⁴	—	—	0.7069
SNRS¹⁶	0.9107	0.7598	0.8449
DRS¹⁷	0.7906	0.7611	0.7752
EDRS	0.9342	0.8239	0.8878
Last generation	0.8893	0.7823	0.8355

Open in a new tab

Note: The boldface values show the best accuracy.

4. Conclusion

In this paper, we proposed an evolutionary deep radiomic sequencer discovery framework to better uncover more efficient yet powerful radiomic sequencers for the purpose of lung cancer classification. An evolutionary deep intelligence approach is incorporated within the discovery radiomics framework to evolve the underlying deep neural network architecture of the deep radiomic sequencer over multiple generations and discover a more efficient and generalized deep radiomic sequencer. The ultimate goal here is to synthesize a deep neural network as the underlying core of the radiomic sequencer with fewer numbers of parameters, which produces more concise radiomic sequences that can better capture the differences between healthy and cancerous lung tissue. Results show that by evolving and discovering more efficient radiomic sequencers, the diagnostic accuracy can be increased. Experimental results demonstrate that the EDRS discovered using the proposed evolutionary deep radiomic sequencer discovery approach can outperform other state-of-the-art radiomics-driven methods, achieving a sensitivity of 93.42%, a specificity of 82.39%, and an accuracy of 88.78%. It has been showed in the literature that there is a direct relation between the number of parameters and the need for training data. As a future work, it is suggested to study the effect of an evolutionary deep intelligence framework when limited training data are available.

Acknowledgments

This research has been supported by the Ontario Institute of Cancer Research (OICR), Canada Research Chairs Programs, Natural Sciences and Engineering Research Council of Canada (NSERC), and the Ministry of Research and Innovation of Ontario. The authors also thank Nvidia for the GPU hardware used in this study through the Nvidia Hardware Grant Program.

Biographies

Mohammad Javad Shafiee received his BSc and MSc degrees in computer science and artificial intelligence from Shiraz University, Shiraz, Iran, in 2008 and 2011, respectively; and his PhD in systems design engineering from the University of Waterloo, Waterloo, Canada in 2017. He is currently a research assistant professor in the Department of Systems Design Engineering at the University of Waterloo. His main research focus is on statistical learning and graphical models with random fields and deep learning approaches.

Audrey G. Chung received her BASc and MASc degrees in systems design engineering from the University of Waterloo, Waterloo, Canada, in 2014 and 2016, respectively. She is currently pursuing her PhD in the Department of Systems Design Engineering with her primary focus on artificial intelligence and deep learning. Her research interests include image processing and computer vision, with a specific emphasis on biomedical imaging.

Farzad Khalvati received his PhD in electrical and computer engineering from the University of Waterloo, Waterloo, Canada, in 2009. He is currently an assistant professor in the Department of Medical Imaging, University of Toronto, Toronto, Canada, and a junior scientist at Sunnybrook Research Institute, Toronto, Canada. His research interests include translational research in medical imaging, quantitative biomedical imaging (radiomics) for cancer diagnosis and prognosis, and intelligent medical image computing.

Masoom A. Haider received his MD degree from the University of Ottawa, Ottawa, Canada, in 1986. In 1994, he completed his residency in radiology from the University of Toronto, Toronto, Canada, where he is currently a professor of radiology in the Department of Medical Imaging. He is also the chief of the Medical Imaging Department at Sunnybrook Health Sciences Centre, Toronto, Canada. His research interests include abdominal and pelvic MRI with special emphasis in prostate cancer, therapeutic response assessment, and functional imaging of Cancer.

Alexander Wong is currently the Canada research chair of medical imaging systems, the codirector of the Vision and Image Processing Research Group, and an associate professor in the Department of Systems Design Engineering at University of Waterloo. His current research interests revolve around computational imaging and artificial intelligence, with a focus on integrative computational imaging systems for biomedical imaging and operational artificial intelligence. He has authored more than 400 refereed journals and conference papers, and patents.

Disclosures

The authors have no relevant financial interests in the manuscript and no other potential conflicts of interest to disclose.

References

1.American Cancer Society, “Cancer facts & figures 2016,” Atlanta: (2016). [Google Scholar]
2.Canadian Cancer Society’s Advisory Committee on Cancer Statistics, Canadian Cancer Statistics, Canadian Cancer Society, Toronto, Ontario: (2016). [Google Scholar]
3.Lambin P., et al. , “Radiomics: extracting more information from medical images using advanced feature analysis,” Eur. J. Cancer 48(4), 441–446 (2012). 10.1016/j.ejca.2011.11.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Aerts H. J. W. L., “Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach,” Nat. Commun. 5, 4006 (2014). 10.1038/ncomms5006 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Gevaert O., et al. , “Non-small cell lung cancer: identifying prognostic imaging biomarkers by leveraging public gene expression microarray data,” Radiology 264(2), 387–396 (2012). 10.1148/radiol.12111607 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Maforo N., et al. , “Radiomics of multi-parametric breast MRI in breast cancer diagnosis: a quantitative investigation of diffusion weighted imaging, dynamic contrast-enhanced, and t2-weighted magnetic resonance imaging,” Med. Phys. 42(6), 3213 (2015). 10.1118/1.4923882 [DOI] [Google Scholar]
7.Khalvati F., Wong A., Haider M. A., “Automated prostate cancer detection via comprehensive multi-parametric magnetic resonance imaging texture feature models,” BMC Med. Imaging 15(1), 27 (2015). 10.1186/s12880-015-0069-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Cameron A., et al. , “Multiparametric MRI prostate cancer analysis via a hybrid morphological-textural model,” in Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual Int. Conf. of the IEEE (2014). [DOI] [PubMed] [Google Scholar]
9.Anirudh R., et al. , “Lung nodule detection using 3D convolutional neural networks trained on weakly labeled data,” Proc. SPIE 9785, 978532 (2016). 10.1117/12.2214876 [DOI] [Google Scholar]
10.Orozco H. M., et al. , “Automated system for lung nodules classification based on wavelet feature descriptor and support vector machine,” Biomed. Eng. Online 14(1), 9 (2015). 10.1186/s12938-015-0003-y [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Shen W., et al. , “Multi-scale convolutional neural networks for lung nodule classification,” in Int. Conf. on Information Processing in Medical Imaging, pp. 588–599 (2015). 10.1007/978-3-319-19992-4_46 [DOI] [PubMed] [Google Scholar]
12.Shen W., et al. , “Multi-crop convolutional neural networks for lung nodule malignancy suspiciousness classification,” Pattern Recognit. 61, 663–673 (2017). 10.1016/j.patcog.2016.05.029 [DOI] [Google Scholar]
13.Kumar D., Wong A., Clausi D. A., “Lung nodule classification using deep features in CT images,” in 12th Conf. on Computer and Robot Vision (CRV), pp. 133–138 (2015). 10.1109/CRV.2015.25 [DOI] [Google Scholar]
14.Shen W., et al. , “Learning from experts: developing transferable deep features for patient-level lung cancer prediction,” in Int. Conf. on Medical Image Computing and Computer-Assisted Intervention, pp. 124–131, Springer; (2016). 10.1007/978-3-319-46723-8_15 [DOI] [Google Scholar]
15.Karimi A.-H., et al. , “Discovery radiomics via a mixture of deep convnet sequencers for multi-parametric MRI prostate cancer classification,” in Int. Conf. Image Analysis and Recognition, Springer; (2017). 10.1007/978-3-319-59876-5_6 [DOI] [Google Scholar]
16.Shafiee M. J., et al. , “Discovery radiomics via stochasticnet sequencers for cancer detection,” in NIPS Workshop on Machine Learning in Healthcare (2015). [Google Scholar]
17.Kumar D., et al. , “Discovery radiomics for pathologically-proven computed tomography lung cancer prediction,” in Int. Conf. Image Analysis and Recognition, Springer; (2017). 10.1007/978-3-319-59876-5_7 [DOI] [Google Scholar]
18.Shafiee M. J., Siva P., Wong A., “Stochasticnet: forming deep neural networks via stochastic connectivity,” IEEE Access 4, 1915–1924 (2016). 10.1109/ACCESS.2016.2551458 [DOI] [Google Scholar]
19.Shafiee M. J., Mishra A., Wong A., “Deep learning with Darwin: evolutionary synthesis of deep neural networks,” arXiv preprint arXiv:1606.04393 (2016).
20.Shafiee M. J., Barshan E., Wong A., “Evolution in groups: a deeper look at synaptic cluster driven evolution of deep neural networks,” in Future Technologies Conf., IEEE; (2017). [Google Scholar]
21.Armato S. G., III, et al. , “The lung image database consortium (LIDC) and image database resource initiative (IDRI): a completed reference database of lung nodules on CT scans,” Med. Phys. 38(2), 915–931 (2011). 10.1118/1.3528204 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Armato S. G., III, et al. , “Lung image database consortium: developing a resource for the medical imaging research community,” Radiology 232(3), 739–748 (2004). 10.1148/radiol.2323032035 [DOI] [PubMed] [Google Scholar]
23.Shafiee M. J., Wong A., “Evolutionary synthesis of deep neural networks via synaptic cluster-driven genetic encoding,” in Advanced Neural Information Processing Workshop (NIPS) (2016). [Google Scholar]
24.LeCun Y., et al. , “Gradient-based learning applied to document recognition,” Proc. IEEE 86(11), 2278–2324 (1998). 10.1109/5.726791 [DOI] [Google Scholar]
25.Ngiam J., et al. , “On optimization methods for deep learning,” in Proc. of the 28th Int. Conf. on Machine Learning (ICML-11), pp. 265–272 (2011). [Google Scholar]
26.Toyoda Y., et al. , “Sensitivity and specificity of lung cancer screening using chest low-dose computed tomography,” Br. J. Cancer 98(10), 1602–1607 (2008). 10.1038/sj.bjc.6604351 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r1] 1.American Cancer Society, “Cancer facts & figures 2016,” Atlanta: (2016). [Google Scholar]

[r2] 2.Canadian Cancer Society’s Advisory Committee on Cancer Statistics, Canadian Cancer Statistics, Canadian Cancer Society, Toronto, Ontario: (2016). [Google Scholar]

[r3] 3.Lambin P., et al. , “Radiomics: extracting more information from medical images using advanced feature analysis,” Eur. J. Cancer 48(4), 441–446 (2012). 10.1016/j.ejca.2011.11.036 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r4] 4.Aerts H. J. W. L., “Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach,” Nat. Commun. 5, 4006 (2014). 10.1038/ncomms5006 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r5] 5.Gevaert O., et al. , “Non-small cell lung cancer: identifying prognostic imaging biomarkers by leveraging public gene expression microarray data,” Radiology 264(2), 387–396 (2012). 10.1148/radiol.12111607 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r6] 6.Maforo N., et al. , “Radiomics of multi-parametric breast MRI in breast cancer diagnosis: a quantitative investigation of diffusion weighted imaging, dynamic contrast-enhanced, and t2-weighted magnetic resonance imaging,” Med. Phys. 42(6), 3213 (2015). 10.1118/1.4923882 [DOI] [Google Scholar]

[r7] 7.Khalvati F., Wong A., Haider M. A., “Automated prostate cancer detection via comprehensive multi-parametric magnetic resonance imaging texture feature models,” BMC Med. Imaging 15(1), 27 (2015). 10.1186/s12880-015-0069-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r8] 8.Cameron A., et al. , “Multiparametric MRI prostate cancer analysis via a hybrid morphological-textural model,” in Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual Int. Conf. of the IEEE (2014). [DOI] [PubMed] [Google Scholar]

[r9] 9.Anirudh R., et al. , “Lung nodule detection using 3D convolutional neural networks trained on weakly labeled data,” Proc. SPIE 9785, 978532 (2016). 10.1117/12.2214876 [DOI] [Google Scholar]

[r10] 10.Orozco H. M., et al. , “Automated system for lung nodules classification based on wavelet feature descriptor and support vector machine,” Biomed. Eng. Online 14(1), 9 (2015). 10.1186/s12938-015-0003-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[r11] 11.Shen W., et al. , “Multi-scale convolutional neural networks for lung nodule classification,” in Int. Conf. on Information Processing in Medical Imaging, pp. 588–599 (2015). 10.1007/978-3-319-19992-4_46 [DOI] [PubMed] [Google Scholar]

[r12] 12.Shen W., et al. , “Multi-crop convolutional neural networks for lung nodule malignancy suspiciousness classification,” Pattern Recognit. 61, 663–673 (2017). 10.1016/j.patcog.2016.05.029 [DOI] [Google Scholar]

[r13] 13.Kumar D., Wong A., Clausi D. A., “Lung nodule classification using deep features in CT images,” in 12th Conf. on Computer and Robot Vision (CRV), pp. 133–138 (2015). 10.1109/CRV.2015.25 [DOI] [Google Scholar]

[r14] 14.Shen W., et al. , “Learning from experts: developing transferable deep features for patient-level lung cancer prediction,” in Int. Conf. on Medical Image Computing and Computer-Assisted Intervention, pp. 124–131, Springer; (2016). 10.1007/978-3-319-46723-8_15 [DOI] [Google Scholar]

[r15] 15.Karimi A.-H., et al. , “Discovery radiomics via a mixture of deep convnet sequencers for multi-parametric MRI prostate cancer classification,” in Int. Conf. Image Analysis and Recognition, Springer; (2017). 10.1007/978-3-319-59876-5_6 [DOI] [Google Scholar]

[r16] 16.Shafiee M. J., et al. , “Discovery radiomics via stochasticnet sequencers for cancer detection,” in NIPS Workshop on Machine Learning in Healthcare (2015). [Google Scholar]

[r17] 17.Kumar D., et al. , “Discovery radiomics for pathologically-proven computed tomography lung cancer prediction,” in Int. Conf. Image Analysis and Recognition, Springer; (2017). 10.1007/978-3-319-59876-5_7 [DOI] [Google Scholar]

[r18] 18.Shafiee M. J., Siva P., Wong A., “Stochasticnet: forming deep neural networks via stochastic connectivity,” IEEE Access 4, 1915–1924 (2016). 10.1109/ACCESS.2016.2551458 [DOI] [Google Scholar]

[r19] 19.Shafiee M. J., Mishra A., Wong A., “Deep learning with Darwin: evolutionary synthesis of deep neural networks,” arXiv preprint arXiv:1606.04393 (2016).

[r20] 20.Shafiee M. J., Barshan E., Wong A., “Evolution in groups: a deeper look at synaptic cluster driven evolution of deep neural networks,” in Future Technologies Conf., IEEE; (2017). [Google Scholar]

[r21] 21.Armato S. G., III, et al. , “The lung image database consortium (LIDC) and image database resource initiative (IDRI): a completed reference database of lung nodules on CT scans,” Med. Phys. 38(2), 915–931 (2011). 10.1118/1.3528204 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r22] 22.Armato S. G., III, et al. , “Lung image database consortium: developing a resource for the medical imaging research community,” Radiology 232(3), 739–748 (2004). 10.1148/radiol.2323032035 [DOI] [PubMed] [Google Scholar]

[r23] 23.Shafiee M. J., Wong A., “Evolutionary synthesis of deep neural networks via synaptic cluster-driven genetic encoding,” in Advanced Neural Information Processing Workshop (NIPS) (2016). [Google Scholar]

[r24] 24.LeCun Y., et al. , “Gradient-based learning applied to document recognition,” Proc. IEEE 86(11), 2278–2324 (1998). 10.1109/5.726791 [DOI] [Google Scholar]

[r25] 25.Ngiam J., et al. , “On optimization methods for deep learning,” in Proc. of the 28th Int. Conf. on Machine Learning (ICML-11), pp. 265–272 (2011). [Google Scholar]

[r26] 26.Toyoda Y., et al. , “Sensitivity and specificity of lung cancer screening using chest low-dose computed tomography,” Br. J. Cancer 98(10), 1602–1607 (2008). 10.1038/sj.bjc.6604351 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Discovery radiomics via evolutionary deep radiomic sequencer discovery for pathologically proven lung cancer detection

Mohammad Javad Shafiee

Audrey G Chung

Farzad Khalvati

Masoom A Haider

Alexander Wong

Abstract.

1. Introduction