Abstract.
While lung cancer is the second most diagnosed form of cancer in men and women, a sufficiently early diagnosis can be pivotal in patient survival rates. Imaging-based, or radiomics-driven, detection methods have been developed to aid diagnosticians, but largely rely on hand-crafted features that may not fully encapsulate the differences between cancerous and healthy tissue. Recently, the concept of discovery radiomics was introduced, where custom abstract features are discovered from readily available imaging data. We propose an evolutionary deep radiomic sequencer discovery approach based on evolutionary deep intelligence. Motivated by patient privacy concerns and the idea of operational artificial intelligence, the evolutionary deep radiomic sequencer discovery approach organically evolves increasingly more efficient deep radiomic sequencers that produce significantly more compact yet similarly descriptive radiomic sequences over multiple generations. As a result, this framework improves operational efficiency and enables diagnosis to be run locally at the radiologist’s computer while maintaining detection accuracy. We evaluated the evolved deep radiomic sequencer (EDRS) discovered via the proposed evolutionary deep radiomic sequencer discovery framework against state-of-the-art radiomics-driven and discovery radiomics methods using clinical lung CT data with pathologically proven diagnostic data from the LIDC-IDRI dataset. The EDRS shows improved sensitivity (93.42%), specificity (82.39%), and diagnostic accuracy (88.78%) relative to previous radiomics approaches.
Keywords: discovery radiomics, radiomic sequencing, lung cancer, evolutionary deep intelligence, evolved deep radiomic sequencer
1. Introduction
Lung cancer is the second most diagnosed form of cancer in men and women after prostate cancer and breast cancer, respectively. In 2016, lung cancer accounted for an estimated 158,080 deaths ( of cancer deaths) and 224,390 new cases in Americans.1 Similarly, lung cancer accounted for an estimated 20,800 deaths ( of cancer deaths) and 28,400 new cases in Canadians.2 Early detection of lung cancer can significantly impact the patient survival rate, making efficient and reliable lung cancer screening methods crucial.
Imaging-based cancer detection or radiomics-driven methods have recently grown in popularity to help streamline the cancer screening process and increase diagnostic consistency. Referring to the extraction and analysis of large amounts of quantitative features from medical imaging data, radiomics3 allows for the creation of a high-dimensional abstract feature space that can be utilized for cancer detection via the detailed characterization of cancer phenotypes. The prognostic potential of radiomics has previously been demonstrated in studies on lung and head-and-neck cancer patients.4,5 Aerts et al.4 introduced a comprehensive study spanning over 1000 patients across seven datasets to demonstrate the application of radiomics toward differentiating between tumor phenotypes, indicating clinical and prognostic implications. In addition, radiomics has shown promise in combination with multiparametric magnetic resonance imaging for breast cancer detection6 and prostate cancer detection.7,8
Radiomics-driven methods have previously been developed for malignant lung nodule detection using computed tomography (CT) images.9–12 Anirudh et al.9 used weakly labeled lung data from the SPIE-LUNGx dataset to train a three-dimensional convolutional neural network (CNN) and generate radiomic sequences for lung nodule detection. In contrast, Orozco et al.10 generated wavelet-based radiomic sequences and demonstrated the effectiveness of wavelet-based features using a subset of images from the early lung cancer action project and lung image database consortium (LIDC) datasets.
Shen et al.11 proposed multiscale CNN, a hierarchical framework for extracting discriminative features from lung nodules. Specifically, the framework is comprised of alternating, stacked layers, and uses multiscale nodule patches to learn class-specific features. More recently, Shen et al. extended their previous work to malignancy suspiciousness classification.12 In addition, the extension simplified the training process via a multicrop pooling architecture. An important aspect of these aforementioned radiomics-driven methods is that they leverage radiologist-driven nodule annotations for predicting the malignancy of lung nodules, rather than using pathology-proven data.
There are relatively few radiomics-driven methods that perform lung cancer detection using pathology-proven diagnostic data.13,14 Kumar et al.13 introduced an unsupervised deep autoencoder for feature extraction with a binary decision tree classifier for lung nodule classification. Shen et al.14 proposed a domain-adaptation framework for lung nodule malignancy prediction; more specifically, Shen et al. proposed CNN-multiple instance learning (CNN-MIL) for learning transferable patient-level malignancy knowledge, which combines a CNN model with an MIL model.
Recently, the concept of discovery radiomics was introduced where notions of using predefined, hand-crafted features for cancer detection are bypassed in favor of radiomic sequencers that produce abstract imaging-based features that are discovered directly from the wealth of readily available medical imaging data. This allows for custom-tailored features to be discovered that can better characterize cancerous tissue and distinguish cancer phenotypes relative to conventional features. Discovery radiomics has shown promising results for both prostate cancer15 and lung cancer13,16,17 detection.
A number of different radiomic sequencers have been proposed within the discovery radiomics framework for the purpose of lung cancer detection. Kumar et al.13 introduced the notion of deep autoencoding radiomic sequencers (DARS), which are comprised of a deep autoencoder architecture. Shafiee et al.16 proposed deep radiomics sequencers based on a deep convolutional StochasticNet18 architecture, referred to as StochasticNet sequencers. More recently, Kumar et al.17 leveraged deep radiomic sequencers built upon a deep CNN architecture.
While diagnostically powerful, the discovered radiomic sequencers (DRSs) were both computationally expensive and memory intensive, which could make it difficult for on-site clinical deployment and would require the transfer of patient information to more powerful cloud computing leading to patient privacy concerns. To mitigate computational requirements and increase operating efficiency, we propose an evolutionary deep radiomic sequencer discovery framework for discovering more efficient yet powerful deep radiomic sequencers. Using the concept of evolutionary deep intelligence19,20 to mimic biological evolution mechanisms, the proposed evolutionary deep sequencer discovery process discovers progressively more efficient yet diagnostically powerful deep radiomic sequencers over multiple generations. The resulting evolved deep radiomic sequencers (EDRSs) are not only significantly more efficient, thus making them more suitable for on-site clinical deployment, but can provide improved diagnostic performance compared to existing deep radiomic sequencers.
2. Methods
In this section, we will first discuss the concepts behind discovery radiomics and evolutionary deep intelligence. We will then present the proposed evolutionary deep sequencer discovery approach in detail.
2.1. Discovery Radiomics
The idea behind discovery radiomics can be described as follows (see Fig. 1). Given past radiology data and corresponding pathology-verified radiologist tissue annotations from a medical imaging data archive (i.e., provided by Cancer Imaging Archive21,22 and consisting of diagnostic and lung cancer screening thoracic CT), the radiomic sequencer discovery process learns a radiomic sequencer that can extract highly customized radiomic features (which we will refer to as a radiomic sequence) that are tailored for characterizing unique tissue phenotypes that differentiate cancerous tissue from healthy tissue. The DRS can be applied to new patient data to extract the corresponding radiomic sequence for cancer screening and diagnosis purposes.
As discussed earlier, one of the key limitations of previously proposed deep radiomic sequencers13,17 for the purpose of lung cancer detection is that, while diagnostically powerful, they are both computationally expensive and memory intensive. They usually utilize very deep neural network architectures with a large number of parameters (i.e., which needs a large amount of memory to store) such that a huge set of arithmetic operations are required to generate the radiomic sequence and as a result it needs a fair amount of time to produce the results. This could make it difficult for on-site clinical deployment and would require the transfer of patient information to more powerful cloud computing leading to patient privacy concerns. To mitigate computational requirements and increase operating efficiency to enable on-site clinical deployment, we will leverage the concept of evolutionary deep intelligence19,20 to discover highly efficient deep radiomic sequencers that still provide strong diagnostic performance.
2.2. Evolutionary Deep Intelligence
Prior to describing the proposed evolutionary deep sequencer discovery approach, it is first important to discuss the idea behind evolutionary deep intelligence. First introduced by Shafiee et al.,19 the general idea is to synthesize progressively more efficient deep neural networks over multiple generations. The evolution of deep neural networks is modeled in a probabilistic manner, where the architectural traits of ancestor networks are encoded by a probabilistic DNA. The probabilistic DNA is utilized to mimic biological heredity, and new offspring networks are synthesized stochastically based on this probabilistic model. Each synaptic connectivity is modeled by a probability distribution based on the corresponding weight magnitude in the ancestor network such that the strength of the weight determines the probability of each synapse to be connected in the offspring network. To close the cycle of evolution, environmental factors are applied to the model to mimic random mutation and natural selection. The environmental factor is combined with the probabilistic DNA to enforce how the random mutation should be applied (i.e., what the rate of mutation of synaptic connectivity should be in the offspring network architecture). Loosely speaking, when the environmental factor forces the offspring network architectures to be smaller than their ancestor, this causes a decrease in the chance of each synaptic connectivity to be synthesized in the offspring network such that weaker synaptic connectivity in the ancestor network will have a lower chance of being connected in the offspring network. At each generation, the offspring network (which is more efficient than its parent) is then trained to refine its modeling capabilities and maximize its modeling accuracy.
Figure 2 shows the evolution process visually. As seen, the evolution is initialized using a known network structure as the first generation. The network is trained based on the available training data and the weights associated with each synaptic strength are computed. The underlying heredity of the network (i.e., as the parent network) is encoded by the probabilistic DNA, which is modeled based on the synaptic strengths. The environmental factors are then formulated into the model to account for the requirements needed to be satisfied by the offspring network. The offspring network is then synthesized by taking advantage of random mutation to diversify the offspring network from its ancestors. This process is repeated until all requirements are satisfied by the latest offspring network. Given its ability to produce progressively more efficient yet powerful deep neural networks, we are motivated to leverage the ideas behind evolutionary deep intelligence within the discovery radiomics framework to discover highly efficient yet diagnostically accurate deep radiomic sequencers for the purpose of lung cancer detection.
2.2.1. Evolutionary Deep Radiomic Sequencer Discovery
Motivated to leverage evolutionary deep intelligence within the discovery radiomics framework, we introduce an evolutionary deep radiomic sequencer discovery process for discovering deep radiomic sequencers. As seen in Fig. 3, the evolutionary deep radiomic sequencer discovery framework discovers a more optimal deep radiomic sequencer generation by generation and, as a result, the generated radiomic sequence at each generation is more concise compared to radiomic sequences generated by previous deep radiomic sequencers in past generations.
The methodology behind the proposed framework can be described as follows. Inspired by Refs. 20 and 23, let the deep radiomic sequencer be modeled as , denoting a network architecture with the set of neurons and set of synaptic connectivities . In this study, we will utilize a deep CNN architecture for the deep radiomic sequencer (see Fig. 4). The structural information of a deep radiomic sequencer at generation can be encoded by . is the set of weights that encode the strength associated with each synapse in the network at generation , where a synaptic weight of zero indicates that the associated synapse is not connected. It should be noted that can, therefore, encode the structural information of a network at generation . As a result, it is possible to reformulate as without any loss of modeling accuracy. Thus, the probabilistic DNA of a deep radiomic sequencer at generation is formulated as , such that at each generation the structure of the sequencer is synthesized given the trained weights of the sequencer of the previous generation .
The genetic encoding scheme (i.e., probabilistic DNA) can be formulated in different ways to favor special requirements needed to be applied when the new offspring deep radiomic sequencers are synthesized. For promoting computational efficiency and compactness, is modeled such that it promotes the formation of a particular cluster of synapses while considering the synthesis of each individual synapse in the offspring deep radiomic sequencer as well20
(1) |
where promotes the synthesis of a particular cluster of synapses , , given the weights of the network at generation and is the probability that synapse will be synthesized in the offspring deep radiomic sequencer at generation .
A cluster of synapses can be defined and represented based on different factors, such as faster run time of the offspring radiomic sequencer on a parallel computing device or decreased storage requirements relative to its ancestor sequencer. However, the main advantage of Eq. (1) is that the not only favors strong synapses, which are more effective in maintaining a high modeling accuracy, but it promotes the persistence of clusters of synapses in the offspring deep radiomic sequencer, which can extract more discriminative features, resulting in a sequencer that can model the problem more accurately. Here, we define a set of synapses constructing a filter in each convolutional layer as a cluster of synapses in the network structure of a deep radiomic sequencer. As shown in Fig. 4, each filter in a convolutional layer is responsible for producing one output channel of the layer. By extending this definition to all convolutional layers in the radiomic sequencer, the length of the radiomic sequence varies over the generations as the number of filters in the last layer determines the actual length of the radiomic sequence.
The probabilistic DNA is combined with the environmental factor model to mimic natural selection, such that the offspring deep radiomic sequencer for the next generation is comprised of stochastically selected synapses or clusters of synapses. The environmental factor simulates the conditions that the offspring networks should be adapted for. For example, if in the new environment the offspring network should be faster in computation, the environmental factors enforce the offspring network to be synthesized with fewer numbers of filters for decreasing the processing time. The environmental factors can also reflect what is the situation in terms of memory availability and the host hardware in which the computation will be done; as a result the synthesized offspring network architecture adapts itself to these environmental factors to be able to survive. The probabilistic model of the network structure at generation can be formulated as
(2) |
where quantitatively encodes the environmental conditions, and the offspring deep radiomic sequencer structures must adapt to them to survive over generations. As mentioned before, the goal here is to synthesize a deep radiomic sequencer with fewer parameters while preserving the modeling accuracy; therefore, the environmental factor model favors the formation of a deep radiomic sequencer with fewer parameters and increased efficiency over the generations. This property is applied via a cluster-based encoding scheme, which decreases the number of filters of different layers over generations
(3) |
More specifically, the environmental factor is formulated such that the offspring radiomic sequencer is limited to 80% of the total number of synapses in its direct ancestor sequencer.
3. Results
3.1. Experimental Setup
The proposed evolutionary deep radiomic sequencer discovery framework was examined using the pathology-proven subset of the LIDC-IDRI21,22 dataset and was compared to state-of-the-art methods. In this section, the configuration of the dataset, the underlying network architecture of the DRSs, and the competing methods are explained.
3.1.1. Lung Dataset
In this study, we used the subset of the LIDC-IDRI21,22 dataset that had corresponding pathology-proven diagnostic data. The dataset is a public dataset provided by Cancer Imaging Archive21,22 consisting of diagnostic and lung cancer screening thoracic CT scans with marked-up annotated lesions. The CT images were captured using a broad range of scanner models from different manufacturers by applying the following tube peak potential energies for acquiring the scans: 120 kV (), 130 kV (), 135 kV (), and 140 kV (). A subset of 93 patient cases, which have definite diagnostic results, was selected from the LIDC-IDRI. While pathology data were used to generate labels for the CT images in the LIDC-IDRI dataset, the pathology data were not available for comparison and labels were provided on a nodule basis. Using data augmentation, an enriched dataset of 42,340 lung lesions was obtained via the rotation of each malignant and benign lesion by 45-deg and 10-deg increments, respectively. The proposed method is examined by a 10-fold cross-validation approach where 9 out of 10 folds of patient cases (subset of patient cases) are used in the training while the other fold (subset of patient cases) is utilized as test samples and the results are reported based on the average performance of 10 trials.
3.1.2. Network Architecture (Lenet5)
The deep neural network architecture of the original, first generation radiomic sequencer used in this study builds upon the Lenet5 architecture.24 The radiomic sequencer is comprised of three convolutional layers: : , : , and : , where the first layer consists of 32 filters, the second layer has 32 filters, and the last layer has 64 filters. The radiomic sequence generated by the original, first generation radiomic sequencer has a length of and is the input into two fully connected layers (: 64 and : 2) to classify each input as cancerous or benign.
3.1.3. Competing Frameworks
The proposed evolutionary deep radiomic sequencer discovery was evaluated using the enriched dataset and quantitatively compared to four state-of-the-art radiomics-driven approaches.13,14,16,17
Kumar et al.’s DARS13 uses a five layer denoising autoencoder trained by L-BFGS with 30 iterations and a batch size of 400, as suggested by past work;25 a 200 dimension feature vector is extracted from the fourth layer and paired with a binary decision tree classifier. Shen et al.’s proposed CNN-MIL14 is composed of three concatenated convolutional layers, each with 64 convolutional kernels of size . Each convolutional layer is followed by a rectified linear unit and a max-pooling layer ( pooling window in the first layer and in the subsequent layers), and two fully connected layers are used to determine nodule malignancy. Shafiee et al.’s StochasticNet radiomic sequencer (SNRS)16 is constructed using three stochastically formed convolutional layers of 32, 32, and 64 receptive fields, respectively. Each receptive field is in size and is part of a random graph realization with a uniform neural connectivity probability of 0.5. Similarly, Kumar et al.’s DRS17 is comprised of three convolutional sequencing layers of 20, 50, and 500 receptive fields, respectively, each of size .
3.2. Experimental Results
The proposed evolutionary deep radiomic sequencer discovery process was performed through 11 generations where in each generation, the environmental factor restricts the offspring radiomic sequencer to 80% of the total number of synapses in its direct previous network. Using this environmental factor, the number of parameters in the deep neural network of radiomic sequencer is decreased generation by generation, allowing for the generated radiomic sequences to be more compact over generations. Decreasing the number of parameters in the sequencer is important as it affects the generalizability of the sequencer such that a more generalized sequencer is less likely to be over-trained to the training data and can perform more accurately in the evaluation step.
The performance of the proposed framework is examined in a 10-fold cross validation approach where 9 out of 10 subsets of the data are used in the training step while the 10th subset is used to evaluate the model. This training and testing process is repeated over all permutations of the training and testing subsets. The cross-validation approach is combined with evolutionary deep intelligence, where in each validation step, the radiomic sequencers are synthesized generation by generation with the same training dataset and validated with the same testing data.
Table 1 shows the average performance of the proposed framework over 11 generations. As seen, by moving generation by generation, the number of filters used in the radiomic sequencer is decreased and the length of the radiomic sequence is correspondingly shortened. However, the performance of the radiomic sequencers improves over generations, which demonstrates the increase in the generalizability of the models through generations. As seen, the performance of evolved radiomic sequencers (i.e., sensitivity, specificity, and accuracy) increases after the first generation and it reaches a stable point (e.g., generation 7). However, evolving the radiomic sequencer after the seventh generation can improve the compactness of the radiomic sequences.
Table 1.
ANF | RSL | Sensitivity | Specificity | Accuracy | |
---|---|---|---|---|---|
Gen. 1 | 194.0 | 3104.0 | 0.8786 | 0.7570 | 0.8255 |
Gen. 2 | 180.4 | 2886.4 | 0.9156 | 0.7788 | 0.8590 |
Gen. 3 | 171.0 | 2736.0 | 0.9305 | 0.8063 | 0.8795 |
Gen. 4 | 161.1 | 2577.6 | 0.9276 | 0.8062 | 0.8812 |
Gen. 5 | 150.9 | 2414.4 | 0.9311 | 0.8109 | 0.8845 |
Gen. 6 | 142.6 | 2281.6 | 0.9295 | 0.8125 | 0.8834 |
Gen. 7 | 135.1 | 2161.6 | 0.9390 | 0.8105 | 0.8898 |
Gen. 8 | 125.8 | 2012.8 | 0.9341 | 0.8129 | 0.8879 |
Gen. 9 | 118.5 | 1896.0 | 0.9384 | 0.8169 | 0.8917 |
Gen. 10 | 111.2 | 1779.2 | 0.9385 | 0.8107 | 0.8901 |
Gen. 11 | 104.5 | 1672.0 | 0.9342 | 0.8239 | 0.8878 |
Note: The boldface values show the best accuracy.
Table 1 demonstrates that the specificity of the radiomic sequencers increases when the sequencers are evolved over generations, which is a good indication of generalizability of the final model. It is worth noting that in lung cancer classification, improving the specificity is challenging26 and increasing the specificity while maintaining a reasonable sensitivity is highly desirable.
Table 1 also shows that the evolved radiomic sequencers can perform better in terms of sensitivity compared to the first generation original ancestor radiomic sequencer, resulting in a model with higher accuracy. As mentioned before, one of the important obstacles in using a deep neural network as the underlying architecture for a radiomic sequencer is the efficiency of the underlying deep neural network. As seen, the average number of filters constructing the radiomic sequencer is decreased over generations, indicating that the efficiency of the radiomic sequencer is increasing generation by generation. It is also worth noting that the number of filters of a deep neural network determines the number of parameters needed to be computed in one forward pass of the network to compute the final prediction; therefore, decreasing this number can increase the efficiency of the radiomic sequencer.
To evaluate the efficiency of the synthesized radiomic sequencers, the running time computation of the sequencers is examined at each generation with 1500 sample inputs. Figure 5 shows the running time performance of synthesized radiomic sequencers through generations. As seen, the subsequent generations perform faster than their ancestors, which shows the efficiency of the proposed evolutionary deep intelligence framework.
Decreasing the number of filters in the model decreases the length of the radiomic sequence. As shown in Table 1, the length of the radiomic sequence is shortened generation by generation and the length of the radiomic sequence in the last generation is about half the size of the radiomic sequence of the first generation, demonstrating that it is possible to increase the concision of the radiomic sequence while simultaneously increasing the modeling accuracy.
Figure 6 shows the sensitivity of the evolved radiomic sequencers overlaid with the standard deviation across different folds of cross validation over multiple generations. By evolving the radiomic sequencers generation by generation, the sensitivity increases while the standard deviation decreases (notice that the purple margin narrows over generations). This is another indication of generalizability of the evolved radiomic sequencers as the variance of the models in different cross validation folds of evaluation decreases over generations. This effect is more obvious in Fig. 7 as the standard deviation of the specificity measure is decreased generation by generation and as mentioned before, a more reliable specificity is highly desirable in lung cancer classification. Figure 8 shows the same behavior of the modeling accuracy over generations.
As the last experimental result, Table 2 shows the comparison of the proposed framework (EDRS) with other state-of-the-art approaches. It should be noted that the statistics and modeling performances of other state-of-the-art frameworks are reported directly by Kumar et al.17 and Shafiee et al.18 As seen, the proposed radiomic sequencer in the discovery radiomics framework outperforms other state-of-the-art methods in sensitivity (93.42%), specificity (82.39%), and accuracy (88.78%). To demonstrate the effect of the evolutionary deep intelligence framework on discovery radiomic sequencer, the final network architecture synthesized by the evolutionary deep intelligence framework is trained from scratch. The performance of this network (so-called last generation) is compared with the result of the evolutionarily deep intelligence approach. As seen in Table 1, although the optimized network architecture synthesized by the evolutionary framework is utilized to train the last-generation approach, the sequencer could not compete with the EDRS performance and could not gain the same accuracy level.
Table 2.
Sensitivity | Specificity | Accuracy | |
---|---|---|---|
DARS13 | 0.8314 | 0.2018 | 0.7501 |
CNN-MIL14 | — | — | 0.7069 |
SNRS16 | 0.9107 | 0.7598 | 0.8449 |
DRS17 | 0.7906 | 0.7611 | 0.7752 |
EDRS | 0.9342 | 0.8239 | 0.8878 |
Last generation | 0.8893 | 0.7823 | 0.8355 |
Note: The boldface values show the best accuracy.
4. Conclusion
In this paper, we proposed an evolutionary deep radiomic sequencer discovery framework to better uncover more efficient yet powerful radiomic sequencers for the purpose of lung cancer classification. An evolutionary deep intelligence approach is incorporated within the discovery radiomics framework to evolve the underlying deep neural network architecture of the deep radiomic sequencer over multiple generations and discover a more efficient and generalized deep radiomic sequencer. The ultimate goal here is to synthesize a deep neural network as the underlying core of the radiomic sequencer with fewer numbers of parameters, which produces more concise radiomic sequences that can better capture the differences between healthy and cancerous lung tissue. Results show that by evolving and discovering more efficient radiomic sequencers, the diagnostic accuracy can be increased. Experimental results demonstrate that the EDRS discovered using the proposed evolutionary deep radiomic sequencer discovery approach can outperform other state-of-the-art radiomics-driven methods, achieving a sensitivity of 93.42%, a specificity of 82.39%, and an accuracy of 88.78%. It has been showed in the literature that there is a direct relation between the number of parameters and the need for training data. As a future work, it is suggested to study the effect of an evolutionary deep intelligence framework when limited training data are available.
Acknowledgments
This research has been supported by the Ontario Institute of Cancer Research (OICR), Canada Research Chairs Programs, Natural Sciences and Engineering Research Council of Canada (NSERC), and the Ministry of Research and Innovation of Ontario. The authors also thank Nvidia for the GPU hardware used in this study through the Nvidia Hardware Grant Program.
Biographies
Mohammad Javad Shafiee received his BSc and MSc degrees in computer science and artificial intelligence from Shiraz University, Shiraz, Iran, in 2008 and 2011, respectively; and his PhD in systems design engineering from the University of Waterloo, Waterloo, Canada in 2017. He is currently a research assistant professor in the Department of Systems Design Engineering at the University of Waterloo. His main research focus is on statistical learning and graphical models with random fields and deep learning approaches.
Audrey G. Chung received her BASc and MASc degrees in systems design engineering from the University of Waterloo, Waterloo, Canada, in 2014 and 2016, respectively. She is currently pursuing her PhD in the Department of Systems Design Engineering with her primary focus on artificial intelligence and deep learning. Her research interests include image processing and computer vision, with a specific emphasis on biomedical imaging.
Farzad Khalvati received his PhD in electrical and computer engineering from the University of Waterloo, Waterloo, Canada, in 2009. He is currently an assistant professor in the Department of Medical Imaging, University of Toronto, Toronto, Canada, and a junior scientist at Sunnybrook Research Institute, Toronto, Canada. His research interests include translational research in medical imaging, quantitative biomedical imaging (radiomics) for cancer diagnosis and prognosis, and intelligent medical image computing.
Masoom A. Haider received his MD degree from the University of Ottawa, Ottawa, Canada, in 1986. In 1994, he completed his residency in radiology from the University of Toronto, Toronto, Canada, where he is currently a professor of radiology in the Department of Medical Imaging. He is also the chief of the Medical Imaging Department at Sunnybrook Health Sciences Centre, Toronto, Canada. His research interests include abdominal and pelvic MRI with special emphasis in prostate cancer, therapeutic response assessment, and functional imaging of Cancer.
Alexander Wong is currently the Canada research chair of medical imaging systems, the codirector of the Vision and Image Processing Research Group, and an associate professor in the Department of Systems Design Engineering at University of Waterloo. His current research interests revolve around computational imaging and artificial intelligence, with a focus on integrative computational imaging systems for biomedical imaging and operational artificial intelligence. He has authored more than 400 refereed journals and conference papers, and patents.
Disclosures
The authors have no relevant financial interests in the manuscript and no other potential conflicts of interest to disclose.
References
- 1.American Cancer Society, “Cancer facts & figures 2016,” Atlanta: (2016). [Google Scholar]
- 2.Canadian Cancer Society’s Advisory Committee on Cancer Statistics, Canadian Cancer Statistics, Canadian Cancer Society, Toronto, Ontario: (2016). [Google Scholar]
- 3.Lambin P., et al. , “Radiomics: extracting more information from medical images using advanced feature analysis,” Eur. J. Cancer 48(4), 441–446 (2012). 10.1016/j.ejca.2011.11.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Aerts H. J. W. L., “Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach,” Nat. Commun. 5, 4006 (2014). 10.1038/ncomms5006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gevaert O., et al. , “Non-small cell lung cancer: identifying prognostic imaging biomarkers by leveraging public gene expression microarray data,” Radiology 264(2), 387–396 (2012). 10.1148/radiol.12111607 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Maforo N., et al. , “Radiomics of multi-parametric breast MRI in breast cancer diagnosis: a quantitative investigation of diffusion weighted imaging, dynamic contrast-enhanced, and t2-weighted magnetic resonance imaging,” Med. Phys. 42(6), 3213 (2015). 10.1118/1.4923882 [DOI] [Google Scholar]
- 7.Khalvati F., Wong A., Haider M. A., “Automated prostate cancer detection via comprehensive multi-parametric magnetic resonance imaging texture feature models,” BMC Med. Imaging 15(1), 27 (2015). 10.1186/s12880-015-0069-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Cameron A., et al. , “Multiparametric MRI prostate cancer analysis via a hybrid morphological-textural model,” in Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual Int. Conf. of the IEEE (2014). [DOI] [PubMed] [Google Scholar]
- 9.Anirudh R., et al. , “Lung nodule detection using 3D convolutional neural networks trained on weakly labeled data,” Proc. SPIE 9785, 978532 (2016). 10.1117/12.2214876 [DOI] [Google Scholar]
- 10.Orozco H. M., et al. , “Automated system for lung nodules classification based on wavelet feature descriptor and support vector machine,” Biomed. Eng. Online 14(1), 9 (2015). 10.1186/s12938-015-0003-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Shen W., et al. , “Multi-scale convolutional neural networks for lung nodule classification,” in Int. Conf. on Information Processing in Medical Imaging, pp. 588–599 (2015). 10.1007/978-3-319-19992-4_46 [DOI] [PubMed] [Google Scholar]
- 12.Shen W., et al. , “Multi-crop convolutional neural networks for lung nodule malignancy suspiciousness classification,” Pattern Recognit. 61, 663–673 (2017). 10.1016/j.patcog.2016.05.029 [DOI] [Google Scholar]
- 13.Kumar D., Wong A., Clausi D. A., “Lung nodule classification using deep features in CT images,” in 12th Conf. on Computer and Robot Vision (CRV), pp. 133–138 (2015). 10.1109/CRV.2015.25 [DOI] [Google Scholar]
- 14.Shen W., et al. , “Learning from experts: developing transferable deep features for patient-level lung cancer prediction,” in Int. Conf. on Medical Image Computing and Computer-Assisted Intervention, pp. 124–131, Springer; (2016). 10.1007/978-3-319-46723-8_15 [DOI] [Google Scholar]
- 15.Karimi A.-H., et al. , “Discovery radiomics via a mixture of deep convnet sequencers for multi-parametric MRI prostate cancer classification,” in Int. Conf. Image Analysis and Recognition, Springer; (2017). 10.1007/978-3-319-59876-5_6 [DOI] [Google Scholar]
- 16.Shafiee M. J., et al. , “Discovery radiomics via stochasticnet sequencers for cancer detection,” in NIPS Workshop on Machine Learning in Healthcare (2015). [Google Scholar]
- 17.Kumar D., et al. , “Discovery radiomics for pathologically-proven computed tomography lung cancer prediction,” in Int. Conf. Image Analysis and Recognition, Springer; (2017). 10.1007/978-3-319-59876-5_7 [DOI] [Google Scholar]
- 18.Shafiee M. J., Siva P., Wong A., “Stochasticnet: forming deep neural networks via stochastic connectivity,” IEEE Access 4, 1915–1924 (2016). 10.1109/ACCESS.2016.2551458 [DOI] [Google Scholar]
- 19.Shafiee M. J., Mishra A., Wong A., “Deep learning with Darwin: evolutionary synthesis of deep neural networks,” arXiv preprint arXiv:1606.04393 (2016).
- 20.Shafiee M. J., Barshan E., Wong A., “Evolution in groups: a deeper look at synaptic cluster driven evolution of deep neural networks,” in Future Technologies Conf., IEEE; (2017). [Google Scholar]
- 21.Armato S. G., III, et al. , “The lung image database consortium (LIDC) and image database resource initiative (IDRI): a completed reference database of lung nodules on CT scans,” Med. Phys. 38(2), 915–931 (2011). 10.1118/1.3528204 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Armato S. G., III, et al. , “Lung image database consortium: developing a resource for the medical imaging research community,” Radiology 232(3), 739–748 (2004). 10.1148/radiol.2323032035 [DOI] [PubMed] [Google Scholar]
- 23.Shafiee M. J., Wong A., “Evolutionary synthesis of deep neural networks via synaptic cluster-driven genetic encoding,” in Advanced Neural Information Processing Workshop (NIPS) (2016). [Google Scholar]
- 24.LeCun Y., et al. , “Gradient-based learning applied to document recognition,” Proc. IEEE 86(11), 2278–2324 (1998). 10.1109/5.726791 [DOI] [Google Scholar]
- 25.Ngiam J., et al. , “On optimization methods for deep learning,” in Proc. of the 28th Int. Conf. on Machine Learning (ICML-11), pp. 265–272 (2011). [Google Scholar]
- 26.Toyoda Y., et al. , “Sensitivity and specificity of lung cancer screening using chest low-dose computed tomography,” Br. J. Cancer 98(10), 1602–1607 (2008). 10.1038/sj.bjc.6604351 [DOI] [PMC free article] [PubMed] [Google Scholar]