Abstract
Technological innovation has accelerated the pathological diagnostic process for cancer, especially in digitizing histopathology slides and incorporating deep learning-based approaches to mine the subvisual morphometric phenotypes for improving pathology diagnosis. In this perspective paper, we provide an overview on major deep learning approaches for digital pathology and discuss challenges and opportunities of such approaches to aid cancer diagnosis in digital pathology. In particular, the emerging graph neural network may further improve the performance and interpretability of deep learning in digital pathology.
Keywords: Digital pathology, microscopy imaging, AI, deep learning, cancer diagnosis, graph neural networks
Diagnostic and surgical pathology relies on human interpretations of patterns and morphology. The process is slow, error-prone, and largely unaided by digital technology of the 21st century. Coupling the rapid improvement of deep learning models, vastly enhanced computing power, and reduced storage costs with the rising burden of cancer incidence, the shrinking number of pathologists, the wide-availability of whole slice digitization techniques, and the increasing pathologist case load while reimbursements are dropping, the potential of deploying deep learning in automating the interpretation of digitized pathology slides is tremendous. Tissue or biopsy slides provide an important basis for pathological diagnosis. Many clinical tasks requiring high reproducibility, low fault tolerance, and a high degree of dependence on experienced readers are suitable for applying deep learning for classification, segmentation, or object detection.[1–4]
Before digging in the digital pathology, let us briefly review the history of deep learning from the proposal of the multilayer perception (MP) neuron mathematical model in 1943 [5] to now, where it has shined in several fields. In 1958, Rosenblatt proposed the first generation of neural network single-layer perceptron [6], which can distinguish basic shapes such as triangles and squares. But in 1969, Minsky proved that a single-layer perceptron could not solve the XOR problem and did not have a good learning mechanism.[7] In 1986, the second-generation neural network proposed by Hinton et al.[8], with the back-propagation training method and sigmoid nonlinear mapping, provided an effective solution and triggered the second wave of neural networks. In 1989, LeCun and colleagues [9] invented a convolutional neural network (CNN) and applied it to digital handwriting recognition. However, the backpropagation method has a serious gradient vanishing problem, and the development of neural networks once fell into a trough. Then, in 2006, Hinton proposed a pre-training method by the auto-encoder to suppress the vanishing gradient problem and resurrected interest in CNN [10]. In 2011, the Rectified Linear Unit (ReLU) activation function proposed by Glorot et al. effectively suppressed the vanishing gradient problem [11]. In 2012, Hinton et al. participated in the ImageNet image recognition competition and won the championship through the CNN network AlexNet [12] that far exceeded the classification performance of the second-place support vector machine (SVM) method. Methods such as ReLU and Dropout used in the AlexNet model to suppress the vanishing gradient problem and reduce overfitting can still be found in the recently developed deep learning models. Since then, deep learning has ushered in explosive growth. There are improvements made in the convergence process of deep learning, such as batch normalization [13], transfer learning [14], and skip-connection in ResNet [15–17]. Meanwhile, there are diversification of tasks, such as RNN-based time series deep learning model [18–21], used for image generation generative adversarial networks (GANs) [22–32], object detection models [33–35], natural language processing (NLP) models [36–40], and reinforcement learning models [41–43]. Deep learning systems even surpasses humans in many applications, e.g., Alphago zero [44].
Digital pathology initially referred to digitized histopathology slides using whole slide imaging technology. A large-scale comparison study on tumor specimens from over 1,992 patients concluded that the equivalent diagnostic performance between the pathologists using digitized whole-slide images (WSI) and the pathologists viewing conventional tissue slides [45]. However, studies have shown promise of integrating deep learning-based approaches with human pathologist’s diagnoses in improving diagnostic accuracy while reducing human error rate in cancer diagnosis [46–48]. The fast evolving field of digital pathology now explores the use of deep learning-based approaches for the classification and diagnosis of digitized images [4, 49–52].
For example, Nagpal et al.[53] designed a two-stage deep learning model to help pathologists differentiate between benign and malignant prostate tumors and grade the cancer quantitatively. The first stage was a deep convolutional neural network-based regional Gleason pattern classification. It was trained using 912 slides with 112 million human pathologist-annotated images. At the second stage, the classifications of human pathologists on 1,159 slides were used for training the model. The reference standard included a dataset of 331 slides from 331 patients that were reviewed by independent pathologists and genitourinary pathologists. Performance of the deep learning model vs. 29 human pathologists were compared. The mean accuracy of the deep learning model was 0.7, significantly superior to the accuracy of 0.61 (p = 0.002) by the 29 pathologists. In terms of the Gleason Grade decision thresholds, the receiver operating characteristic (ROC) curves by the deep learning model achieved 0.95–0.96 at each threshold. Especially at Gleason Grade ≥ 4, the deep learning model showed greater sensitivity and specificity than 9 out of 10 pathologists [53]. On the other hand, Mahmood et al.[54] developed a multistage mitotic-cell-detection method based on Faster region convolutional neural network (Faster R-CNN)[55], post-processing and deep convolutional neural networks (deep CNNs) for breast cancer detection. Two breast cancer mitotic cell-detection image datasets, i.e., ICPR 2012 MITOSIS[56] and ICPR 2014[57], were used to train the Faster R-CNN model. In stage 1, an image was input into the trained Faster R-CNN detector. In stage 2, post-processing was performed to reduce the false positives based on statistical, texture, shape, and color features. In stage 3, independently trained Resnet-50[15] and Densenet-201[58] scores were fused by deep CNNs, and the final classification of the mitotic and non-mitotic cells was performed in stage 4 [54]. This method obtained high detection accuracy of mitotic cells in multiple datasets including ICPR 2012, ICPR 2014, and TUPAC16 datasets, and the precision was higher than other ten different methods published previously.
At this point, we can state that CNNs have greatly improved the diagnosis based on histopathological slides in digital pathology [59]. However, in most situations the image is Euclidean while the content is usually not. Traditional CNN needs to learn the content through multiple layers. Due to cropping, however, traditional CNN cannot obtain information outside the field of view, resulting in performance degradation. Moreover, it is precisely because of this process of extracting non-Euclidean information that the learning process of CNN is difficult to comprehend. There are many research papers regarding the interpretability of deep learning [60–63], and notably the emerging development of graph neural networks (GNNs) has brought us a promising path [64, 65].
The first appearance of graphs in the field of mathematics can be traced back to the Königsberg bridges problem [66]. A graph is a kind of data structure that models a set of objects (nodes) and their relationships (edges). It can be stated that GNN is to graphs what CNN is to image processing. In the development of GNN, traditional deep learning methods have also been analogously applied to GNN, such as Attention mechanism [67], Long short-term memory (LSTM) control gate [68], etc. GNN is widely used in social networks, recommendation systems, physical systems, chemical molecular properties prediction, knowledge maps, and many other fields [64]. As a data structure with powerful performance capabilities, graphs combining with deep learning would become the next hot spot in the digital pathology.
The feature extraction ability of GNN is also stronger than CNN. For instance, Yanning et al. [69] developed automatic methods for grading colorectal cancer (CRC). The authors stated that CRC grading usually uses small image patches, and it is impossible to obtain information on the micro-architecture of the entire tissue during the grading process. The authors proposed a cell-graph convolutional neural network (CGC-Net) model that combines the Adaptive GraphSAGE graph convolution structure to obtain multi-level features[70]. In comparison with traditional CNN, the proposed model achieves much better accuracy of 97%. Under the theme of interpretability, GNN clearly shows its understanding of the relationship between objects in the graph without the support of additional models. For example, Mookund Sureka et al.[71] pointed out that the medical community needs interpretable models to conduct due diligence and improve understanding of disease mechanisms and treatment, which is difficult for CNN to do. Thus, the authors adopted an approach to model a histology tissue as a graph of nuclei and developed a GNN framework based on the attention mechanism and node occlusion for disease diagnosis. The method highlights the relative contribution of each cell nucleus in the image and fits the mental model of the pathologists.
In summary, recent development and applications of deep learning have progressively transformed digital pathology, particularly with promising GNN models. Although GNN has certain shortcomings, such as over-smoothing and scalability, the relatively easy-to-analyze structure and stronger generalization ability of GNN reveal its potential in digital pathology.
Highlights.
The article emphasizes the application of AI in digital pathology. In the past, present and future, AI will continue to change the pathology field to make it more efficient, accurate, and interpretable.
The article enumerates the application of AI in digital pathology and demonstrates the improvement of AI in this field.
The emergence of graph neural network makes it possible for AI to be interpretable while being efficient and accurate.
Acknowledgments:
The authors would like to thank Drs. Dorothy Lewis and Rebecca Danforth for proofreading the manuscript.
Funding:
This work was funded by NIH R01CA238727 to H.Z and S.T.C.W, and NIH U01CA253553, NIH R01CA251710, John S. Dunn Research Foundation, and T.T. and W.F. Chao Foundation to S.T.C.W.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Conflicts of Interest: The authors declare no conflicts of interest.
Declaration of interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References:
- 1.Wong ST, The Adoption of Artificial Intelligence in Cancer Pathology and Imaging, in Cancer and Society. 2019, Springer; p. 113–120. [Google Scholar]
- 2.Kononenko I, Machine learning for medical diagnosis: history, state of the art and perspective. Artificial Intelligence in medicine, 2001. 23(1): p. 89–109. [DOI] [PubMed] [Google Scholar]
- 3.Jiang Y, et al. , Emerging role of deep learning- based artificial intelligence in tumor pathology. Cancer Communications, 2020. 40(4): p. 154–166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bera K, et al. , Artificial intelligence in digital pathology—new tools for diagnosis and precision oncology. Nature reviews Clinical oncology, 2019. 16(11): p. 703–715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.McCulloch WS and Pitts W, A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics, 1943. 5(4): p. 115–133. [PubMed] [Google Scholar]
- 6.Rosenblatt F, The perceptron: a probabilistic model for information storage and organization in the brain. Psychological review, 1958. 65(6): p. 386. [DOI] [PubMed] [Google Scholar]
- 7.Minsky M and Papert SA, Perceptrons: An introduction to computational geometry. 2017: MIT press. [Google Scholar]
- 8.Rumelhart DE, Hinton GE, and Williams RJ, Learning representations by back-propagating errors. nature, 1986. 323(6088): p. 533–536. [Google Scholar]
- 9.LeCun Y, et al. , Backpropagation applied to handwritten zip code recognition. Neural computation, 1989. 1(4): p. 541–551. [Google Scholar]
- 10.Hinton GE and Salakhutdinov RR, Reducing the dimensionality of data with neural networks. science, 2006. 313(5786): p. 504–507. [DOI] [PubMed] [Google Scholar]
- 11.Glorot X, Bordes A, and Bengio Y. Deep sparse rectifier neural networks. in Proceedings of the fourteenth international conference on artificial intelligence and statistics 2011. [Google Scholar]
- 12.Krizhevsky A, Sutskever I, and Hinton GE. Imagenet classification with deep convolutional neural networks. in Advances in neural information processing systems. 2012. [Google Scholar]
- 13.Ioffe S and Szegedy C, Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167, 2015. [Google Scholar]
- 14.Pan SJ and Yang Q, A survey on transfer learning. IEEE Transactions on knowledge and data engineering, 2009. 22(10): p. 1345–1359. [Google Scholar]
- 15.He K, et al. Deep residual learning for image recognition. in Proceedings of the IEEE conference on computer vision and pattern recognition 2016. [Google Scholar]
- 16.Szegedy C, et al. , Inception-v4, inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv:1602.07261, 2016. [Google Scholar]
- 17.He K, et al. Identity mappings in deep residual networks in European conference on computer vision. 2016. Springer. [Google Scholar]
- 18.Bojanowski P, Joulin A, and Mikolov T, Alternative structures for character-level RNNs. arXiv preprint arXiv:1511.06303, 2015. [Google Scholar]
- 19.Hochreiter S and Schmidhuber J, Long short-term memory. Neural computation, 1997. 9(8): p. 1735–1780. [DOI] [PubMed] [Google Scholar]
- 20.Schuster M and Paliwal KK, Bidirectional recurrent neural networks. IEEE transactions on Signal Processing, 1997. 45(11): p. 2673–2681. [Google Scholar]
- 21.Salehinejad H, et al. , Recent advances in recurrent neural networks. arXiv preprint arXiv:1801.01078, 2017. [Google Scholar]
- 22.Goodfellow I, et al. Generative adversarial nets. in Advances in neural information processing systems. 2014. [Google Scholar]
- 23.Mirza M and Osindero S, Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784, 2014. [Google Scholar]
- 24.Radford A, Metz L, and Chintala S, Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434, 2015. [Google Scholar]
- 25.Salimans T, et al. Improved techniques for training gans. in Advances in neural information processing systems. 2016. [Google Scholar]
- 26.Isola P, et al. Image-to-image translation with conditional adversarial networks. in Proceedings of the IEEE conference on computer vision and pattern recognition 2017. [Google Scholar]
- 27.Zhu J-Y, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks. in Proceedings of the IEEE international conference on computer vision 2017. [Google Scholar]
- 28.Karras T, et al. , Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196, 2017. [Google Scholar]
- 29.Zhang H, et al. Stackgan: Text to photo-realistic image synthesis with stacked generative adversarial networks. in Proceedings of the IEEE international conference on computer vision 2017. [Google Scholar]
- 30.Brock A, Donahue J, and Simonyan K, Large scale gan training for high fidelity natural image synthesis. arXiv preprint arXiv:1809.11096, 2018. [Google Scholar]
- 31.Karras T, Laine S, and Aila T. A style-based generator architecture for generative adversarial networks. in Proceedings of the IEEE conference on computer vision and pattern recognition 2019. [DOI] [PubMed] [Google Scholar]
- 32.Gui J, et al. , A review on generative adversarial networks: Algorithms, theory, and applications. arXiv preprint arXiv:2001.06937, 2020. [Google Scholar]
- 33.Zou Z, et al. , Object detection in 20 years: A survey. arXiv preprint arXiv:1905.05055, 2019. [Google Scholar]
- 34.Jiao L, et al. , A survey of deep learning-based object detection. IEEE Access, 2019. 7: p. 128837–128868. [Google Scholar]
- 35.Liu L, et al. , Deep learning for generic object detection: A survey. International journal of computer vision, 2020. 128(2): p. 261–318. [Google Scholar]
- 36.Torfi A, et al. , Natural Language Processing Advancements By Deep Learning: A Survey. arXiv preprint arXiv:2003.01200, 2020. [Google Scholar]
- 37.Otter DW, Medina JR, and Kalita JK, A survey of the usages of deep learning for natural language processing. IEEE Transactions on Neural Networks and Learning Systems, 2020. [DOI] [PubMed] [Google Scholar]
- 38.Qiu X, et al. , Pre-trained models for natural language processing: A survey. arXiv preprint arXiv:2003.08271, 2020. [Google Scholar]
- 39.Minaee S, et al. , Deep learning based text classification: A comprehensive review. arXiv preprint arXiv:2004.03705, 2020. [Google Scholar]
- 40.Hu D An introductory survey on attention mechanisms in NLP problems in Proceedings of SAI Intelligent Systems Conference. 2019. Springer. [Google Scholar]
- 41.Li G, et al. , Human-centered reinforcement learning: a survey. IEEE Transactions on Human-Machine Systems, 2019. 49(4): p. 337–349. [Google Scholar]
- 42.Moerland TM, Broekens J, and Jonker CM, Model-based reinforcement learning: A survey. arXiv preprint arXiv:2006.16712, 2020. [Google Scholar]
- 43.Luong NC, et al. , Applications of deep reinforcement learning in communications and networking: A survey. IEEE Communications Surveys & Tutorials, 2019. 21(4): p. 3133–3174. [Google Scholar]
- 44.Silver D, et al. , Mastering the game of go without human knowledge. nature, 2017. 550(7676): p. 354–359. [DOI] [PubMed] [Google Scholar]
- 45.Mukhopadhyay S, et al. , Whole slide imaging versus microscopy for primary diagnosis in surgical pathology: a multicenter blinded randomized noninferiority study of 1992 cases (pivotal study). The American journal of surgical pathology, 2018. 42(1): p. 39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Wang D, et al. , Deep learning for identifying metastatic breast cancer. arXiv preprint arXiv:1606.05718, 2016. [Google Scholar]
- 47.Steiner DF, et al. , Impact of deep learning assistance on the histopathologic review of lymph nodes for metastatic breast cancer. The American journal of surgical pathology, 2018. 42(12): p. 1636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Dimitriou N, Arandjelović O, and Caie PD, Deep learning for whole slide image analysis: An overview. Frontiers in Medicine, 2019. 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Niazi MKK, Parwani AV, and Gurcan MN, Digital pathology and artificial intelligence. The lancet oncology, 2019. 20(5): p. e253–e261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Janowczyk A and Madabhushi A, Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases. Journal of pathology informatics, 2016. 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Signaevsky M, et al. , Artificial intelligence in neuropathology: deep learning-based assessment of tauopathy. Laboratory Investigation, 2019. 99(7): p. 1019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Becker JU, et al. , Artificial intelligence and machine learning in nephropathology. Kidney International, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Nagpal K, et al. , Development and validation of a deep learning algorithm for improving Gleason scoring of prostate cancer. NPJ digital medicine, 2019. 2(1): p. 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Raverot V, et al. , Age-and sex-specific TSH upper-limit reference intervals in the general French population: there is a need to adjust our actual practices. Journal of Clinical Medicine, 2020. 9(3): p. 792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Ren S, et al. Faster r-cnn: Towards real-time object detection with region proposal networks. in Advances in neural information processing systems 2015. [DOI] [PubMed] [Google Scholar]
- 56.Roux L, et al. , Mitosis detection in breast cancer histological images An ICPR 2012 contest. Journal of pathology informatics, 2013. 4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Roux L, et al. , Mitos and atypia. Image Pervasive Access Lab (IPAL), Agency Sci. Technol. & Res. Inst. Infocom Res., Singapore, Tech. Rep, 2014. [Google Scholar]
- 58.Huang G, et al. Densely connected convolutional networks. in Proceedings of the IEEE conference on computer vision and pattern recognition 2017. [Google Scholar]
- 59.Acs B, Rantalainen M, and Hartman J, Artificial intelligence as the next step towards precision pathology. Journal of Internal Medicine, 2020. [DOI] [PubMed] [Google Scholar]
- 60.Gilpin LH, et al. Explaining explanations: An overview of interpretability of machine learning in 2018 IEEE 5th International Conference on data science and advanced analytics (DSAA). 2018. IEEE. [Google Scholar]
- 61.Chakraborty S, et al. Interpretability of deep learning models: a survey of results in 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI). 2017. IEEE. [Google Scholar]
- 62.Ahmad MA, Eckert C, and Teredesai A. Interpretable machine learning in healthcare. in Proceedings of the 2018 ACM international conference on bioinformatics, computational biology, and health informatics 2018. [Google Scholar]
- 63.Serag A, et al. , Translational AI and deep learning in diagnostic pathology. Frontiers in Medicine, 2019. 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Zhou J, et al. , Graph neural networks: A review of methods and applications. arXiv preprint arXiv:1812.08434, 2018. [Google Scholar]
- 65.Wu Z, et al. , A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 2020. [DOI] [PubMed] [Google Scholar]
- 66.Barnett JH, Early writings on graph theory: Euler circuits and the Königsberg bridge problem. Part II: Historical Projects in Discrete Mathematics and Computer Science of Resources for Teaching Discrete Mathematics: Classroom Project, History Modules, and Articles, MAA Notes, 2005. 74: p. 197–208. [Google Scholar]
- 67.Veličković P, et al. , Graph attention networks. arXiv preprint arXiv:1710.10903, 2017. [Google Scholar]
- 68.Si C, et al. An attention enhanced graph convolutional lstm network for skeleton-based action recognition. in Proceedings of the IEEE conference on computer vision and pattern recognition 2019. [Google Scholar]
- 69.Zhou Y, et al. Cgc-net: Cell graph convolutional network for grading of colorectal cancer histology images. in Proceedings of the IEEE International Conference on Computer Vision Workshops 2019. [Google Scholar]
- 70.Hamilton W, Ying Z, and Leskovec J. Inductive representation learning on large graphs. in Advances in neural information processing systems. 2017. [Google Scholar]
- 71.Sureka M, et al. , Visualization for Histopathology Images using Graph Convolutional Neural Networks. arXiv preprint arXiv:2006.09464, 2020. [Google Scholar]