Abstract
Objective:
To analyse the automatic classification performance of a convolutional neural network (CNN), Google Inception v3, using tomographic images of odontogenic keratocysts (OKCs) and ameloblastomas (AMs).
Methods:
For construction of the database, we selected axial multidetector CT images from patients with confirmed AM (n = 22) and OKC (n = 18) based on a conclusive histopathological report. The images (n = 350) were segmented manually and data augmentation algorithms were applied, totalling 2500 images. The k-fold × five cross-validation method (k = 2) was used to estimate the accuracy of the CNN model.
Results:
The accuracy and standard deviation (%) of cross-validation for the five iterations performed were 90.16 ± 0.95, 91.37 ± 0.57, 91.62 ± 0.19, 92.48 ± 0.16 and 91.21 ± 0.87, respectively. A higher error rate was observed for the classification of AM images.
Conclusion:
This study demonstrated a high classification accuracy of Google Inception v3 for tomographic images of OKCs and AMs. However, AMs images presented the higher error rate.
Keywords: Ameloblastoma, Odontogenic Cysts, Artificial intelligencex, Tomographyx, X-Ray computed
Introduction
Intraosseous benign odontogenic lesions comprise a group of pathological entities characterized by variable symptoms, clinical presentations and radiographic and histological features. These lesions are detected incidentally during routine radiographic examination since symptoms are uncommon at early stages. Although benign, these lesions exhibit a destructive potential associated with high rates of recurrence and can reach a large size until they become symptomatic,1 reducing the quality of life of affected patients. Excluding inflammatory pathologies and odontoma, odontogenic keratocyst (OKC) and ameloblastoma (AM) are the most prevalent benign intraosseous lesions cited in different studies.1–4 The similar imaging findings of these lesions render their distinction a challenge.5
OKC is the third most common odontogenic cyst that frequently affects younger individuals, with a peak incidence between the second and third decades of life.3,6 Radiographically, OKC appears as a well-defined uni- or multilocular radiolucency associated with minimal buccolingual expansion.6–8 This odontogenic cyst frequently involves the posterior mandible and tends to grow towards the mandibular ramus.9,10 Treatment generally consists of enucleation combined with other adjuvant therapies in order to reduce the risk of recurrence.10,11
AM is the most common epithelial odontogenic tumour,4 which frequently affects the posterior mandible. A peak incidence between the fourth and sixth decades of life has been reported.8 Radiographically, AM appears as a uni- or multilocular radiolucent lesion associated with impacted teeth, tooth displacement, root resorption, cortical bone expansion and/or perforation, and the presence of septa that confer a soap bubble or honeycomb appearance.8,12 In view of the high risk of recurrence, surgical resection is usually the therapy of choice for this tumour.13
Since different treatment approaches exist for these pathologies and their imaging features may be confused, the subjective interpretation by the examiner to identify particularities is time-consuming and relies on his/her experience for suggesting a diagnosis as there are no well-defined criteria to distinguish these tumors. Within this context, auxiliary tools such as the assessment of intralesional heterogeneity using Hounsfield unit (HU) may contribute to this distinction.14
In the field of computer science, artificial intelligence is the study of computer algorithms and models that simulate learning processes and problem-solving of humans. An interesting subdivision of this field is convolutional neural networks (CNNs), which consist of algorithms inspired by the structure of the human brain, with a function analogue to that of neuronal cells.15 It is important to highlight that artificial intelligence is not a new technology, but the advances in the processing capacity of machines in recent years have made its use possible and have increased interest and knowledge in the area.
In medicine, studies evaluating the use of neural networks as an auxiliary diagnostic tool have reported promising results regarding the detection and classification of malignant lesions.16–19 Although introduced later in dentistry, studies also suggest an excellent performance of CNNs for the assessment of radiographic images, with different applications20–24 that range from the detection of caries lesions in two-dimensional exams25 to the diagnosis of Sjögren’s syndrome using more complex methods such as CT.26
Considering the similar imaging features of OKCs and AMs and the importance of an accurate presumptive diagnosis for adequate therapy, objective tools can greatly assist in the diagnostic process. Within this context, the application of CNNs is a possibility of combining supervised learning algorithms with imaging methods in an attempt to increase diagnostic accuracy, to reduce the evaluation time and to establish the most appropriate therapeutic approach considering the high recurrence rates of OKCs and AMs.11
Understanding this problem, the aim of this study is to analyse the automatic classification performance of a CNN model using tomographic images of OKCs and AMs.
Methods and materials
This study was conducted at the Dental Radiology Service, School of Dentistry, Federal University of Bahia (UFBA), in collaboration with the Postgraduate Program in Computer Science (PGComp/UFBA) and the Federal Institute of Education, Science and Technology of Bahia (IFBaiano). The project was approved by the Ethics Committee under approval number 366.989.
A total of 48 multidetector computed tomography (MDCT) exams of patients seen at the Oral-Maxillofacial Surgery and Traumatology Service of the School of Dentistry, UFBA, and Santo Antônio Hospital (Salvador, Bahia, Brazil) between 2013 and 2019 were selected. Images with a conclusive anatomopathological report of AM or OKC provided by an experienced oral pathologist were included. Exclusion criteria were as follows: recent intervention at the lesion site, clinically evident infection and lesions > 80 mm. We selected these conditions because they could result in significant changes in content and clinical behaviour. Images with a large number of metallic artefacts involving the region of the lesion were also excluded from the data set, for a total of eight patients excluded.
All images were acquired by HiSpeed CT (GE Healthcare, Chicago, USA) in 0.6 mm with 15.8 FOV, 512 × 512 matrix. The MDCT scans were processed with the soft tissue customized window of the Horos™ DICOM Viewer software (Pixmeo, SARL, Bernex, Switzerland) in order to identify differences in the density of the intralesional tissue. The tomographic sections comprising the image compatible with the lesion were exported from the DICOM (Digital Imaging and Communications in Medicine) to TIFF format (Tagged Image File Format) and then segmented.
The images were segmented manually and individually into rectangular ROIs by a single experienced and calibrated examiner using the ImageJ software (National Institutes of Health, Bethesda, Maryland, USA), which resulted in arbitrary regions of interest of different sizes as shown in Figure 1. The images were saved in PNG (Portable Network Graphics) format, the standard format of CNN which generates compressed files without loss of information.
Figure 1.
TCMD. Axial view. Soft tissue customized window (WL 76, WW 209); b, c and d show segmented ROIs of axial sections of TCMD obtained from different patients with Am and f, g and h show ROIs after segmentation of axial sections of TCMD obtained from different patients with OKC.
Construction of the dataset
Axial tomographic images of 22 AMs and 18 OKCs were selected for construction of the dataset, resulting in a total of 350 samples of the original set of images. Only sections in which the tumour was visible and measured less than 80 mm were selected.
Data augmentation is usually applied to increase the number of images and to minimize overfitting, that is when a CNN model performs well during training but generates many classification errors when exposed to a new dataset. This technique consists of artificially augmenting a dataset using transformation methods in which new images are produced from the original images.27,28 For this purpose, rotation, mirroring, random displacement in the horizontal and vertical directions, elastic distortion and resizing were applied to the original images of the dataset. During the augmentation process, not all of these transformations were applied to the images. For each transformation, a probability to its application can be defined. Data augmentation is a common procedure in medical image datasets to deal with the usually smaller number of samples. Previous studies29,30 analysed the use of such procedures in medical image datasets and concluded that it is an important step to improve the learning process. There are recent works that propose the use of CNNs to produce more parametrized augmentation images in the medical image domain.31,32 In this study, data augmentation was applied to our 350 images dataset, using the transformations rotation (90o and 180o), flips (horizontal and vertical), elastic distortion (configure to be moderate) and resizing (to regularize the different ROIs size). The result was a 7.14 times augmentation, which means 2500 images. All images, original and augmented, were resized to the fixed resolution of 299 × 299 pixels, as required by the CNN Inception v3.
The database was then divided into two main sets: a development set, used to learn the parameters, to estimate the generalization error during or after training, and a final test set used for final validation of the network.33 For this study, 70% of the samples from the original database were selected for the development set and the last 30% of the samples were selected for the final test set. It is important to notice that these two sets, development and final test, are disjunct, that means images in one set did not appear in the other. This selection was done manually, arranging the images on different directories.
The development set was subdivided into three subsets: training, validation and test, as shown in Figure 2. In this subdivision, 10% of the samples were used to define the test set and the remaining 90% of the samples of the development set were divided equally to build the training and validation sets. It is important to notice that the test set was built once and the same set of samples was used during the training/validation steps (Figure 2), related to the cross-validation process, described afterwards.
Figure 2.
Schematic representation of the 2-fold ×5 method used to promote the cross-validation procedure during the learning phase. The development data consist of 70% of the original dataset and is selected manually. The test set corresponds to 10% of the development dataset.
Cross-validation
One method used for evaluating the generalization capability of a model is the cross-validation. It is a statistical technique that can determine the robustness of a model.34 The basic concept of this method is the subdivision of the dataset into mutual exclusive subsets, and then use some of these subsets to estimate the model parameters (training), while the other subsets are used to validate the model.35 There are many ways to produce such partitioning in the literature. In this research, we used the k-fold cross-validation, which subdivides the training set into k folds with nearly equally size subsets or folds, containing samples distributed randomly. Then, k iterations of training and validation steps are applied using a different fold of data for validation and the remaining k-1 folds for learning/training. Special care should be taken to ensure that each fold is statistically representative of the original dataset.36
More precisely, this study used the 2-fold ×5 scheme, proposed by Dietterich.37 Figure 2 depicts this scheme: the development set was divided into 2-folds that are exchanged during the cross-validation process. The cross-validation is repeated five times. Of course, the folds are generated from scratch for each one of the five cross-validation iterations. The test dataset is kept unchanged during this process. According to Dietterich,37 this scheme improves the test stability and reduces the computational cost. Also, in the context of this study, where we aim to show that CNNs can improve the differentiation of two kinds of lesions, we believe that this approach was appropriate.
Convolutional neural network architecture
The CNNs are one of the most advanced deep learning architectures.38 This learning approach allows computational models composed of multiple processing layers to learn data representations with multiple levels of abstraction. In this study, we used the Inception v3 model developed by Google LLC (Mountain View, CA, USA) (Figure 3).
Figure 3.
Schematic representation of the Google Inception v3 model architecture.
The typical architecture of a CNN comprises a series of stages. The first stage consists of two layers responsible for feature extraction: convolutional layers, which process inputs considering local receptive fields, and the pooling layer, which reduces the spatial dimensionality of the representations. The second stage consists of a fully connected layer, which acts as the classifier itself. The output of the CNN defines the probability that the image belongs to one of the classes for which the network was trained.39
Inception model
The architecture of the Inception v3 model consists of a stack of modules that look like small independent networks, divided into several parallel branches40 and is designed to extract features with a reduced number of images, thus decreasing the computational cost of accurate classification of images using the deep learning method. The aim of this model, which was introduced by Szegedy,41 is to act as a multi-level feature extractor, executing convolutions with three different kernel sizes (1 × 1, 3 × 3 and 5 × 5) and thus covering elements of interest with different sizes and positions in an image. The Inception model was run applying the RMSprop optimizer, which is effective and practical for the learning method used in this study.
All the experiments described in this study were performed on a Dell Inspiron i17-7500U notebook, with an Intel i7 2.7 GHz processor, 8 GB of RAM, NVidia GeForce MX150 GPU, running Linux/Ubuntu 20.04 operating system. The application was written in Python35,40,42 programming language and used the following packages: TensorFlow v.1.9.0 and Keras v.2.2.0 for the CNN; and Augmentor for the image augmentation process. Although the tests were made in a Linux environment, the software developed is multi-platform and can run on other operating systems such as Windows and MacOS, with minor adjustments, as well. In this machine, the mean time per training was about 42.69 h (without GPU acceleration enabled).
Results
In order to evaluate the learning process, the precision and accuracy of the model were analysed.35 At the end of the 2-fold ×5 cross-validation process, the Google Inception v3 neural network model used for the automatic classification of tomographic images of OKCs and AM reports the precision value and a 2 × 2 matrix, called confusion matrix. Table 1 shows the matrixes for each 2-fold cross-validation step in the five interactions. As our application deals with a 2-class classifier, the values on this matrix correspond to the percentage of true-positives in the main diagonal and the percentage of false-positives and false-negatives in the secondary diagonal. Based on this matrix, many measures can be defined, such as accuracy, precision, sensibility and F1-Score.35,43 In our analysis, accuracy is the most important measure, indicating the proportion of samples that produces the right classification.
Table 1.
Confusion matrix for the classification of odontogenic keratocysts and ameloblastomas images after data augmentation (n = 2500) in each training run (×5 iterations)
| Input | first | second | third | fourth | fifth | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| AM | OKC | AM | OKC | AM | OKC | AM | OKC | AM | OKC | |
| AM | 0.85 | 0.15 | 0.85 | 0.15 | 0.84 | 0.16 | 0.77 | 0.23 | 0.84 | 0.16 |
| OKC | 0.05 | 0.95 | 0.17 | 0.83 | 0.07 | 0.93 | 0.06 | 0.94 | 0.09 | 0.91 |
AM, Ameloblastoma;OKC, Odontogenic keratocyst.
Mean time per training run: 42.69 h.
In general, the confusion matrix presented in Table 1 indicates a higher error rate for AM images, suggesting a better classification capacity for axial sections of OKCs.
Table 2 shows the accuracy achieved by the CNN in the validation of each iteration. The values exceeded 90% in all five iterations, with a standard deviation less than 1%, suggesting an excellent classification capacity when new images were presented to the tested model.
Table 2.
Accuracy (%) and standard deviation of cross-validation for five iterations
| Accuracy | first | second | third | fourth | fifth |
|---|---|---|---|---|---|
| Mean ± standard deviation (%) | 90.16 ± 0.95 | 91.37 ± 0.57 | 91.26 ± 0.19 | 92.48 ± 0.16 | 91.21 ± 0.87 |
Discussion
The present study demonstrated the capacity of the Google Inception v3 CNN model to classify MDCT images of OKCs and AMs with accuracies higher than 90%. This result agrees with findings reported in dental20,25 and medical studies.44–46 To our knowledge, this is the first study to test the classification performance for two odontogenic tumours with similar characteristics rather than only the identification of pathological entities in imaging exams.
The data of Table 1 generated by cross-validation repetitions show a higher error rate for the classification of AM images, corroborating the findings of Kwon et al. (2020).47 This higher rate might be explained by the radiographic features of this tumor. As suggested by Kitisubkanchana et al. (2020),5 OKCs are generally unilocular with smooth borders and exhibit anteroposterior growth without significant cortical bone expansion, while most AMs are multilocular and have irregular borders. This disformed feature of AM, as indicated in Figure 1, may have impaired the recognition of the tomographic image patterns of this tumor by CNN, resulting in classification errors. Furthermore, the existence of multiple fluid-containing cystic areas detected in different regions of AM results in tomographic sections with areas of lower tomographic density,14 which can confuse the perception of the CNN and increase the probability of errors.
Regarding the content of the two lesions evaluated, it is known that, although not statistically significant, a difference exists between the HU values provided by MDCT.14 OKCs are composed of poorly soluble proteins such as parakeratin and collagen, which result in lower average attenuation values (HU) compared to AMs. The latter tumor consists of epithelial cells and dense connective tissue that confers higher average attenuation.14,48 MDCT images were chosen because this method allows differentiating the tissues comprising the tumor mass and content of the cystic cavity by selecting an adequate window for this purpose.49
Comparison of the diagnostic accuracy of panoramic radiography and CT for the differentiation of OKCs and AMs showed that the three-dimensional method contributes substantially to the correct diagnosis, particularly because of its capacity to identify high-density areas, number of loculations, and bone expansion.48 In the present study, these characteristics may have been detected during the step of feature extraction and used by the CNN for the distinction of the two pathological entities. Some authors have reported promising results regarding the computer-assisted diagnosis using CT images of periapical cysts and OKCs,50 radiolucent lesions51,52 and maxillofacial cysts.53,54
We highlight that this is the first study evaluating the use of CNNs for the automatic classification of benign odontogenic lesions using MDCT images. Moreover, unlike the present study which evaluated the differentiation of two complex and similar pathological entities, most studies applying CNNs to radiographic images in dentistry used two-dimensional imaging methods and aimed to detect or diagnose the presence of diseases compared to images of healthy structures, therefore with a lower level of complexity than the study described in this article.
Although the diagnostic value of CNN models is considerably high, their application is still not a reality55 since the steps of segmentation and training are time-consuming and incur high computational costs, which turns it into a limitation for its usage. The exclusion of large diameter lesions and infected lesions is also a limitation of this study since these characteristics can raise doubts about diagnosis and imitate malignant lesions. In future studies, by including the study of these images, CNN’s performance limits can be better explored. Afterwards one of main limitations of the study is a small sample size. Although the same image used in the training set was not reused for test set, there is a chance that images from the same patient could be used for both processes. Further studies with larger samples should be evaluated to ensure the accuracy and to prevent overfitting.
After successive studies and improvements in the methodological process, these systems should become an integral and essential part of the radiology workflow. The model tested here is intended to be readily available for scientific purpose. It is expected that in the medium term and after expanding the database and new training and test runs in order to ensure high rates of accuracy, this tool will be effectively integrated into the workflow of daily clinical practice.
Conclusion
According to the methodology used in the present study, the Google Inception v3 model provided a high classification accuracy for tomographic images of OKCs and AMs. A higher rate of classification errors was observed for AM images compared to OKCs. Since this is the first study that used an automatic classifier to differentiate similar odontogenic tumors based on MDCT images, further investigations are necessary to elucidate the true accuracy of CNNs for the diagnosis of these tumors and to identify the best approach to introducing artificial intelligence in the daily practice of radiologists. It is strongly suggested that studies including other types of lesions histopathological lesions be performed. An analysis of other parametrizations and other types of CNNs could be made. The new approach to generate data augmentation, based on GANs (Generative Adversarial Networks), also is an interesting future research that can improve the learning process.
Footnotes
Acknowledgements: This study was supported by CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico) (grant 458665/2014-2) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Brazil.
Contributor Information
Mayara Simões Bispo, Email: may.simoes@hotmail.com.
Mário Lúcio Gomes de Queiroz Pierre Júnior, Email: mpierrejr@gmail.com.
Antônio Lopes Apolinário Jr, Email: alopesajr@gmail.com.
Jean Nunes dos Santos, Email: jeanunes@ufba.br.
Braulio Carneiro Junior, Email: brauliocj@gmail.com.
Frederico Sampaio Neves, Email: fredsampaio@yahoo.com.br.
Iêda Crusoé-Rebello, Email: iedacr@ufba.br.
REFERENCES
- 1.Ledesma-Montes C, Mosqueda-Taylor A, Carlos-Bregni R, de León ER, Palma-Guzmán JM, Páez-Valencia C, et al. Ameloblastomas: a regional Latin-American multicentric study. Oral Dis 2007; 13: 303–7. doi: 10.1111/j.1601-0825.2006.01284.x [DOI] [PubMed] [Google Scholar]
- 2.MacDonald D. Lesions of the jaws presenting as radiolucencies on cone-beam CT. Clin Radiol 2016; 71: 972–85. doi: 10.1016/j.crad.2016.05.018 [DOI] [PubMed] [Google Scholar]
- 3.Johnson NR, Gannon OM, Savage NW, Batstone MD. Frequency of odontogenic cysts and tumors: a systematic review. J Investig Clin Dent 2014; 5: 9–14. doi: 10.1111/jicd.12044 [DOI] [PubMed] [Google Scholar]
- 4.Siriwardena BSMS, Crane H, O'Neill N, Abdelkarim R, Brierley DJ, Franklin CD, et al. Odontogenic tumors and lesions treated in a single specialist oral and maxillofacial pathology unit in the United Kingdom in 1992-2016. Oral Surg Oral Med Oral Pathol Oral Radiol 2019; 127: 151–66. doi: 10.1016/j.oooo.2018.09.011 [DOI] [PubMed] [Google Scholar]
- 5.Kitisubkanchana J, Reduwan NH, Poomsawat S, Pornprasertsuk-Damrongsri S, Wongchuensoontorn C. Odontogenic keratocyst and ameloblastoma: radiographic evaluation. Oral Radiol 2021; 37: 55–65. doi: 10.1007/s11282-020-00425-2 [DOI] [PubMed] [Google Scholar]
- 6.Alves DBM, Tuji FM, Alves FA, Rocha AC, Santos-Silva ARD, Vargas PA, et al. Evaluation of mandibular odontogenic keratocyst and ameloblastoma by panoramic radiograph and computed tomography. Dentomaxillofac Radiol 2018; 47: 20170288. doi: 10.1259/dmfr.20170288 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ringer E, Kolokythas A. Bone margin analysis for benign odontogenic tumors. Oral Maxillofac Surg Clin North Am 2017; 29: 293–300. doi: 10.1016/j.coms.2017.03.006 [DOI] [PubMed] [Google Scholar]
- 8.El-Naggar AK, Chan JKC, Grandis JR, Takata T, Slootweg PJ. Odontogenic and maxillofacial bone tumours. In: El-Naggar AK (4ed). WHO classification of head and neck tumours. Lyon, France: IARC; 2017. . 204–60. [Google Scholar]
- 9.Gohel A, Oda M, Katkar AS, Sakai O. Multidetector row computed tomography in maxillofacial imaging. Dent Clin North Am 2018; 62: 453–65. doi: 10.1016/j.cden.2018.03.005 [DOI] [PubMed] [Google Scholar]
- 10.Rajendra Santosh AB. Odontogenic cysts. Dent Clin North Am 2020; 64: 105–19. doi: 10.1016/j.cden.2019.08.002 [DOI] [PubMed] [Google Scholar]
- 11.Chrcanovic BR, Gomez RS. Recurrence probability for keratocystic odontogenic tumors: an analysis of 6427 cases. J Craniomaxillofac Surg 2017; 45: 244–51. doi: 10.1016/j.jcms.2016.11.010 [DOI] [PubMed] [Google Scholar]
- 12.Scarfe WC, Toghyani S, Azevedo B. Imaging of benign odontogenic lesions. Radiol Clin North Am 2018; 56: 45–62. doi: 10.1016/j.rcl.2017.08.004 [DOI] [PubMed] [Google Scholar]
- 13.Hendra FN, Natsir Kalla DS, Van Cann EM, de Vet HCW, Helder MN, Forouzanfar T. Radical vs conservative treatment of Intraosseous ameloblastoma: systematic review and meta-analysis. Oral Dis 2019; 25: 1683–96. doi: 10.1111/odi.13014 [DOI] [PubMed] [Google Scholar]
- 14.Crusoé-Rebello I, Oliveira C, Campos PSF, Azevedo RA, dos Santos JN. Assessment of computerized tomography density patterns of ameloblastomas and keratocystic odontogenic tumors. Oral Surg Oral Med Oral Pathol Oral Radiol Endod 2009; 108: 604–8. doi: 10.1016/j.tripleo.2009.03.008 [DOI] [PubMed] [Google Scholar]
- 15.Mupparapu M, Wu C-W, Chen Y-C. Artificial intelligence, machine learning, neural networks, and deep learning: Futuristic concepts for new dental diagnosis. Quintessence Int 2018; 49: 687–8. doi: 10.3290/j.qi.a41107 [DOI] [PubMed] [Google Scholar]
- 16.Brinker TJ, Hekler A, Utikal JS, Grabe N, Schadendorf D, Klode J, et al. Skin cancer classification using convolutional neural networks: systematic review. J Med Internet Res 2018; 20: e11936. doi: 10.2196/11936 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Gong J, Liu J, Hao W, Nie S, Zheng B, Wang S, et al. A deep residual learning network for predicting lung adenocarcinoma manifesting as ground-glass nodule on CT images. Eur Radiol 2020; 30: 1847–55. doi: 10.1007/s00330-019-06533-w [DOI] [PubMed] [Google Scholar]
- 18.Cho B-J, Bang CS, Park SW, Yang YJ, Seo SI, Lim H, et al. Automated classification of gastric neoplasms in endoscopic images using a convolutional neural network. Endoscopy 2019; 51: 1121–9. doi: 10.1055/a-0981-6133 [DOI] [PubMed] [Google Scholar]
- 19.Sim Y, Chung MJ, Kotter E, Yune S, Kim M, Do S, et al. Deep Convolutional neural network-based software improves radiologist detection of malignant lung nodules on chest radiographs. Radiology 2020; 294: 199–209. doi: 10.1148/radiol.2019182465 [DOI] [PubMed] [Google Scholar]
- 20.Aubreville M, Knipfer C, Oetter N, Jaremenko C, Rodner E, Denzler J. Automatic classification of cancerous tissue in Laserendomicroscopy images of the oral cavity using deep learning. Sci Rep 2017; 7: 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Murata M, Ariji Y, Ohashi Y, Kawai T, Fukuda M, Funakoshi T, et al. Deep-learning classification using convolutional neural network for evaluation of maxillary sinusitis on panoramic radiography. Oral Radiol 2019; 35: 301–7. doi: 10.1007/s11282-018-0363-7 [DOI] [PubMed] [Google Scholar]
- 22.Fukuda M, Inamoto K, Shibata N, Ariji Y, Yanashita Y, Kutsuna S, et al. Evaluation of an artificial intelligence system for detecting vertical root fracture on panoramic radiography. Oral Radiol 2020; 36: 337–43. doi: 10.1007/s11282-019-00409-x [DOI] [PubMed] [Google Scholar]
- 23.Chen H, Zhang K, Lyu P, Li H, Zhang L, Wu J, et al. A deep learning approach to automatic teeth detection and numbering based on object detection in dental periapical films. Sci Rep 2019; 9: 1–11. doi: 10.1038/s41598-019-40414-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lee J-H, Kim D-H, Jeong S-N, Choi S-H. Diagnosis and prediction of periodontally compromised teeth using a deep learning-based convolutional neural network algorithm. J Periodontal Implant Sci 2018; 48: 114–23. doi: 10.5051/jpis.2018.48.2.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Lee J-H, Kim D-H, Jeong S-N, Choi S-H. Detection and diagnosis of dental caries using a deep learning-based convolutional neural network algorithm. J Dent 2018; 77: 106–11. doi: 10.1016/j.jdent.2018.07.015 [DOI] [PubMed] [Google Scholar]
- 26.Kise Y, Ikeda H, Fujii T, Fukuda M, Ariji Y, Fujita H, et al. Preliminary study on the application of deep learning system to diagnosis of Sjögren’s syndrome on CT images. Dentomaxillofac Radiol 2019; 48: 20190019. doi: 10.1259/dmfr.20190019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. 2011. Available from: http://code.google.com/p/cuda-convnet/.
- 28.Shijie J, Ping W, Peiyi J, Siping H. Research on data augmentation for image classification based on convolution neural networks. In: Proc - 2017 Chinese Autom Congr CAC; 2017. pp. 4165–70. [Google Scholar]
- 29.Nalepa J, Marcinkiewicz M, Kawulok M. Data augmentation for Brain-Tumor segmentation: a review. Front Comput Neurosci 2019; 13: 83. doi: 10.3389/fncom.2019.00083 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cirillo MD, Abramian D, Eklund A. What is the best data augmentation for 3D brain tumor segmentation?In: arXiv; 2020. [Google Scholar]
- 31.Xu J, Li M, Zhu Z. Automatic data augmentation for 3D medical image segmentation. In: ArXiv; 2020. [Google Scholar]
- 32.Isensee F, Jäger PF, Kohl SAA, Petersen J, Maier-Hein KH. Automated design of deep learning methods for biomedical image segmentation. Nat Methods 2019; 18: 203–11. [DOI] [PubMed] [Google Scholar]
- 33.Goodfellow I, Bengio Y, Courville A. Deep learning. 2016. Available from: https://www.deeplearningbook.org/.
- 34.Zhang G, Y. Hu M, Eddy Patuwo B, C. Indro D, MY H, Patuwo BE. Artificial neural networks in bankruptcy prediction: general framework and cross-validation analysis. Eur J Oper Res 1999; 116: 16–32. doi: 10.1016/S0377-2217(98)00051-4 [DOI] [Google Scholar]
- 35.Sarkar D, Bali R, Sharma T. Practical machine learning with python. practical machine learning with python. New York: Apress; 2018. [Google Scholar]
- 36.Kohavi R. A study of cross-validation and bootstrap for accuracy estimation and model selection. 1995. Available from: http://robotics.stanford.edu/~ronnyk.
- 37.Dietterich TG. Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 1998; 10: 1895–923. doi: 10.1162/089976698300017197 [DOI] [PubMed] [Google Scholar]
- 38.Dumoulin V, Visin F, GEP B. A guide to convolution arithmetic for deep learning. 2018. Available from: http://ethanschoonover.com/solarized.
- 39.LeCun Y, Bengio Y, Hinton G. Deep learning. Nature 2015; 521: 436–44. doi: 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]
- 40.Chollet F. Deep learning with python. Berkeley, CA: Apress; 2018. [Google Scholar]
- 41.Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D. Going deeper with convolutions. In: In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society; 2015. pp. 1–9. [Google Scholar]
- 42.Ketkar N. Deep Learning with Python. Berkeley, CA: Apress; 2017. [Google Scholar]
- 43.Fawcett T. An introduction to ROC analysis. Pattern Recognit Lett 2006; 27: 861–74. doi: 10.1016/j.patrec.2005.10.010 [DOI] [Google Scholar]
- 44.Chang J, Yu J, Han T, Chang HJ, Park E. A method for classifying medical images using transfer learning: a pilot study on histopathology of breast cancer. In: In2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom); 2017. pp. 1–4. [Google Scholar]
- 45.Coudray N, Ocampo PS, Sakellaropoulos T, Narula N, Snuderl M, Fenyö D, et al. Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat Med 2018; 24: 1559–67. doi: 10.1038/s41591-018-0177-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Mednikov Y, Nehemia S, Zheng B, Benzaquen O, Lederman D. Transfer representation learning using Inception-V3 for the detection of masses in mammography. Annu Int Conf IEEE Eng Med Biol Soc 2018; 2018: 2587–90. doi: 10.1109/EMBC.2018.8512750 [DOI] [PubMed] [Google Scholar]
- 47.Kwon O, Yong T-H, Kang S-R, Kim J-E, Huh K-H, Heo M-S, et al. Automatic diagnosis for cysts and tumors of both jaws on panoramic radiographs using a deep convolution neural network. Dentomaxillofac Radiol 2020; 49: 20200185. doi: 10.1259/dmfr.20200185 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Ariji Y, Morita M, Katsumata A, Sugita Y, Naitoh M, Goto M, et al. Imaging features contributing to the diagnosis of ameloblastomas and keratocystic odontogenic tumours: logistic regression analysis. Dentomaxillofac Radiol 2011; 40: 133–40. doi: 10.1259/dmfr/24726112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.White SC, Pharoah MJ. The evolution and application of dental maxillofacial imaging modalities. Dent Clin North Am 2008; 52: 689–705. doi: 10.1016/j.cden.2008.05.006 [DOI] [PubMed] [Google Scholar]
- 50.Yilmaz E, Kayikcioglu T, Kayipmaz S. Computer-Aided diagnosis of periapical cyst and keratocystic odontogenic tumor on cone beam computed tomography. Comput Methods Programs Biomed 2017; 146: 91–100. doi: 10.1016/j.cmpb.2017.05.012 [DOI] [PubMed] [Google Scholar]
- 51.Okada K, Rysavy S, Flores A, Linguraru MG. Noninvasive differential diagnosis of dental periapical lesions in cone-beam CT scans. Med Phys 2015; 42: 1653–65. doi: 10.1118/1.4914418 [DOI] [PubMed] [Google Scholar]
- 52.Silva VKS, Vieira WA, Bernardino Ítalo M, Travençolo BAN, Bittencourt MAV, Blumenberg C, et al. Accuracy of computer-assisted image analysis in the diagnosis of maxillofacial radiolucent lesions: a systematic review and meta-analysis. Dentomaxillofacial Radiology 2020; 49: 20190204. doi: 10.1259/dmfr.20190204 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Abdolali F, Zoroofi RA, Otake Y, Sato Y. Automated classification of maxillofacial cysts in cone beam CT images using contourlet transformation and spherical harmonics. Comput Methods Programs Biomed 2017; 139: 197–207. doi: 10.1016/j.cmpb.2016.10.024 [DOI] [PubMed] [Google Scholar]
- 54.Lee J-H, Kim D-H, Jeong S-N. Diagnosis of cystic lesions using panoramic and cone beam computed tomographic images based on deep learning neural network. Oral Dis 2020; 26: 152–8. doi: 10.1111/odi.13223 [DOI] [PubMed] [Google Scholar]
- 55.Hiraiwa T, Ariji Y, Fukuda M, Kise Y, Nakata K, Katsumata A, et al. A deep-learning artificial intelligence system for assessment of root morphology of the mandibular first molar on panoramic radiography. Dentomaxillofac Radiol 2019; 48: 1–7. doi: 10.1259/dmfr.20180218 [DOI] [PMC free article] [PubMed] [Google Scholar]



