Abstract
Dental caries detection, in the past, has been a challenging task given the amount of information got from various radiographic images. Several methods have been introduced to improve the quality of images for faster caries detection. Deep learning has become the methodology of choice when it comes to analysis of medical images. This survey gives an in-depth look into the use of deep learning for object detection, segmentation, and classification. It further looks into literature on segmentation and detection methods of dental images through deep learning. From the literature studied, we found out that methods were grouped according to the type of dental caries (proximal, enamel), type of X-ray images used (extraoral, intraoral), and segmentation method (threshold-based, cluster-based, boundary-based, and region-based). From the works reviewed, the main focus has been found to be on threshold-based segmentation methods. Most of the reviewed papers have preferred the use of intraoral X-ray images over extraoral X-ray images to perform segmentation on dental images of already isolated parts of the teeth. This paper presents an in-depth analysis of recent research in deep learning for dental caries segmentation and detection. It involves discussing the methods and algorithms used in segmenting and detecting dental caries. It also discusses various existing models used and how they compare with each other in terms of system performance and evaluation. We also discuss the limitations of these methods, as well as future perspectives on how to improve their performance.
1. Introduction
For easier segmentation and detection of dental caries, prior knowledge for several tasks is needed. There is a need to understand the various sections of the tooth and the specific position of the lesion on the tooth. An understanding of the types of dental images to be used, for instance, panoramic or bitewing radiographs, is also needed. Furthermore, the specific regions or areas of interest which are required should be clear in order to be able to choose the suitable method for segmentation and detection of caries. All this information is required in order to achieve high performance segmentation and detection of dental caries.
1.1. Tooth Anatomy
The tooth is a small white structure found in the jawbone of animals and human beings. In human beings, the number of teeth ranges from 20 primary teeth in children to 28–32 permanent teeth in adults. Further, the tooth can be broken down into three main layers: the crown, neck, and root. The crown is the visible region above the gum line. The neck connects the crown to the root, while the root is the region inside the bone socket [1]. Additionally, each tooth consists of hard tissues that protect the soft tissues in the center, as can be seen in Figure 1.
Figure 1.

Cross section of a molar tooth [2].
1.2. Dental Caries
Dental caries is a tooth infection caused by bacteria. Diet that includes fructose, sucrose, and glucose accelerates the occurrence of dental caries. The acid released from the above process leads to demineralization of the tooth surface. Caries occurs when the rate of demineralization is less than the rate of decay. There are various categories of dental caries and these can be characterized [3]. The protocol characterizes dental caries based on its location and also the affected tooth [4]. The categories are as follows:
Class I: dental caries on occlusal surfaces of posterior teeth, for instance, molars and incisors.
Class II: occurs on proximal surfaces of posterior teeth.
Class III: occurs on interproximal surfaces of anterior teeth, with no incisor edge involvement.
Class IV: occurs on interproximal surfaces of anterior teeth with incisor edge involvement.
Class V: occurs on the lingual or cervical third of the facial surface of the tooth.
Class VI: occurs on the occlusal or incisor edge, worn away due to attrition.
From positional classification, caries can also be classified based on the severity of lesions on the tooth. This is done based on the amount of dentin and enamel that has been affected by the caries.
Incipient caries are caries that have a depth of less than half of the enamel of the tooth.
Moderate caries are caries that are more than halfway through the enamel but do not touch the dentin.
Advanced caries are caries that extend to the dentin region.
Severe caries are caries that extend more than halfway through the dentin and even reach the pulp.
Identification of caries under classes I, IV, and VI can be done during clinical inspection, since the regions are visible orally. The introduction of X-rays in the medical field has greatly improved diagnosis of various ailments. In the dental field, radiography has improved the visual inspection of patient's teeth. X-rays have enabled professionals to be able to view previously unobservable regions of caries that would have gone untreated.
1.3. Dental Radiographs: X-Rays
There are varying degrees of information needed depending on the form of treatment required to diagnose a certain ailment. An X-ray or radiograph is a digital film that represents unobservable information not visible by the naked eye. Figures 2–4 show some types of radiographs. The work in [5] explains three types of X-rays that are commonly used to diagnose dental health. There are two types of radiographs, intraoral and extraoral [6].
Intraoral radiographs: X-ray film captures radiographic images while inside the mouth. This type is subdivided further into bitewing radiographs, periapical radiographs, and occlusal radiographs.
Extraoral radiographs: X-ray film captures radiographic images while outside the mouth. This type is subdivided further into panoramic radiograph, computed tomography (CT), and sialography.
Figure 2.

Periapical radiograph.
Figure 3.

Bitewing radiograph.
Figure 4.

Panoramic radiograph.
2. Deep Learning Systems Application Areas
The introduction of deep learning systems has contributed to several application areas using medical images.
2.1. Digital Microscopy and Pathology
Deep learning methods have been very popular in these areas, especially with the growing availability of tissue specimen. In this domain, deep learning techniques developed have had their main focus on three broad challenges. These are segmentation of large organs, detecting, segmenting, and classifying nuclei, and also detecting and classifying region of interest of lesions. Other areas where deep learning techniques have contributed include normalizing histopathology and color normalization in image analysis. The work in [7] introduces a method for normalizing stains on histopathology images using autoencoders. Color normalization is further demonstrated by [8], where the use of convolutional neural networkss (CNN) for tissue classification in stained images was introduced. Brain. In this area, deep neural networks (DNN) have been used for brain image analysis in several domains, most notably the classification of Alzheimer's disease. Other domains include segmentation of brain tissues and anatomical structures, for instance, hippocampus. There are also the segmentation and detection of brain tumor, lesions, and microbleeds. There are other tasks that require more anatomical information like white matter lesion segmentation, and [9] tackled such scenarios. They lowered the sampling rate of nonuniformed sampled patches to cover a larger part of the region of interest.
Chest. Deep learning also has contributed to thoracic image analysis, on both tomography and radiology. Deep neural networks have addressed the detection, characterization, and classifying nodules from radiology tests conducted. Computed tomography (CT) scans have detected several diseases, including lung diseases, from a single image. Chest radiography is the most common radiology test and therefore uses a large set of images that are used to train systems. These systems can be a combination of convolution neural networks used for image analysis and recurrent neural networks used for text analysis.
Eye. Deep learning algorithms have also been introduced in the analysis of eye images and have seen CNNs being employed to address the segmentation of anatomical structures. These networks have also addressed the detection and segmentation of retinal diseases, diagnosis, and assessment of image quality. The work in [10] shows the performance of a Google Inception v3 network for diabetic retinopathy detection and compares the results with those of seven ophthalmologists.
Musculoskeletal. In this domain, deep learning algorithms are used for identification and segmentation of joints, bones, and soft tissues in images. The method used in [11] is one of the applications that trained their system with musculoskeletal disc images and had very good performance across some radiology scoring tasks.
Breast. Research on breast imaging has resulted in [12], which shows significant advances over the state of the art, achieving performance of human readers on the region of interest (ROI). The main task also is to detect breast cancer, and this consists of several subtasks. These include detection and classification of lesions, micro classification, and cancer risk scores of images. The availability of huge amounts of image data has made mammography easier to perform and the most common method used for breast radiography.
Abdomen. Research on the abdomen is localized on the segmentation of organs, mainly pancreas, kidneys, liver (tumors), and bladder. The main radiograph used for most organs is the CT radiograph, as well as MRI only for prostate analysis. The method used by [13] introduces a hybrid neural network used to extract features that will be used further for classification. Cardiac. Cardiac image analysis has embraced deep learning in segmentation, tracking, classification, and accessing image quality. The MRI is used for radiographic testing. Reference [14] introduced the use of neural networks to segment the left ventricle using the recurrent connection of U-network architecture. Reference [15] used neural networks to perform regression for identifying some cardiac sequence on its model.
3. Factors That Lead to Dental Caries
[The biology, prevention, diagnosis and treatment of dental caries Scientific advances in the United States] show factors affecting caries, namely, saliva, bacteria, diet, and hereditary.
Bacteria. Reference [16] shows how plaques form around the surface of the tooth and then come into contact with carbohydrates to form acids that dissolve the tooth structure. This concept formed the basic foundation of dental caries. There is no specific way of how bacteria affects the tooth; thus it is hard to control the dental caries disease. In the method [17], how dental caries occurs from bacteria associating with fermented carbohydrates is explained, thus being referred to as a diet-bacteria disease. Researchers have associated the caries process with fermented carbohydrates [18] and showed a relationship between sugar and caries leading to acids formed on the tooth's surface. Rather, [19] demonstrated how frequent sweet snacks consumption is related to dental caries.
Saliva. It plays a critical part in the well-being of soft and hard tissues of teeth inside the mouth [20]. When the saliva flow rate is low, this shows risk to dental caries. Further, [21] explained how saliva flow measurement is an important risk assessment and management measure for dental caries.
3.1. Prevention of Caries
Dental caries can be prevented through various ways:
Sealants: this can be done as [22] shows how sealants are introduced over the specific carious region to stop the decay process on the tooth. This prevents food particles from collecting in the tooth pits, thus preventing caries.
Remineralization: this is done by hardening the tooth's enamel as explained by [23] to prevent dental caries from even taking place.
Fluoride: this is found from excessive fluorosis in drinking water; in [24], a practicing dentist, associated with the enamel [3], showed its effects on the enamel of the tooth. Research from [25] gave an optimal level of fluoride in water to prevent dental caries.
Risk assessment: this entails the prior determination of someone's developing dental caries during a specific period of time as [26] explained to enable easier management. Assessment is very useful in determining whether extra diagnostic measures are required. The work in [27] explained how assessment assesses effectiveness of previous caries control measures and acts as a guide to treatment planning in the future. Further research has seen that caries can be prevented by saliva when fluoride application is combined with regular removal of forming dental plaques, realized by brushing teeth. The research in [28] explained how saliva and small amounts of fluoride contributed to the hardening of the enamel and thus low risk of dental caries.
3.2. Diagnosis of Caries
Diagnosis of dental caries is done by the following:
Clinical method: this is described by [29] and is explicitly defined as the visual detection of dental caries from an oral examination. They used separators to visualize areas of interest and also used dental floss applied on these areas to detect roughness of the surfaces.
Radiographic method: the radiographs also known as X-rays are digital images and [30] introduced them in dentistry. The work in [31] further intensified the use of dental radiographs in their research to detect caries. The work in [32] generated images of higher diagnostic quality compared to those found from conventional films to detect caries in bitewing X-rays.
3.3. Treatment of Caries
Focus has shifted from surgical ways to the management of caries via restoration of tooth structure development or implants. The research work in [33] emphasized the prevention of the disease, remineralization steps, and minimizing the access to caries affected regions to avoid further decay.
4. Image Representation for Dental Segmentation and Detection
The representation of dental images is done through the segmentation of various regions of interest from the larger image to locate objects. Segmentation of images is, therefore, partitioning of an image into several segments to be used to identify objects and their edges. Image segmentation can be categorized according to similarity and discontinuity properties. Discontinuity based methods are referred to as boundary-based methods, while similarity-based methods are referred to as region-based methods [34].
Therefore, the segmentation process is based on dividing an image into groups of similar characteristics and features. Mathematically, segmentation of an image R is a finite set of regions R1 … Rs. R=∪i=1RiRi∩Rji ≠ j.
4.1. Categories of Dental Images Segmentation
Research done by [35, 36] further categorizes segmentation methods according to various characteristics such as region, entropy, shape, threshold, and pixels correlation among others. These characteristics were from thermal, X-ray images to aid analysis of specific points or regions of interest. Research studies show that dental image segmentation is classified as region-based, cluster-based, threshold-based, boundary-based, and watershed-based methods.
Region-based: it divides an image into several regions based on discontinuities of pixel intensity. Reference [37] explained how segmentation of dental panoramic X-ray images assists dentists to detect osteoporosis disease. The work in [38] also used the region-based approach in segmentation of bitewing dental X-ray images.
Threshold-based: this is done by choosing a threshold value from pixel intensities of an image. Then, pixels that exceed the threshold value are placed into a region, while those that are below the threshold value are placed in an adjacent region. Threshold-based segmentation is common in most of the reviewed articles [2, 6, 39–48].
Cluster-based: it is the automatic grouping of image data based on certain degrees of similarity between the data. The degree of similarity depends on the problem being solved. The algorithm used to perform clustering of data uses the automatically detected groups as initial parameters. Research by [49] performed jaw lesion segmentation using the fuzzy C-means method. The work in [50] used a semifuzzy supervised clustering algorithm to segment dental radiographs.
Boundary-based: it is used to find edge or point discontinuities on images. It detects color or pixel intensity discontinuities in the gray levels of the image. Active contours are used by [51–53], as one of the approaches to segment images based on their boundaries. The approach performs segmentation by outlining an object from an image and is also referred to as the snake method. Level set method (LSM) is another approach for detecting boundaries in an image. It handles segmentation by performing geometric operations to detect contours with topology changes. Examples of works on boundary-based segmentation are [54, 55], which used LSM to segment radiographic images.
Watershed-based: it is performed on a grayscale image and used mathematical morphology to segment adjacent regions in an image; watershed-based segmentation was used by [56] on bitewing dental radiographs. It was also used by [57] as a combination of the K-means clustering and the watershed method for color-based segmentation.
4.2. Diagrammatic Representation of Dental Image Segmentation
This section aims at giving pictorial representations of various segmentation methods, as can be viewed below in Figure 5. The work in [38] proposed a novel way of finding region of interests in both the gap valley and tooth isolation using edge intensity curves. It used the region growing approach [58] to detect the region of interest. It further used canny edge detection algorithm [59] to detect the edges of the isolated teeth.
Figure 5.

Left: region growing approach; central: canny edge detected image; right: tooth isolation and gap valley using binary intensity integral curves [38].
A dental classification method by [60] proposed a classification of periapical images via the fuzzy value, which takes care of dental orientation problems such as missing teeth. They also used the dental universal numbering system to categorize teeth into incisors, canine, molar, and premolar as shown in Figure 6.
Figure 6.

Dental universal numbering system [60].
Segmentation of the images is done through multiscale aggregation [61], which deals with pixel distortions in image data. Integral projection is used to detect horizontal individual teeth, and this is further classified according to one of the four incisor, molar, premolar, and canine categories. Figure 7 shows images of the above-mentioned method.
Figure 7.

(a) Horizontal integral projection of lower tooth; (b) classification result [61].
Additionally, [62] proposed the use of faster regions of a convolution neural network to detect and number periapical dental images. A filtering algorithm is used to delete overlapping boxes detected by the faster convolution network. A neural network is introduced to detect missing teeth. A rule-based teeth numbering system is used to match labels of detected teeth boxes to modify results that violate set rules. The work in [63] explained how they achieved instance segmentation through the use of a mask region-based convolution neural network. This system is an extension of [64], which includes a section of convolution networks to achieve the task of instance segmentation (Figure 8).
Figure 8.

Training process of the segmentation system [63].
Features are extracted from ResNet101, which then compose the feature pyramid network (FPN). The FPN defines anchors and extracts regions of interest (ROI). The region proposal network is formed from a combination of anchors and the feature pyramid network (FPN) (Figure 8). Finally, regions of interest are aligned to the same size and, further, each fixed size is classified as a tooth or a background (class scores). The fixed size features are localized by regression of bounding box coordinates. Finally, pixels are segmented by the full convolution network (FCN) [23] in each detected mask as seen in Figure 9.
Figure 9.

Process of separating the teeth.
Table 1 clearly shows related works grouped by various categories and methods. A diagnostic system proposed by [66] consists of Laplacian filtering, window-based adaptive threshold, morphological operations, statistical feature extraction, and back-propagation neural network. The back-propagation neural network is used to classify a tooth surface as normal or having dental caries. A study by [67] analyzes feature extraction performance of dental caries image using Gray Level Cooccurrence Matrix (GLCM) algorithm for contrasted two types of caries based on the theory of GV Black, namely, dental caries Class 3 and Class 4.
Table 1.
Related works grouped by various categories and methods.
A CNN model using a U-shaped deep CNN (U-Net) proposed by [68] was for dental caries detection on bitewing radiographs, and it was further investigated whether the model can improve clinicians' performance.
5. Feature Extraction Techniques for Dental Images
Feature extraction on an image is done by the use of various methods depending on its texture, pixels, and color intensity. The method by [69] proposed evaluating the performance of various texture feature maps to recognize demineralization of caries. It used intraoral image analysis that includes run-length matrices (RLM), first-order features (FOF), cooccurrence matrices, gray tone matrices, local binary patterns, and K-means clustering to transform images of confirmed caries cases. The performance from the different feature maps was compared to that of radiographic images by several radiologists. Feature maps are a product of extraction of features from an image by various methods and techniques that include the following.
First-order features (FOF): this is a formalised description of the detection of probability of a particular intensity within data in an image, and this shape is used to determine image parameters such as contrast, sharpness, and other objects. For instance, for an image I with spatial resolution W∗H and range intensity G, its histogram is defined as
| (1) |
Several equations can be derived from the above equation, namely, mean, variance, and entropy, and can be represented as
| (2) |
| (3) |
| (4) |
Run-length matrix is defined by changes in the illumination of pixel values on an image [70], to explain the coarse texture of pixels can be expressed by a larger section of similar color encoded as a matrix. Other works that have embraced this method include [71–73].
Gray level cooccurrence matrix (GLCM): this, according to [74], shows texture relations between adjacent pixels in the image texture. The matrix is calculated from entries that represent probabilities of coexisting gray tone pixels that are next to each other, and the distance between the pixels is a feature parameter. Other features that can be extracted by this matrix include contrast, entropy, variance, some average, and homogeneity. Related works include [75, 76].
Gray tone difference matrix (GTDM): this matrix, according to [77], is described as gray tone textural properties, which is the difference in pixel intensity level I and illumination I in a K∗K neighborhood described as
| (5) |
| (6) |
Another method proposed by [78] combines GTDM, LBP, and K-means clustering for feature extraction.
Laws' texture energy measures: according to [79], a method that utilised set masks to calculate local energy of an image was proposed. These masks further detect textural characteristics such as edges, spots, levels, and ripples. Other works that use this method include [80, 81].
Local binary patterns (LBP): it detects small structures such as edges, lines, and spots [82], on the skin, thus representing them as binary patterns. For pixel inputs (xc, yc), a neighborhood of radius R and several evenly sampled points on radius P are specified. Local binary pattern function is given by
| (7) |
Here, s[.] operator returns 1 for positive values and 0 for negative values.
Figure 10 shows a 3∗3 local binary operation concatenating all 8 bits to output a binary number, which is converted to a decimal number. This decimal number is the LBP code and is assigned to the center pixel. Other works that use LBP include [83] and [84].
Figure 10.

LBP operation given P = 1 and R = 1 [82].
Clustering: this used K-means method to cluster pixels. The intensity of pixels is taken into account; also prior information about the number of clusters should be defined. Euclidean distance is the metric used when adding individual pixels to the nearest cluster. A cluster value is set to the average value of all pixel intensities, and this procedure is iterated until a specified threshold is met [85]. Related works include [86–88]. Figure 11 shows the feature maps of the extraction techniques discussed.
Figure 11.

Intraoral radiographs texture feature maps [69].
According to [89], a textural feature system for diagnosis of dental caries in radiographs was used. This system introduced other feature extraction techniques such as the following.
Gabor filters: they were originally introduced by Gabor, D. (1946) and extracted edge-like components with very high frequency in a local region of an image. They are described as the best texture descriptor due to their use in segmentation, object recognition, tracking of motion, and image registration. Spatial domain Gabor is defined as
| (8) |
Here σx and σy are standard deviations for (x, y) axis distribution, and the sinusoidal frequency is denoted by W. Other related studies on Gabor filters include [90–92].
Local ternary patterns: this, from [93], shows how LBP is extended to a three valued code from two values. The ternary code is got from comparing neighboring pixel values with a set threshold value τ. Values that are within the threshold value are set to 0, those above it are set to +1, and those below the threshold value are set to −1.
The threshold function value is defined as
| (9) |
where τ is the set threshold, xc is the value of the central pixel, and xi are the neighbouring pixels of xc. Diagrammatically, the process can be displayed by a Figure 12 LTP code for a 3 × 3 matrix image region. Other related works include [94–96].
Figure 12.

(a–d) LTP code with τ ≥ +15, τ ≤ −15 and both positive and negative LBP codes.
Morphological gradient: the method introduced by [97] is used to increase the intensity of boundary edges of an image. This method makes it easier to observe edge boundaries and other objects clearly [98] and identify dental caries on teeth. Other works that involve the mMG method include [99–101]. Multiple morphological gradient consists of several encryption elements that aid in processing of images and include gradient, multiple values, and threshold.
Morphology gradient: it increases the intensity of the edges of objects in the image, and this is done by dilation subtracted by erosion morphology: (A ⊕ B) − (A⊖B).
Multiplier value: it is a constant value to enable the increase in pixel value on the image. This is defined as
| (10) |
Here w is the bit depth.
Threshold: this is done by separating pixel values into two classes, 0 and 1, which are dependent on a constant threshold value q. According to [99], thresholding is defined as
| (11) |
Here 0 < q ≤ Amax.
Table 2 shows various works grouped by feature extraction method.
Table 2.
Works grouped by feature extraction method.
A novel deep convolution layer network (CNN) with a Long Short-Term Memory (LSTM) model was proposed by [103] for the detection and diagnosis of dental caries on periapical dental images.
6. Deep Learning Methods and Algorithms
Research by [104] explains how machine learning algorithms are divided into unsupervised and supervised learning algorithms.
6.1. Learning Algorithms
Supervised learning: the model is represented with a dataset D={x, y}n=1N of input features x and y. They can take different forms depending on the learning task, for instance, with classification y will be a scalar representing a class label. With regression, it will take a vector of continuous variables. In dealing with a segmentation model, y can be a multidimensional label image; basically supervised training finds model parameters θ that best predict data given a loss function . This describes the output of the model obtained by feeding x to f(x; ⊖) that represents the model.
Unsupervised learning: there are algorithms that are trained to find patterns, such as data without labels, for instance, clustering and principal component analysis methods. These algorithms can be performed on models with different loss functions.
6.2. Neural Networks
These are networks that contribute to deep learning systems. Neural networks consist of neurons with some activation a and parameters θ={W, β} , where W is weight and β is bias. Activation preselects the linear combination of input x to a neutron and its parameters, followed by element on element (.) for nonlinearity, referred to as the transfer function: a=σ(wT+b). Transfer functions for traditional neural networks are sigmoid and the hyperbolic tangent function. Multilayered neural networks known as perceptrons consist of several layers of these transformations:
| (12) |
Here Wn is a matrix comprising wk rows with k as activation on the output. n is the number of current layers, and L is the final layer. Hidden layers are layers in between the input layer and the output layer. When a network contains multiple hidden layers, it is referred to as a deep neural network.
Activations done to the final layer which is the output layer of the network are mapped to a distribution over the number of classes P(y|x; θ) through a softmax function:
| (13) |
Here WiL is the weight of the output node associated with class i.
Stochastic gradient descent is the method used to fit parameters to dataset D. A small subset of the dataset is used to optimize maximum likelihood and this in return minimizes negative log-likelihood as
| (14) |
This further leads to binary cross entropy loss for two class problems and categorical cross entropy for multiclass tasks. Deep neural networks gained popularity in 2006 and, in the method in [105], it was explained how DNNs were pretrained layer by layer and their stacked network fine-tuned to produce good evaluation performance. Currently, the most used deep learning popular networks are the convolution and the recurrent networks.
6.2.1. Convolution Neural Networks
These networks have weights that are shared in a manner that the network performs convolutions on images. This means that there is no redundancy in the way the model learns separate detectors for the same object that occurs at different position on an image. It reduces the number of parameters to be used for learning. At each layer of the network, the image is convolved with a set of K kernels W=(W1 … WK) , with biases β=(b1 … bK) and each generating Xk feature map.
The features are subjected to a nonlinear transform σ(.) and the process iterated for every convolution layer l given by
| (15) |
The convolution neural network also has pooling layers that aggregate pixels and their neighbors using an invariant function, which is the maximum operation. Aggregation is done to subsequent convolution layers and, at the end of the stream, a regular network layer is added where weights are not shared. The network is trained by feeding activations to the output layer through a softmax function. Under the CNNs, there exist common architectures that are widely used in the analysis of medical images and include the following.
General classification architectures: they include networks such AlexNet [106], which, unlike its precursor LeNet [107], consists of five convolution kernel layers employed in the input and output. AlexNet incorporated rectified linear units (ReLU) as their activation function, and this has become the most common choice in CNNs. Further, there has been an interest in using smaller kernels instead of single layers of kernels with a large receptive field; this in return has less number of parameters. Research by [108] discussed the use of deeper neural networks that have small fixed size kernels in each layer of the 19-layer model, referred to as VGG19 network model. More complex building blocks have been introduced to deep networks, to improve efficiency in the training process, and also reduce the amount of parameters used. There is also [109], where GoogLeNet is a 22-layered network that uses inception blocks with a set of convolutions of different sizes. The ResNet architecture was introduced and consists of ResNet blocks which, instead of learning a function, learn mappings in each layer that is close to the identity function [64]. There has been an increase in the use of these deep architectures due to their low memory use, and this even contributed to a recent version of GoogLeNet referred to as Inception v3 [110].
Multistream architectures: they are networks that accommodate multiple inputs in form of channels towards the input layer and then are later merged at any point in the network. Image processing can be done by the use of multiscale image analysis and classification for brain lesions [111]. Multistream architecture [112] is used for segmentation of natural images. Challenges of deep learning systems in the medical imaging domain are in adapting existing architectures, for instance, with different input formats (2D or 3D) data. Volumetric data can be divided into slices and fed as different streams to a network to avoid a result of large amounts of parameters. These techniques can still be used to perform knee cartilage segmentation [113]. The multistream architectures can be used for classification in the context of medical imaging [114].
Segmentation architectures: they are specific to the task of segmentation of medical images. CNNs are used to classify individual pixels with those in their neighborhood in an image. To avoid redundancy in the classification of the pixels, fully connected layers are rewritten as convolutions and this helps the CNN to take in input images larger than what it was trained on and produce a likelihood map. The resultant fully CNN (fCNN) can then be applied to an entire volume of images. This further leads to low resolution of output compared to the input images due to pooling of layers. There is a technique that applies the FCNN to shifted versions of the input, followed by stitching the result together to obtain a full resolution of the final output [115]. The FCNN was improved by proposing the U-net architecture that has convolutions in its downsampling and later an upsampling task to increase the image size [116]. Another method added skip connections to U-net architecture to connect downsampling and upsampling of the convolution layers [115]. A similar approach was used by [117], for 3D data, and [118] incorporated residual blocks and Dice loss layer to the U-net architecture instead of the commonly used cross entropy.
6.2.2. Recurrent Neural Networks
These networks were developed for discrete sequence analysis and have varied lengths for both inputs and outputs, thus making them suitable for tasks such as machine translation. In a classification task, the model learns distribution over classes P(y|x1 … xT; θ) given sequence x1 … xT as input.
The RNN has a hidden state h at time t which is the output of a nonlinear mapping from xt and previous state ht−1: ht=σ(Ext+Rht−1+b). Here W, R are weight matrices that are shared over time. For a classification task, several fully connected layers are used and followed by a softmax to map the sequence over the classes as
| (16) |
RNNs have problems of memory shortage similar to those of other deep neural networks. Several techniques have been developed, such as the Long Short-Term Memory (LSTM) cell [119], to deal with problem. A simplification of the LSTM is the gated recurrent unit (GRU) [120]. RNNs are increasingly being adopted with promising results, for instance, [121], in the human brain challenge.
6.3. Unsupervised Models
Some unsupervised models reviewed include autoencoders, restricted Boltzmann machines, and deep belief networks.
6.3.1. Autoencoders
These are networks trained to reconstruct input x on output via a hidden layer h. They have a weight matrix Wx,h and bias bx,h from input to hidden state and Wh,x with corresponding bias bh,x from hidden state to reconstruction. The hidden activation is computed as
| (17) |
Here, h is smaller than x to prevent the model from learning the identity function. According to [122], another solution was introduced to prevent the model from learning the identity function. The model uses a denoising autoencoder that trains the model to reconstruct input from noise. Deep autoencoders are realized by placing autoencoder layers on top of each other, and in most cases the layers were trained individually. Examples of autoencoders include the following.
Variational autoencoders: with these, [123] introduced the use of conditional variational autoencoders for pathology detection in medical images.
6.3.2. Adverserial Networks
These are used for image generation tasks and include works from [124, 125].
6.3.3. Deep Belief Networks
These are a type of Markov random field (MRF) which constitute an input layer x=(x1 … xN) and a hidden layer h=(h1 … hM) having a latent feature representation. Its connections are bidirectional and thus a generative model that can be sampled for new data points. An energy function can be defined for a state (x, h) of input and hidden layers as
| (18) |
Here c and b are the bias terms. Further, the probability of the system is given by
| (19) |
This makes computing the partition function Z intractable, while conditioned inference in computing h conditioned on x is tractable and is given by
| (20) |
The use of DBNs can be to fuse medical images [48]. DBNs also are used to extract high level features from medical images and effectively classify them [126].
6.4. Software and Hardware
These are the processors and software programs used to handle the running of various deep learning techniques. They include GPU computing libraries such as CUDA and OpenCL, these being very fast processor units compared to the previously used CPUs. These GPUs work hand in hand with the available open-source software programs that provide a platform to implement various operations of neural networks such as convolutions. Most popular packages include Caffe [127], TensorFlow [128], Theano [129], and Torch [130] that provide interfaces for implementing various operations in deep learning. They are also third-party packages written on top of frameworks like Keras [131] and GitHub [132]. Figure 13 shows a representation of deep learning architectures.
Figure 13.

Representation of deep learning architectures. (a) Autoencoder. (b) Deep belief networks. (c) Recurrent neural network. (d) Convolutional neural network. (e) Multistream convolutional neural network. (f) U-net architecture [104].
7. Deep Learning Uses in Medical Images
These systems can be used in various tasks like segmentation, classification, detection, and registration.
7.1. Classification
In this approach, one has a single input or multiple inputs with a single variable as output. For instance, in a disease classification setup, one has the disease or not and diagnosis of the disease is based on a sample of the dataset. Transfer learning is therefore realized and is defined as the use of pretrained networks to cover very large datasets for deep networks training. Transfer learning can be used to fine-tune a pretrained network on medical images and also for feature extraction on image data. These processes are beneficial in saving time used to train deep networks and enable extracted features to be analyzed faster. Object classification is usually done on a small part of the image and can be divided into two or more classes for analysis. For instance, [133] used three CNNs, each taking a nodule patch as input and each feature output is concatenated to form a final feature vector. Multistream CNNs are used to classify skin lesions, with each stream working on different resolutions of the image [134].
7.2. Detection
Detection of objects and their respective regions of interest in images is an important part of diagnosis by medical practitioners. The task consists of localization and identification of small regions in a full image. These systems are designed to automatically detect regions and decrease the reading time of human experts. An example of detection is [98], which explained the use of edge detection method to detect dental caries on dental images.
7.3. Segmentation
Segmentation of various organs allows deep analysis of several parameters related to volume and shape, for example, in skin or breast cancer analysis. It is always the first step before the detection process and is defined as the identification of the pixels which make up the interior or exterior contour of the object of interest. There have been a wide variety of methods developed to segment images using deep learning medical images. Segmentation is needed to perform accurate segmentation on 3D CNNs using multistream networks with different scales [135].
7.4. Registration
Registration is a common image analysis task in which coordinate transforms are calculated from one image to another. This is done iteratively assuming parametric transformations and a predetermined metric is optimized. Deep networks can benefit from registration by estimating a similarity measure for two images to drive an iterative optimization, for instance, [136]. They can also be used to predict transformation parameters using regression, for example, [137].
8. Databases Used by Dental Images
There is a need for efforts to build or come up with a public dataset to aid in developing of algorithms to be used in the dental imaging area. In order for this to come to pass, researchers need to release data used in their papers, and this will lead to a repository that can reliably catalogue and archive publicly dental imaging data. Several examples of datasets exist and include the digital database for screening mammography (DDSM) [138], which is used for mammogram image analysis and aids in screening of breast cancer. There exist several databases that are used in dental imaging and these include the following:
PASCAL VOC 2007: this is from the results from the pascal visual object classes challenge of 2007 [139].
Caltech101: this is a dataset of images to facilitate computer vision and its techniques [140]. This dataset is also applicable in image recognition and classification and contains a total of 9,146 images split into 101 distinct object categories.
NORB: it is used for experiments in 3D object recognition shapes [141]. The dataset contains 50 toys belonging to five generic categories that include aeroplanes, cars, four legged animals, and trucks.
CIFAR-10/100: this consists of 60000 32 × 32 color images in 10 classes, with 50000 training images and 10000 test images [142].
MNIST: it is an acronym for modified national institute of standards and technology, and it is a dataset of handwritten digits commonly used for training and testing data in the machine learning field. It contains 60000 training images and 10000 testing images [143].
LabelMe: it is a dataset and web based tool used for image annotation [144] and was created by MIT computer science intelligence laboratory (CSAIL). It is applicable in computer vision and is very dynamic, free, and open to public contribution. It contains 187240 images, 62197 annotated images, and 658992 labelled objects.
ImageNet: it is a large dataset used for visual object recognition and has more than 20000 categories with each category containing several hundred images [145].
Summary: both MNIST and CIFAR-10/100 datasets are available for the public, while the other datasets can be accessed by directly contacting the researchers. Dental datasets are difficult to find and thus researchers prefer using available public datasets. ImageNet is one of the datasets that is used for evaluations by several dental imaging models like ResNets and VGGNets.
9. Evaluation Protocols for Dental Images
Deep learning techniques are used on problems having very large datasets with thousands of instances, and therefore they need a way to estimate the performance of a given data configuration and use this for comparison with performances of other configurations. One of the ways is splitting data [146], since very large datasets require long training times.
9.1. Splitting Data
The data is split into training and testing data splits; for instance, Keras library for deep learning provides two ways of handling the splitting of data. It can split your data into a validation set and evaluate the performance of your model on that validation set. This is done by setting the validation split argument on the fit function to a certain percentage of your training dataset such as 30% for validation.
9.2. Manual K-Fold Cross Validation
This is used as the standard evaluation method for machine leaning techniques. This method splits the dataset into k-subsets and trains the model on all the subsets one after the other except one subset that is left out as the validation set. Evaluation is done on the left-out subset, which is the validation set, and the performance is averaged across all models created. The cross-validation method is applicable on small deep learning models and is used with 5 or 10 folds. We also have the method in [147] as the training-testing split method.
9.3. Training-Testing Split
Data is split into two parts, training set and testing set. A model is then fitted to the training set, and then the fit model is used to make predictions on the testing set. This is further used to evaluate the skill of the predictions and thus referred to as the training-testing split. Training-testing split is used as an estimate of how well the model performs on a dataset, especially when presented with new data. This method is preferred, especially with very large data and slow model to train. This is because the skill score for the model is noisy due to randomness of data. The randomness of data makes the model flexible but makes it less stable; for instance, you get different results from training the same model. This can be controlled by introducing a random seed and repeating experiments multiple times. The use of random seed is basically just using the same randomness every time the model is being fit and evaluated. Get an average of the estimated model skill after running the experiments multiple times. We also have [148] that also gives a description of the confusion matrix.
9.4. Confusion Matrix
It is an N∗N matrix with N number of classes being predicted and is used mostly with class output models. The matrix consists of several metrics that include the following:
Accuracy: is the total number of correct predictions.
Precision or positive predictive value: is a proportion of positive correct predictions.
Negative predictive value: is a proportion of negative correct predictions.
Recall or sensitivity: is a proportion of actual positive cases correctly identified.
Specificity: is a proportion of actual negative cases correctly identified.
9.5. F1-Score
This is defined as the harmonic mean of the precision and recall values for a classification problem task.
The F1-score is given by
| (21) |
Harmonic mean is preferred over the arithmetic mean because it takes care of extreme values, for instance, those in a binary classification model. The F1-score can also be adjusted to increase effectiveness by adding β as an adjustable parameter to get
| (22) |
10. Conclusion
From this survey, various techniques, methods, and approaches have been discussed concerning the segmentation and detection of dental images. Works that stem from the industry and academia have been mentioned and discussed, which include existing algorithms, segmentation and detection methods, databases, and various protocols for evaluating performance.
There is a huge potential for use of dental radiography and especially work focused on detection of dental images. Most of the existing systems dwell much on dental segmentation and not on feature extraction (detection). There is a need to improve existing dental detection systems, and one way to do so is by the introduction of automatic blob detection technique. Blob detection has been used in other fields of medical imaging but has not seen substantive use in the field of dental imaging. The use of such image analysis techniques to determine the presence of caries aims to create a system that takes a human diagnostic approach, whereby dental caries are diagnosed based on visual interpretation of teeth.
11. Future Work
There are issues and encouraging future perspectives of study which have popped out from our discussions here and they are highlighted as follows:
Data availability and reliability: Deep learning networks require large amount of data to be able to achieve meaningful and effective performance results. Due to the nature of dental images, there is a need for hybrid datasets to aid good performance of the networks. There is a need for publicly available datasets for dental images to make deep learning in the field possible.
Data standardization: Many methods discussed here are handling the preprocessing step through manual methods, such as cropping the region of interest on an image. These methods contribute to the loss of some key details from the images. Some networks end up dividing a whole image into subregions, and this slows down the learning process that occurs one subregion after the other. There are methods like downsampling which might lead to deletion of important details and this seems to have been due to limitations in computational power. Deep learning approaches have seen increased learning on whole images rather than the need for manual manipulation of images at the preprocessing stage in order to get more general and accurate results.
Weight regularization methods: Deep learning networks can also be improved by introducing weight regularization to improve their performance. The regularization of weights involves optimization of model hyperparameters such as the learning rate and the dropout rate. Optimization of models at several key processes, such as the use of dropout at the segmentation stage, could improve the final result, and future work will look at how to implement such optimizations. Basically, weight regularization methods are introduced into networks for parameter optimization. Furthermore, introduction of early stopping regularization technique will also help in reducing overfitting, which is always a major problem with existing models.
Hybrid approaches: Deep neural networks can also be achieved by combining several models or methods to form hybrid networks that will improve overall evaluation performance. The combination can be in any stage of the model, for instance, combining two preprocessing techniques to come with a single one to enhance image quality. This combination can also be handled by joining various attributes of different models to form one hybrid model that will enhance training, extraction, detection, and classification of objects. Combination of different convolution networks to form one hybrid network will be a good area to explore. This will save the long training and testing times that come with large networks having many convolution networks.
Oral healthcare follow-ups from medical practitioners are very important for risk assessment and management of dental caries [149–153].
Data Availability
Deep learning networks require large amount of data to be able to achieve meaningful and effective performance results. Due to the nature of dental images, there is a need for hybrid datasets to aid good performance of the networks. There is a need for publicly available datasets for dental images to make deep learning in the field possible
Conflicts of Interest
The authors declare that they have no conflicts of interest.
References
- 1.Waugh A., Grant A. Ross and Wilson Anatomy and Physiology in Health and Illness E-Book . Amsterdam, Netherlands: Elsevier Health Sciences; 2014. [Google Scholar]
- 2.Wang L., Shi F., Li G., et al. Segmentation of neonatal brain MR images using patch-driven level sets. NeuroImage . 2014;84:141–158. doi: 10.1016/j.neuroimage.2013.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Black G. V. Vol. 2. Chicago, IL, USA: Medico-Dental Publishing Company; 1955. Operative dentistry. [Google Scholar]
- 4.Osterloh D., Viriri S. Unsupervised Caries Detection in Non-standardized Bitewing Dental X-Rays. Advances in Visual Computing ; Proceedings of the International Symposium on Visual Computing; 2016; Berlin, Germany. pp. 649–658. [DOI] [Google Scholar]
- 5.Newman M. G., Takei H., Klokkevold P. R., Carranza F. A. Carranza’s Clinical Periodontology . Amsterdam, Netherlands: Elsevier health sciences; 2011. [Google Scholar]
- 6.Tikhe S. V., Naik A. M., Bhide S. D., Saravanan T., Kaliyamurthie K. P. February. Algorithm to identify enamel caries and interproximal caries using dental digital radiographs. Procedings of the 2016 IEEE 6th International Conference on Advanced Computing (IACC); 2016; Bhimavaram, India. IEEE; pp. 225–228. [Google Scholar]
- 7.Janowczyk A., Basavanhally A., Madabhushi A. Stain normalization using sparse autoencoders (StaNoSA): application to digital pathology. Computerized Medical Imaging and Graphics . 2017;57:50–61. doi: 10.1016/j.compmedimag.2016.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Vahadane A., Peng T., Sethi A., et al. Structure-preserving color normalization and sparse stain separation for histological images. IEEE Transactions on Medical Imaging . 2016;35(8):1962–1971. doi: 10.1109/tmi.2016.2529665. [DOI] [PubMed] [Google Scholar]
- 9.Ghafoorian M., Karssemeijer N., Heskes T., et al. April. Non-uniform patch sampling with deep convolutional neural networks for white matter hyperintensity segmentation. Proceedings of the 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI); June 2016; Prague, Czech Republic. IEEE; pp. 1414–1417. [Google Scholar]
- 10.Gulshan V., Peng L., Coram M., et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA . 2016;316(22):2402–2410. doi: 10.1001/jama.2016.17216. [DOI] [PubMed] [Google Scholar]
- 11.Mohamed J. Mechanisms of diabetes-induced liver damage: the role of oxidative stress and inflammation. Sultan Qaboos University Medical Journal . 2016;16(2):e132–141. doi: 10.18295/squmj.2016.16.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kooi T., Gubern-Merida A., Mordang J.-J., et al. A Comparison Between a Deep Convolutional Neural Network and Radiologists for Classifying Regions of Interest in Mammography. Breast Imaging . 2016;23:51–56. doi: 10.1007/978-3-319-41546-8_7. [DOI] [Google Scholar]
- 13.Dou Q., Yu L., Chen H., et al. 3D deeply supervised network for automated segmentation of volumetric medical images. Medical Image Analysis . 2017;41:40–54. doi: 10.1016/j.media.2017.05.001. [DOI] [PubMed] [Google Scholar]
- 14.Poudel R. P., Lamata P., Montana G. Reconstruction, Segmentation, and Analysis of Medical Images . Berlin, Germany: Springer; 2016. Recurrent fully convolutional neural networks for multi-slice MRI cardiac segmentation; pp. 83–94. [Google Scholar]
- 15.Kong B., Zhan Y., Shin M., Denny T., Zhang S. Medical Image Computing and Computer-Assisted Intervention, Berlin, Germany: Springer; 2016. Recognizing End-Diastole and End-Systole Frames via Deep Temporal Regression Network. [DOI] [Google Scholar]
- 16.Miller W. D. The presence of bacterial plaques on the surface of teeth and their significance. Dent Cosmos . 1902;44:425–446. [Google Scholar]
- 17.Bowen W. H., Birkhed D., Granath L., McHugh W. D. Dental caries: dietary and microbiology factors. systemized prevention of oral disease: theory and practice. 1986.
- 18.Stephan R. M. Intra-oral hydrogen-ion concentrations associated with dental caries activity. Journal of Dental Research . 1944;23(4):257–266. doi: 10.1177/00220345440230040401. [DOI] [Google Scholar]
- 19.Weiss R. L., Trithart A. H. Between-meal eating habits and dental caries experience in preschool children. American Journal of Public Health and the Nations Health . 1960;50(8):1097–1104. doi: 10.2105/ajph.50.8.1097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Mandel ID, Wotman S. The salivary secretions in health and disease. Oral Sciences Reviews . 1976;8:25–47. [PubMed] [Google Scholar]
- 21.Navazesh M. How can oral health care providers determine if patients have dry mouth? The Journal of the American Dental Association . 2003;134(5):613–618. doi: 10.14219/jada.archive.2003.0229. [DOI] [PubMed] [Google Scholar]
- 22.Griffin S. O., Oong E., Kohn W., et al. The effectiveness of sealants in managing caries lesions. Journal of Dental Research . 2008;87(2):169–174. doi: 10.1177/154405910808700211. [DOI] [PubMed] [Google Scholar]
- 23.Hazirbas C., Ma L., Domokos C., Cremers D. Asian Conference on Computer Vision . Berlin, Germany: Springer; 2016. November. fusenet: incorporating depth into semantic segmentation via fusion-based cnn architecture; pp. 213–228. [Google Scholar]
- 24.McKay F. S. The relation of mottled enamel to caries read before the Chicago dental society, April 17, 1928. The Journal of the American Dental Association (1922) . 1928;15(8):1429–1437. doi: 10.14219/jada.archive.1928.0269. [DOI] [Google Scholar]
- 25.Dean H. T., Jay P., Knutson J. W. Studies on Mass Control of Dental Caries through Fluoridation of the Public Water Supply. Public Health Reports (1896–1970) . 1950;65(43):1403–1408. doi: 10.2307/4587515. [DOI] [PubMed] [Google Scholar]
- 26.Reich E., Lussi A., Newbrun E. Caries-risk assessment. International Dental Journal . 1999;49(1):15–26. doi: 10.1111/j.1875-595x.1999.tb00503.x. [DOI] [PubMed] [Google Scholar]
- 27.Fontana M., Zero D. T. Assessing patients’ caries risk. The Journal of the American Dental Association . 2006;137(9):1231–1239. doi: 10.14219/jada.archive.2006.0380. [DOI] [PubMed] [Google Scholar]
- 28.Koulourides T. Implications of remineralization in the treatment of dental caries. 1986. [PubMed]
- 29.Black G. V. Mottled teeth: an endemic developmental imperfection of the enamel of the teeth heretofore unknown in the literature of dentistry. Dental Cosmos . 1916;58:129–156. [Google Scholar]
- 30.Zero D. T., Fontana M., Martínez-Mier E. A., et al. The Biology, Prevention, Diagnosis and Treatment of Dental Caries. The Journal of the American Dental Association . 2009;140:25S–34S. doi: 10.14219/jada.archive.2009.0355. [DOI] [PubMed] [Google Scholar]
- 31.Raper H. R. Practical Clinical Preventive Dentistry Based Upon Periodic Roentgen-Ray ExaminationsA reply to Dr. Kells’ paper, The Limitations of the Roentgen-Ray in Dental Practice, given, in substance, in a discussion presented before the American Dental Association, Dallas, Texas, November 10–14, 1924. The Journal of the American Dental Association (1922) . 1925;12(9):1084–1100. doi: 10.14219/jada.archive.1925.0251. [DOI] [Google Scholar]
- 32.Wenzel A. Bitewing and digital bitewing radiography for detection of caries lesions. Journal of Dental Research . 2004;83:72–75. doi: 10.1177/154405910408301s14. [DOI] [PubMed] [Google Scholar]
- 33.Mirza J., Robertson G. Vital guide to Dental implants. Vital . 2008;6(1):19–22. doi: 10.1038/vital890. [DOI] [Google Scholar]
- 34.Subramanyam R. B., Prasad K. P., Anuradha B. Different image segmentation techniques for dental image extraction. International Journal of Engineering Research in Africa . 2014;4(7):173–177. [Google Scholar]
- 35.Sezgin M., Sankur B. Survey over image thresholding techniques and quantitative performance evaluation. Journal of Electronic Imaging . 2004;13(1):146–166. [Google Scholar]
- 36.Silva G., Oliveira L., Pithon M. Automatic segmenting teeth in X-ray images: trends, a novel data set, benchmarking and future perspectives. Expert Systems with Applications . 2018;107:15–31. doi: 10.1016/j.eswa.2018.04.001. [DOI] [Google Scholar]
- 37.Lurie A., Tosoni G. M., Tsimikas J., Walker F. Recursive hierarchic segmentation analysis of bone mineral density changes on digital panoramic images. Oral surgery, oral medicine, oral pathology and oral radiology . 2012;113(4):549–558. doi: 10.1016/j.oooo.2011.10.002. [DOI] [PubMed] [Google Scholar]
- 38.Modi C. K., Desai N. P. May. A simple and novel algorithm for automatic selection of ROI for dental radiograph segmentation. Proceedings of the 2011 24th Canadian Conference on Electrical and Computer Engineering (CCECE); 2011; Ontario, Canada. pp. 000504–000507. [Google Scholar]
- 39.Abaza A., Ross A., Ammar H. November. Retrieving dental radiographs for post-mortem identification. Proceedings of the 2009 16th IEEE International Conference on Image Processing (ICIP); November 2009; Cairo Egypt. IEEE; pp. 2537–2540. [Google Scholar]
- 40.Ajaz A., Kathirvelu D. April. Dental biometrics: computer aided human identification system using the dental panoramic radiographs. Proceedings of the 2013 International Conference on Communication and Signal Processing; 2013; Melmaruvathur, India. IEEE; pp. 717–721. [Google Scholar]
- 41.Azevedo A. D. C. S., Alves N. Z., Michel-Crosato E., Rocha M., Cameriere R., Biazevic M. G. H. Dental age estimation in a Brazilian adult population using Cameriere’s method. Brazilian Oral Research . 2015;29(1):1–9. doi: 10.1590/1807-3107bor-2015.vol29.0016. [DOI] [PubMed] [Google Scholar]
- 42.Wang Y., Lin C. L., Miller J. D. Improved 3D image segmentation for X-ray tomographic analysis of packed particle beds. Minerals Engineering . 2015;83:185–191. doi: 10.1016/j.mineng.2015.09.007. [DOI] [Google Scholar]
- 43.Xi Y., Jin Y., De Man B., Wang G. High-kVp assisted metal artifact reduction for X-ray computed tomography. IEEE Access . 2016;4:4769–4776. doi: 10.1109/access.2016.2602854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Amer Y. Y., Aqel M. J. An efficient segmentation algorithm for panoramic dental images. Procedia Computer Science . 2015;65:718–725. doi: 10.1016/j.procs.2015.09.016. [DOI] [Google Scholar]
- 45.Lin M., Chen J.-H., Wang X., Chan S., Chen S., Su M.-Y. Template-based automatic breast segmentation on MRI by excluding the chest region. Medical Physics . 2013;40(12):p. 122301. doi: 10.1118/1.4828837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Indraswari R., Arifin A. Z., Navastara D. A., Jawas N. September. Teeth segmentation on dental panoramic radiographs using decimation-free directional filter bank thresholding and multistage adaptive thresholding. Proceedings of the 2015 International Conference on Information Communication Technology and Systems (ICTS); 2015; Surabaya, Indonesia. IEEE; pp. 49–54. [Google Scholar]
- 47.Razali M. R. M., Ahmad N. S., Hassan R., Zaki Z. M., Ismail W. December. Sobel and canny edges segmentations for the dental age assessment. Proceedings of the 2014 International Conference on Computer Assisted System in Health; 2014; IEEE; pp. 62–66. [Google Scholar]
- 48.Kaur M., Singh D. Fusion of medical images using deep belief networks. Cluster Computing . 2019;34:1–15. [Google Scholar]
- 49.Alsmadi M. K. A hybrid Fuzzy C-Means and Neutrosophic for jaw lesions segmentation. Ain Shams Engineering Journal . 2018;9(4):697–706. doi: 10.1016/j.asej.2016.03.016. [DOI] [Google Scholar]
- 50.Tuan T. M. Dental segmentation from X-ray images using semi-supervised fuzzy clustering with spatial constraints. Engineering Applications of Artificial Intelligence . 2017;59:186–195. [Google Scholar]
- 51.Işın A., Direkoğlu C., Şah M. Review of MRI-based brain tumor image segmentation using deep learning methods. Procedia Computer Science . 2016;102:317–324. [Google Scholar]
- 52.Niroshika U. A. A., Meegama R. G. N. Active Contour Model for the Detection of Sharp Corners in Image Boundaries . 2013. [Google Scholar]
- 53.Hasan A., Meziane F., Aspin R., Jalab H. Segmentation of brain tumors in MRI images using three-dimensional active contour without edge. Symmetry . 2016;8(11):p. 132. doi: 10.3390/sym8110132. [DOI] [Google Scholar]
- 54.Rad A., Mohd Rahim M. S., Rehman A., Altameem A., Saba T. Evaluation of current dental radiographs segmentation approaches in computer-aided applications. IETE Technical Review . 2013;30(3):210–222. doi: 10.4103/0256-4602.113498. [DOI] [Google Scholar]
- 55.Kumar R., Talukdar F. A., Dey N., et al. Information Technology and Intelligent Transportation Systems, Berlin, Germany: Springer; 2017. Histogram thresholding in image segmentation: a joint level set method and lattice Boltzmann method based approach. [DOI] [Google Scholar]
- 56.Li Y., Shi H., Jiao L., Liu R. Quantum evolutionary clustering algorithm based on watershed applied to SAR image segmentation. Neurocomputing . 2012;87:90–98. doi: 10.1016/j.neucom.2012.02.008. [DOI] [Google Scholar]
- 57.Garg I., Kaur B. March. Color based segmentation using K-mean clustering and watershed segmentation. Proceedings of the 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom); 2016; Noida, India. IEEE; pp. 3165–3169. [Google Scholar]
- 58.Yau H. T., Lin Y. K., Tsou L. S., Lee C. Y. An adaptive region growing method to segment inferior alveolar nerve canal from 3D medical images for dental implant surgery. Computer-Aided Design and Applications . 2008;5(5):743–752. doi: 10.3722/cadaps.2008.743-752. [DOI] [Google Scholar]
- 59.Sharifi M., Fathy M., Mahmoudi M. T. April. A classified and comparative study of edge detection algorithms. Proceedings of the International Conference on Information Technology: Coding and Computing; 2002; Las Vegas, NV, USA. IEEE; pp. 117–120. [Google Scholar]
- 60.Tangel M. L., Fatichah C., Yan F., et al. June. Dental classification for periapical radiograph based on multiple fuzzy attribute. Proceedings of the 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS); June 2013; Edmonton, Canada. IEEE; pp. 304–309. [Google Scholar]
- 61.Tangel M. L., Fatichah C., Fatichah C., Widyanto M. R., Dong F., Hirota K. Multiscale image aggregation for dental radiograph segmentation. Journal of Advanced Computational Intelligence and Intelligent Informatics . 2012;16(3):388–396. doi: 10.20965/jaciii.2012.p0388. [DOI] [Google Scholar]
- 62.Chen H., Zhang K., Lyu P., et al. A deep learning approach to automatic teeth detection and numbering based on object detection in dental periapical films. Scientific Reports . 2019;9(1):3840–11. doi: 10.1038/s41598-019-40414-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Jader G., Fontineli J., Ruiz M., Abdalla K., Pithon M., Oliveira L. October. Deep instance segmentation of teeth in panoramic X-ray images. Proceedings of the 2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI); 2018; IEEE; pp. 400–407. [Google Scholar]
- 64.Ren S., He K., Girshick R., Sun J. Faster r-cnn: towards real-time object detection with region proposal networks. Advances in Neural Information Processing Systems . 2015:91–99. doi: 10.1109/TPAMI.2016.2577031. [DOI] [PubMed] [Google Scholar]
- 65.Kaur G., Rani J. Mri Brain Tumor Segmentation Methods-A Review. 2016.
- 66.Geetha V., Aprameya K. S., Hinduja D. M. Dental caries diagnosis in digital radiographs using back-propagation neural network. Health Information Science and Systems . 2020;8(1):8–14. doi: 10.1007/s13755-019-0096-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Jusman Y., Tamarena R. I., Puspita S., Saleh E., Kanafiah S. N. A. M. August. Analysis of features extraction performance to differentiate of dental caries types using gray level Co-occurrence matrix algorithm. Proceedings of the 2020 10th IEEE International Conference on Control System, Computing and Engineering (ICCSCE); 2020; Penang, Malasyia. pp. 148–152. [Google Scholar]
- 68.Lee S., Oh S. I., Jo J., Kang S., Shin Y., Park J. W. Deep learning for early dental caries detection in bitewing radiographs. 2021. [DOI] [PMC free article] [PubMed]
- 69.Obuchowicz R., Nurzynska K., Obuchowicz B., Urbanik A., Piórkowski A. Caries detection enhancement using texture feature maps of intraoral radiographs. Oral Radiology . 2020;36(3):275–287. doi: 10.1007/s11282-018-0354-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Galloway M. M., Mm G. Texture Analysis Using Gray Level Run Lengths . 1975. [Google Scholar]
- 71.Eqbal S., Paliwal R. K., Yadava R. L. FCM and SVM based medical image feature extraction using wavelet transform. Tathapi with ISSN 2320-0693 is an UGC CARE Journal . 2020;19(23):24–32. [Google Scholar]
- 72.Raj J. R. F., Vijayalakshmi K., Kavi Priya S. Medical image denoising using multi-resolution transforms. Measurement . 2019;145:769–778. doi: 10.1016/j.measurement.2019.01.001. [DOI] [Google Scholar]
- 73.Deshmukh H. U., Thorat S. S. Tumor detection from brain mri image using neural network approach. 2018.
- 74.Haralick R. M., Shanmugam K., Dinstein I. H. Textural features for image classification. IEEE Transactions on systems, man, and cybernetics . 1973;6:610–621. doi: 10.1109/tsmc.1973.4309314. [DOI] [Google Scholar]
- 75.Ps S. K., Dharun V. S. Extraction of texture features using GLCM and shape features using connected regions. International Journal of Engineering and Technology . 2016 doi: 10.21817/ijet/2016/v8i6/160806254. [DOI] [Google Scholar]
- 76.Korchiyne R., Farssi S. M., Sbihi A., Touahni R., Alaoui M. T. A combined method of fractal and GLCM features for MRI and CT scan images classification. 2014.
- 77.Amadasun M., King R. Textural features corresponding to textural properties. IEEE Transactions on systems, man, and Cybernetics . 1989;19(5):1264–1274. doi: 10.1109/21.44046. [DOI] [Google Scholar]
- 78.Fekri-Ershad S. Innovative Texture Database Collecting Approach and Feature Extraction Method Based on Combination of Gray Tone Difference Matrixes, Local Binary Patterns, and K-Means Clustering . 2018. arXiv preprint arXiv:1803.04125. [Google Scholar]
- 79.Laws K. Los Angeles, CA, USA: University of Southern California; 1980. Textured image segmentation. Ph. D. Dissertation. [Google Scholar]
- 80.Orooji M., Alilou M., Rakshit S., et al. Combination of computer extracted shape and texture features enables discrimination of granulomas from adenocarcinoma on chest computed tomography. Journal of medical imaging (Bellingham, Wash.) . 2018;5(2) doi: 10.1117/1.JMI.5.2.024501.024501 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Ramapraba P. S., Chitra M. P., Prem Kumar M. Effective lesion detection of colposcopic images using active contour method. 2017.
- 82.Ojala T., Pietikäinen M., Harwood D. A comparative study of texture measures with classification based on featured distributions. Pattern Recognition . 1996;29(1):51–59. doi: 10.1016/0031-3203(95)00067-4. [DOI] [Google Scholar]
- 83.Verma M., Raman B. Center symmetric local binary co-occurrence pattern for texture, face and bio-medical image retrieval. Journal of Visual Communication and Image Representation . 2015;32:224–236. doi: 10.1016/j.jvcir.2015.08.015. [DOI] [Google Scholar]
- 84.Liu L., Lao S., Fieguth P. W., Guo Y., Wang X., Pietikäinen M. Median robust extended local binary pattern for texture classification. IEEE Transactions on Image Processing . 2016;25(3):1368–1381. doi: 10.1109/tip.2016.2522378. [DOI] [PubMed] [Google Scholar]
- 85.Min E., Guo X., Liu Q., Zhang G., Cui J., Long J. A survey of clustering with deep learning: from the perspective of network architecture. IEEE Access . 2018;6:39501–39514. doi: 10.1109/access.2018.2855437. [DOI] [Google Scholar]
- 86.Li F., Liu M. Alzheimer’s disease diagnosis based on multiple cluster dense convolutional networks. Computerized Medical Imaging and Graphics . 2018;70:101–110. doi: 10.1016/j.compmedimag.2018.09.009. [DOI] [PubMed] [Google Scholar]
- 87.Edwards S., Brown S., Lee M. Automated 3-D tissue segmentation via clustering. Journal of Biomedical Engineering and Medical Imaging . 2018;5(2):p. 08. doi: 10.14738/jbemi.52.4204. [DOI] [Google Scholar]
- 88.Zhao K., Zhou L., Qian P., et al. A transfer fuzzy clustering and neural network based tissue segmentation method during PET/MR attenuation correction. Journal of Medical Imaging and Health Informatics . 2019;9(7):1491–1497. doi: 10.1166/jmihi.2019.2749. [DOI] [Google Scholar]
- 89.Geetha V., Aprameya K. S. Textural analysis based classification of digital X-ray images for dental caries diagnosis. Int J Eng Manuf (IJEM)., . 2019;9(3):44–45. [Google Scholar]
- 90.Avinash S., Manjunath K., Kumar S. S. International Conference on Inventive Computation Technologies (ICICT), Vol. 3. IEEE; 2016. August. An improved image processing analysis for the detection of lung cancer using Gabor filters and watershed segmentation technique; pp. 1–6. [Google Scholar]
- 91.Aguirre-Ramos H., Avina-Cervantes J. G., Cruz-Aceves I., Ruiz-Pinales J., Ledesma S. Blood vessel segmentation in retinal fundus images using Gabor filters, fractional derivatives, and Expectation Maximization. Applied Mathematics and Computation . 2018;339:568–587. doi: 10.1016/j.amc.2018.07.057. [DOI] [Google Scholar]
- 92.Hegde M., Sivakumar P. A. Development and implementation of vlsi reconfigurable architecture for gabor filter in medical imaging application. International Journal of Engineering and Management Research (IJEMR) . 2018;8(3):71–76. doi: 10.31033/ijemr.8.3.10. [DOI] [Google Scholar]
- 93.Xiaoyang Tan X., Triggs B. Enhanced local texture feature sets for face recognition under difficult lighting conditions. IEEE Transactions on Image Processing . 2010;19(6):1635–1650. doi: 10.1109/tip.2010.2042645. [DOI] [PubMed] [Google Scholar]
- 94.Murala S., Jonathan Wu Q. M. Local ternary co-occurrence patterns: a new feature descriptor for MRI and CT image retrieval. Neurocomputing . 2013;119:399–412. doi: 10.1016/j.neucom.2013.03.018. [DOI] [Google Scholar]
- 95.Deep G., Kaur L., Gupta S. Local mesh ternary patterns: a new descriptor for MRI and CT biomedical image indexing and retrieval. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization . 2018;6(2):155–169. doi: 10.1080/21681163.2016.1193447. [DOI] [Google Scholar]
- 96.Rabidas R., Midya A., Sadhu A., Chakraborty J. March. Benign-malignant mass classification in mammogram using edge weighted local texture features. Medical Imaging 2016: Computer-Aided Diagnosis . 2016;978597851X [Google Scholar]
- 97.Naam J., Harlan J., Madenda S., Wibowo E. P. Identification of the proximal caries of dental x-ray image with multiple morphology gradient method. International Journal of Advanced Science, Engineering and Information Technology . 2016;6(3):343–346. [Google Scholar]
- 98.Naam J., Harlan J., Madenda S., Wibowo E. P. The algorithm of image edge detection on panoramic dental X-ray using multiple morphological gradient (mMG) method. International Journal of Advanced Science, Engineering and Information Technology . 2016;6(6):1012–1018. [Google Scholar]
- 99.Liang Y., Sun L., Ser W., et al. Hybrid threshold optimization between global image and local regions in image segmentation for melasma severity assessment. Multidimensional Systems and Signal Processing . 2017;28(3):977–994. doi: 10.1007/s11045-015-0375-y. [DOI] [Google Scholar]
- 100.Rakhlin A., Shvets A., Iglovikov V., Kalinin A. A. Lecture Notes in Computer Science . Cham: Springer; 2018. Deep Convolutional Neural Networks for Breast Cancer Histology Image Analysis; pp. 737–744. [DOI] [Google Scholar]
- 101.Naqi S. M., Sharif M., Yasmin M., Fernandes S. L. Lung nodule detection using polygon approximation and hybrid features from CT images. Current Medical Imaging . 2018;14(1):108–117. [Google Scholar]
- 102.Gabor D. Theory of communication. Part 1: the analysis of information. Journal of the Institution of Electrical Engineers - Part III: Radio and Communication Engineering . 1946;93(26):429–441. doi: 10.1049/ji-3-2.1946.0074. [DOI] [Google Scholar]
- 103.Singh P., Sehgal P. G.V Black dental caries classification and preparation technique using optimal CNN-LSTM classifier. Multimedia Tools and Applications . 2021;80(4):5255–5272. doi: 10.1007/s11042-020-09891-6. [DOI] [Google Scholar]
- 104.Litjens G., Kooi T., Bejnordi B. E., et al. A survey on deep learning in medical image analysis. Medical Image Analysis . 2017;42:60–88. doi: 10.1016/j.media.2017.07.005. [DOI] [PubMed] [Google Scholar]
- 105.Larochelle H., Erhan D., Courville A., Bergstra J., Bengio Y. June. An empirical evaluation of deep architectures on problems with many factors of variation. Proceedings of the 24th International Conference on Machine Learning; 2007; Oregon, USA. pp. 473–480. [Google Scholar]
- 106.Krizhevsky A., Sutskever I., Hinton G. E. Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems . 2012;21:1097–1105. [Google Scholar]
- 107.LeCun Y., Bottou L., Bengio Y., Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE . 1998;86(11):2278–2324. doi: 10.1109/5.726791. [DOI] [Google Scholar]
- 108.Simonyan K., Zisserman A. Very deep convolutional networks for large-scale image recognition. 2014. https://arxiv.org/abs/1409.1556 .
- 109.Goodfellow I. J., Shlens J., Szegedy C. Explaining and harnessing adversarial examples. 2014. https://arxiv.org/abs/1412.6572 .
- 110.Khosravi P., Kazemi E., Imielinski M., Elemento O., Hajirasouliha I. Deep convolutional neural networks enable discrimination of heterogeneous digital pathology images. EBioMedicine . 2018;27:317–328. doi: 10.1016/j.ebiom.2017.12.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Kamnitsas K., Ledig C., Newcombe V. F. J., et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Medical Image Analysis . 2017;36:61–78. doi: 10.1016/j.media.2016.10.004. [DOI] [PubMed] [Google Scholar]
- 112.Couprie C., Farabet C., Najman L., LeCun Y. Indoor semantic segmentation using depth information. 2013. https://arxiv.org/abs/1301.3572 .
- 113.Prasoon A., Petersen K., Igel C., Lauze F., Dam E., Nielsen M. Deep Feature Learning for Knee Cartilage Segmentation Using a Triplanar Convolutional Neural Network. Advanced Information Systems Engineering ; Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention; 2013; Berlin, Heidelberg. pp. 246–253. [DOI] [PubMed] [Google Scholar]
- 114.Setio A. A. A., Ciompi F., Litjens G., et al. Pulmonary nodule detection in CT images: false positive reduction using multi-view convolutional networks. IEEE Transactions on Medical Imaging . 2016;35(5):1160–1169. doi: 10.1109/tmi.2016.2536809. [DOI] [PubMed] [Google Scholar]
- 115.Long J., Shelhamer E., Darrell T. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; 2015; Boston, MA, USA. pp. 3431–3440. [DOI] [PubMed] [Google Scholar]
- 116.Ronneberger O., Fischer P., Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. Lecture Notes in Computer Science . 2015;24:234–241. doi: 10.1007/978-3-319-24574-4_28. [DOI] [Google Scholar]
- 117.Çiçek Ö., Abdulkadir A., Lienkamp S. S., Brox T., Ronneberger O. 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation. Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016 . 2016;42:424–432. doi: 10.1007/978-3-319-46723-8_49. [DOI] [Google Scholar]
- 118.Milletari F., Navab N., Ahmadi S. A. 2016 fourth international conference on 3D vision, 3DV. IEEE; 2016. October. V-net: fully convolutional neural networks for volumetric medical image segmentation; pp. 565–571. [Google Scholar]
- 119.Hochreiter S., Schmidhuber J. Long short-term memory. Neural Computation . 1997;9(8):1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]
- 120.Chung J., Gulcehre C., Cho K., Bengio Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. 2014.
- 121.Stollenga M. F., Byeon W., Liwicki M., Schmidhuber J. Parallel multi-dimensional lstm, with application to fast biomedical volumetric image segmentation. Advances in Neural Information Processing Systems . 2015;43:2998–3006. [Google Scholar]
- 122.Vincent P., Larochelle H., Lajoie I., Bengio Y., Manzagol P. A., Bottou L. Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research . 2010;11(12) [Google Scholar]
- 123.Uzunova H., Schultz S., Handels H., Ehrhardt J. Unsupervised pathology detection in medical images using conditional variational autoencoders. International journal of computer assisted radiology and surgery . 2019;14(3):451–461. doi: 10.1007/s11548-018-1898-0. [DOI] [PubMed] [Google Scholar]
- 124.Zhang H., Goodfellow I., Metaxas D., Odena A. May. Self-attention generative adversarial networks. Proceedings of the International Conference on Machine Learning; 2019; Long Beach, CA, USA. pp. 7354–7363. [Google Scholar]
- 125.Korkinof D., Rijken T., O’Neill M., Yearsley J., Harvey H., Glocker B. High-resolution mammogram synthesis using progressive generative adversarial networks. 2018.
- 126.Khatami A., Khosravi A., Lim C. P., Nahavandi S. A Wavelet Deep Belief Network-Based Classifier for Medical Images. Neural Information Processing ; Proceedings of the International Conference on Neural Information Processing; 2016; Berlin, Germany. pp. 467–474. [DOI] [Google Scholar]
- 127.Jia Y., Shelhamer E., Donahue J., et al. November. Caffe: convolutional architecture for fast feature embedding. Proceedings of the 22nd ACM International Conference on Multimedia; November 2014; Orlando, FL, USA. pp. 675–678. [Google Scholar]
- 128.Abadi M., Barham P., Chen J., et al. 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16) 2016. Tensorflow: a system for large-scale machine learning; pp. 265–283. [Google Scholar]
- 129.Maclaurin D., Duvenaud D., Adams R. P. Autograd: effortless gradients in numpy. ICML 2015 AutoML workshop . 2015;238(5) [Google Scholar]
- 130.Collobert R., Kavukcuoglu K., Farabet C. Torch7: a matlab-like environment for machine learning. BigLearn, NIPS Workshop . 2011;34 [Google Scholar]
- 131. Chollet, F., 2018. Introduction to keras. March 9th.
- 132.Zagalsky A., Feliciano J., Storey M. A., Zhao Y., Wang W. February. The emergence of github as a collaborative platform for education. Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work Social Computing; 2015; Vancouver, Canada. pp. 1906–1917. [Google Scholar]
- 133.Shen W., Zhou M., Yang F., Yang C., Tian J. Multi-scale Convolutional Neural Networks for Lung Nodule Classification. Lecture Notes in Computer Science . 2015;34:588–599. doi: 10.1007/978-3-319-19992-4_46. [DOI] [PubMed] [Google Scholar]
- 134.Kawahara J., BenTaieb A., Hamarneh G. April. Deep features to classify skin lesions. Proceedings of the 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI) . 2016;32:1397–1400. [Google Scholar]
- 135.Ghafoorian M., Karssemeijer N., Heskes T., et al. Deep multi-scale location-aware 3D convolutional neural networks for automated detection of lacunes of presumed vascular origin. NeuroImage: Clinical . 2017;14:391–399. doi: 10.1016/j.nicl.2017.01.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Simonovsky M., Komodakis N. Onionnet: sharing features in cascaded deep classifiers. 2016.
- 137.Rather A. M., Agarwal A., Sastry V. N. Recurrent neural network and a hybrid model for prediction of stock returns. Expert Systems with Applications . 2015;42(6):3234–3241. doi: 10.1016/j.eswa.2014.12.003. [DOI] [Google Scholar]
- 138.Lee R. S., Gimenez F., Hoogi A., Rubin D. Vol. 8. The Cancer Imaging Archive; 2016. Curated breast imaging subset of DDSM. [Google Scholar]
- 139.Everingham M., Van Gool L., Williams C. K. I., Winn J., Zisserman A. The pascal visual object classes (voc) challenge. International Journal of Computer Vision . 2010;88(2):303–338. doi: 10.1007/s11263-009-0275-4. [DOI] [Google Scholar]
- 140.Fei-Fei L., Fergus R., Perona P. June. Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop; 2004; Washington, D.C., USA. IEEE; p. p. 178. [Google Scholar]
- 141.LeCun Y., Huang F. J., Bottou L. June. Learning methods for generic object recognition with invariance to pose and lighting. 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004 . 2004;2:II–104. [Google Scholar]
- 142.Krizhevsky A., Hinton G. Learning multiple layers of features from tiny images. 2009.
- 143.Li Deng L. The mnist database of handwritten digit images for machine learning research [best of the web] IEEE Signal Processing Magazine . 2012;29(6):141–142. doi: 10.1109/msp.2012.2211477. [DOI] [Google Scholar]
- 144.Russell B. C., Torralba A., Murphy K. P., Freeman W. T. LabelMe: a database and web-based tool for image annotation. International Journal of Computer Vision . 2008;77(1–3):157–173. doi: 10.1007/s11263-007-0090-8. [DOI] [Google Scholar]
- 145.Deng J., Dong W., Socher R., Li L. J., Li K., Fei-Fei L. June. Imagenet: a large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition; June 2009; Long Beach, CA, USA. pp. 248–255. [Google Scholar]
- 146.Adagbasa E. G., Adelabu S. A., Okello T. W. Application of deep learning with stratified K-fold for vegetation species discrimation in a protected mountainous region using Sentinel-2 image. Geocarto International . 2019;43:1–21. doi: 10.1080/10106049.2019.1704070. [DOI] [Google Scholar]
- 147.Sun X., Gossmann A., Wang Y., Bischt B. December. Variational resampling based assessment of deep neural networks under distribution shift. Proceedings of the 2019 IEEE Symposium Series on Computational Intelligence (SSCI); December 2019; Xiamen, China. IEEE; pp. 1344–1353. [Google Scholar]
- 148.Abdualgalil B., Abraham S. February. Applications of machine learning algorithms and performance comparison: a review. Proceedings of the 2020 International Conference on Emerging Trends in Information Technology and Engineering (Ic-ETITE); 2020; Vellore, India. IEEE; pp. 1–6. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Deep learning networks require large amount of data to be able to achieve meaningful and effective performance results. Due to the nature of dental images, there is a need for hybrid datasets to aid good performance of the networks. There is a need for publicly available datasets for dental images to make deep learning in the field possible
