Abstract
Aim: To describe a novel neural network based oral precancer (oral submucous fibrosis; OSF) stage detection method.
Method: The wavelet coefficients of transmission electron microscopy images of collagen fibres from normal oral submucosa and OSF tissues were used to choose the feature vector which, in turn, was used to train the artificial neural network.
Results: The trained network was able to classify normal and oral precancer stages (less advanced and advanced) after obtaining the image as an input.
Conclusions: The results obtained from this proposed technique were promising and suggest that with further optimisation this method could be used to detect and stage OSF, and could be adapted for other conditions.
Keywords: oral precancer, oral submucous fibrosis, collagen, transmission electron microscopy, image analysis, wavelet, artificial neural network
Cancers of the oral cavity are increasing at an alarming rate and have been reported to account for approximately 220 000 new cases each year for men (5% of all cancers) and about 90 000 for women (2% of all cancers).1 Two third of these cases are recorded in developing countries. A high incidence of oral cancer is seen in the Indian subcontinent as a result of the late diagnosis of potential precancerous lesions and conditions. Oral submucous fibrosis (OSF) is an insidious chronic progressive precancerous condition of the oral cavity and oropharynx with a high degree of malignant potential.2 A large proportion of these precancerous lesions convert to squamous cell carcinoma, the malignant transformation rate being in the order of 7.6%.2 OSF is prevalent in the Indian subcontinent in all age groups and across all socioeconomic strata, but the incidence exhibits regional and interregional variations (0.2–4.57%), being highest in Southern India.2 This disease is now a public health issue in many parts of the world including the UK,3 South Africa,4 and many southeast Asian countries.5,6
“The simple pathological evaluation procedure currently used for oral submucous fibrosis does not provide a quantitative analysis of the vital changes in the tissues”
The aetiology of OSF is not well understood, but maximum importance has been given to the chewing of areca nut in isolation or in combination with betel leaf or other tobacco products.7,8 Arecoline, an active alkaloid found in betel nuts, stimulates fibroblasts to increase collagen production by 150%.9 However, available data suggest that a multifactorial mechanism is involved in the pathogenesis of OSF, including areca nut chewing, the ingestion of chillies, and deficiencies of nutrients, trace metals, and vitamins, in addition to hypersensitivity to various dietary constituents and genetic and immunological predisposition. 2,8,10–12
Presently, no specific diagnostic test is available for OSF except for histopathological studies. The pathological status of OSF is mainly assessed by light microscopic studies of oral biopsies and clinical evaluation. Microscopically, the main features of OSF are less vascularised collagenous connective tissue with occasional progressive atrophy of the muscle fibres, a minimal to moderate degree of chronic inflammatory infiltration, and atrophic overlying epithelium with a variable degree of dysplastic changes.3,13
The main histopathological characteristic of OSF is the deposition of collagen in the subepithelial connective tissue leading to epithelial atrophy.14,15 It has been found that exposure of buccal mucosal fibroblasts to alkaloid may result in the accumulation of collagen.16 Reduced degradation of the α1(I) collagen trimer synthesised by OSF fibroblasts may induce alteration of the ratio of α1(I) to α2(I) chains.17 Collagenase activity has been found to be lower in OSF than in normal oral mucosa.18 These findings suggest that OSF should be considered as a collagen metabolic disorder resulting from alkaloid exposure and individual variations in collagen metabolism.
The simple pathological evaluation procedure currently used does not provide a quantitative analysis of the vital changes in the tissues—that is, epithelial dysplastic changes, subepithelial fibrosis, etc. Accordingly, histopathological grading of the disease state is also empirical. Ultrastructural studies on collagen in oral submucous fibrosis qualitatively noted variations in the width and periodicity of fibrillar collagen between diseased and normal tissue specimens.19 Therefore, our present study analyses transmission electron micrography (TEM) images of subepithelial fibrillar collagen in early and advanced stages of OSF and compares them with those obtained from normal oral mucosa by applying a novel computer aided technique. In this computer aided diagnostic (CAD) approach, a specific attempt is made to grade OSF stages in a quantitative manner, to minimise the ambiguity in the presently accepted empirical (clinical and histopathological) diagnostic procedure of OSF. This quantitative approach may finally lead to the identification of suitable biomarkers for the more specific grading of OSF.
CAD refers to a diagnostic process during which a radiologist uses computer analysis as a diagnostic aid to achieve a more accurate interpretation of the disease state. The precancer diagnostic importance of the CAD system coupled with wavelet ANN (artificial neural network) is studied by analysing collagen in OSF and normal oral mucosa. The wavelet transformation technique is ideal for obtaining information from signals that are aperiodic, noisy, intermittent, or transient. ANN is an important statistical tool20 that can improve the characterisation of pathological images especially of precancer and cancers. There are only a few reports on the successful applications of machine learning to precancer diagnosis.21–25 Thus, our study could strengthen the foundation of ANN in CAD applications.
MATERIALS AND METHODS
Selection of patients
Clinically diagnosed patients with OSF were subjected to incisional biopsy with their previous consent at the department of oral and maxillofacial pathology, R Ahmed Dental College and Hospital, Kolkata, India for histopathological evaluation. Portions of each oral mucosal biopsy from the clinically and histologically confirmed cases of early (n = 52) and advanced (n = 58) stages of OSF were taken for the analysis of fibrillar collagen ultrastructure by TEM. Normal healthy volunteers (n = 35) of similar age and food habits, but without the oral habit, were also included in our study.
Clinical classification of OSF stages
The classification/grading of OSF was carried out according to the degree of trismus, which directly correlates with the degree of fibrosis, progression of the disease, and location of the OSF lesion in the oral mucosa. Trismus, or reduction in the overall mouth opening, is one of the most reliable manifestations of OSF,11,12 and it has been a cardinal clinical feature for grading patients into different groups. The gradation of trismus and in turn the gradation of the disease process was assessed by measuring the distance between the incisal edges of the upper and lower central incisal teeth—the interincisal distance (IID)—using vernier calipers as follows: mild or moderate grade (IID, ⩾ 1.5 but < 3.5 cm) and severe grade (IID, < 1.5 cm), considering IID ⩾ 3.5 as normal.
TEM study
Biopsy samples (1 × 2 mm) were fixed in primary fixative (2.5% glutaraldehyde in 0.12M phosphate buffer) for 48 hours at 4°C. The tissues were then washed in 0.12M phosphate buffer and postfixed in 2% osmium tetroxide for two hours at room temperature with constant shaking. After postfixation, tissues were dehydrated in graded alcohol and processed for spur embedding. Ultrathin sections were cut on the Nova Ultratome (LKB, Sweden), collected on copper grids, and stained with uranyl acetate and lead citrate.26 Sections were observed under TEM (JEOL, Japan; 100CX TEM) at 60 kV.
Wavelet ANN based analysis of TEM collagen images
We have devised a novel wavelet ANN based scheme to detect and grade the stages of OSF (advanced and less advanced) by analysing TEM images of subepithelial collagen fibres from normal healthy volunteers and patients with OSF. Figure 1A–F shows representative images at a magnification of ×64 000.
Accordingly, a set of 145 random samples (64 × 64 pixels) of subepithelial collagen image data was used to make a large training set. The depth of all the images was 8. Two hundred and fifty six different shades were needed to represent all the colours from complete black to absolute white. This 64 × 64 pixel image was decomposed into four levels using the “HAAR” wavelet. Both the approximate and detail coefficients were extracted for each level. The feature vector was chosen judiciously with the combination of these approximate and detail coefficients. These feature vectors were used to train the ANN. During the test, the images were decomposed into 64 × 64 non-overlapping pixel blocks. The approximate and detail coefficients were extracted for each block. These data were then fed into the network. The output can be of three different types: advanced and less advanced stage of OSF and normal. Based on the number of blocks in each of these three classes, a decision will have been taken for the test image.
Wavelet and sub-band decomposition and feature vector extraction27,28 (http://www.mathworks.com/access/helpdesk/toolbox/wavelet/wavelet.shtml)
In continuous signal analysis, a signal g(t) is often represented by a weighted sum of basis functions:
(1) |
where ψi(t) is the basis function and ci is the coefficient.
Multiresolution analysis can be done using continuous and discrete wavelet transforms (CWT and DWT). To define CWT, a scaled and translated version of the basis function, called the mother wavelet, is used to achieve the constant Q requirement. Considering a real band pass filter with impulse response ψ(t) of zero mean:
(2) |
The CWT is defined as:
(3) |
where ψ*(t) is the complex conjugate of ψ(t), aɛR+, and bɛR.
Equation 3 can be rewritten as:
(4) |
where:
(5) |
In the case of discrete signals the theory of filter banks and their application to sub-band signal decomposition must be briefly discussed. For example, a simple two channel filter bank structure (fig 2) is used to explain the decomposition and reconstruction of a signal using DWTs.
A discrete time signal x(n) is applied to a pair of filters. A lower resolution signal is obtained by convoluting x(n) with a half band low pass filter having an impulse response h2(n). The half band signal can be made full band by down sampling with a factor of two (doubling the scale by a factor of two in the analysis). The z-transform Y2(z) of the resulting signal y2(n) can be expressed as:
(6) |
where H2(z) and X2(z) are the z-transform of h2(n) and x(n). Compared with the original signal x(n), the filtered signal y2(n) is reduced in resolution by a factor of two as a result of low pass filtering and doubled in scale as a result of down sampling. In a similar manner, it is possible to compute the added details of a signal as a high pass version of x(n) using a filter with impulse response h1(n) followed by down sampling. Wavelet analysis became popular in the multiresolution analysis of biomedical images because of its ability to obtain smooth approximation at different levels.29
The same sub-band decomposition can be carried out in the case of two dimensional signals. As mentioned in the previous section, large numbers of 64 × 64 pixel image templates were taken to extract the feature vector for training the ANN. Figure 3A–D shows four such training samples, two images of normal samples and two of advanced stage OSF. These sample images have been decomposed by HAAR wavelets into four levels. In each level, there are three sets of detail coefficients: horizontal (Hi), vertical (Vi), and diagonal (Di); i = 1, 2, 3, 4, as shown in fig 4. The dimensions of Hi, Vi, and Di are 64*2−i × 64*2−i pixels. Hi, Vi, and Di were extracted using the “detcoef2” function of MATLAB 5.1.24 The four approximation coefficients, Ai, were extracted from the four levels using “appcoef2”. The frobenious norm27 was computed for Hi, Vi, Di, and Ai and denoted as:
(7) |
The element of the feature vector (FV) is the frobenious norm of Hi, Vi, Di, and Ai.
(8) |
i = 1,2,3,4 where κ is set at 0.001.
Multilayered perceptron feed forward neural network
Figure 5 shows a representative multilayered perceptron feed forward network. Each node represented by the box is called a perceptron.
It has been proved that a three layered network can represent a non-linear function of any order.31 The numbers of layers and numbers of nodes in the hidden layer are guided by many practical aspects, such as redundancy, number of input training sets, spurious oscillations, etc. However, the most crucial part of an ANN based model is to train the network. The most widely studied and used training algorithm is the so called back propagation technique, which is robust and reliable. The main problem with neural network training is to devise a method of updating the representative weights that minimises the error. However, the updating of the weights has been done here by the Levenberg-Marquardt32 algorithm. The Levenberg-Marquardt algorithm performs much better with some knowledge of the process, so that quick convergence is obtained with a very small error.
Figure 6 shows the structure of the network. In this figure, vij and wpq denote the weights for the successive layers. The basic purpose of training a network is to optimise vij and wpq with respect to a particular set of input–output training patterns.
The responses at the hidden nodes bj = 1,2…p are calculated by evaluating the contributions from all the input nodes through a non-linear mapping function:
(9) |
where the function f(•) chosen is the “tansig” function of MATLAB 5.1.
The tansig function is given by:
(10) |
θj is the bias at the jth hidden layer node and ai is the input vector. Similarly, ck, k = 1,2,…q is calculated using:
(11) |
where τk is the bias at the kth output layer node.
The function “trainlm” in the neural network toolbox in MATLAB 5.1 was used to train the network. It uses a mixture of the Gauss-Newton method and a gradient descent technique for optimisation of the weights, wjk and vij.
The Levenberg-Marquardt weight update rule is:
(12) |
where J is the Jacobian matrix of derivatives of each error to each weight, μ is a scalar, I is the identity matrix, and e is an error vector. Training continues until the error goal is met and the minimum error gradient occurs.
Training/testing by ANN
In our present study, a three layer ANN was used. A single hidden layer with four neurones was found to be sufficient for training (fig 7). The maximum number of epochs was fixed at 400 and the sum squared error goal was kept at 10−5. The learning rate and minimum feasible gradient were chosen as 0.01 and 0.05, respectively. Detailed discussions of these parameters are available in Demuth and Beale.31
RESULTS
After successful training, several images from different patients were tested. The feature vector was extracted from each of the contiguous 64 × 64 pixel blocks by wavelet decomposition. These test feature vectors were fed into the ANN. Figs 8–10 show the ANN outputs for three different test images. The target output has three different states, denoted as −1, 0, and +1, which represent normal, less advanced stage, and advanced stage OSF, respectively. In the case of untrained test samples, the ANN output for a particular 64 × 64 pixel block may not map exactly to one of the three states—for example, a sample of advanced state OSF may produce an output of +0.9 instead of +1.0—so that bands need to be defined around these three states. These bands corresponding to normal, less advanced, and advanced stages have been designated as follows: if the output of a 64 × 64 pixel block is less than −0.5, the block is categorised as “normal”; if the output is between −0.5 and +0.5 is categorised as “less advanced”; and if the output is greater than +0.5 it is categorised as “advanced”.
In fig 8 a normal image has been tested and the target output of each 64 × 64 pixel block is set as −1. There are 60 blocks of 64 × 64 pixels. Of these 60 blocks, only nine are misclassified. Similarly, in fig 9 the test image is a less advanced stage of disease and the target output is set at 0. Of the total number of 297 blocks, 26 blocks are wrongly classified, whereas in fig 10, the test image is advanced stage of the disease and the target output is set as +1. Of 60 blocks only six do not show the correct signature.
To provide a measure of the efficiency of the proposed ANN technique, the properly classified block index (PCBI) has been computed for all the test images. PCBI has been defined as follows:
(13) |
where Np and NT are the total number of properly classified blocks and the total number of 64 × 64 pixel blocks in the test image. After classifying the blocks into three states −1, 0, and +1, the number of blocks in each state has been computed. If NP−1, NP0, and NP1 are the number of 64 × 64 pixel blocks classified as normal, less advanced, and advanced, respectively, the number of properly classified blocks is:
(14) |
Based on the magnitude of the PCBI, the final diagnosis is performed on the nature of the test images. A binary decision is taken based on the PCBI. If the PCBI is greater than 50% (fig 11), the wavelet ANN based system can identify the correct diagnosis. In table 1, 16 test images were used to check the diagnostic ability of the proposed wavelet ANN based system: all test images were diagnosed properly by the proposed system.
Table 1.
Test image | PCBI (%) | Test image | PCBI (%) |
1 | 85.00 | 9 | 87.45 |
2 | 86.54 | 10 | 89.42 |
3 | 56.82 | 11 | 86.67 |
4 | 73.33 | 12 | 95.19 |
5 | 77.14 | 13 | 95.73 |
6 | 87.33 | 14 | 91.67 |
7 | 91.25 | 15 | 79.50 |
8 | 71.43 | 16 | 75.34 |
PCBI, properly classified block index.
DISCUSSION
Our study proposes a novel ANN based CAD technique to identify the progressive stages of the oral precancerous condition OSF. TEM images of oral subepithelial collagen fibres (test images) are subdivided into 64 × 64 pixel contiguous blocks and these blocks undergo wavelet decomposition. The wavelet coefficients are used as feature vectors. In the less advanced stage of the disease, some of the blocks show the signature of the normal collagen image, whereas others have the signature of advanced stage OSF. As a result, the false detection rate is high in the less advanced stage of the disease but the PCBI is always greater than 50%. Because the final decision is taken based on the magnitude of the PCBI, it always leads to the correct diagnosis.
It should be mentioned that from these sample images 16 features were extracted using wavelet transformation. However, all 16 features may not contribute equally to classifying the image characteristics. There are some common features and some unwanted features resulting from noise that can mislead both the training process and the decision making process. Therefore, in future studies the feature set could be selectively weighted or restricted using a suitable technique,32 so that unwanted or misleading features could be deactivated or isolated.
Take home messages.
We have developed a novel neural network based method for detecting and staging oral precancer (oral submucous fibrosis; OSF)
The wavelet coefficients of transmission electron microscopy images of collagen fibres from normal oral submucosa and OSF tissues were used to choose the feature vector which, in turn, was used to train the artificial neural network
The trained network was able to classify normal and oral precancer stages (less advanced and advanced) after obtaining the image as an input
Abbreviations
ANN, artificial neural network
CAD, computer aided diagnosis
CWT, continuous wavelet transform
DWT, discrete wavelet transform
IID, interincisal distance
OSF, oral submucous fibrosis
PCBI, properly classified block index
TEM, transmission electron microscopy
REFERENCES
- 1.Parkin DM, Pisani P, Ferlay J. Estimates of the worldwide incidence of twenty five major cancers in 1990. Int J Cancer 1999;80:827–41. [DOI] [PubMed] [Google Scholar]
- 2.Aziz SR. Oral submucous fibrosis: an unusual disease. J N J Dent Assoc 1997;68:17–19. [PubMed] [Google Scholar]
- 3.Canniff JP, Harvey W, Harris M. Oral submucous fibrosis: its pathogenesis and management. Br Dent J 1986;160:429–34. [DOI] [PubMed] [Google Scholar]
- 4.Seedat HA, van Wyk CW. Betel-nut chewing and submucous fibrosis in Durban. S Afr Med J 1988;74:568–71. [PubMed] [Google Scholar]
- 5.Maher R, Lee AJ, Warnakulasuriya KA, et al. Role of areca nut in the causation of oral submucous fibrosis: a case–control study in Pakistan. J Oral Pathol Med 1994;23:65–9. [DOI] [PubMed] [Google Scholar]
- 6.Zain RB, Ikeda N, Razak IA, et al. A national epidemiological survey of oral mucosal lesions in Malaysia. Community Dent Oral Epidemiol 1997;25:377–83. [DOI] [PubMed] [Google Scholar]
- 7.Regezi JA, Sciubba JJ. Oral pathology: clinical pathologic correlations. London: WB Saunders, 1999.
- 8.Liao PH, Lee TL, Yang LC, et al. Adenomatous polyposis coli gene mutation and decreased wild-type p53 protein expression in oral submucous fibrosis: a preliminary investigation. Oral Surg Oral Med Oral Pathol Oral Radiol Endod 2001;92:202–7. [DOI] [PubMed] [Google Scholar]
- 9.Canniff JP, Harvey W. The aetiology of oral submucous fibrosis: the stimulation of collagen synthesis by extracts of areca nut. Int J Oral Surg 1981;10 (suppl 1) :163–7. [PubMed] [Google Scholar]
- 10.van Wyk CW, Grobler-Rabie AF, Martell RW, et al. HLA-antigens in oral submucous fibrosis. J Oral Pathol Med 1994;23:23–7. [DOI] [PubMed] [Google Scholar]
- 11.Paul RR, Chatterjee J, Das AK, et al. Altered elemental profile as indicator of homeostatic imbalance in pathogenesis of oral submucous fibrosis. Biol Trace Elem Res 2002;87:45–56. [DOI] [PubMed] [Google Scholar]
- 12.Paul RR, Chatterjee J, Das AK, et al. Zinc and iron as bioindicators of precancerous nature of oral submucous fibrosis. Biol Trace Elem Res 1996;54:213–30. [DOI] [PubMed] [Google Scholar]
- 13.Pindborg JJ, Sirsat SM. Oral submucous fibrosis. Oral Surg Oral Med Oral Pathol 1966;22:764–79. [DOI] [PubMed] [Google Scholar]
- 14.Hardie J. Oral submucous fibrosis. A review with case reports. J Can Dent Assoc 1987;53:389–93. [PubMed] [Google Scholar]
- 15.Huang IY, Shieh TY. Collagen content and types in oral submucous fibrosis. Gaoxiong Yi Xue Ke Xue Za Zhi 1989;5:162–71. [PubMed] [Google Scholar]
- 16.Harvey W, Scutt A, Meghji S, et al. Stimulation of human buccal mucosa fibroblasts in vitro by betel-nut alkaloids. Arch Oral Biol 1986;31:45–9. [DOI] [PubMed] [Google Scholar]
- 17.Kuo MY, Chen HM, Hahn LJ, et al. Collagen biosynthesis in human oral submucous fibrosis fibroblast cultures. J Dent Res 1995;74:1783–8. [DOI] [PubMed] [Google Scholar]
- 18.Shieh TY, Yang JF. Collagenase activity in oral submucous fibrosis. Proc Natl Sci Counc Repub China B 1992;16:106–10. [PubMed] [Google Scholar]
- 19.van Wyk CW, Seedat HA, Phillips VM. Collagen in submucous fibrosis: an electron microscopic study. J Oral Pathol Med 1990;19:182–7. [DOI] [PubMed] [Google Scholar]
- 20.Dayhoff JE, Deleo JM. Artificial neural network—opening the black box. Cancer 2001;91:1615–35. [DOI] [PubMed] [Google Scholar]
- 21.Kappen HJ, Neijt JP. Advanced ovarian cancer: neural network analysis to predict treatment outcome. Ann Oncol 1993;4 (suppl 4) :31–4. [DOI] [PubMed] [Google Scholar]
- 22.Maclin P, Dempsey J. Using an artificial neural network to diagnose hepatic masses. J Med Syst 1992;16:215–25. [DOI] [PubMed] [Google Scholar]
- 23.Ravdin PM, Clark GM. A practical application of neural network analysis for predicting outcome of individual breast cancer patients. Breast Cancer Res Treat 1992;22:285–93. [DOI] [PubMed] [Google Scholar]
- 24.Wilding P, Morgan MA, Grygotis AE, et al. Application of back propagation neural networks to diagnosis of breast and ovarian cancer. Cancer Lett 1994;74:143–53. [DOI] [PubMed] [Google Scholar]
- 25.Wu Y, Giger ML, Doi K, et al. Artificial neural networks in mammography: application to decision making in the diagnosis of breast cancer. Radiology 1993;187:81–7. [DOI] [PubMed] [Google Scholar]
- 26.Chatterjee J, De K, Basu SK, et al. Collagen, zinc and iron contents of rat skin irradiated with chronic low-dose X-ray. Indian J Med Res 1993;98:243–7. [PubMed] [Google Scholar]
- 27.Daubechies I. Ten lectures on wavelets. Philadelphia: SIAM, 1992.
- 28.Vellerli M, Herley C. Wavelet and filter banks: theory and design. IEEE Trans Signal Process 1992;40:2207–32. [Google Scholar]
- 29.Mishra A, Dutta PK, Ghosh MK. Non-rigid cardiac motion quantification from 2-D image sequences based on wavelet synthesis. Image and Vision Computing 2001;19:929–39. [Google Scholar]
- 30.Withdrawn.
- 31.Cybenko G. Approximation by superpositions of sigmoidal functions. Mathematics of Control Signals and Systems 1989;2:303–14. [Google Scholar]
- 32.Demuth H, Beale M. Neural network toolbox—for use with MATLAB users guide. Massachusetts: The Mathworks Inc, 1993.
- 33.Fiori S, Grimani F, Burrascano P. Novel neural network feature selection procedure by generalization maximization with application to automatic robot guidance. International Journal of Smart Engineering and System Design 2002;4:91–106. [Google Scholar]