Skip to main content
Evidence-based Complementary and Alternative Medicine : eCAM logoLink to Evidence-based Complementary and Alternative Medicine : eCAM
. 2015 Jul 13;2015:897580. doi: 10.1155/2015/897580

Significant Geometry Features in Tongue Image Analysis

Bob Zhang 1,*, Han Zhang 1
PMCID: PMC4515517  PMID: 26246842

Abstract

The shape of a human tongue and its relation to a patients' state, either healthy or diseased (and if diseased which disease), is quantitatively analyzed using geometry features by means of computerized methods in this paper. Thirteen geometry features based on measurements, distances, areas, and their ratios are extracted from tongue images captured by a specially designed device with color correction. Using the features, 5 tongue shapes (rectangle, acute and obtuse triangles, square, and circle) are defined based on traditional Chinese medicine (TCM). Classification of the shapes is subsequently carried out with a decision tree. A large dataset consisting of 672 images comprising of 130 healthy and 542 disease examples (labeled according to Western medical practices) are tested. Experimental results show that the extracted geometry features are effective at tongue shape classification (coarse level). Even if more than one disease class belongs to the same shape, the disease classes can still be discriminated via fine level classification using a combination of the geometry features, with an average accuracy of 76.24% for all shapes.

1. Introduction

The human tongue contains numerous features. Traditionally, medical practitioners would examine these features based on years of experience [15]. However, ambiguity and subjectivity are associated with their diagnostic results. To eliminate these qualitative aspects, tongue images can be objectively analyzed, which offers a new way to diagnose disease, one that minimizes the physical harm inflicted to patients (compared with other medical examinations).

In state-of-the-art computerized tongue image analysis, color and texture features are the most prevalent [619]. There exists little or no literature on tongue image analysis using geometry features, whereas in traditional medicines such as traditional Chinese medicine (TCM) the shape of a tongue can be used to determine a patients' illness [3, 4, 20]. The authors in [21] proposed an approach to automatically recognize tongue shapes based on geometry features. The seven geometric features included various measurements of length, area, and angle extracted from tongue images. Using a support decision tool to weight the relative influences of the geometry features, they classified an image into one of six tongue shapes, namely, hammer, rectangle, acute triangle, obtuse triangle, square, and round (based on TCM). Experimental results conducted on 362 tongue images exhibited an accuracy of 90.3% for shape classification. Nevertheless, there was little quantitative analysis between tongue shape and the relationship to its current health state.

In this paper we thoroughly examine the aforementioned problem via geometry features in tongue image analysis. The imaging device used to capture tongue images is made up of a 3-chip CCD camera with 8 bit resolution and two D65 fluorescent tubes placed symmetrically around the camera in order to produce a uniform illumination. The images captured were color corrected [22] to eliminate any noise caused by variations of illumination and device dependency. Also, the tongue image capture device ensures that the images are properly aligned. This allows consistent feature extraction and classification in the following steps. Figure 1 shows the capture device. Using this device we form a large tongue image database consisting of 672 samples. This database is composed of 130 healthy and 542 disease samples, divided into 7 classes with at least 19 examples. Every image is segmented [19] with the background removed and tongue foreground remaining. From each tongue image consisting of a tip, body, and root [20], 13 geometry features derived from measurements, distances, areas, and their ratios are extracted. Using these features we define 5 tongue shapes based on TCM, rectangle, acute and obtuse triangles, square, and circle. Coarse level classification applying a decision tree [23] was used to classify a tongue into one of the shapes. Experimental results showed that a majority of the samples in the classes from healthy and disease samples tend to be one form, where healthy versus disease samples and disease versus disease are separable using fine level classification, employing a combination of geometry features. This proves the significance of the geometry features at establishing a relationship between a tongue's state and its shape.

Figure 1.

Figure 1

Tongue image capture device.

The rest of this paper is organized as follows. Thirteen geometry features extracted from tongue images are presented in Section 2 along with the tongue shape classification performed using a decision tree. Following this, experimental results using coarse and fine level classifications are given in Section 3. Finally, concluding remarks are made in Section 4.

2. Materials and Methods

The tongue image dataset is first introduced in Section 2.1. Afterwards, a detailed description of the 13 geometry features extracted from a tongue image is given in Section 2.2. How these features are then used to classify a tongue into 5 major shapes is provided in Section 2.3.

2.1. Tongue Image Dataset

The tongue image database is composed of 672 images (one image per person) divided into 130 healthy and 542 disease samples. Seven disease classes and healthy classes were captured at Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangdong, China. Patients with diabetes mellitus were processed at the Hong Kong Foundation for Research and Development in Diabetes, Prince of Wales Hospital, Hong Kong. Healthy samples were verified through a blood test and other experiments. If indicators from the tests fall within a certain range they were deemed healthy. In the disease class, samples were collected from inpatients with illness determined by their admission note and diagnosed using western medical practices. Inpatients suffering from the same disease were grouped together into a single class. In total there were 7 disease groups (with at least 19 samples). A summary of the disease class breakdown is given in Table 1.

Table 1.

Disease class statistics listing its name and number of samples.

Disease name Number of samples
Diabetes mellitus (DM) 296
Nephritis (NR) 90
Gastritis verrucosa (GV) 67
Nephrotic syndrome (NS) 30
Erosive gastritis (EG) 20
Chronic gastritis (CG) 20
Coronary heart disease (CHD) 19

2.2. Geometry Features

In the following subsection we describe the 13 geometry features extracted from tongue images (which have been converted to binary images after segmentation [19]). These features based on measurements, distances, areas, and their ratios are used in subsequent sections to define and classify 5 tongue shapes.

2.2.1. Width

The width (w) feature (see Figure 2) is measured as the horizontal distance along the x-axis from a tongue's furthest right edge point (x max) to its furthest left edge point (x min):

w=xmaxxmin. (1)
Figure 2.

Figure 2

Illustration of features 1, 2, and 4.

2.2.2. Length

The length (l) feature (see Figure 2) is measured as the vertical distance along the y-axis from a tongue's furthest bottom edge (y max) point to its furthest top edge point (y min):

l=ymaxymin. (2)

2.2.3. Length-Width Ratio

The length-width ratio (lw) is the ratio of a tongue's length to its width:

lw=lw. (3)

2.2.4. Smaller Half Distance

Smaller half distance (z) is the half distance of l or w depending on which segment is shorter (see Figure 2):

z=minl,w2. (4)

2.2.5. Center Distance

The center distance (cd) (refer to Figure 3) is the distance from w's y-axis center point to the center point of l(y cp):

cd=maxyxmax+maxyxmin2ycp, (5)

where y cp = (y max + y min)/2.

Figure 3.

Figure 3

Illustration of feature 5.

2.2.6. Center Distance Ratio

Center distance ratio (cdr) is ratio of cd to l:

cdr=cdl. (6)

2.2.7. Area

The area (a) of a tongue is defined as the number of tongue foreground pixels.

2.2.8. Circle Area

Circle area (ca) is the area of a circle within the tongue foreground using smaller half distance z, where r = z (refer to Figure 4):

ca=πr2. (7)
Figure 4.

Figure 4

Illustration of feature 8.

2.2.9. Circle Area Ratio

Circle area ratio (car) is the ratio of ca to a:

car=caa. (8)

2.2.10. Square Area

Square area (sa) is the area of a square defined within the tongue foreground using smaller half distance z (refer to Figure 5):

sa=4z2. (9)
Figure 5.

Figure 5

Illustration of feature 10.

2.2.11. Square Area Ratio

Square area ratio (sar) is the ratio of sa to a:

sar=saa. (10)

2.2.12. Triangle Area

Triangle area (ta) is the area of a triangle defined within the tongue foreground (see Figure 6). The right point of the triangle is x max, the left point is x min, and the bottom is y max.

Figure 6.

Figure 6

Illustration of feature 12.

2.2.13. Triangle Area Ratio

Triangle area ratio (tar) is the ratio of ta to a:

tar=taa. (11)

2.3. Tongue Shape Classification

Based on TCM we define 5 tongue shapes, rectangle, acute triangle, obtuse triangle, square, and circle, which can be classified using the 13 features explained above (see Section 2.2). A rectangle tongue's vertical length is long, but its horizontal width along the tip, body, and root remains relatively constant. An acute triangle tongue's vertical length is longer than its largest horizontal width (at the root) but gradually decreases from the body down to the tip. If the tongue shape is an obtuse triangle, its horizontal width is greater than its vertical length, with the width steadily decreasing as it approaches the tip. In a square tongue shape both its horizontal width and vertical length are similar. Finally, if a tongue is circle, both the horizontal width and vertical length will be alike, but its car (8) will be closer to 1. Figure 7 depicts the 5 tongue shapes using typical samples.

Figure 7.

Figure 7

Typical samples to show the 5 tongue shapes.

To classify tongue images into its proper shape, a decision tree structure shown in Figure 8 is used. Given a tongue we first examine its length-width (lw) ratio. If this ratio is t low lwlwt high lw, the tongue shape must be square or circle (left branch), and if the ratio is t high lw < lw or t low lw > lw, the shape of the tongue can be rectangle, acute triangle, or obtuse triangle (right branch). The values of t low lw and t high lw are 0.95 and 1.05, respectively.

Figure 8.

Figure 8

Decision tree to classify the tongue shapes.

Focusing on the left branch, the average radius (r avg) of the tongue is first calculated as

ravg=l+w4 (12)

which is the average of w/2 and l/2. Next, the ratio T sc is computed as

Tsc=aravg2. (13)

If the tongue shape is approximately square, the value of T sc ≈ 4  (i.e., 4 · r avg 2/r avg 2), and if it is approximately circle, T scπ (i.e., π·r avg 2/r avg 2). Hence, the two shapes can be defined as

Square=Tscπ+ε,Circle=Tsc<π+ε, (14)

where ε is a constant equal to 0.1.

Turning our attention to the right branch, we initially calculate the ratio T rao:

Trao=al·w. (15)

If this ratio is greater than or equal to t rect, the tongue shape is rectangle:

Rectangle=Traotrect, (16)

where t rect is 0.85 and the maximum of T rao is 1. If T rao < t rect, the shape of the tongue is either acute or obtuse triangle. To determine which triangle, length-width ratio is used once again as follows:

Acute  Triangle=Trao<trectlwtao,Obtuse  Triangle=Trao<trectlw<tao, (17)

where t ao is given as 1.05. The parameter values listed above to classify a tongue image were chosen empirically.

3. Results and Discussions

The following section presents the experimental results. A coarse level classification showing the results of tongue shape classification is given in Section 3.1. Classes classified into the same shape are further differentiated in Section 3.2 through fine level classification using a combination of geometry features.

3.1. Coarse Level: Tongue Shape Classification Result

By applying the tongue shape classification algorithm (described above, see Section 2.3) to every image in the dataset, its shape can be determined. This result is listed in Table 2. In the table it can be seen that the most common shapes in healthy group are circle or square, representing 86.92% (113/130) of all samples. In DM the majority shape is obtuse triangle, accounting for 86.15% (255/296). For NR, acute triangle takes the majority with 83.33% (75/90). Having 85.07% (57/67) and 83.33% (25/30), rectangle is the most prevalent shape in GV and NS, respectively. In EG the dominant shapes are circle or square, making up 85.00% (17/20). Finally, in CG and CHD acute and obtuse triangles are the most widespread, embodying 80.00% (16/20) and 84.21% (16/19) of all images correspondingly. Figure 9 depicts three typical samples from the healthy class, while Figures 10(a)10(g) illustrate typical samples from the disease classes.

Table 2.

Tongue shape classification result for the dataset.

Rectangular Acute triangle Obtuse triangle Circle Square
Healthy 5 4 8 50 63
DM 7 22 255 2 10
NR 5 75 4 0 6
GV 57 5 4 0 1
NS 25 1 1 1 2
EG 1 1 1 5 12
CG 4 16 0 0 0
CHD 0 3 16 0 0

Figure 9.

Figure 9

Three typical samples from healthy samples.

Figure 10.

Figure 10

Three typical samples from (a) DM, (b) NR, (c) GV, (d) NS, (e) EG, (f) CG, and (g) CHD.

3.2. Fine Level: Classification Result within Each Shape

In the tongue shape classification results, there exists more than one class for each shape (see Table 2). To distinguish between classes with the same shape, a set of geometry features were selected and applied to SVM. Half the images in each class were randomly selected for training, while the other half was used as testing. To measure the performance, average accuracy was employed. The linear kernel function (dot product) was used to map the training data into kernel space, while a quadratic kernel produced similar classification results. k-NN was also tested but did not perform as well.

Utilizing a grouping of the features is logical, since not every feature can have a positive contribution to the final result, as is the case here where 13 features used for classification produced poor results. Therefore, an optimization of the features is necessary. To reduce the number of features, sequential forward selection (SFS) [23] was implemented. SFS is a feature selection method that begins with an empty set of features. It adds additional features based on maximizing some criterion function J and terminates when all features have been added. In our case J is the average accuracy of SVM. Below, each tongue shape is examined in detail.

3.2.1. Rectangle

Both GV and NS were classified into this shape. Applying SFS with SVM to separate the two classes, the highest average accuracy of 70.07% was achieved using features 9, 4, 7, 8, 2, 13, 12, and 1.

3.2.2. Acute Triangle

For this shape NR and CG were classified together. l (feature 2) attained the best average accuracy of 70.00%.

3.2.3. Obtuse Triangle

DM and CHD were assigned to obtuse triangle. Through a combination of features consisting of 1, 7, 4, 11, 13, and 2, the highest average accuracy of 76.23% was achieved.

3.2.4. Circle or Square

The maximum average accuracy of 88.65% was obtained using w (feature 1) and a (feature 7) to classify the two classes (healthy and EG) appointed to circle or square.

Table 3 summarizes the results for each tongue shape. For completeness, the average accuracy of healthy samples versus NR, GV, NS, CG, DM, and CHD is shown in Table 4. From this result, it can be seen that healthy samples are distinguishable compared to others, given their different tongue shapes.

Table 3.

Optimized classification result between various classes having identical tongue shape.

Tongue shape Class comparison Optimal feature(s) Average accuracy
Circle or square Healthy versus EG 1, 7 88.65%
Obtuse triangle DM versus. CHD 1, 7, 4, 11, 13, 2 76.23%
Acute triangle NR versus CG 2 70.00%
Rectangle GV versus NS 9, 4, 7, 8, 2, 13, 12, 1 70.07%
Table 4.

Optimized classification result between healthy versus NR, GV, NS, CG, DM, and CHD groups.

Classes compared with healthy Optimal feature(s) Average accuracy
NR 1, 7, 12, 8, 4 82.09%
GV 1, 8 86.72%
NS 7, 4, 1, 8 85.26%
CG 1 75.19%
DM 12, 1, 13, 9, 11 69.09%
CHD 8, 6, 4, 10, 13, 9, 2 80.89%

4. Conclusions

In this paper we thoroughly examined tongue shape and its relation to a patients' state (either healthy or diseased) using geometry features through computerized methods. With tongue images captured by a specially designed device that accounts for image correction and a large dataset labeled according to western medical practices, we have a solid foundation to carry out this objective study. Thirteen geometry features including measurements, distances, areas, and their ratios were extracted from each tongue image. The features helped define 5 tongue shapes rooted upon TCM and classified using a decision tree. In the experimental results, coarse level classification first showed that the tongue classes belong to different shapes. Although more than one class occupies the same shape, in fine level classification they are still distinguishable, when employing SFS with SVM (using a grouping of geometry features). This validates the significance of geometry features at shape classification, as well as healthy versus disease/disease versus disease classifications. With tongue shape and a persons' health state now established using computer-based methods, this potentially provides a new painless and efficient way to diagnose patients. A continuation of this work will investigate the fusion of all possible tongue features including color and texture in order to better determine a patients' state.

Acknowledgments

This work is supported by the University of Macau (SRG2013-00048-FST). The authors would like to thank Professor David Zhang for his contributions.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

References

  • 1.Kirschbaum B. Atlas of Chinese Tongue Diagnosis. Seattle, Wash, USA: Eastland Press; 2000. [Google Scholar]
  • 2.Kacera W. Ayurvedic Tongue Diagnosis. 2nd. Lotus Press; 2006. [Google Scholar]
  • 3.Li N. M. The Contemporary Investigations of Computerized Tongue Diagnosis, The Handbook of Chinese Tongue Diagnosis. Beijing, China: Shed-Yuan Publishing; 1994. [Google Scholar]
  • 4.Li N., Zhang D., Wang K. Tongue Diagnostics. Beijing, China: Shed-Yuan Publishing; 2006. [Google Scholar]
  • 5.Maciocia G. Tongue Diagnosis in Chinese Medicine. Seattle, Wash, USA: Eastland Press; 1995. [Google Scholar]
  • 6.Chiu C.-C. Development of a computerized tongue diagnosis system. Biomedical Engineering—Applications, Basis and Communications. 1996;8(4):342–350. [Google Scholar]
  • 7.Chiu C.-C. A novel approach based on computerized image analysis for traditional Chinese medical diagnosis of the tongue. Computer Methods and Programs in Biomedicine. 2000;61(2):77–89. doi: 10.1016/S0169-2607(99)00031-0. [DOI] [PubMed] [Google Scholar]
  • 8.Wang Y.-G., Yang J., Zhou Y., Wang Y.-Z. Region partition and feature matching based color recognition of tongue image. Pattern Recognition Letters. 2007;28(1):11–19. doi: 10.1016/j.patrec.2006.06.004. [DOI] [Google Scholar]
  • 9.Li B., Huang Q., Lu Y., Chen S., Liang R., Wang Z. A method of classifying tongue colors for traditional chinese medicine diagnosis based on the CIELAB color space. Proceedings of the International Conference on Medical Biometrics; 2007; pp. 153–159. [Google Scholar]
  • 10.Wang X., Zhang D. Statistical tongue color distribution and its application. Proceedings of the International Conference on Computer and Computational Intelligence; 2011. [Google Scholar]
  • 11.Zhang D. Automated Biometrics: Technologies and Systems. Boston, Mass, USA: Kluwer Academic Publishers; 2000. [Google Scholar]
  • 12.Zhang D., Pang B., Li N., Wang K., Zhang H. Computerized diagnosis from tongue appearance using quantitative feature classification. The American Journal of Chinese Medicine. 2005;33(6):859–866. doi: 10.1142/S0192415X05003466. [DOI] [PubMed] [Google Scholar]
  • 13.Zhang H.-Z., Wang K.-Q., Jin X.-S., Zhang D. SVR based color calibration for tongue image. Proceedings of the International Conference on Machine Learning and Cybernetics (ICMLC '05); August 2005; pp. 5065–5070. [Google Scholar]
  • 14.Zhang Y., Liang R., Wang Z., Fan Y., Li F. Analysis of the color characteristics of tongue digital images from 884 physical examination cases. Journal of Beijing University of Traditional Chinese Medicine. 2005;28:73–75. [Google Scholar]
  • 15.Li C. H., Yuen P. C. Tongue image matching using color content. Pattern Recognition. 2002;35(2):407–419. doi: 10.1016/S0031-3203(01)00021-8. [DOI] [Google Scholar]
  • 16.Su W., Xu Z. Y., Wang Z. Q., Xu J. T. Objectified study on tongue images of patients with lung cancer of different syndromes. Chinese Journal of Integrative Medicine. 2011;17(4):272–276. doi: 10.1007/s11655-011-0702-6. [DOI] [PubMed] [Google Scholar]
  • 17.Wang K. Q., Zhang D., Li N. M., Pang B. Tongue diagnosis based on biometric pattern recognition technology. In: Pal S. K., Pal A., editors. Pattern Recognition from Classical to Modern Approaches. 1st. World Scientific Publishing; 2001. pp. 575–598. [Google Scholar]
  • 18.Pang B., Zhang D., Wang K. Tongue image analysis for appendicitis diagnosis. Information Sciences. 2005;175(3):160–176. doi: 10.1016/j.ins.2005.01.010. [DOI] [Google Scholar]
  • 19.Pang B., Zhang D., Wang K. The bi-elliptical deformable contour and its application to automated tongue segmentation in chinese medicine. IEEE Transactions on Medical Imaging. 2005;24(8):946–956. doi: 10.1109/TMI.2005.850552. [DOI] [PubMed] [Google Scholar]
  • 20.Pae E. K., Lowe A. A. Tongue shape in obstructive sleep apnea patients. Angle Orthodontist. 1999;69(2):147–150. doi: 10.1043/0003-3219(1999)069<0147:TSIOSA>2.3.CO;2. [DOI] [PubMed] [Google Scholar]
  • 21.Huang B., Wu J., Zhang D., Li N. Tongue shape classification by geometric features. Information Sciences. 2010;180(2):312–324. doi: 10.1016/j.ins.2009.09.016. [DOI] [Google Scholar]
  • 22.Wang X., Zhang D. An optimized tongue image color correction scheme. IEEE Transactions on Information Technology in Biomedicine. 2010;14(6):1355–1364. doi: 10.1109/TITB.2010.2076378. [DOI] [PubMed] [Google Scholar]
  • 23.Duda R. O., Hart P. E., Stork D. G. Pattern Classification. 2nd. New York, NY, USA: Wiley-Interscience; 2000. [Google Scholar]

Articles from Evidence-based Complementary and Alternative Medicine : eCAM are provided here courtesy of Wiley

RESOURCES