Abstract
Background. Pattern identification (PI) is the basic system for diagnosis of patients in traditional Korean medicine (TKM). The purpose of this study was to identify misclassification objects in discriminant model of PI for improving the classification accuracy of PI for stroke. Methods. The study included 3306 patients with stroke who were admitted to 15 TKM hospitals from June 2006 to December 2012. We derive the four kinds of measure (D, R, S, and C score) based on the pattern of the profile graphs according to classification types. The proposed measures are applied to the data to evaluate how well those detect misclassification objects. Results. In 10–20% of the filtered data, misclassification rate of C score was highest compared to those rates of other scores (42.60%, 41.15%, resp.). In 30% of the filtered data, misclassification rate of R score was highest compared to those rates of other scores (40.32%). And, in 40–90% of the filtered data, misclassification rate of D score was highest compared to those rates of other scores. Additionally, we can derive the same result of C score from multiple regression model with two independent variables. Conclusions. The results of this study should assist the development of diagnostic standards in TKM.
1. Introduction
Due to the development of modern medicine, the average lifespan for human beings is anticipated to rise beyond 85 years of age within the following 20 years [1]. In the meantime, since the rate of aging in South Korea is expected to surge up to 35.1% by 2050, ranking 2nd in the world close to Japan (37.7%), geriatric diseases and the health of the elderly have emerged as one of the most critical social problems of improving the quality of life in the future [2]. In particular, stroke is one of the representative geriatric diseases, along with dementia. Personal and social insecurities caused by the disease have continued to grow. In addition, stroke ranks as the top mortality risk to Koreans among the single diseases and contributes to more than 70% of the in-patients at traditional Korean medical hospitals [3, 4]. In traditional Korean medicine (TKM), specific or nonspecific symptoms of patients are diagnosed by observing, listening, asking, and feeling their pulse under the diagnostic system of pattern identification (PI) in order to determine the cause, nature, treatment method, and treatment drugs of a disease [5–7]. This PI diagnosis collects specific or nonspecific symptoms of patients and classifies them into one of the hundreds of symptom classes. It is the essential core technology forming the backbone of diagnosis and treatment in oriental medicine. However, the PI diagnosis holds limited objectivity and reproducibility due to the lack of standardized measurement indices, and objectification problems have always arisen with respect to personal deviations among TKM physicians based on their knowledge and experience [6–8].
As the necessity for the standardization of diagnostic systems has recently come to the fore, studies have been underway to objectify diagnosis.
In the study titled “Fundamental Study for the Standardization and Objectification of Pattern Identification in Traditional Korean Medicine for Stroke (SOPI-Stroke),” which was conducted over 9 years from 2005 to 2013, the Korea Institute of Oriental Medicine (KIOM) proposed a standardization plan for PI/syndrome differentiation of stroke, established stroke PI diagnostic indices, built a database system relating to TKM clinical technologies by setting up a clinical index database, and founded a scientific basis for stroke and PI by discovering stroke and PI biological indices, to which the latest research methods, such as OMICS, were applied. Studies were carried out to discover biological indices that could be helpful to stroke prevention by finding out what the stroke risk factors were [9–16].
Consequently, the purpose of this study was to identify misclassification objects in discriminant model of PI for improving the classification accuracy of PI for stroke patients. Although current TKM PI diagnostic tools for stroke were developed after several years of research and prepared for public release, the tools still need corrections and modifications in many aspects [17–19]. In this study, the key topics for discussion involve appropriate statistical methods to reduce the probability of diagnostic misclassification.
2. Methods
2.1. Subjects
The study included 3306 patients with stroke who were admitted to 15 oriental medical university hospitals from June 2006 to December 2012. Each patient provided informed consent to undergo procedures that were approved by the respective institutions' Institutional Review Boards (IRB). Informed consent of all the study patients was obtained after a thorough explanation of the details. We enrolled stroke patients for enrollment within 30 days of the onset of their symptoms, provided that their diagnosis was confirmed by an imaging diagnosis such as computerized tomography (CT) or magnetic resonance imaging (MRI). Patients with traumatic stroke such as subarachnoid, subdural, and epidural hemorrhage were excluded from the study.
2.2. Measured Variables
Each patient was seen by two experts at the same department within each site. All experts who were well trained in standard operation procedures (SOPs) were participating in this study. The experts had at least three years of clinical experiences with stroke after finishing regular college education about TKM for six years. The examination parameters were extracted from parts of a case report form (CRF) for the standardization of stroke diagnosis that had been developed by an expert committee organized by the KIOM [7, 11, 12].
2.2.1. The Korean Standard PI for Stroke-3
PI process for differentiating stroke with four TKM types: the Fire-heat (FH) pattern, Dampness-phlegm (DP) pattern, Yin deficiency (YD) pattern, and Qi deficiency (QD) pattern [11, 12]. The FH pattern is characterized by any symptom of heat or fire that is contracted externally or engendered internally. The DP pattern is characterized by impeding Qi movement and its turbidity, heaviness, stickiness, and downward-flowing properties. The QD pattern is characterized by qi deficiency with diminished internal organ function, which is marked by shortness of breath, lassitude, listlessness, spontaneous sweating, a pale tongue, and a weak pulse. The YD pattern is characterized by yin deficiency with diminished moistening and the inability to restrain yang, which is usually manifested as fever [7, 9–13, 20]. The Korean Standard PI for Stroke-3 consists of 44 clinical indices and each clinical index belongs to its respective PI (Supplemental Table 1, in Supplementary Material available online at http://dx.doi.org/10.1155/2016/1912897).
2.3. Statistical Methods
After determining 12 different types of misclassification through discriminant analysis, we plotted it on the profile graphs according to types. And then we derive the four kinds of measure (D, R, S, and C score) based on the pattern analysis of the profile graphs. The proposed measures are applied to the stroke data to evaluate how well those detect misclassification objects.
2.3.1. Types of Misclassification
According to the results from the discriminant model classification, 2,209 patients posted correct classifications out of the total of 3,306 patients (66.82%) (Table 1). Out of the 3,306 patients, 1,097 were misclassified (33.2%) and the misclassification types are summarized in Table 2. To analyze the misclassification types, 44 clinical indices of the Korean Standard PI for Stroke-3 were grouped into four upper-class variables (QD, DP, YD, and FH pattern indices). In addition, the average and standard deviation of each upper-class variable was used to attain standardized scores, after which the misclassification types were analyzed (Figure 1).
Table 1.
Results using the classification of discriminant model.
| Classification result N (%) | ||||||
|---|---|---|---|---|---|---|
| QD | DP | YD | FH | Total | ||
| Physician's diagnosis | QD | 498 (66.94) | 115 (15.46) | 95 (12.77) | 36 (4.84) | 744 (22.50) |
| DP | 118 (10.61) | 783 (70.41) | 69 (6.21) | 142 (12.77) | 1112 (33.64) | |
| YD | 70 (14.64) | 55 (11.51) | 276 (57.74) | 77 (16.11) | 478 (14.46) | |
| FH | 46 (4.73) | 147 (15.12) | 127 (13.07) | 652 (67.08) | 972 (29.40) | |
| Total | 732 (22.14) | 1100 (33.27) | 567 (17.15) | 907 (27.44) | 3306 (100.00) | |
QD: Qi deficiency pattern; DP: Dampness-phlegm pattern; YD: Yin deficiency pattern; FH: Fire-heat pattern.
Table 2.
The mean values of the standardized scores for upper-class variables according to misclassification type.
| Types of misclassification | N (%) | Z QD | Z DP | Z YD | Z FH | |
|---|---|---|---|---|---|---|
| 1 | DPFH# | 142 (12.94) | −0.565 | −0.113 | −0.251 | 0.648 |
| 2 | DPQD | 118 (10.76) | 1.004 | −0.001 | −0.312 | −0.492 |
| 3 | DPYD | 69 (6.29) | 0.118 | −0.060 | 0.902 | 0.085 |
| 4 | FHDP | 147 (13.40) | −0.426 | 0.610 | −0.114 | 0.069 |
| 5 | FHQD | 46 (4.19) | 0.907 | −0.494 | −0.233 | 0.096 |
| 6 | FHYD | 127 (11.58) | −0.291 | −0.596 | 0.956 | 0.184 |
| 7 | QDDP | 115 (10.48) | 0.111 | 0.605 | −0.394 | −0.456 |
| 8 | QDFH | 36 (3.28) | 0.075 | −0.500 | −0.373 | 0.560 |
| 9 | QDYD | 95 (8.66) | 0.512 | −0.487 | 0.808 | −0.299 |
| 10 | YDDP | 55 (5.01) | −0.229 | 0.529 | −0.153 | −0.336 |
| 11 | YDFH | 77 (7.02) | −0.393 | −0.525 | 0.133 | 0.568 |
| 12 | YDQD | 70 (6.38) | 0.914 | −0.492 | 0.240 | −0.337 |
|
| ||||||
| Total | 1097 (100.00) | 0.067 | −0.063 | 0.110 | 0.017 | |
QD: Qi deficiency pattern; DP: Dampness-phlegm pattern; YD: Yin deficiency pattern; FH: Fire-heat pattern; DPFH#: physician's diagnosis- Dampness-phlegm pattern, classification result, Fire-heat pattern; Z QD: the standardized scores for upper-class variables according to Qi deficiency pattern; Z DP: the standardized scores for upper-class variables according to Dampness-phlegm pattern; Z YD: the standardized scores for upper-class variables according to Yin deficiency pattern; Z FH: the standardized scores for upper-class variables according to Fire-heat pattern.
Figure 1.
Process of grouping of explanatory variables and standardized scores generation. The mean and standard deviation of each upper-class variable were used to attain standardized scores, after which the misclassification types were analyzed. QD: Qi deficiency pattern; DP: Dampness-phlegm pattern; YD: Yin deficiency pattern; FH: Fire-heat pattern.
2.3.2. The Profile Graphs
With 12 misclassification types and 4 correct classification types categorized by the discriminant analysis, the profile graphs were drawn. Specifically, two of the 4 patterns were selected and the correct classification types and misclassification types for each pattern were collected from the TKM physicians and divided. For instance, as described in Figure 2, patients applicable to two misclassification types (FHQD and QDFH) were grouped together. Next, the upper-class variable scores of each patient were used to draw a profile plot. At this point, it was critical to arrange the pattern scores of correct classification on the edges and those of the other two pattern scores inside. The profile graphs of the misclassification types (FHQD, QDFH, etc.) and the correct classification types (e.g., FH, QD, YD, and DP) are depicted in Figures 2 –7 and the relevant statistics are in Table 3. As illustrated in Figures 2–7, two misclassification types demonstrate a U-shaped pattern and correct classification types an L-shaped or flipped-L-shaped pattern.
Figure 2.

The profiles graphs of the FH and QD. Z FH: the standardized scores for upper-class variables according to Fire-heat pattern; Z QD: the standardized scores for upper-class variables according to Qi deficiency pattern; Z DP: the standardized scores for upper-class variables according to Dampness-phlegm pattern; Z YD: the standardized scores for upper-class variables according to Yin deficiency pattern; OK: the correct classification types.
Figure 3.

The profiles graphs of the QD and YD. Z FH: the standardized scores for upper-class variables according to Fire-heat pattern; Z QD: the standardized scores for upper-class variables according to Qi deficiency pattern; Z DP: the standardized scores for upper-class variables according to Dampness-phlegm pattern; Z YD: the standardized scores for upper-class variables according to Yin deficiency pattern; OK: the correct classification types.
Figure 4.

The profiles graphs of the DP and YD. Z FH: the standardized scores for upper-class variables according to Fire-heat pattern; Z QD: the standardized scores for upper-class variables according to Qi deficiency pattern; Z DP: the standardized scores for upper-class variables according to Dampness-phlegm pattern; Z YD: the standardized scores for upper-class variables according to Yin deficiency pattern; OK: the correct classification types.
Figure 5.

The profiles graphs of the FH and YD. Z FH: the standardized scores for upper-class variables according to Fire-heat pattern; Z QD: the standardized scores for upper-class variables according to Qi deficiency pattern; Z DP: the standardized scores for upper-class variables according to Dampness-phlegm pattern; Z YD: the standardized scores for upper-class variables according to Yin deficiency pattern; OK: the correct classification types.
Figure 6.

The profiles graphs of the DP and QD. Z FH: the standardized scores for upper-class variables according to Fire-heat pattern; Z QD: the standardized scores for upper-class variables according to Qi deficiency pattern; Z DP: the standardized scores for upper-class variables according to Dampness-phlegm pattern; Z YD: the standardized scores for upper-class variables according to Yin deficiency pattern; OK: the correct classification types.
Figure 7.

The profiles graphs of the DP and FH. Z FH: the standardized scores for upper-class variables according to Fire-heat pattern; Z QD: the standardized scores for upper-class variables according to Qi deficiency pattern; Z DP: the standardized scores for upper-class variables according to Dampness-phlegm pattern; Z YD: the standardized scores for upper-class variables according to Yin deficiency pattern; OK: the correct classification types.
Table 3.
Summary of Z scores according to the profile graphs for PI classification types.
| Classification types | N | Z scores (mean ± SE) | ||||
|---|---|---|---|---|---|---|
| Z QD | Z DP | Z YD | Z FH | |||
| FH, QD classification types | FHQD | 46 | 0.907 ± 0.137 | −0.494 ± 0.110 | −0.233 ± 0.109 | 0.097 ± 0.120 |
| OK(FH) | 652 | −0.620 ± 0.025 | −0.425 ± 0.031 | 0.028 ± 0.038 | 0.919 ± 0.042 | |
| OK(QD) | 498 | 1.189 ± 0.043 | −0.372 ± 0.033 | −0.223 ± 0.035 | −0.637 ± 0.030 | |
| QDFH | 36 | 0.075 ± 0.130 | −0.500 ± 0.107 | −0.373 ± 0.118 | 0.560 ± 0.175 | |
| Total | 1232 | 0.189 ± 0.034 | −0.408 ± 0.022 | −0.095 ± 0.025 | 0.249 ± 0.034 | |
|
| ||||||
| QD, YD classification types | QDYD | 95 | 0.513 ± 0.103 | −0.487 ± 0.072 | 0.808 ± 0.099 | −0.300 ± 0.078 |
| OK(QD) | 498 | 1.189 ± 0.043 | −0.372 ± 0.033 | −0.223 ± 0.035 | −0.637 ± 0.030 | |
| OK(YD) | 276 | −0.031 ± 0.045 | −0.579 ± 0.046 | 1.159 ± 0.068 | −0.135 ± 0.048 | |
| YDQD | 70 | 0.914 ± 0.102 | −0.493 ± 0.090 | 0.240 ± 0.105 | −0.337 ± 0.085 | |
| Total | 939 | 0.742 ± 0.034 | −0.454 ± 0.024 | 0.322 ± 0.036 | −0.433 ± 0.025 | |
|
| ||||||
| DP, YD classification types | DPYD | 69 | 0.118 ± 0.097 | −0.060 ± 0.101 | 0.903 ± 0.139 | 0.085 ± 0.127 |
| OK(DP) | 783 | −0.323 ± 0.027 | 0.883 ± 0.034 | −0.443 ± 0.024 | −0.336 ± 0.026 | |
| OK(YD) | 276 | −0.031 ± 0.045 | −0.579 ± 0.046 | 1.159 ± 0.068 | −0.135 ± 0.048 | |
| YDDP | 55 | −0.229 ± 0.090 | 0.529 ± 0.116 | −0.153 ± 0.090 | −0.336 ± 0.092 | |
| Total | 1183 | −0.225 ± 0.022 | 0.471 ± 0.032 | 0.022 ± 0.032 | −0.264 ± 0.023 | |
|
| ||||||
| FH, YD classification types | FHYD | 127 | −0.291 ± 0.069 | −0.597 ± 0.063 | 0.956 ± 0.108 | 0.184 ± 0.087 |
| OK(FH) | 652 | −0.620 ± 0.025 | −0.425 ± 0.031 | 0.028 ± 0.038 | 0.919 ± 0.042 | |
| OK(YD) | 276 | −0.031 ± 0.045 | −0.579 ± 0.046 | 1.159 ± 0.068 | −0.135 ± 0.048 | |
| YDFH | 77 | −0.393 ± 0.077 | −0.525 ± 0.086 | 0.133 ± 0.095 | 0.568 ± 0.093 | |
| Total | 1132 | −0.424 ± 0.022 | −0.489 ± 0.023 | 0.415 ± 0.034 | 0.555 ± 0.032 | |
|
| ||||||
| DP, QD classification types | DPQD | 118 | 1.004 ± 0.071 | −0.001 ± 0.071 | −0.312 ± 0.070 | −0.492 ± 0.064 |
| OK(DP) | 783 | −0.323 ± 0.027 | 0.883 ± 0.034 | −0.443 ± 0.024 | −0.336 ± 0.026 | |
| OK(QD) | 498 | 1.189 ± 0.043 | −0.372 ± 0.033 | −0.223 ± 0.035 | −0.637 ± 0.030 | |
| QDDP | 115 | 0.111 ± 0.070 | 0.605 ± 0.071 | −0.395 ± 0.067 | −0.456 ± 0.069 | |
| Total | 1514 | 0.311 ± 0.028 | 0.380 ± 0.027 | −0.357 ± 0.019 | −0.456 ± 0.018 | |
|
| ||||||
| DP, FH classification types | DPFH | 142 | −0.565 ± 0.047 | −0.113 ± 0.059 | −0.251 ± 0.069 | 0.648 ± 0.076 |
| OK(DP) | 783 | −0.323 ± 0.027 | 0.883 ± 0.034 | −0.443 ± 0.024 | −0.336 ± 0.026 | |
| OK(FH) | 652 | −0.620 ± 0.025 | −0.425 ± 0.031 | 0.028 ± 0.038 | 0.919 ± 0.042 | |
| FHDP | 147 | −0.426 ± 0.054 | 0.610 ± 0.064 | −0.114 ± 0.068 | 0.069 ± 0.061 | |
| Total | 1724 | −0.464 ± 0.017 | 0.283 ± 0.026 | −0.221 ± 0.020 | 0.254 ± 0.026 | |
PI: pattern identification; QD: Qi deficiency pattern; DP: Dampness-phlegm pattern; YD: Yin deficiency pattern; FH: Fire-heat pattern; OK: the correct classification types; Z QD: the standardized scores for upper-class variables according to Qi deficiency pattern; Z DP: the standardized scores for upper-class variables according to Dampness-phlegm pattern; Z YD: the standardized scores for upper-class variables according to Yin deficiency pattern; Z FH: the standardized scores for upper-class variables according to Fire-heat pattern.
2.3.3. Derived Four Measures (D, R, S, and C Scores)
In the profile graphs, misclassification observations in most of the 6 cases displayed a bathtub or U-shaped pattern since pattern scores corresponding to actual patterns would be relatively high and the misclassification of a pattern is highly probable if relatively higher scores were observed in the other pattern. In the meantime, correct classification observations showed an L-shaped (or flipped-L-shaped) pattern. Although actual patterns are unknown due to the lack of direct diagnoses from TKM physicians, if a new patient establishes a bathtub-shaped profile simply with 4 upper-class pattern scores (obligatory two high scores and two low scores), this patient is likely to be misclassified through the future discriminant model. Criteria were designed to assess how close a pattern score profile would be to a bathtub shape through various arrangements and simple calculations of the four pattern scores and applied to already discriminated data. By doing so, comparison was conducted to investigate how much misclassification was estimated and how much discrimination rates improved when the estimated misclassification observations were eliminated beforehand.
(1) D Score. Analyzing correct classification and misclassification types with profile graphs, the D value was derived considering that a difference between the maximum value Z (1) and the second-largest value Z (2) of misclassification was smaller than that of correct classification, and classification by the value was attempted (Figure 8). Namely, under the hypothesis that the smaller the D value was, the closer the profile graph was to a bathtub shape and the higher the probability of the respective observations corresponding to misclassification was, the D values were applied to the clinical stroke data.
Figure 8.
Derived D values based on the pattern analysis of the profile graphs. Under the hypothesis that the smaller the D value was, the closer the profile graph was to a bathtub (or U) shape, and the higher the probability of the respective observations corresponding to misclassification was.
After sorting the data by the D value in descending order and investigating the frequency and rates of misclassification over 10% intervals (Figure 9), the misclassification probability of the 10% (N = 331) filtered data reached 40.79% (N m = 135, Meanm = 0.058), which was 7.61% p higher than the previously calculated misclassification probability (33.18%) of the total data. The misclassification probabilities of the data filtered from 20% to 90% were lower than that of the 10% filtered data but higher than that of the total data (33.18%). In the data filtered at 10%, 20%, 40%, and 50%, average D values of the misclassifications and correct classifications were barely different from each other, even though the average D values of the misclassifications tended to be higher than those of the correct classifications. In the other data groups, the average D values of the correct classifications were higher than those of the misclassifications (Table 4). Meanwhile, examining the frequencies and rates of the correct classifications in the data selected for D values, the misclassification probability of the correct classifications in the 90% (N = 2975) selected data recorded 67.66% (N c = 2013, % of N m = 32.34%), which was 0.86% p higher than those of the previously calculated correct classifications (66.8%) of the total data. In the 80% (N = 2645) selected data, the misclassification probabilities of correct classifications reached 68.28% (N c = 1806, % of N m = 31.72%), which was 0.62% p higher than those in the 90% selected data. In the data selected from 70% to 10%, the correct classifications gradually increased (Table 4).
Figure 9.
Data filtering and selection method. Data were ranged according to each four measures (D, R, S, and C values) in descending or ascending order by increasing data by 10% intervals.
Table 4.
Types of classifications distribution of filtered/selected data by D value.
| Filtered% | Type of classifications distribution of filtered data by D value | Selected% | Type of classifications distribution of selected data by D value | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N m (%) | N c (%) | N t (%) | Meanm | Meanc | Meant | N m (%) | N c (%) | N t (%) | Meanm | Meanc | Meant | ||
| 10% | 135 (40.79) | 196 (59.21) | 331 (100) | 0.058 | 0.053 | 0.055 | 10% | 42 (12.69) | 289 (87.31) | 331 (100) | 2.399 | 2.531 | 2.515 |
| 20% | 258 (39.03) | 403 (60.97) | 661 (100) | 0.124 | 0.125 | 0.125 | 20% | 119 (18.00) | 542 (82.00) | 661 (100) | 1.913 | 2.112 | 2.076 |
| 30% | 382 (38.51) | 610 (61.49) | 992 (100) | 0.184 | 0.184 | 0.184 | 30% | 232 (23.39) | 760 (76.61) | 992 (100) | 1.585 | 1.870 | 1.804 |
| 40% | 525 (39.71) | 797 (60.29) | 1322 (100) | 0.252 | 0.242 | 0.246 | 40% | 338 (25.57) | 984 (74.43) | 1322 (100) | 1.397 | 1.668 | 1.599 |
| 50% | 647 (39.14) | 1006 (60.86) | 1653 (100) | 0.319 | 0.319 | 0.319 | 50% | 450 (27.22) | 1203 (72.78) | 1653 (100) | 1.244 | 1.508 | 1.436 |
| 60% | 759 (38.26) | 1225 (61.74) | 1984 (100) | 0.387 | 0.402 | 0.396 | 60% | 572 (28.83) | 1412 (71.17) | 1984 (100) | 1.107 | 1.375 | 1.298 |
| 70% | 865 (37.38) | 1449 (62.62) | 2314 (100) | 0.460 | 0.492 | 0.480 | 70% | 715 (30.90) | 1599 (69.10) | 2314 (100) | 0.973 | 1.265 | 1.175 |
| 80% | 978 (36.98) | 1667 (63.02) | 2645 (100) | 0.550 | 0.593 | 0.578 | 80% | 839 (31.72) | 1806 (68.28) | 2645 (100) | 0.875 | 1.154 | 1.065 |
| 90% | 1055 (35.46) | 1920 (64.54) | 2975 (100) | 0.630 | 0.731 | 0.695 | 90% | 962 (32.34) | 2013 (67.66) | 2975 (100) | 0.788 | 1.055 | 0.969 |
N m: number of misclassification types; N c: number of correct classification types; N t: number of total classification types; Meanm: mean of misclassification type; Meanc: mean of correct classification type; Meant: mean of total classification type.
(2) R Score. Analyzing correct classification and misclassification types with profile graphs, the R value was derived considering that a difference between the maximum value Z (1) and the minimum value Z (4) of misclassification was smaller than that of correct classification, and classification by the value was attempted (Figure 10). Namely, under the hypothesis that the larger the R value was, the closer the profile graph was to an L-shaped or flipped-L-shaped pattern, and the higher the probability of the respective observations corresponding to correct classification was the R values were applied to the clinical stroke data in the same way as previously (Table 5).
Figure 10.
Derived R values based on the pattern analysis of the profile graphs. Under the hypothesis that the larger the R value was, the closer the profile graph was to an L-shaped or flipped-L-shaped pattern, the higher the probability of the respective observations corresponding to correct classification was.
Table 5.
Types of classifications distribution of filtered/selected data by R value.
| Filtered% | Type of classifications distribution of filtered data by R value | Selected% | Type of classifications distribution of selected data by R value | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N m (%) | N c (%) | N t (%) | Meanm | Meanc | Meant | N m (%) | N c (%) | N t (%) | Meanm | Meanc | Meant | ||
| 10% | 135 (40.79) | 196 (59.21) | 331 (100) | 0.674 | 0.677 | 0.676 | 10% | 65 (19.64) | 266 (80.36) | 331 (100) | 3.790 | 3.882 | 3.864 |
| 20% | 261 (39.49) | 400 (60.51) | 661 (100) | 0.847 | 0.864 | 0.858 | 20% | 160 (24.21) | 501 (75.79) | 661 (100) | 3.247 | 3.418 | 3.376 |
| 30% | 400 (40.32) | 592 (59.68) | 992 (100) | 0.991 | 0.990 | 0.990 | 30% | 254 (25.60) | 738 (74.40) | 992 (100) | 2.967 | 3.116 | 3.078 |
| 40% | 507 (38.35) | 815 (61.65) | 1322 (100) | 1.099 | 1.130 | 1.118 | 40% | 371 (28.06) | 951 (71.94) | 1322 (100) | 2.719 | 2.905 | 2.853 |
| 50% | 623 (37.69) | 1030 (62.31) | 1653 (100) | 1.212 | 1.252 | 1.234 | 50% | 474 (28.68) | 1179 (71.32) | 1653 (100) | 2.542 | 2.710 | 2.662 |
| 60% | 726 (36.59) | 1258 (63.41) | 1984 (100) | 1.310 | 1.369 | 1.347 | 60% | 590 (29.74) | 1394 (70.26) | 1984 (100) | 2.377 | 2.556 | 2.503 |
| 70% | 843 (36.43) | 1471 (63.57) | 2314 (100) | 1.431 | 1.486 | 1.466 | 70% | 697 (30.12) | 1617 (69.88) | 2314 (100) | 2.243 | 2.411 | 2.360 |
| 80% | 937 (35.43) | 1708 (64.57) | 2645 (100) | 1.537 | 1.623 | 1.593 | 80% | 836 (31.61) | 1809 (68.39) | 2645 (100) | 2.080 | 2.288 | 2.222 |
| 90% | 1032 (34.69) | 1943 (65.31) | 2975 (100) | 1.660 | 1.777 | 1.736 | 90% | 962 (32.34) | 2013 (67.66) | 2975 (100) | 1.942 | 2.162 | 2.091 |
N m: number of misclassification types; N c: number of correct classification types; N t: number of total classification types; Meanm: mean of misclassification types; Meanc: mean of correct classification types; Meant: mean of total classification types.
(3) S Score. Analyzing correct classification and misclassification types with profile graphs, the S value was derived considering that the second-largest value Z (2) of misclassification was higher than that of correct classification, and classification by the value was attempted (Figure 11). Namely, under the hypothesis that the larger the S value was, the closer the profile graph was to a bathtub (or U) shape and the higher the probability of the respective observations corresponding to misclassification was, the S values were applied to the clinical stroke data. In this case, the frequency and rates of misclassification over 10% intervals were investigated after sorting the data by the S value in ascending order (Table 6).
Figure 11.
Derived S values based on the pattern analysis of the profile graphs. Under the hypothesis that the larger the S value was, the closer the profile graph was to a bathtub (or U) shape, the higher the probability of the respective observations corresponding to misclassification was.
Table 6.
Types of classifications distribution of filtered/selected data by S value.
| Filtered% | Type of classifications distribution of filtered data by S value | Selected% | Type of classifications distribution of selected data by S value | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N m (%) | N c (%) | N t (%) | Meanm | Meanc | Meant | N m (%) | N c (%) | N t (%) | Meanm | Meanc | Meant | ||
| 10% | 120 (36.25) | 211 (63.75) | 331 (100) | 5.587 | 5.763 | 5.699 | 10% | 100 (30.21) | 231 (69.79) | 331 (100) | −1.678 | −1.625 | −1.641 |
| 20% | 234 (35.40) | 427 (64.60) | 661 (100) | 4.620 | 4.673 | 4.654 | 20% | 205 (31.01) | 456 (68.99) | 661 (100) | −1.159 | −1.162 | −1.161 |
| 30% | 333 (33.57) | 659 (66.43) | 992 (100) | 4.051 | 3.975 | 4.000 | 30% | 312 (31.45) | 680 (68.55) | 992 (100) | −0.792 | −0.804 | −0.800 |
| 40% | 435 (32.90) | 887 (67.10) | 1322 (100) | 3.580 | 3.475 | 3.509 | 40% | 431 (32.60) | 891 (67.40) | 1322 (100) | −0.469 | −0.516 | −0.501 |
| 50% | 554 (33.51) | 1099 (66.49) | 1653 (100) | 3.126 | 3.085 | 3.099 | 50% | 543 (32.85) | 1110 (67.15) | 1653 (100) | −0.167 | −0.226 | −0.207 |
| 60% | 666 (33.57) | 1318 (66.43) | 1984 (100) | 2.768 | 2.731 | 2.743 | 60% | 662 (33.37) | 1322 (66.63) | 1984 (100) | 0.127 | 0.043 | 0.071 |
| 70% | 785 (33.92) | 1529 (66.08) | 2314 (100) | 2.405 | 2.411 | 2.409 | 70% | 764 (33.02) | 1550 (66.98) | 2314 (100) | 0.382 | 0.335 | 0.351 |
| 80% | 892 (33.72) | 1753 (66.28) | 2645 (100) | 2.106 | 2.093 | 2.097 | 80% | 863 (32.63) | 1782 (67.37) | 2645 (100) | 0.649 | 0.642 | 0.644 |
| 90% | 997 (33.51) | 1978 (66.49) | 2975 (100) | 1.814 | 1.777 | 1.789 | 90% | 977 (32.84) | 1998 (67.16) | 2975 (100) | 0.993 | 0.963 | 0.973 |
N m: number of misclassification types; N c: number of correct classification types; N t: number of total classification types; Meanm: mean of misclassification type; Meanc: mean of correct classification type; Meant: mean of total classification type.
(4) C Score. Analyzing correct classification and misclassification types with profile graphs, the C value was derived considering that a difference between the sum of Z (1) and Z (2) and the sum of Z (3) and Z (4) of misclassification was larger than that of correct classification, and classification by the value was attempted (Figure 12). Namely, under the hypothesis that the larger the C value was, the closer the profile graph was to a bathtub (or U) shape, the higher the probability of the respective observations corresponding to misclassification was, the C values were applied to the clinical stroke data in the same way as previously (Table 7).
Figure 12.
Derived C values based on the pattern analysis of the profile graphs. Under the hypothesis that the larger the C value was, the closer the profile graph was to a bathtub (or U) shape, the higher the probability of the respective observations corresponding to misclassification was.
Table 7.
Types of classifications distribution of filtered/selected data by C value.
| Filtered% | Type of classifications distribution of filtered data by C value | Selected% | Type of classifications distribution of selected data by C value | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N m (%) | N c (%) | N t (%) | Meanm | Meanc | Meant | N m (%) | N c (%) | N t (%) | Meanm | Meanc | Meant | ||
| 10% | 141 (42.60) | 190 (57.40) | 331 (100) | 0.846 | 0.845 | 0.845 | 10% | 84 (25.38) | 247 (74.62) | 331 (100) | 5.037 | 5.134 | 5.110 |
| 20% | 272 (41.15) | 389 (58.85) | 661 (100) | 1.066 | 1.085 | 1.078 | 20% | 177 (26.78) | 484 (73.22) | 661 (100) | 4.345 | 4.463 | 4.431 |
| 30% | 396 (39.92) | 596 (60.08) | 992 (100) | 1.240 | 1.267 | 1.256 | 30% | 273 (27.52) | 719 (72.48) | 992 (100) | 3.928 | 4.040 | 4.009 |
| 40% | 516 (39.03) | 806 (60.97) | 1322 (100) | 1.392 | 1.426 | 1.413 | 40% | 370 (27.99) | 952 (72.01) | 1322 (100) | 3.636 | 3.737 | 3.708 |
| 50% | 619 (37.45) | 1034 (62.55) | 1653 (100) | 1.520 | 1.588 | 1.562 | 50% | 478 (28.92) | 1175 (71.08) | 1653 (100) | 3.378 | 3.498 | 3.463 |
| 60% | 727 (36.64) | 1257 (63.36) | 1984 (100) | 1.664 | 1.746 | 1.716 | 60% | 581 (29.28) | 1403 (70.72) | 1984 (100) | 3.161 | 3.281 | 3.246 |
| 70% | 824 (35.61) | 1490 (64.39) | 2314 (100) | 1.799 | 1.911 | 1.871 | 70% | 701 (30.29) | 1613 (69.71) | 2314 (100) | 2.945 | 3.098 | 3.051 |
| 80% | 920 (34.78) | 1725 (65.22) | 2645 (100) | 1.941 | 2.082 | 2.033 | 80% | 825 (31.19) | 1820 (68.81) | 2645 (100) | 2.746 | 2.928 | 2.871 |
| 90% | 1013 (34.05) | 1962 (65.95) | 2975 (100) | 2.105 | 2.285 | 2.224 | 90% | 956 (32.13) | 2019 (67.87) | 2975 (100) | 2.548 | 2.769 | 2.698 |
N m: number of misclassification types; N c: number of correct classification types; N t: number of total classification types; Meanm: mean of misclassification type; Meanc: mean of correct classification type; Meant: mean of total classification type.
3. Results
3.1. Estimated Misclassification Probability and Discrimination Rate according to Proposed Four Scores
Table 8 summarizes the misclassification probabilities after the data was sorted according to the 4 criteria and investigating the misclassification probability over 10% intervals. If the data were filtered 10–20%, the C score marked 42.60% and 41.15%, respectively, indicating the highest misclassification probability among the criteria. If the data were filtered 30%, the R score stands at 40.32% and the C score at 39.92%. If the data were filtered 40~90%, the misclassification probability of the D score was the highest.
Table 8.
Misclassification rate distribution of the filtered data according to four measures.
| Filtered% | N | D | R | S | C |
|---|---|---|---|---|---|
| 10% | 331 | 40.79 | 40.79 | 36.25 | 42.60 |
| 20% | 661 | 39.03 | 39.49 | 35.40 | 41.15 |
| 30% | 992 | 38.51 | 40.32 | 33.57 | 39.92 |
| 40% | 1322 | 39.71 | 38.35 | 32.90 | 39.03 |
| 50% | 1653 | 39.14 | 37.69 | 33.51 | 37.45 |
| 60% | 1984 | 38.26 | 36.59 | 33.57 | 36.64 |
| 70% | 2314 | 37.38 | 36.43 | 33.92 | 35.61 |
| 80% | 2645 | 36.98 | 35.43 | 33.72 | 34.78 |
| 90% | 2975 | 35.46 | 34.69 | 33.51 | 34.05 |
For the data previously selected by 4 scores (D, R, S, and C), discrimination rates were compared. Having the 4 QD, DP, YD, and FH patterns set as reaction variables for the entire clinical stroke data and 44 clinical indices of the Korean Standard PI for Stroke-3 as independent variables, the discriminant analysis was conducted to calculate the discrimination accuracy (Table 9). If the data were selected at 90%, the discrimination rate of the D score increased to 68.2%, which was the largest increase among the four scores. If the data were selected at 80%, the C score reached 69.0%, making the largest increase. If the data were selected at 70%, the R score posted 70.0%, demonstrating the largest increase in the discrimination rate among the four scores. If the data were selected at 60–10%, the D score recorded the largest increase in the discrimination rate among the four scores.
Table 9.
Discriminant rate distribution of the selected data according to four measures.
| Discriminant rate | |||||
|---|---|---|---|---|---|
| N | D | R | S | C | |
| 100% | 3306 | 66.82 | 66.82 | 66.82 | 66.82 |
| 90% | 2975 | 68.24 (+1.42) | 67.63 (+0.81) | 66.92 (+0.10) | 67.53 (+0.71) |
| 80% | 2645 | 68.62 (+0.38) | 68.47 (+0.84) | 67.15 (+0.23) | 69.04 (+1.51) |
| 70% | 2314 | 69.53 (+0.91) | 69.97 (+1.50) | 66.98 (−0.17) | 69.49 (+0.45) |
| 60% | 1984 | 71.98 (+2.45) | 70.82 (+0.85) | 66.94 (−0.04) | 71.22 (+1.73) |
| 50% | 1653 | 73.32 (+1.34) | 73.08 (+2.26) | 69.03 (+2.09) | 71.81 (+0.59) |
| 40% | 1322 | 75.34 (+2.02) | 74.28 (+1.20) | 68.68 (−0.35) | 73.75 (+1.94) |
| 30% | 992 | 77.32 (+1.98) | 76.81 (+2.53) | 70.26 (+1.58) | 75.81 (+2.06) |
| 20% | 661 | 83.36 (+6.04) | 80.94 (+4.13) | 73.83 (+3.57) | 77.61 (+1.80) |
| 10% | 331 | 89.12 (+5.76) | 87.01 (+6.07) | 75.83 (+2.00) | 82.78 (+5.17) |
3.2. Similarities between Secondary Curvature Function and C Score
3.2.1. Curvature Created by Z (1), Z (2), Z (3), and Z (4) Scores
First of all, assume four scores, Z (1), Z (2), Z (3), and Z (4), as dependent variables observed in the x values (e.g., 1, 2, 3, and 4) having equal intervals, as shown in the profile graphs. In addition, assume that Z (1) is a dependent variable when x = 1, Z (2) when x = 4, Z (3) when x = 2, and Z (4) when x = 3. This assumption is illustrated in Figure 13.
Figure 13.

Curvature created by Z scores (Z (1), Z (2), Z (3), and Z (4)). Z (1), Z (2), Z (3), and Z (4), as dependent variables observed in the x values having equal intervals. Z (1) is a dependent variable when x = 1, Z (2) when x = 4, Z (3) when x = 2, and Z (4) when x = 3.
3.2.2. Estimation of Secondary Curvature
Considering the quadratic curve regression model passing through the four points (1, Z (1)), (2, Z (3)), (3, Z (4)), and (4, Z (2)), Y = β 0 + β 1 X + β 2 X 2 + ϵ, the coefficient of β 2 is the secondary curvature value that we wanted. Namely, the larger the β 2 is, the stronger the bathtub shape becomes, boosting the misclassification probability. Assuming that the estimates of β 0, β 1, and β 2 are b 0, b 1, and b 2, these estimates satisfy the following normal equation [21]:
| (1) |
Here
| (2) |
According to Neter et al. [21], a general two-variable regression model,
| (3) |
has a normal equation
| (4) |
which is equal to
| (5) |
and the following normal equations,
| (6) |
are obtained. In this case, the equations are
| (7) |
Now, if
| (8) |
the normal equations should be equal to
| (9) |
and, ultimately, we obtain
| (10) |
Certainly, the values of b 0 and b 1 may be obtained but omitted herein because they are meaningless. In (10), Z (1) and Z (2) are symmetric, and so are Z (3) and Z (4). Namely, when the curvature creates the largest profile with the 4 points, the curvature will not have any changes even if the largest and the second largest scores were switched. This also holds true for the smallest and the second smallest scores.
In the meantime, the b 2 value equals 1/4 of the C score among the 4 criteria obtained. Namely, the previously used C score was equal to Z (3) and Z (4) was simply subtracted from the total of Z (1) and Z (2), which was the same as the secondary curvature created by the 4 scores.
4. Discussion
In TKM, a PI diagnostic system—one of the core technologies in the diagnosis and treatment of oriental medicine—is used to determine the cause and nature of a disease, treatment methods, and treatment drugs for the patients [5–7]. However, the PI diagnosis holds limited objectivity and reproducibility due to the lack of standardized measurement indices. Objectification problems have always arisen with respect to personal deviations among TKM physicians. As the demand for the reestablishment and development of TKM has increased, studies on the establishment of a scientific basis for and the standardization of PI have been actively conducted [7, 12].
In this study, the clinical data of PI diagnosis for stroke were used to analyze and quantify the profile patterns of the misclassification types by applying the proposed scores to the comparative analysis. This was intended to boost the correct classification of objects by detecting those objects with a high probability of actual misclassification and deferring discrimination. Misclassification types were discerned by a discriminant analysis on the actual clinical data of PI diagnosis for stroke and quantified by a profile pattern analysis. The proposed criteria of each standard were applied to the data already discriminated by the previous discriminant analysis in order to compare how well the misclassification had been estimated and how much the discrimination rate had improved when the estimated misclassification observations were removed in advanced. Particularly, the C score delivered the same results as those from the discrimination of misclassification observations through a secondary curvature. Going forward, the following studies must be performed. First of all, 4 criteria to estimate misclassification were proposed in this study and applied to the actual clinical data, producing the possibility of better estimation of partial misclassification. Nonetheless, it was difficult to notably enhance discrimination rates and additional research appears to be necessary. In addition, 4 pattern groups with a different sample size were used in this study. Hence, the effects of different sample sizes need to be investigated.
Supplementary Material
This summary is the Korean Standard PI for Stroke-3. It consists of 44 clinical indices and each clinical index belongs to its respective PI types (the Fire-Heat pattern, Yin-Deficiency pattern, Qi-Deficiency pattern, and Dampness-Phlegm pattern).
Acknowledgment
This research was supported by a grant from the Korea Institute of Oriental Medicine (K13130, K16111).
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
References
- 1.Social Statistics Office. 2012 Statistics on the Aged. Seoul, Republic of Korea: Korean Statistical Information Service; 2012. [Google Scholar]
- 2.Jeong H. S. Progress of Aging Population and Policy Responses in Japan. Seoul, South Korea: Research Department, The Bank of Korea; 2007 (Korean), http://public.bokeducation.or.kr/ecostudy/publishList.do?&bbsId=22&mode=view&contentId=2655&cPage=19. [Google Scholar]
- 3.Social Statistics Office. Statistics on the Cause of Death. Seoul, Republic of Korea: Korean Statistical Information Service; 2010. [Google Scholar]
- 4.Division of Statistical Analysis in the Institute of Health Insurance Policy. The Status of Health Insurance Benefits of the Inpatients Frequent Diseases on Classified Subdivision of Diseases in Traditional Korean Medicine 2009. Seoul, Republic of Korea: The National Health Insurance Corporation; 2010. [Google Scholar]
- 5.WHO Western Pacific Region. WHO International Standard Terminologies on Traditional Medicine in the Western Pacific Region. Geneva, Switzerland: World Health Organization (WHO); 2007. [Google Scholar]
- 6.Shin S. S., Choi S. M., Shin M. K., Yang K. S. A Study of Standardization of Diagnoses and Diagnostic Requirements in Traditional Korean Medicine III. Seoul, Republic of Korea: Korea Institute of Oriental Medicine; 1997. [Google Scholar]
- 7.Park T.-Y., Lee J. A., Cha M. H., et al. The fundamental study for the Standardization and Objectification of Pattern Identification in traditional Korean medicine for stroke (SOPI-Stroke): an overview of phase I. European Journal of Integrative Medicine. 2012;4(2):e125–e131. doi: 10.1016/j.eujim.2012.01.003. [DOI] [Google Scholar]
- 8.Ko M. M., Lee J. A., Yun K. J., You S. S., Lee M. S. Perception of pattern identification in traditional medicine: a survey of Korean medical practitioners. Journal of Traditional Chinese Medicine. 2014;34(3):369–372. doi: 10.1016/s0254-6272(14)60104-7. [DOI] [PubMed] [Google Scholar]
- 9.Kim J. K., Seol I. C., Lee I., Oh K., Yu B. C., Choi S. M. Report on the Korean standard differentiation of the symptoms and signs for the stoke-1. Korean Journal of Oriental Physiology & Pathology. 2006;20:229–234. [Google Scholar]
- 10.Go H. Y., Kim Y. K., Kang B. K., et al. Report on the Korean standard differentiation of the symptoms and signs for the stoke-2. Korean Journal of Oriental Physiology & Pathology. 2006;20:1789–1791. [Google Scholar]
- 11.Lee J. A., Lee J. S., Ko M. M., et al. Report on the Korean standard pattern identifications for stroke-III. The Journal of Korean Oriental Internal Medicine. 2011;32:232–242. [Google Scholar]
- 12.Lee J. A., Cha M. H., Kang B.-K., et al. Fundamental study for the standardization and objectification of pattern identification in traditional Korean medicine for stroke (SOPI-Stroke): an overview of the second and third stages. European Journal of Integrative Medicine. 2015;7(4):378–383. doi: 10.1016/j.eujim.2015.05.006. [DOI] [Google Scholar]
- 13.Kang B.-K., Moon T.-W., Lee J. A., Park T.-Y., Ko M. M., Lee M. S. The fundamental study for the standardisation and objectification of pattern identification in traditional Korean medicine for stroke (SOPI-Stroke): development and interobserver agreement of the Korean standard pattern identification for stroke (K-SPI-Stroke) tool. European Journal of Integrative Medicine. 2012;4(2):e133–e139. doi: 10.1016/j.eujim.2012.01.002. [DOI] [Google Scholar]
- 14.Kim I. C., Cha M. H., Kim D. M., et al. A functional promoter polymorphism −607G>C of WNT10B is associated with abdominal fat in Korean female subjects. Journal of Nutritional Biochemistry. 2011;22(3):252–258. doi: 10.1016/j.jnutbio.2010.02.002. [DOI] [PubMed] [Google Scholar]
- 15.Ko M. M., Park T.-Y., Lim J. H., Cha M. H., Lee M. S. WNT10B polymorphism in Korean stroke patients with Yin deficiency pattern. Evidence-Based Complementary and Alternative Medicine. 2012;2012:6. doi: 10.1155/2012/798131.798131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lim J. H., Ko M. M., Lee J. S., et al. Genetic association of SNPs located at PON1 gene with dampness and phlegm pattern identification among Korea stroke patients. The Journal of Korean Oriental Internal Medicine. 2010;31:752–762. [Google Scholar]
- 17.Kang B. K., Kang K. W., Park S. W., et al. The discrimination model for the pattern identification diagnosis of the stroke. Korean Journal of Oriental Internal Medicine. 2007;13(2):59–63. [Google Scholar]
- 18.Kang B. K., Lee J. S., Kim S. Y., et al. The discrimination model IV for syndrome differentiation diagnosis in stroke patients. Journal of the Korean Data Analysis Society. 2009;11(6):2995–3007. [Google Scholar]
- 19.Kang B. K., Ko M. M., Lee J. A., Park T. Y., Park Y. G. Discriminant modeling for pattern identification using the Korean standard PI for stroke-III. Korean Journal of Oriental Physiology & Pathology. 2011;25(6):1–6. [Google Scholar]
- 20.Lee J. A., Park T.-Y., Lee J. S., et al. Developing indicators of pattern identification in patients with stroke using traditional Korean medicine. BMC Research Notes. 2012;5, article 136 doi: 10.1186/1756-0500-5-136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Neter J., Wasserman W., Kutner M. H. Applied Linear Statistical Models. 3rd. Philadelphia, Pa, USA: CRC Press; 1990. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
This summary is the Korean Standard PI for Stroke-3. It consists of 44 clinical indices and each clinical index belongs to its respective PI types (the Fire-Heat pattern, Yin-Deficiency pattern, Qi-Deficiency pattern, and Dampness-Phlegm pattern).






