. 2007 Jun 5;3:114. doi: 10.1038/msb4100156

Table 2.

Predictive performance in fivefold cross-validation experiments

Classification scheme^a	CM or No CM	NB or No NB	CD or No CD	CMCD or No CMCD
Accuracy (%)	87±4	77±4	73±5	82±3
Significance respect to random—P(N)	1.1 × 10⁻³⁹	1.2 × 10⁻²⁴	2.4 × 10⁻¹⁶	6.4 × 10⁻²⁶
Default class	CM	No NB	No CD	CMCD
Majority class	CM	No NB	No CD	CMCD
No. of cases	533	496	513	634
Sensitivity (%)	93±4	86±4	78±7	92±3
Specificity (%)	83±3	78±2	78±3	85±2
Minority class
No. of cases	308	345	328	207
Sensitivity (%)	67±5	64±5	66±9	50±11
Specificity (%)	85±8	77±4	66±5	71±5

CM, central metabolism path compound; NB, nonbiodegradable path compound; CD, carbon dioxide path compound; CMCD, central metabolism and carbon dioxide path compounds. Accuracy is the percentage of compounds correctly classified. Sensitivity is the percentage of compounds correctly classified as belonging to a specific class, relative to the total number of cases of that particular class. Specificity is the percentage of compounds correctly classified as belonging to a specific class, relative to the total number of predictions for that particular class. Accuracy, sensitivity and specificity are indicated in the Table as the average±s.d. for the five iterations in the cross-validation experiment. The statistical significance of the observed difference in the performance of the c4.5-based system compared with a random prediction is indicated by the P(N) of a sign test.