Skip to main content
. 2017 Mar 16;10:16. doi: 10.1186/s12920-017-0253-6

Table 3.

Accuracy of predicting breast cancer risk

Dataset METABRIC discovery METABRIC validation MILLER (test)
Low risk High risk Low risk High risk Low risk High risk
Predicted low risk 157 (94%) 11 (7%) 107 (89%) 13 (11%) 68 (93%) 5 (7%)
Predicted medium risk 278 (68%) 134 (33%) 236 (70%) 99 (30%) 55 (68%) 26 (32%)
Predicted high risk 21 (35%) 39 (65%) 33 (42%) 46 (58%) 29 (62%) 18 (38%)

The confusion matrices show the performance of our decision tree on three datasets. The percentage of predicted cases with respect to the total number of predictions in each group is shown in parentheses. From a clinical standpoint, it is important to achieve a high precision (positive predictive value) for low risk cases (shown in bold) to confidently recommend a less agressive treatment regimen for a subset of patients. The probability of surviving more than 10 years is above 89% for the predicted low risk cases in all the three datasets (Additional file 1: Figure S11)