Table 1. Descriptive statistics for data sets used in this study.
Discovery | Validation | |||
Study | TCGA [2] | Australian [14] | Japanese [1] | US [13] |
n | 503 | 240 | 260 | 134 |
GEO Array Type | GPL570* | GPL570 | GPL6480 | GPL96 |
GEO identifier | NA | GSE9891 | GSE32062 | GSE3149 |
Age (Range) | 59.7 (30–89) | 60.2 (23–80) | NA | NA |
Stage (% III, IV) | 92% | 5% | 100% | NA |
Grade (% 3,4) | 87% | 61% | 50% | NA |
Residual Disease (% None) | 23% | 27% | 40% | NA |
Neoadjuvant (% Yes) | 0% | 7% | 0% | NA |
Median Months OS | 44 (40–48) | 44 (38–57) | 60 (50–80) | 74 (35–98) |
Median Months PFS | 18 (15–19) | 15 (14–18) | 19 (18–23) | NA |
* TCGA uses 3 array types. Only the Affymetrix array was used for completeness.