Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2014 Oct 11;136(9):2091–2098. doi: 10.1002/ijc.29247

Dynamic classification using case-specific training cohorts outperforms static gene expression signatures in breast cancer

Balázs Győrffy 1,2,3,, Thomas Karn 4, Zsófia Sztupinszki 1, Boglárka Weltz 3, Volkmar Müller 5, Lajos Pusztai 6
PMCID: PMC4354298  PMID: 25274406

Abstract

The molecular diversity of breast cancer makes it impossible to identify prognostic markers that are applicable to all breast cancers. To overcome limitations of previous multigene prognostic classifiers, we propose a new dynamic predictor: instead of using a single universal training cohort and an identical list of informative genes to predict the prognosis of new cases, a case-specific predictor is developed for each test case. Gene expression data from 3,534 breast cancers with clinical annotation including relapse-free survival is analyzed. For each test case, we select a case-specific training subset including only molecularly similar cases and a case-specific predictor is generated. This method yields different training sets and different predictors for each new patient. The model performance was assessed in leave-one-out validation and also in 325 independent cases. Prognostic discrimination was high for all cases (n = 3,534, HR = 3.68, p = 1.67 E−56). The dynamic predictor showed higher overall accuracy (0.68) than genomic surrogates for Oncotype DX (0.64), Genomic Grade Index (0.61) or MammaPrint (0.47). The dynamic predictor was also effective in triple-negative cancers (n = 427, HR = 3.08, p = 0.0093) where the above classifiers all failed. Validation in independent patients yielded similar classification power (HR = 3.57). The dynamic classifier is available online at http://www.recurrenceonline.com/?q=Re_training. In summary, we developed a new method to make personalized prognostic prediction using case-specific training cohorts. The dynamic predictors outperform static models developed from single historical training cohorts and they also predict well in triple-negative cancers.

Keywords: breast cancer, gene expression, survival


What's New?

A large number of molecular variations are associated with breast cancer, which has inspired the development of a variety of multigene prognostic tests. However, because those tests are based on small, heterogenous data sets, with the intention of being applicable across breast cancers, their predictive performance is limited. This study describes a dynamic, case-specific approach for prognostic predictor discovery that is based on large, homogenous training sets of data. Training sets were selected for their molecular similarity to test cases. Prognostic accuracy was higher for the dynamic approach than for the genomic surrogates of common multigene assays.

In the last decade, numerous multigene prognostic tests have been developed for breast cancer.14 All of these assays were developed from relatively small training sets and the informative genes were selected from molecularly heterogeneous cancers. For example, the 70 genes included in MammaPrint were defined from a mixed cohort of 78 estrogen receptor (ER)-positive and -negative cases.4 The 21 genes in OncotypeDX were derived from 233 ER positive, lymph node negative patients and the 97 genes of the Genomic Grade Index (GGI) were selected from 64 ER positive tumors.3 Although the prognostic performance of these assays has been validated in independent cases, their predictive performance may not be optimal due to the small and heterogeneous training sets that were used for assay discovery.57

Twenty years after the development of the first gene expression arrays, several thousands of gene expression profiles with clinical annotation are now available for breast and other cancers.8,9 We hypothesize that by using much larger and molecularly more homogeneous training sets it should be possible to improve the accuracy of multi-gene prognostic signatures. To maximize the representativeness of the training cohort for the actual test case, we select the molecularly most similar cases from a large training case pool of >3,500 cases. We use this case-matched training cohort to develop a unique, case-specific predictor which is applied to the test case. This method defines a new training sub-cohort for each new case and selects a new set of informative genes. This dynamic classification process is conceptually similar to leave-one-out cross validation, except that predictors are not built from the entire training cohort but only from a subset with the greatest similarity to the test case. For case-specific training set selection, we did not use clinical information such as clinical stage or nodal status because the clinical variables have no strong molecular signatures associated with them and appear to be independent of molecular variables when predicting relapse-free survival.

We compared the predictive performance of the dynamic classifier to three commonly used static predictors including genomic surrogates of the 21-gene Oncotype DX recurrence score, the 70-gene MammaPrint prognostic signature and the 97-gene GGI. We also built an internet-based dynamic prognostic classification tool that could be used to classify new cases and independently validate our method.

Material and Methods

Database construction

In order to establish the largest possible pool of potential training cases for predictor building, we assembled all breast cancer gene expression datasets published in GEO (http://www.ncbi.nlm.nih.gov/geo/) that had clinical annotation and were generated using the HG-U133_A and HG-U133_Plus_2.0 arrays. We identified a total of 6,197 cases in 25 datasets. We have performed a quality check for all arrays as described previously.10 We removed duplicate samples (n = 1,418)—when multiple GEO entries for the same case existed we retained the first published copy of an array.11 The raw .CEL files were MAS5 normalized in R (http://www.r-project.org) using the affy library.12 MAS5 was used because it performed among the best normalization methods compared to RT-PCR measurements in our previous study.13 For predictor building only probe sets that were measured by both Affymetrix arrays (n = 22,277) were used. We also performed a second scaling normalization to set the average expression of each array to 1,000 to reduce batch effects [the value of 1,000 was chosen as this is close to the average of the mean expression using factory-default normalization settings for HG-U133_A (∼700) and HG-U133_Plus_2.0 arrays (∼1,400)]. Only probe sets with a maximal expression value of over 1,000 in at least one sample were retained for predictor building. For genes targeted by multiple probe sets only the JetSet best probe set14 was used. The final number of probe sets/genes included in the training database pool for each case was n = 9,886.

Selection of case-specific training subset and molecular classification

To select samples for model building (training subset), we identified cases that were most similar to the test case by computing Euclidean distance with the “dist” function in R to yield a global similarity matrix using the gene expression data for all genes. This distance is computed between the test case and each of the samples in the database. We ranked cases by this similarity metric and to study the effect of training set size on predictor performance we built predictors from the top 100/200/300/400/500 cases most similar to the test case.

Informative genes were selected for predictor model building by performing a univariate Kaplan–Meier survival analysis for relapse-free survival for each gene using the median expression values as cutoff.15 Multivariate analysis was not performed at this stage due to many missing values of clinically important variables (grade, nodal status, etc.). Genes were ranked by p value and hazard ratio and the average expression of the top 3/5/10/25/50/100/200 genes were used to make a prognostic prediction. Since some genes correlate positively with survival and have higher expression values in the good prognosis group while others show the opposite relationship, for each gene the difference to the median in the training set is used. In case the hazard ratio is <1, the expression value is inverted to a negative value.

Finally, the average expression of the informative genes in the test case is compared to the median of the average expression of these genes in the training set (“molecular classification”). The test case is predicted to have good prognosis if the average expression of the informative genes is in the expression range of the training cohort that has the better outcome. (For example, in case patients in the training set having expression range above the median have a better prognosis, and the average expression of the informative genes in the investigated patient is also above the median, then this patient is designated as one with good prognosis.)

Adjustment for clinical risk

Since selection of the training set cases is driven by molecular similarities to the test case, the resulting training cohort could have unbalanced clinical features that could skew overall prognostic prediction. For example, if the training cohort includes a large number of cases with poor clinical risk features (mostly node-positive, high grade, large cancers), the prognostic risk prediction based on molecular features alone may be erroneous when designating the sample into the good-prognosis cohort.

For this reason, the training set is compared to all the remaining patients. In this, the entire training set represents one cohort and the remaining patients represent another cohort and these are compared using a Cox regression analysis. There are three possible outcomes: the two cohorts do not differ significantly, and the training set can have better or poorer survival than the remaining patients. The result of this analysis termed “training set assessment” is used in the final prognostic classification to adjust the molecular risk that is based on molecular features alone.

Final classification rule

The final classification rule takes into account both the risk assignment from the “training set assessment” and the output from the “molecular classification.” When both predictors are concordant and assign good or poor prognosis, the decision rule follows the concordant vote. When the “molecular classification” is not significant (in other words there are no significant genes capable to define good and poor prognosis in the training set) or when the clinical prediction contradicts the molecular prediction the final output is “intermediate.”

To measure the performance of the dynamic classifier, we performed a leave-one out cross validation (LOOCV) for each sample (i.e., one case held out and the training subset selected from the remaining n − 1 samples). Statistical significance was set at p < 0.01.

Construction of online interface

We made the entire analysis possible in an online accessible software. In this, all computations are performed on a Debian linux (http://www.debian.org) central server running an Apache webserver, a (D)COM server and a background R server. After the upload of an unprocessed Affymetrix HGU133A or HGU133plus2 microarray .CEL file, the processing is performed in the R environment. The packages “affy” and “survival” are used for normalization and for drawing Kaplan–Meier plots, respectively. The homepage was set up using the open source Drupal content management system (http://www.drupal.org). The entire computational process is performed real time and the result is provided at the end of the analysis as a Kaplan–Meier survival plot showing the estimated survival.

Computation of static predictors

We compared the overall performance of our dynamic predictors to genomic surrogates of three commonly used static predictors, the 21-gene recurrence score, the 70-gene Mammaprint signature classifier and the 97-gene GGI. For computing the recurrence score, we used our previously published technique.10 The GGI and the 70-gene classifications were computed using the “genefu” Bioconductor package (http://www.bioconductor.org, http://compbio.dfci.harvard.edu) using the default parameters. The complete analysis results including individual predictions for each patient are available in Supporting Information Table 1. The performance of the classifiers was assessed by computing Cox regression and plotting a Kaplan–Meier plot for each classification algorithm separately.

To compare classification performance in terms of sensitivity, specificity and accuracy, we computed sensitivity = TP/(TP + FN) where TP = number of true positives (samples designated as bad prognosis that actually have relapsed within 5 years), and FN = number of false negatives (samples designated as good prognosis that actually have relapsed); specificity = TN/(TN + FP) where TN = number of true negatives (samples designated as good prognosis that had no recurrence) and FP = number of false positives (samples designated as bad prognosis that had no relapse) and accuracy = (TP + TN)/(TP + FN + TN + FP). In this analysis, patients designated as “intermediate” by the re-training approach were not included in the analysis for the 70-gene and the 97-gene signature. The 21-gene classifier was computed using all patients because it also has an “intermediate” cohort and therefore is more similar to the re-training approach in this context. In the analysis, the predictive power of relapse up to 5 years was compared. Patients censored before 5 years were excluded (final n = 2,801). The cutoff of 5 years was selected as it was used in previous studies, and is also the default length for hormonal therapy. Finally, positive and negative predictive values and confidence intervals were computed as described previously.16,17

Independent validation samples

We collected 325 independent validation samples of early stage breast cancers at the Departments of Gynecology and Obstetrics at the University Hospitals in Frankfurt and Hamburg, Germany. All patients participated in an IRB approved study and signed informed consent for biomarker analysis. They represent consecutive patients undergoing surgical resection up to July 2007.

Isolation of RNA and expression profiling using Affymetrix Human Genome U133A microarrays was performed according to the manufacturer's protocols. Affymetrix CEL files and clinical data have been deposited in the GEO database under the accession numbers GSE4611 (Frankfurt dataset) and GSE46184 (Hamburg dataset). The validation using these samples was performed after the re-training algorithm was completed and the online classification tool was finalized.

Results

Establishment of a master database and classification tool

The number of unique cases with relapse-free survival included in our master database was n = 3,534 from the following datasets: GSE1456, GSE4922, GSE5327, GSE6532, GSE7390, GSE9195, GSE11121, GSE12093, GSE12276, GSE2034, GSE16391, GSE16446, GSE17705, GSE17907, GSE19615, GSE2603, GSE20685, GSE20711, GSE21653, GSE25066, GSE2990, GSE31519 and GSE3494 (clinical characteristics for these patients are summarized in Table1). We must note that the definitions for relapse-free survival were slightly different in the original datasets. The exact length of the survival time (e.g., from beginning like diagnosis or therapy to the end like local, regional or distant metastasis) were not published in most of the datasets and therefore we merged any type of earliest progression of the disease into relapse-free survival.

Table 1.

Clinical characteristics of patients included in the pooled datasets

HER2 status All patients HER2− HER2+
ER status ER+ ER− (ER+ and ER−)
Adjuvant therapy No systemic therapy Adjuvant therapy Adjuvant therapy Adjuvant therapy
N 3,534 672 1,316 427 551
ER+ 2,960/3,534 (83.1%) (all) (all) (none) 372/551 (66.8%)
LN+ 992/3,220 (30.8%) 3/672 (0.4%) 564/1,083 (52.1%) 195/324 (60.2%) 147/465 (31.6%)
Grade 1 329/2,185 (15.6%) 143/528 (27.0%) 132/815 (16.1%) 10/326 (3.1%) 17/291 (5.8%)
Grade 2 842/2,185 (38.5%) 306/528 (58.0%) 355/815 (43.6%) 45/326 (13.8%) 97/291 (33.3%)
Grade 3 964/2,185 (44.1%) 78/528 (14.8%) 297/815 (36.5%) 271/326 (83.1%) 177/291 (60.8%)
Recurrence events 1,160/3,534 229/672 357/1,316 107/427 237/551
Median RFS (years) 5.85 7.85 5.42 3.44 5.51
Median age (year) 53.2 55.2 55.5 49.9 51.5
Median size (cm) 2.3 2.0 2.49 2.0 2.35

ER = estrogen receptor, RFS = recurrence free survival.

Using these patient samples, two independent classifications are executed as described in the Material and Methods section (summarized in Fig. 1). The predictor performance increased using a larger training set size up to 400 (p = 0.0005), but it significantly deteriorated when sample size was extended to 500 cases (p = 0.024). Because of this deterioration in performance and because of the substantially increasing computational time when including >500 patients in the training set we have not tested larger training set sizes. We have also assessed the effect of informative gene set sizes of 3/5/10/25/50/100/200. Although the difference between these was not significant, the nominally best classification was be achieved by using 25 genes. The Kaplan–Meier survival plot for the predictor using case-specific training set of 400 patients with top 25 most informative genes calculated over all cases is presented in Figure 2.

Figure 1.

Figure 1

Dynamic predictor development process. A large database is used to select a subset of training cases that are molecularly the most similar to the test case. This training subset is used to identify predictive features and to develop the test-case specific predictor (“molecular classification”). The training set is compared to the remaining samples (“training set assessment”) and the final classification takes into account both the “molecular classification” and the “training set assessment” results.

Figure 2.

Figure 2

Relapse-free survival curves for the dynamic classifier computed using the top 25 genes and a training set size of 400 samples and genomic surrogates of three commercially available prognostic signatures applied to the same 3,534 cases. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

To enable the classification of new samples by any user we also established an online interface. The homepage can be accessed at http://www.recurrenceonline.com/?q=Re_training.

Comparison of the dynamic predictor to previously published static classifiers

We applied commonly used genomic surrogates of the 21-gene recurrence score, the 70-gene prognostic signature and the 97-gene GGI to our entire data set (Fig. 2). Individual prediction results are available in Supporting Information Table 1. For all patients, the dynamic prediction method yielded the highest hazard ratio (HR = 3.68) followed by the 70-gene classifier (HR = 3.40), the 21-gene recurrence score (HR = 2.55) and the 97-gene GGI (HR = 2.24).

For ER negative/HER2 negative patients, the dynamic predictor achieved significant discriminating power (HR = 3.08, p = 0.009) only. Similarly, in HER2 positive patients (including both ER positive and negative cases), the dynamic classification method (HR = 2.99, p = 4.8 E−07) was capable to achieve significance only (Fig. 2).

We also assessed the sensitivity, specificity and accuracy of each method for predicting relapse-free survival at five years. The highest sensitivity was achieved by the 70-gene signature (0.98) but it had the lowest specificity (0.14). The dynamic predictor had sensitivity of 0.84. The highest specificity (0.58) and overall accuracy (0.68) were achieved by the dynamic predictor (Table2(A)). When comparing the actual concordance of predictions between the dynamic predictor and the other classifiers, the highest overlap was observed with the 21-gene score (55%) (Table2(B)).

Table 2.

Performance comparison of the different predictors for overall sensitivity, specificity, accuracy, positive predictive value (PPV) and negative predictive value (NPV) including confidence intervals (A) and their concordance in terms of designated prediction (B)

Dynamic re-classification 21-gene signature 70-gene signature 97-gene signature
A
Sensitivity 0.84 (0.81–0.86) 0.80 (0.76–0.82) 0.98 (0.96–0.98) 0.81 (0.78–0.84)
Specificity 0.58 (0.55–0.61) 0.55 (0.53–0.58) 0.14 (0.12–0.16) 0.49 (0.46–0.52)
Accuracy 0.68 (0.66–0.70) 0.64 (0.62–0.65) 0.47 (0.46–0.47) 0.61 (0.59–0.63)
PPV 0.56 (0.54–0.58) 0.53 (0.52–0.55) 0.42 (0.42–0.43) 0.51 (0.49–0.52)
NPV 0.85 (0.82–0.87) 0.81 (0.79–0.83) 0.92 (0.86–0.95) 0.80 (0.77–0.83)
Re-training 21-gene signature 97-gene signature 70-gene signature
B
Prediction Good Intermediate Bad Good Bad Good Bad
Good 527 174 214 651 264 181 734
Intermediate 567 295 357 685 534 110 1109
Bad 100 180 1120 189 1211 7 1393
Concordance 55.0% 52.7% 44.5%

Finally, we evaluated the static classifiers by using the dynamic re-training results as a filter in which we excluded samples where the clinical and molecular classifiers were contradictory (intermediate cases). In this setting, the prediction power for each of the static classifiers increased (21-gene score: HR = 2.87; 97-gene GGI: HR = 2.52; 70-gene classifier: HR = 4.27).

The most consistently strongly prognostic genes

To identify the genes with the highest predictive potential, the prevalence of all genes included in the top 25 list from all LOOCV analyses was counted. In the 3,534 runs (one for each case), 5,038 distinct genes were associated with prognosis in at least one case and only 72 genes were present in more than 5% of classification signatures. Of these strongest genes, one is also present in the 21-gene signature (MYBL2), four are present in the 70-gene signature (CCNE2, CENPA, MCM6, PRC1) and 21 are also present in the 97-gene signature including MYBL2, CCNE2, CENPA and PRC1. All of these survival associated genes are included in Supporting Information Table 2.

Validation in independent clinical samples

Three hundred and twenty-five cases which are not included in the pooled public data database were used for independent validation of the method. We had relapse-free survival data for each patient. The average follow-up was 66 months, with 97 events; 81.1% (n = 206) of samples were ER positive; 39.4% (n = 128) were lymph node positive; 9.1% (n = 29), 58.5% (n = 186) and 32.4% (n = 103) were grade 1, 2 and 3, respectively. The samples are available using the GEO accession numbers GSE46184 and GSE4611.

In these, the dynamic predictor achieved high classification efficiency (HR = 2.96) and outperformed the 21-gene recurrence score (HR = 2.60), the 70-gene signature (HR = 1.89) and the GGI signature (HR = 1.83) (see Kaplan–Meier plots in Fig. 3). When using results of the dynamic re-training to exclude unreliable samples, prediction power for each static classifiers increased (21-gene score: HR = 3.02; 70-gene classifier: HR = 3.60; 97-gene GGI: HR = 2.73). When including the ER positive lymph node negative patients only, the dynamic predictor retained its predictive power (HR = 4.5, p = 0.01), while the 21-gene score (p = 0.13), the 70-gene signature (p = 0.27) and the GGI signature (p = 0.06) have not reached statistical significance.

Figure 3.

Figure 3

Performance of the dynamic classifier and genomic surrogates for three other prognostic signatures in 325 independent validation samples that were not included in the pool of 3,534 samples used for selection of the training set samples. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

Discussion

High throughput genomic analysis has fundamentally changed our perception of breast cancer and the large scale heterogeneity of this disease has become widely recognized.18 Thus, searching for general prognostic markers that are applicable to all breast cancers is no longer considered appropriate.19 Yet, most currently used prognostic signatures were developed over a decade ago following the old paradigm of breast cancer as single disease. Here, we present a new approach to prognostic predictor discovery which recognizes heterogeneity of breast cancer and takes advantage of the large number of gene expression datasets that are now available for predictor discovery and training. The main idea of our method is that we define a predictor for a new case from the molecularly most similar cancers. Since each case differ from one another to various extents, the predictor and training set also differs from case to case.

We applied our method to gene expression data from 3,534 breast cancers and to a set of 325 independent cases. The dynamic classifier yielded higher average classification efficiency than genomic surrogates of three commonly used first generation prognostic signatures including the 21-gene recurrence score, the 70-gene prognostic signature and the 97-gene genomic grade index. It is very important to emphasize that our study compares different conceptual approaches to prognostic prediction rather than results from the actual commercially available prognostic tests. The hazard ratio was computed between the poor and good outcome cohorts only. We implemented this in order to be able to compare 3-class classifiers with the 2-class classifiers. However, removing from the performance calculations the cases that were assigned to the intermediate risk group by the 21-gene score or the dynamic predictors can provide a performance advantage for these classifiers. The highest concordance was observed between the dynamic classifier and the 21-gene score.

In the dynamic re-training, the classification of a sample as intermediate is fundamentally different from other approaches where the intermediate cohort represents a transition zone between good and bad prognosis scores. In our method, intermediate designation refers to patients having discordant clinical and molecular classification assignments. This category acknowledges the uncertainty about their prognosis due to opposing risks by anatomical and molecular variables. When evaluating the static classifiers with the exclusion of these ambiguous samples, prediction efficiency increased for each classifier—thus, the dynamic re-training approach was capable to identify those samples where static predictors had low reliability.

We have to emphasize that the dynamic re-training approach is not restricted to any particular sub-cohort of patients like previous classifiers. One of our most important advantages is that the dynamic classifier performed substantially better to discriminate between good and poor prognosis among ER-negative/HER negative cancers than any of the first generation gene signatures that we tested. It is well recognized that the prognostic power of the currently clinically available multi-gene prognostic assays is primarily restricted to ER positive cancers and the vast majority of ER negative cancers are assigned to poor prognosis by these tests. The classification success in HER2 positive and triple negative patients suggests that there could be sub-cohorts within these patients having good outcome who might perform well without adjuvant therapy. As we did not have sufficient patients with documented lack of adjuvant treatment within these cohorts, these findings must be validated in a future study.

Consistent with our previous expectations, most of the top ranked genes associated with survival differ from training set to training set. The three most commonly top ranked genes were CENPE, RACGAP1 and PGK1 which were included in the list of discriminatory genes in 831, 759 and 756 analyses, respectively. Thus, the most common gene reaches only a prevalence of 23.5% in all analyses. This observation illustrates the instability of gene rankings and also reflects the heterogeneity of breast cancers.20 We also compared the list of strongest genes to the three static signatures and found the largest overlap to the 97-gene signature. In addition, we identified four out of six genes overlapping between the 21-gene, the 70-gene and the 97-gene signatures (MYBL2, CCNE2, CENPA and PRC1).

Our method preserves the independence of model discovery from validation but it does not apply a single fixed predictor to each new case. A unique, case-specific predictor is developed for each new test case. In order to allow other investigators to use and validate our method we constructed a web-based dynamic prognostic predictor tool. It requires uploading of an Affymetrix HGU133A or HGU133plus2 microarray .CEL file, then it automatically performs QC assessment and normalization and performs the dynamic risk prediction as described in this study. This provides a new standardized, low cost, open source paradigm for genomic predictors.21

To our knowledge, this transcriptome-based algorithm presents the first approach where a dynamic classification tool without a predefined gene-list is presented. Conceptually, this method is applicable to any multivariate predictor which relies on high throughput data including other microarray platforms or RNA-seq data. The ultimate power of the approach lies in the future extension of the database by adding new cases with molecular data and known outcome.

Acknowledgments

B.G. was supported by the OTKA K108655, K108688, PD105361 grants and by the Predict project (Grant no. 259303 of the EU Health.2010.2.4.1.-8 call) and L.P. was supported by an award from the Breast Cancer Research Foundation, New York. B.G. has applied for a patent on the diagnostic use of the dynamic method described in the manuscript.

Supporting Information

Supplementary Information Table 1.

ijc0136-2091-sd1.pdf (284.1KB, pdf)

Supplementary Information Table 2.

ijc0136-2091-sd2.pdf (494.9KB, pdf)

References

  1. Paik S, Shak S, Tang G, et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med. 2004;351:2817–26. doi: 10.1056/NEJMoa041588. [DOI] [PubMed] [Google Scholar]
  2. Parker JS, Mullins M, Cheang MC, et al. Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol. 2009;27:1160–7. doi: 10.1200/JCO.2008.18.1370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Sotiriou C, Wirapati P, Loi S, et al. Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J Natl Cancer Inst. 2006;98:262–72. doi: 10.1093/jnci/djj052. [DOI] [PubMed] [Google Scholar]
  4. van't Veer LJ, Dai H, van de Vijver MJ, et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002;415:530–6. doi: 10.1038/415530a. [DOI] [PubMed] [Google Scholar]
  5. Rhodes A, Jasani B, Balaton AJ, et al. Frequency of oestrogen and progesterone receptor positivity by immunohistochemical analysis in 7016 breast carcinomas: correlation with patient age, assay sensitivity, threshold value, and mammographic screening. J Clin Pathol. 2000;53:688–96. doi: 10.1136/jcp.53.9.688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Rhodes A, Jasani B, Balaton AJ, et al. Study of interlaboratory reliability and reproducibility of estrogen and progesterone receptor assays in Europe. Documentation of poor reliability and identification of insufficient microwave antigen retrieval time as a major contributory element of unreliable assays. Am J Clin Pathol. 2001;115:44–58. doi: 10.1309/H905-HYC1-6UQQ-981P. [DOI] [PubMed] [Google Scholar]
  7. Layfield LJ, Goldstein N, Perkinson KR, et al. Interlaboratory variation in results from immunohistochemical assessment of estrogen receptor status. Breast J. 2003;9:257–9. doi: 10.1046/j.1524-4741.2003.09325.x. [DOI] [PubMed] [Google Scholar]
  8. Planey CR, Butte AJ. Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data. 2013;2013:138–42. AMIA Joint Summits Transl Sci Proc. [PMC free article] [PubMed] [Google Scholar]
  9. Wray CJ, Ko TC, Tan FK. Secondary use of existing public microarray data to predict outcome for hepatocellular carcinoma. J Surg Res. 2014;188:137–42. doi: 10.1016/j.jss.2013.12.013. [DOI] [PubMed] [Google Scholar]
  10. Gyorffy B, Benke Z, Lanczky A, et al. RecurrenceOnline: an online analysis tool to determine breast cancer recurrence and hormone receptor status using microarray data. Breast Cancer Res Treat. 2012;132:1025–34. doi: 10.1007/s10549-011-1676-y. [DOI] [PubMed] [Google Scholar]
  11. Gyorffy B, Schafer R. Meta-analysis of gene expression profiles related to relapse-free survival in 1,079 breast cancer patients. Breast Cancer Res Treat. 2009;118:433–41. doi: 10.1007/s10549-008-0242-8. [DOI] [PubMed] [Google Scholar]
  12. Gautier L, Cope L, Bolstad BM, et al. Affy—analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004;20:307–15. doi: 10.1093/bioinformatics/btg405. [DOI] [PubMed] [Google Scholar]
  13. Gyorffy B, Molnar B, Lage H, et al. Evaluation of microarray preprocessing algorithms based on concordance with RT-PCR in clinical samples. PLoS One. 2009;4:e5645. doi: 10.1371/journal.pone.0005645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Li Q, Birkbak NJ, Gyorffy B, et al. Jetset: selecting the optimal microarray probe set to represent a gene. BMC Bioinformatics. 2011;12:474. doi: 10.1186/1471-2105-12-474. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gyorffy B, Lanczky A, Eklund AC, et al. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat. 2010;123:725–31. doi: 10.1007/s10549-009-0674-9. [DOI] [PubMed] [Google Scholar]
  16. Foody GM. Classification accuracy comparison: hypothesis tests and the use of confidence intervals in evaluations of difference, equivalence and non-inferiority. Remote Sensing Environ. 2009;113:5. [Google Scholar]
  17. Steinberg DM, Fine J, Chappell R. Sample size for positive and negative predictive value in diagnostic research using case–control designs. 2009;10:94–105. doi: 10.1093/biostatistics/kxn018. Biostatistics. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Weigelt B, Baehner FL, Reis-Filho JS. The contribution of gene expression profiling to breast cancer classification, prognostication and prediction: a retrospective of the last decade. J Pathol. 2010;220:263–80. doi: 10.1002/path.2648. [DOI] [PubMed] [Google Scholar]
  19. Weigelt B, Pusztai L, Ashworth A, et al. Challenges translating breast cancer gene signatures into the clinic. Nat Rev Clin Oncol. 2012;9:58–64. doi: 10.1038/nrclinonc.2011.125. [DOI] [PubMed] [Google Scholar]
  20. Curtis C, Shah SP, Chin SF, et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature. 2012;486:346–52. doi: 10.1038/nature10983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Sotiriou C, Pusztai L. Gene-expression signatures in breast cancer. N Engl J Med. 2009;360:790–800. doi: 10.1056/NEJMra0801289. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information Table 1.

ijc0136-2091-sd1.pdf (284.1KB, pdf)

Supplementary Information Table 2.

ijc0136-2091-sd2.pdf (494.9KB, pdf)

Articles from International Journal of Cancer. Journal International du Cancer are provided here courtesy of Wiley

RESOURCES