Skip to main content
Genetics, Selection, Evolution : GSE logoLink to Genetics, Selection, Evolution : GSE
. 2004 Mar 15;36(2):191–205. doi: 10.1186/1297-9686-36-2-191

Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes

Theo HE Meuwissen 1,, Mike E Goddard 2
PMCID: PMC2697185  PMID: 15040898

Abstract

The ordinary-, penalized-, and bootstrap t-test, least squares and best linear unbiased prediction were compared for their false discovery rates (FDR), i.e. the fraction of falsely discovered genes, which was empirically estimated in a duplicate of the data set. The bootstrap-t-test yielded up to 80% lower FDRs than the alternative statistics, and its FDR was always as good as or better than any of the alternatives. Generally, the predicted FDR from the bootstrapped P-values agreed well with their empirical estimates, except when the number of mRNA samples is smaller than 16. In a cancer data set, the bootstrap-t-test discovered 200 differentially regulated genes at a FDR of 2.6%, and in a knock-out gene expression experiment 10 genes were discovered at a FDR of 3.2%. It is argued that, in the case of microarray data, control of the FDR takes sufficient account of the multiple testing, whilst being less stringent than Bonferoni-type multiple testing corrections. Extensions of the bootstrap simulations to more complicated test-statistics are discussed.

Keywords: microarray data, gene expression, non-parametric bootstrapping, t-test, false discovery rates

Full Text

The Full Text of this article is available as a PDF (135.2 KB).


Articles from Genetics, Selection, Evolution : GSE are provided here courtesy of BMC

RESOURCES