Skip to main content
. Author manuscript; available in PMC: 2011 Jun 1.
Published in final edited form as: Biol Blood Marrow Transplant. 2010 Jan 29;16(6):751–771. doi: 10.1016/j.bbmt.2010.01.012

Table 2. Most significantly enriched GO terms and KEGG pathways.

A) Tests of 507 Geneontology (GO) biological process terms with at least 10 genes present on the arrays.
Comparison direction number of probe sets selected (of 22690) number of distinct genes selected (of 13142) GO term title number of genes with GO term on the array number of those genes selected p-value
Tmsc vs Tn up in Tmsc 672 543 cell cycle 335 41 4.1E-10
immune response 263 29 1.4E-06
DNA replication 96 16 1.8E-06
mitosis 109 17 2.2E-06
cell division 173 22 2.6E-06
regulation of progression through cell cycle 185 20 8.1E-05
Tmsc vs Tn up in Tn 545 448 transcription 1125 72 1.1E-07
regulation of transcription, DNA-dependent 1486 86 4.6E-07
Te vs Tmsc up in Te 1486 1098 cell cycle 335 102 1.5E-32
mitosis 109 53 1.1E-28
cell division 173 66 1.1E-27
DNA replication 96 40 6.7E-19
apoptosis 296 54 3.1E-08
DNA replication initiation 13 9 1.0E-07
Te vs Tmsc up in Tmsc 1372 1036 ribosome biogenesis and assembly 47 21 9.6E-12
translation 259 51 7.5E-10
Te vs Tn up in Te 1817 1359 cell cycle 335 109 2.6E-29
mitosis 109 52 2.9E-23
cell division 173 67 3.5E-23
DNA replication 96 41 1.8E-16
DNA replication initiation 13 10 2.9E-08
apoptosis 296 59 5.1E-07
DNA recombination 35 13 2.4E-05
regulation of progression through cell cycle 185 38 2.6E-05
Te vs Tn up in Tn 1837 1359 ribosome biogenesis and assembly 47 18 5.3E-07
translation 259 49 3.3E-05
transcription 1125 158 6.9E-05
B) Tests of 190 KEGG pathways
Comparison direction number of probe sets selected (of 22690) number of distinct genes selected (of 13142) Pathway Title number of genes in pathway on the array number of those genes selected p-value
Tmsc vs Tn up in Tmsc 672 543 Cell cycle 102 16 4.1E-06
Pyrimidine metabolism 80 12 1.0E-04
Jak-STAT signaling pathway 122 14 5.0E-04
Tmsc vs Tn up in Tn 545 448 Wnt signaling pathway 133 11 5.8E-03

We fit a 1-way ANOVA model to the data for each probe set and used it to compare each pair of groups. We selected those probe sets that gave p-values of p<.01 and average fold differences of at least 1.5-fold as being differentially expressed. We estimated the false discovery rate (FDR) by performing an identical analysis of all data sets with permuted sample labels that did not recapitulate the actual groupings of the 3 groups of samples, and dividing the average number of probe sets selected in these permuted data sets by the actual number of probe sets selected for the original data, and obtained estimated FDRs of 7.5% for TMSC vs. TN, 3.1% for TE vs. TMSC, and 2.5% for TE vs. TN. The number of probe sets selected as being increased or decreased is shown, as well as the number of distinct genes these represented, determined using Entrez Gene identifiers. The mapping of probe sets to genes, as well as the Gene Ontology terms for each gene, were obtained from Affymetrix web site, and were dated July 11, 2007. We tested each list of distinct genes for over-representation in 507 lists of Gene Ontology terms for which there were at least 10 genes present on the array, as well as for 190 lists of pathways from the Kyoto Encyclopedia of Genes and Genomes, obtained Jun 12, 2007 (http://www.genome.jp/kegg/). We present only the most significant lists obtained in each category, and of the most significant lists present all of them (that is, none are hidden). The total number of genes in each list that were on the array is given as well as the number of those that were selected as altered in each pair-wise comparison of groups of samples. The p-values shown are from using these counts to perform one-sided Fisher's Exact tests, however, since many lists were tested, we also give the average number of lists that obtained a p-value this smaller or smaller in 100 analyses of the same data in which the gene labels were randomly permuted.