Skip to main content
. 2020 Sep 30;48(19):e113. doi: 10.1093/nar/gkaa802

Table 1.

Datasets used to identify stably expressed genes and assess their stability. Datasets are grouped based on the projects they were sourced from and are annotated for the type of dataset, number of samples, number of biological groups and the measurement used (TPM – transcripts per million, RPKM/FPKM – reads/fragments per kilobase per million or CPM – counts per million). Original studies that produced the dataset are cited along with the study where the processed version as downloaded. Processed versions of datasets marked by asterisk were downloaded from the human protein atlas www.proteinatlas.org

Project Dataset Type Measurement Number of samples Number of groups Citations
TCGA TCGA carcinomas Pan-cancer tissue FPKM 7310 13 (26,44)
TCGA other Pan-cancer tissue FPKM 1942 10 (26,44)
TCGA BRCA CPM Breast cancer tissue CPM 1077 6 (26,44)
TCGA normal Normal tissue FPKM 718 20 (26,44)
CCLE CCLE carcinomas Pan-cancer cell line RPKM 581 19 (27,30)
CCLE other Pan-cancer cell line RPKM 348 15 (27,30)
HPA HPA tissue Normal tissue TPM 43 43 (45)
HPA cell line Normal cell line TPM 64 64 (45)
HPA blood sample Blood TPM 109 19 (45)
CPTAC CPTAC TCGA colon Colon cancer tissue MS/MS intensity 95 - (46)
FANTOM FANTOM CAGE tissue* Normal tissue TPM 45 45 (47-49)
GTEx GTEx (v7) Normal tissue (post-mortem) TPM 8462 29 -
Other Daeman et al. breast cell lines Breast cancer cell line RPKM 64 4 (31,35)
GSE60424 sorted blood Blood RPKM 28 7 (38,50)
Monaco et al. blood* Blood TPM 30 30 (45,51)
Schmiedel et al. blood* Blood TPM 15 15 (32,45)