Skip to main content
. 2012 Nov 26;41(Database issue):D56–D63. doi: 10.1093/nar/gks1172

Table 1.

The full complement of ENCODE data sets summarized by cell type [types annotated as cancer are marked with asterisk (*)]

Cell type Tissue Description Data sets
TF/His RNA Other Total
Tier 1 initial
    GM12878 Blood Lymphoblastoid 137 27 49 213
    K562* Blood Leukemia 247 45 80 372
Tier 1 added in 2011
    H1-hESC Embryonic stem Embryonic stem 96 14 23 133
Tier 2 initial
    HeLa-S3* Uterine cervix Cervical carcinoma 93 14 30 137
    HepG2* Liver Liver carcinoma 118 19 26 163
    HUVEC Umbilical endothelium Umbilical vein endothelial 37 13 16 66
Tier 2 added in 2011
    A549* Lung Lung carcinoma 89 22 12 123
    CD14+ Blood Monocyte 17 4 4 25
    IMR90 Lung Lung fibroblast 11 16 10 37
    MCF-7* Breast Breast carcinoma 50 15 32 97
    SK-N-SH* Brain Neuroblastoma 36 16 7 59
Tier 2 added in 2012
    CD20+ Blood B cell 11 5 4 20
    H1-neuron Neuron H1ES-derived neuron 5 3 1 9
    LHCN-M2 Muscle Myoblast 7 2 4 13
Human: totals
    Tier1 + Tier2 (14) 954 215 298 1467
    Tier 3 (274) 591 94 734 1419
    All (288) 1545 309 1032 2886
Mouse
    All (81) 381 102 100 583

Studies in the human genome focused on common cell types in designated ‘tiers’, with Tier1 most intensively studied, followed by Tier 2. A total of 10 292 files have been released referenced to the human (hg19/GRCh37) genome. For mouse (mm9/NCBI37), the comparable number is 8952 files. Data are available for download from the UCSC download server; for access see http://encodeproject.org/ENCODE/downloads.html and http://encodeproject.org/ENCODE/downloadsMouse.html. File formats are described on the ENCODE Portal File Formats page.