Skip to main content
. 2015 Oct 29;44(Database issue):D992–D999. doi: 10.1093/nar/gkv1123

Table 2. Data and properties of cancer genes in NCG 5.0.

Data sets in NCG 5.0 All cancer genes (1571) Known cancer genes (518) Candidate cancer genes (1053) Other human genes
Dominant (395) Recessive (112)
Human genes All genes 1525 382 112 1020 17 489
Duplicated genes (%) 280 (18%) 76 (20%) 12 (11%) 187 (18%) 3520 (20%)
Orthology All genes 1501 379 110 1001 16 618
Pre-metazoan genes (%) 992 (66%) 233 (61%) 80 (72%) 672 (67%) 10 516 (63%)
Protein–protein interactions All nodes 1332 371 110 840 13 262
Hubs (%) 558 (42%) 213 (57%) 78 (71%) 257 (31%) 2970 (22%)
All nodes in HT network 1177 339 108 720 11 481
Hubs in HT network (%) 386 (33%) 148 (44%) 52 (48%) 177 (25%) 2681 (23%)
Protein complexes Proteins (%) 752 (49%) 238 (62%) 87 (78%) 418 (41%) 4917 (28%)
miRNA interactions miRNA target genes (%) 1101 (72%) 332 (87%) 99 (88%) 662 (65%) 10 643 (61%)
miRNAs 324 247 163 250 438
Expression in normal tissues All genes in GTEx 1513 379 111 1012 16 818
Ubiquitous genes (%) 965 (64%) 301 (79%) 98 (88%) 555 (55%) 11 077 (66%)
Tissue-specific genes (%) 62 (4%) 5 (1%) 0 (0%) 57 (6%) 726 (4%)
All genes in Protein Atlas 1517 378 112 1016 16 889
Ubiquitous genes (%) 831 (55%) 278 (74%) 95 (85%) 447 (44%) 9492 (56%)
Tissue-specific genes (%) 90 (6%) 11 (3%) 1 (1%) 78 (8%) 1042 (6%)
Expression in cancer cell lines Cancer cell line encyclopedia 1426 367 106 942 15 158
COSMIC Cancer Lines 1398 358 105 924 14 788
Genentech data set 1524 381 112 1020 17 164

Of the 518 known cancer genes derived from CGC, 391 are annotated as dominant (mostly oncogenes), 108 as recessive (mostly tumour-suppressors), four as both as dominant and recessive and 15 have no specified mode of action. Duplicated genes have one or more duplicated loci in the genome covering ≥60% of their length (12). Pre-metazoan genes originated in the Last Universal Common Ancestor, Eukaryotes or Opisthokonts. Ubiquitously expressed genes are expressed in ≥95% tissues (29 tissues in GTEx and 30 tissues in Protein Atlas). HT = high throughput (publications reporting ≥100 interactions).