Table 2. Data and properties of cancer genes in NCG 5.0.
Data sets in NCG 5.0 | All cancer genes (1571) | Known cancer genes (518) | Candidate cancer genes (1053) | Other human genes | ||
---|---|---|---|---|---|---|
Dominant (395) | Recessive (112) | |||||
Human genes | All genes | 1525 | 382 | 112 | 1020 | 17 489 |
Duplicated genes (%) | 280 (18%) | 76 (20%) | 12 (11%) | 187 (18%) | 3520 (20%) | |
Orthology | All genes | 1501 | 379 | 110 | 1001 | 16 618 |
Pre-metazoan genes (%) | 992 (66%) | 233 (61%) | 80 (72%) | 672 (67%) | 10 516 (63%) | |
Protein–protein interactions | All nodes | 1332 | 371 | 110 | 840 | 13 262 |
Hubs (%) | 558 (42%) | 213 (57%) | 78 (71%) | 257 (31%) | 2970 (22%) | |
All nodes in HT network | 1177 | 339 | 108 | 720 | 11 481 | |
Hubs in HT network (%) | 386 (33%) | 148 (44%) | 52 (48%) | 177 (25%) | 2681 (23%) | |
Protein complexes | Proteins (%) | 752 (49%) | 238 (62%) | 87 (78%) | 418 (41%) | 4917 (28%) |
miRNA interactions | miRNA target genes (%) | 1101 (72%) | 332 (87%) | 99 (88%) | 662 (65%) | 10 643 (61%) |
miRNAs | 324 | 247 | 163 | 250 | 438 | |
Expression in normal tissues | All genes in GTEx | 1513 | 379 | 111 | 1012 | 16 818 |
Ubiquitous genes (%) | 965 (64%) | 301 (79%) | 98 (88%) | 555 (55%) | 11 077 (66%) | |
Tissue-specific genes (%) | 62 (4%) | 5 (1%) | 0 (0%) | 57 (6%) | 726 (4%) | |
All genes in Protein Atlas | 1517 | 378 | 112 | 1016 | 16 889 | |
Ubiquitous genes (%) | 831 (55%) | 278 (74%) | 95 (85%) | 447 (44%) | 9492 (56%) | |
Tissue-specific genes (%) | 90 (6%) | 11 (3%) | 1 (1%) | 78 (8%) | 1042 (6%) | |
Expression in cancer cell lines | Cancer cell line encyclopedia | 1426 | 367 | 106 | 942 | 15 158 |
COSMIC Cancer Lines | 1398 | 358 | 105 | 924 | 14 788 | |
Genentech data set | 1524 | 381 | 112 | 1020 | 17 164 |
Of the 518 known cancer genes derived from CGC, 391 are annotated as dominant (mostly oncogenes), 108 as recessive (mostly tumour-suppressors), four as both as dominant and recessive and 15 have no specified mode of action. Duplicated genes have one or more duplicated loci in the genome covering ≥60% of their length (12). Pre-metazoan genes originated in the Last Universal Common Ancestor, Eukaryotes or Opisthokonts. Ubiquitously expressed genes are expressed in ≥95% tissues (29 tissues in GTEx and 30 tissues in Protein Atlas). HT = high throughput (publications reporting ≥100 interactions).