Table 1. Genomic locations of microsatellites found to be globally differential between the germline DNA of cancer patients and cancer-free volunteers.
Motif | Up stream | Down stream | 5′ UTR | 3′ UTR | Intron | Exon | Intergenic | Total Loci | Total RefSeq Genes | Total Cancer Genes |
---|---|---|---|---|---|---|---|---|---|---|
Sporadic Breast Cancer Patient Motifs | ||||||||||
TTA | 74 | 67 | 732 | 134 | 4,588 | 0 | 8,865 | 14,460 | 3,508 | 774 |
TATAT | 3 | 1 | 11 | 2 | 99 | 0 | 363 | 479 | 98 | 22 |
TTTAGT | 0 | 1 | 3 | 0 | 22 | 0 | 34 | 60 | 26 | 5 |
TATT | 139 | 170 | 1,609 | 233 | 9,411 | 0 | 18,063 | 29,625 | 5,919 | 1,362 |
TTTTCA | 0 | 0 | 8 | 1 | 23 | 0 | 49 | 81 | 32 | 7 |
TATTCT | 1 | 0 | 1 | 0 | 18 | 0 | 39 | 59 | 18 | 4 |
TATTTC | 0 | 0 | 2 | 0 | 18 | 0 | 50 | 70 | 18 | 3 |
TATATT | 1 | 1 | 17 | 6 | 154 | 0 | 383 | 562 | 147 | 31 |
TATCTT | 0 | 0 | 1 | 0 | 7 | 0 | 10 | 18 | 6 | 0 |
ACTTTT | 0 | 0 | 2 | 0 | 8 | 0 | 17 | 27 | 9 | 2 |
AATTT | 2 | 2 | 35 | 6 | 193 | 0 | 452 | 690 | 227 | 57 |
AATTTT | 3 | 2 | 38 | 8 | 246 | 0 | 462 | 759 | 277 | 64 |
TATTTT | 63 | 79 | 496 | 85 | 3,173 | 0 | 5,639 | 9,535 | 2,832 | 662 |
ACGGGC | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 3 | 4 | 1 |
TGGCGA | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 2 | 2 | 1 |
GCGGT | 0 | 0 | 1 | 1 | 2 | 0 | 1 | 5 | 4 | 0 |
CGGCCA | 1 | 1 | 2 | 1 | 2 | 1 | 1 | 9 | 9 | 2 |
GAGCGG | 6 | 1 | 8 | 0 | 1 | 7 | 12 | 35 | 23 | 7 |
Motifs that Differentiate Colon Cancer Only | ||||||||||
TGGGTC | 1 | 0 | 1 | 0 | 5 | 1 | 13 | 21 | 9 | 2 |
poly A | 1,080 | 1,660 | 10,215 | 1,721 | 63,241 | 0 | 91,398 | 169,315 | 13,375 | 2,864 |
AATAC | 0 | 2 | 9 | 0 | 42 | 0 | 94 | 147 | 57 | 12 |
Only genes in the RefSeq database were included. A “count” is defined as a complete tandem repeat at least 18 bp (for 3-mers and 6-mers) or 20 bp (for 1-, 2-, 4-, and 5-mers), in length. Upstream and downstream were defined as 1,000 bp distal from the transcribed gene. There are a total of 4,230 ‘cancer’ genes and 31,118 RefSeq genes.