Abstract
Microsatellite instability (MSI) is associated with 10–15% of colorectal, endometrial, ovarian and gastric cancers, and has long been used as a diagnostic tool for hereditary nonpolyposis colorectal carcinoma-related cancers. Tumor specific length alterations within microsatellites are generally accepted to be a consequence of strand slippage events during DNA replication, which are uncorrected due to a defective postreplication mismatch repair (MMR) system. Mutations arising within microsatellites associated with critical target genes are believed to play a causative role in the evolution of MMR-defective tumors. In this review, we summarize current evidence of mutational biases within microsatellites arising as a consequence of intrinsic DNA sequence effects as well as variation in MMR efficiency. Microsatellite mutational biases are generally not considered during clinical testing; however, we suggest that such biases may be clinically significant as a factor contributing to phenotypic variation among MSI-positive tumors.
Introduction
Microsatellites are short, repetitive elements in genomic DNA consisting of 1–6 bases per repeat unit, and are nonrandomly distributed throughout the human genome (for reviews, see (1, 2)). Microsatellites located in intragenic regions (promoters, 3′-UTRs, introns) can be important regulators of gene expression by influencing transcription rate, RNA stability, splicing efficiency, and RNA-protein interactions. Because microsatellite sequences are highly polymorphic in human populations, genetic changes in some microsatellites can be important evolutionarily by affecting phenotype. Microsatellites located within intergenic regions might also have functional roles in chromatin organization and recombination.
Postreplication mismatch repair (MMR) is a critical mechanism for maintaining microsatellite stability through the correction of base substitution mismatches and insertion/deletion events (for review, see (3)). Premutational intermediates are detected and processed by heterodimers of the MutS and MutL family of proteins. The MutSα complex (MSH2 and MSH6) can recognize base mispairs as well as single base insertion/deletion loops (IDLs) in either the parental or nascent strand. MutSβ (MSH2 and MSH3) can recognize IDLs of 2–4 bases in addition to single base IDLs. MutL heterodimers are recruited by MutS complexes to the mismatch or IDL, and consist of MLH1 dimerized with either PMS2 (MutLα), PMS1 (MutLβ), or MLH3 (MutLγ). MutLα is regarded as the major MutL complex for repair of both large and small IDLs, while MutLγ and MutLβ have little or no known role in MMR, respectively.
Clinical phenotype of microsatellite mutation
Lynch syndrome, also known as hereditary non-polyposis colorectal cancer (HNPCC), is characterized by early onset tumors in the proximal colon as well as extracolonic regions (for review, see (4)). Lynch syndrome results from the germline inheritance of mutations in MMR genes and subsequent MMR-deficiency arising during neoplastic cell evolution. Such MMR deficiency can be readily diagnosed as tumor-specific microsatellite instability (MSI), using a panel of defined microsatellite markers. Patients who are homozygotes or compound heterozygotes for mutations in MSH2, MSH6, MLH1, or PMS2 genes can present with childhood hematological and brain malignancies as well as features of neurofibromatosis (e.g., cafe-au-lait spots), in addition to early-onset gastrointestinal neoplasms.
Germline mutations in patients with a history of HNPCC are most frequently seen in the MSH2 or MLH1 genes. Due to partial redundancy and differences in substrate specificity, MSH6 (MutSα-specific) alterations are less common in HNPCC, while genetic predisposition to HNPCC has not been shown for MSH3 (MutSβ-specific) mutations (5). Similarly, PMS2 (MutLα-specific) mutations are rarely observed in HNPCC patients, as compared to MLH1 mutations, even though PMS2 is the primary binding partner of MLH1. This discrepancy in the frequency of MLH1 versus PMS2 mutations may be due, in part, to redundancy between PMS2 and MLH3 activities, since MLH3 has similar biochemical properties to PMS2 (6). However, investigations of MLH3 (MutLγ-specific) and PMS1 (MutLβ-specific) have shown only a minor role in microsatellite stability (3), and it is unclear whether mutations in these genes are involved in HNPCC like phenotypes (5).
Although MMR proteins act in a single biochemical pathway, loss of individual components results in both similar and distinct clinical phenotypes. Patients carrying heterozygous mutations in either the MLH1 or MSH2 genes display classical HNPCC, while patients carrying heterozygous MSH6 and PMS2 mutations usually display a decreased penetrance and later onset of HNPCC (5). Genotypic-phenotypic correlations for HNPCC are complicated, as patients carrying different variant alleles within the same gene can have distinct clinical phenotypes. For example, a recent study described a patient with an MLH1-G67E mutation and family history of HNPCC who displayed both classic and atypical tumors, distinct from families with MLH1-G67R mutations that only show classical tumors (7). Clinical phenotypes also can vary widely among patients having the same allelic mutation. For example, many carriers with an MSH2-A636P allele displayed only colorectal carcinomas, while other patients also developed endometrial or ovarian carcinomas (8). Variable presentations of tumor types among HNPCC patients may reflect the stochastic nature of genetic alterations (mutation timing, frequency and specificity) that result from the mutator phenotype initiated by defective MMR.
Microsatellite mutational biases due to intrinsic DNA features
Mutation rates of common microsatellites found in the human genome are extremely variable. The combined mutation rate estimates from analyses of parent-child allele transmissions and experimental determinations in somatic, nontumorigenic human cells range from a low of ~10−6 to a high of ~10−2 mutations per locus per generation (for reviews, see (1, 9)). Multiple extrinsic factors influence microsatellite mutation rates, including flanking sequence, recombination rate, sex, and age. Undoubtedly, however, the vast majority of the observed variation in mutagenesis is due to features intrinsic to the repeated DNA sequence itself, including motif unit size (mono, di, tri, etc.), length (number of units), and sequence composition (1, 9, 10). In general, motif size is inversely related to microsatellite allele mutation frequency, while motif length is directly related to microsatellite mutation frequency. Motif sequence can induce mutational variation in several ways. First, the precise sequence composition of an allele affects thermostability and DNA secondary structure potential, both of which may be factors contributing to the significant differences in microsatellite allele mutation rates observed in human cells (11). Second, the purity of a microsatellite directly affects mutability, as interrupted alleles have a significantly lower mutation rate. Third, the complexity of a microsatellite locus (simple, compound, or complex) has been implicated as a factor in mutability (1). In addition to differences in absolute mutation frequency, the types of mutational events arising within microsatellite alleles are dependent upon intrinsic DNA features. Mathematical modeling using human genome data has shown that expansions are most prevalent in short microsatellites whereas long microsatellites tend to contract (1). Non-tumorigenic human cells also display distinct mutational biases for contractions versus expansion mutations within di- and tetranucleotide microsatellites due to differences in motif sequence composition (9, 11).
Mechanistically, microsatellite alleles expand or contract through strand slippage events during DNA synthesis. One potential source of the above mutational biases may be differential stabilization of slipped strand intermediates through formation of stable secondary DNA structures within microsatellite alleles. In this mechanism, both the DNA sequence content and the DNA strand location (leading versus lagging strand) of a microsatellite contribute to mutational bias. DNA repair has also been demonstrated to impart mutation biases.
Microsatellite mutational biases due to defective MMR
The repair specificity of individual MMR proteins is an especially important consideration for disease phenotype. DNA repair by each MutS heterodimer is substrate specific within mono-, di-, or tetranucleotide repeats: MutSα is expected to repair premutational intermediates only within mononucleotide repeats, while MutSβ potentially repairs premutational intermediates within mono-, di-, tri- and tetranucleotide repeats (12). In yeast and E. coli models, the efficiency of MMR within microsatellites is dependent upon motif size, length, and sequence composition of an allele (9). Mutation rate variations among GT/CA loci within the yeast genome have been attributed to differential MMR efficiencies (13). We have recently demonstrated that differential MMR efficiency by PMS2 contributes to mutation rate biases at mono-, di- and tetranucleotide alleles in nontumorigenic human cells (14). Thus, available experimental data show that MMR contributes to mutation frequency biases within microsatellites.
We and others also have observed differences in the mutational specificity of microsatellites due to defects in specific MMR genes (Table 1). In mice, deficiency of MSH2 gives rise to 78% one base-pair deletions in a mononucleotide allele; in contrast, loss of MSH3 or MSH6 results in only 61% or 47% deletion mutations within mononucleotides, respectively (15). For dinucleotide GT/CA alleles, loss of yeast MSH3 results in a bias favoring deletions, whereas loss of yeast MSH6 results in the opposite bias, favoring insertions. In MSH2 deficient yeast, the majority of mutations at [AG/TC]5 within the APC gene are insertions whereas deletions are more frequent in an [A/T]6 tract of the same gene (16). Human APC in MMR defective colorectal tumors displays a reverse trend where deletions are readily observed within the [AG/TC]5 tract, while insertions are common at [A/T]6 (16). The human APC data was obtained from a mutation database representing patients of various MMR backgrounds, and as such, mutation spectra are a combination of various MMR defects. The discrepancy between yeast and human data could also indicate selection for the tumor phenotype.
Table 1.
MMR gene deficiency | Species | STR | Del:Ins | Reference |
---|---|---|---|---|
MSH2 | mouse | (G/C)7–8 | 4:1 | (15) |
yeast | (GT/CA)16.5 | 1:1 | (12) | |
MSH6 | mouse | (G/C)7–8 | 1:1 | (15) |
yeast | (G/C)18 | 3:1 | (12) | |
(GT/CA)16.5 | 1:3.5 | (12) | ||
MSH3 | mouse | (G/C)7–8 | 2:1 | (15) |
yeast | (GT/CA)16.5 | 4:1 | (12) | |
MLH1 | mouse | (G/C)7–8 | 4:1 | (15) |
(A/T)23–27 | 19:1 | (17) | ||
(AC/TG)22–33 | 2:1 | (17) | ||
PMS2 | mouse | (G/C)7–8 | 2:1 | (15) |
(A/T)23–27 | 6:1 | (17) | ||
(AC/TG)22–33 | 1:2 | (17) | ||
human | (GT/CA)10 | 1:11 | unpublished observations | |
(TTCC/AAGG)9 | 1:21 | (14) | ||
(TTTC/AAAG)9 | 1:4 | (14) |
Loss of specific members of the MutL family also produces distinct mutational biases. For mononucleotide [G/C]7–8 alleles in mice, 80% of mutations in the absence of MLH1 were deletions, compared to only 64% with PMS2 loss (Table 1) (15). The difference between MutL members (MLH1 and PMS2) was even more dramatic for an AC/TG dinucleotide locus, in that MLH1-deficient mice displayed a bias in favor of deletions while PMS2 deficient mice displayed a bias in favor of expansions (17). In our recent publication, we demonstrate that a bias in favor of expansions within microsatellites is also observed in PMS2-deficient human cells (14). As shown in Table 1, deficiency of PMS2 increases expansion mutations as compared to deletions within the [TTCC/AAGG]9 and [TTTC/AAAG]9 tetranucleotide repeats, as well as within a [GT/CA]10 dinucleotide repeat. This bias may be significant in vivo, as loss of PMS2 allows nuclear localization of MLH3 (6), thereby potentially changing the mutation profile. We proposed that repair proteins such as MLH1-MLH3 may be able to compensate for PMS2 loss, as MMR components have very distinct effects during DNA repair. Consistent with this interpretation, knockout mice with PMS2/MLH3 deficiency display much greater MSI than PMS2 deficient mice (18). Taken together, these experimental data demonstrate that loss of PMS2 generates microsatellite mutational biases that are quite distinct from those generated by loss of MLH1.
Target genes of MSI
Three mechanisms have been described by which alterations in microsatellite allele length directly affect neoplastic progression: 1) Gene inactivation due to frameshift mutations within exonic microsatellites; 2) Gene activation due to enhanced transcription factor binding of microsatellites within promoter regions; and 3) Microsatellite allele length differences that affect gene expression levels.
Mutation of mononucleotide microsatellites within exonic regions has been extensively analyzed in MSI colorectal tumors (reviewed in (19)). Classes of genes known to be altered in MSI tumors include DNA repair, cell growth and signaling, apoptosis, and transcription factors. For example, the TGFBRII gene contains an (A)10 repeat which is mutated in 86% of MSI positive tumors, compared to just 0.6% of MSI negative tumors (20). It is also interesting to note that several of the MMR genes, i.e., MSH6, MSH3, PMS2 and MLH3, also contain mononucleotide microsatellites. This property has been proposed to allow fine tuning of mutation rates for adaptation during evolution (21) and may also play a role in the immune response (22). MMR is highly involved in class switch recombination and somatic hypermutation, and thus control of immunoglobulin diversity. Switch regions contain highly repetitive G/C rich sequences, and loss of individual MMR proteins has been shown to differentially regulate the frequency of recombination within these regions.
Intragenic microsatellites located within introns or 5′ and 3′ UTRs should also be considered as potential target genes for MSI. For example, experiments to elucidate the mechanism of gene regulation by the EWS/FLI oncogenic transcription factor found that a minimum of 4 consecutive GGAA sequences need to be present in the promoter of target genes for transcriptional activity. Importantly, transactivation increased with each additional unit of GGAA, and differences in microsatellite size and sequence have been suggested to be a factor determining susceptibility to Ewing’s sarcoma (23). Microsatellite allele length polymorphisms are known to be correlated with differential gene expression. A relevant example for neoplasia is the EGFR gene, which contains a polymorphic (CA)14–21 unit repeat within intron 1. Variation in the number of (CA) units causes differing levels of EGFR gene expression and concomitant protein production (shorter alleles produce greater protein), which may be a predictor of clinical outcome in breast cancer (24).
The mutator phenotype model of cancer, which is appropriate for the description of MMR-deficient tumors, postulates that tumors evolve as a heterogeneous collection of genetically distinct cells, each having different, but frequently overlapping, patterns of mutation (25). Direct analyses of mononucleotide alleles in MSI colorectal carcinomas have shown that strand slippage mutations are as abundant in short, intergenic microsatellites as within intragenic, coding repeats (26). Thus, in addition to microsatellite sequence changes that are phenotypically selected during tumor cell evolution, a high frequency of random (“passenger”) microsatellite changes is expected to arise throughout the genome of individual MMR-deficient tumor cells.
Perspective
Long regarded as “junk DNA”, microsatellites are emerging as important regulators of genome function and gene expression. Historically, the primary clinical use of microsatellites has been as markers of genome instability. However, a more direct function of microsatellite instability is to create genetic diversity for tumor cell evolution or immune system adaptability. Mutational biases (e.g., high versus low mutability; insertion versus deletion events) within microsatellites are caused by intrinsic DNA features and by loss of a specific MMR protein. Such biases will directly determine the complement of microsatellite-associated target genes that are affected in MSI tumors. In addition, individual germline polymorphisms in microsatellite sequence purity and length will affect the inherent mutability of a particular microsatellite allele. We postulate that somatic mutational biases, together with germline microsatellite polymorphisms, may underlie the tissue-specific and individual-specific pattern of tumor formation observed among HNPCC and other MSI patients. The unique pattern of mutational events – error type, affected unit size, and length of the microsatellite – arising within an MMR-defective tumor cell population will thus shape an individual’s tumor genome landscape, leading to a distinct clinical phenotype. Understanding the genotypic-phenotypic relationships in MMR-deficient tumors is necessary to develop better predictive models of disease progression and novel therapeutic approaches.
Acknowledgments
Grant Support: National Institutes of Health (RO1 CA100060; K. Eckert) and the Jake Gittlen Cancer Research Foundation.
References
- 1.Ellegren H. Microsatellites: simple sequences with complex evolution. Nat Rev Genet. 2004;5:435–45. doi: 10.1038/nrg1348. [DOI] [PubMed] [Google Scholar]
- 2.Li YC, Korol AB, Fahima T, Nevo E. Microsatellites within genes: structure, function, and evolution. Mol Biol Evol. 2004;21:991–1007. doi: 10.1093/molbev/msh073. [DOI] [PubMed] [Google Scholar]
- 3.Hsieh P, Yamane K. DNA mismatch repair: Molecular mechanism, cancer, and ageing. Mechanisms of Ageing and Development. 2008;129:391–407. doi: 10.1016/j.mad.2008.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lynch HT, Lynch PM, Lanspa SJ, Snyder CL, Lynch JF, Boland CR. Review of the Lynch syndrome: history, molecular genetics, screening, differential diagnosis, and medicolegal ramifications. Clin Genet. 2009;76:1–18. doi: 10.1111/j.1399-0004.2009.01230.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Peltomaki P. Lynch Syndrome Genes. Familial Cancer. 2005;4:227–32. doi: 10.1007/s10689-004-7993-0. [DOI] [PubMed] [Google Scholar]
- 6.Korhonen MK, Raevaara TE, Lohi H, Nystrom M. Conditional nuclear localization of hMLH3 suggests a minor activity in mismatch repair and supports its role as a low-risk gene in HNPCC. Oncol Rep. 2007;17:351–4. [PubMed] [Google Scholar]
- 7.Clyne M, Offman J, Shanley S, et al. The G67E mutation in hMLH1 is associated with an unusual presentation of Lynch syndrome. Br J Cancer. 2009;100:376–80. doi: 10.1038/sj.bjc.6604860. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Foulkes WD, Thiffault I, Gruber SB, et al. The Founder Mutation MSH2*1906G-->C Is an Important Cause of Hereditary Nonpolyposis Colorectal Cancer in the Ashkenazi Jewish Population. The American Journal of Human Genetics. 2002;71:1395–412. doi: 10.1086/345075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Eckert KA, Hile SE. Every microsatellite is different: Intrinsic DNA features dictate mutagenesis of common microsatellites present in the human genome. Molecular Carcinogenesis. 2009;48:379–88. doi: 10.1002/mc.20499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kelkar YD, Tyekucheva S, Chiaromonte F, Makova KD. The genome-wide determinants of human and chimpanzee microsatellite evolution. Genome Res. 2008;18:30–8. doi: 10.1101/gr.7113408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Eckert KA, Yan G, Hile SE. Mutation rate and specificity analysis of tetranucleotide microsatellite DNA alleles in somatic human cells. Mol Carcinog. 2002;34:140–50. doi: 10.1002/mc.10058. [DOI] [PubMed] [Google Scholar]
- 12.Sia EA, Kokoska RJ, Dominska M, Greenwell P, Petes TD. Microsatellite instability in yeast: dependence on repeat unit size and DNA mismatch repair genes. Mol Cell Biol. 1997;17:2851–8. doi: 10.1128/mcb.17.5.2851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hawk JD, Stefanovic L, Boyer JC, Petes TD, Farber RA. Variation in efficiency of DNA mismatch repair at different sites in the yeast genome. Proceedings of the National Academy of Sciences of the United States of America. 2005;102:8639–43. doi: 10.1073/pnas.0503415102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Shah SN, Eckert KA. Human postmeiotic segregation 2 exhibits biased repair at tetranucleotide microsatellite sequences. Cancer Res. 2009;69:1143–9. doi: 10.1158/0008-5472.CAN-08-3499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hegan DC, Narayanan L, Jirik FR, Edelmann W, Liskay RM, Glazer PM. Differing patterns of genetic instability in mice deficient in the mismatch repair genes Pms2, Mlh1, Msh2, Msh3 and Msh6. Carcinogenesis. 2006;27:2402–8. doi: 10.1093/carcin/bgl079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kazunori Otsuka TS, Shibata Hiroyuki, Kato Satoshi, Sakayori Masato, Shimodaira Hideki, Kanamaru Ryunosuke, Ishioka Chikashi. Analysis of the human APC mutation spectrum in a Saccharomyces cerevisiae strain with a mismatch repair defect. International Journal of Cancer. 2003;103:624–30. doi: 10.1002/ijc.10883. [DOI] [PubMed] [Google Scholar]
- 17.Yao X, Buermeyer AB, Narayanan L, et al. Different mutator phenotypes in Mlh1-versus Pms2-deficient mice. Proc Natl Acad Sci U S A. 1999;96:6850–5. doi: 10.1073/pnas.96.12.6850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chen PC, Dudley S, Hagen W, et al. Contributions by MutL homologues Mlh3 and Pms2 to DNA mismatch repair and tumor suppression in the mouse. Cancer Res. 2005;65:8662–70. doi: 10.1158/0008-5472.CAN-05-0742. [DOI] [PubMed] [Google Scholar]
- 19.Woerner SM, Kloor M, von Knebel Doeberitz M, Gebert JF. Microsatellite instability in the development of DNA mismatch repair deficient tumors. Cancer Biomark. 2006;2:69–86. doi: 10.3233/cbm-2006-21-208. [DOI] [PubMed] [Google Scholar]
- 20.Sima Salahshor UK, Påhlman Lars, Glimelius Bengt, Lindmark Gudrun, Lindblom Annika. Colorectal cancer with and without microsatellite instability involves different genes. Genes, Chromosomes and Cancer. 1999;26:247–52. [PubMed] [Google Scholar]
- 21.Chang DK, Metzgar D, Wills C, Boland CR. Microsatellites in the Eukaryotic DNA Mismatch Repair Genes as Modulators of Evolutionary Mutation Rate. Genome Research. 2001;11:1145–6. doi: 10.1101/gr.186301. [DOI] [PubMed] [Google Scholar]
- 22.Peled JU, Kuang FL, Iglesias-Ussel MD, et al. The Biochemistry of Somatic Hypermutation. Annual Review of Immunology. 2008;26:481–511. doi: 10.1146/annurev.immunol.26.021607.090236. [DOI] [PubMed] [Google Scholar]
- 23.Gangwal K, Sankar S, Hollenhorst PC, et al. Microsatellites as EWS/FLI response elements in Ewing’s sarcoma. Proc Natl Acad Sci U S A. 2008;105:10149–54. doi: 10.1073/pnas.0801073105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Brandt B, Meyer-Staeckling S, Schmidt H, Agelopoulos K, Buerger H. Mechanisms of EGFR gene transcription modulation: relationship to cancer risk and therapy response. Clin Cancer Res. 2006;12:7252–60. doi: 10.1158/1078-0432.CCR-06-0626. [DOI] [PubMed] [Google Scholar]
- 25.Loeb LA, Bielas JH, Beckman RA. Cancers exhibit a mutator phenotype: clinical implications. Cancer Res. 2008;68:3551–7. doi: 10.1158/0008-5472.CAN-07-5835. [DOI] [PubMed] [Google Scholar]
- 26.Sammalkorpi H, Alhopuro P, Lehtonen R, et al. Background mutation frequency in microsatellite-unstable colorectal cancer. Cancer Res. 2007;67:5691–8. doi: 10.1158/0008-5472.CAN-06-4314. [DOI] [PubMed] [Google Scholar]