Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2008 Mar 24.
Published in final edited form as: Biostatistics. 2005 Apr 14;6(3):450–464. doi: 10.1093/biostatistics/kxi021

Accuracy of MSI testing in predicting germline mutations of MSH2 and MLH1: a case study in Bayesian meta-analysis of diagnostic tests without a gold standard

Sining Chen 1,, Patrice Watson 2, Giovanni Parmigiani 3
PMCID: PMC2274000  NIHMSID: NIHMS41830  PMID: 15831578

SUMMARY

Microsatellite instability (MSI) testing is a common screening procedure used to identify families that may harbor mutations of a mismatch repair (MMR) gene and therefore may be at high risk for hereditary colorectal cancer. A reliable estimate of sensitivity and specificity of MSI for detecting germline mutations of MMR genes is critical in genetic counseling and colorectal cancer prevention. Several studies published results of both MSI and mutation analysis on the same subjects. In this article we perform a meta-analysis of these studies and obtain estimates that can be directly used in counseling and screening. In particular, we estimate the sensitivity of MSI for detecting mutations of MSH2 and MLH1 to be 0.81 (0.73–0.89). Statistically, challenges arise from the following: (a) traditional mutation analysis methods used in these studies cannot be considered a gold standard for the identification of mutations; (b) studies are heterogeneous in both the design and the populations considered; and (c) studies may include different patterns of missing data resulting from partial testing of the populations sampled. We address these challenges in the context of a Bayesian meta-analytic implementation of the Hui–Walter design, tailored to account for various forms of incomplete data. Posterior inference is handled via a Gibbs sampler.

Keywords: Diagnostic test, Hereditary nonpolyposis colorectal cancer, Microsatellite instability, Sensitivity, Specificity

1. INTRODUCTION

Hereditary nonpolyposis colorectal cancer (HNPCC) (Lynch and de la Chapelle, 1999) is the most common hereditary cancer syndrome. It can be caused by a deleterious germline mutation of one of the mismatch repair (MMR) genes, mainly MSH2 and MLH1. These mutations confer substantially increased lifetime risk of cancer of the colorectum and other sites (Vasen et al., 2001; Lin et al., 1998). Microsatellite instability (MSI) is the amplification or deletion within DNA microsatellites (tandemly repeated sequences that appear throughout the genome). This phenotype is caused by a general failure of the DNA MMR system. All HNPCC cases, i.e. cases with inherited defects in the MMR genes, have a strong tendency to develop tumors with this phenotype. For a few years since MSI was discovered in association with HNPCC (Thibodeau et al., 1993), this phenomenon was not uniformly characterized. A variety of laboratories developed their own tools for measuring MSI, and related studies indicated that many markers could be used to distinguish MSI tumors from microsatellite stable (MSS) ones. In 1998, ‘The International Workshop on Microsatellite Instability and RER Phenotypes in Cancer Detection and Familial Predisposition’ was held to develop a concensus definition of MSI and to establish unifying criteria for its measurement (Boland et al., 1998). As a result, the following five markers are now used for the identification of MSI tumors: BAT26, BAT25, D5S346, D2S123, and D17S250. Instability in more than two of the five markers indicates an MSI tumor. To date, MSI is found in more than 70% of HNPCC-related cancers (Liu et al., 2000) and up to 10–20% of sporadic colorectal cancers (Salovaara et al., 2000; Lothe et al., 1993). MSI testing is aimed at detecting MMR deficiencies and has become widely used as a prescreening procedure for possible HNPCC germline mutations, thanks to the simpler and less expensive technique compared to most mutation analyses (Boland et al., 1998).

A reliable estimate of sensitivity and specificity of MSI for detecting germline mutations of MMR genes is critical in genetic counseling and colorectal cancer prevention. In particular, knowing one’s probability of carrying an inherited deleterious mutation on the MSH2 and MLH1 genes prior to genetic testing by mutation analysis is important for decision making about genetic testing, disease prophylaxis, family planning, and more. As many colorectal cancer patients undergo MSI testing before mutation analysis, pretest carrier probabilities need to incorporate information about MSI testing, as done in commonly used prediction software such as BayesMendel (Chen et al., 2004). This requires estimates of the sensitivity of MSI, defined as the probability of a subject’s tumor sample being microsatellite unstable (MSI = 1) given he/she is carrying a deleterious germline mutation of MSH2 or MLH1 (Mut = 1), and of the specificity of MSI, defined as the probability of the subject’s tumor sample being MSS (MSI = 0) given he/she is not carrying a mutation (Mut = 0). Also, accurate estimates of the test properties along with related costs are determinants in designing screening and surveillance programs for colorectal cancer (Ramsey et al., 2001; Reyes et al., 2002).

Several investigations have reported testing a set of tumors both for germline mutations and for MSI, and then comparing the results. Among these, studies using tumors from high-risk colorectal cancer cases have relatively small sample sizes. Yet accurate risk prediction and genetic counseling require a reliable estimate of MSI sensitivity and specificity. In this article we perform a meta-analysis of these studies and obtain estimates of sensitivity and specificity as defined above. Statistically, challenges arise from the following: (a) traditional mutation analysis methods used in published studies cannot be considered a gold standard for the identification of mutations (Yan et al., 2000); (b) studies are heterogeneous in both the design and the populations considered; and (c) studies may include different patterns of missing data resulting from partial testing of the populations sampled. We address these challenges using a Bayesian meta-analysis motivated by the Hui–Walter approach (Hui and Walter, 1980). This approach exploits the availability of multiple studies that sample populations with different mutation prevalences, to obtain estimates of sensitivity and specificity of two tests without requiring either one to be a gold standard. We develop specific adaptations designed to accommodate incomplete data and heterogeneity in the experimental designs of the available studies. Posterior inference is handled via a Gibbs sampler.

2. DATA AND METHODS

For our meta-analysis, we considered published studies that involved testing colorectal cancer tumor tissues for MSI and/or germline mutations. We initially identified a number of such studies by interviewing experts, searching on PUBMED, searching reference sets of previously identified papers, and searching citation indices. Not all studies with such results could be included in the analysis. The reason for exclusion and a more detailed description of each study are provided in the Appendix. We then selected studies with clearly reported test results. Examining the results of the literature search, the following issues arose.

The first issue is the absence of a gold standard. In the published studies, tumor samples were tested with a variety of mutation analysis strategies including direct sequencing alone, denaturing gradient gel electrophoresis (DGGE) alone, single-strand conformation analysis (SSCA) alone, polymerase chain reaction (PCR) + SSCA + heteroduplex analysis (HA), DGGE + reverse transcriptase-polymerase chain reaction (RT-PCR) + protein truncation test (PTT), and many others. We discuss the handling of these testing strategies in Section 4. For now, we refer to these tests collectively as ‘mutation analysis’. Studies have demonstrated that these conventional mutation testing strategies are not sensitive to certain complex mutations (Wang et al., 2001; Sophie et al., 2001), or large genomic deletions and rearrangements (Yan et al., 2000; Payne et al., 2000; Nakagawa et al., 2002; Wang et al., 2003; Renkonen et al., 2003). The mutations that are likely to be missed by conventional methods make up a significant fraction of all mutations related to colorectal cancers. Because of this limitation, in these studies, the true genotype is not always observed. Using germline testing results as the gold standard can lead to biased estimates of MSI sensitivity and specificity.

The second issue is population heterogeneity both between and within studies. Several studies we included tested families that can be categorized into different groups according to certain clinical criteria. It is likely that the prevalence of deleterious mutation differs from group to group, but the authors often chose not to report the results by groups but rather gave aggregate results. Meanwhile, other studies may focus on a single more homogeneous population, but the populations may be different across studies. Overall, this scenario is a challenge but also a strength because the existence of such strata allows us to account for the lack of a gold standard, as we will discuss later.

The third issue is incompleteness in the data. An ideal study is one where the same set of subjects are tested both for MSI and germline mutations. However, not all studies have complete data in this sense. In Terdiman et al. (2002), Wang et al. (2003), Loukola et al. (1999a), and Salovaara et al. (2000), it was expected that the yield of mutations would be low among MSI-negative tumors and only those samples that tested positive for MSI were genotyped. In Cederquist et al. (2001), only 23 out of the 35 MSI positives were genotyped. Ponz de Leon et al. (2004) only gave mutation analysis results, whereas Percesepe et al. (2001), Salahshor et al. (1999), and Dietmaier et al. (1997) only gave MSI results. While the data in these studies are incomplete, they may provide important information on MSI accuracy. Therefore, the inclusion of such data was judged to be worth the additional modeling effort required.

In our analysis, we stratified the study populations into a high-risk group and a low-risk group. The high-risk group includes families that fulfilled the Amsterdam criteria, or met any modified clinical guidelines for the purpose of identifying HNPCC families. In addition, we have included in the high-risk group all families and individual cases that can be classified as being at significantly elevated risk, based on descriptors, such as ‘with age of onset before 35’, ‘with two or more close relatives with colorectal cancer’, or ‘exhibiting familial clustering of colorectal cancer (strong and positive family history)’. The low-risk group includes cases from population-based registries. The grouping we use here attempts, within the limits imposed by the information available, to distinguish between data gathered in high-risk clinical settings, most of which is self-referred, and data gathered using population-based strategies. While our approach would have been applicable, and potentially more efficient, to a larger set of ordered categories of risk, we stopped at two because several studies tested a possibly heterogeneous sample and only reported aggregate results, and to assure that each group has a reasonable sample size. The stratified data are shown in Table 1.

Table 1.

A list of the studies included. Data are stratified into a high-risk group in the top portion and a low-risk group in the bottom portion. In each row, data are summarized in four columns by MSI test results (MSI = 0 or 1) and mutation analysis results (Mut = 0 or 1). Missing counts due to samples without one of the two tests are represented by x. and y.. For example, x3 stands for the potential number of mutation analysis positive subjects in Ponz de Leon et al. (2004) who would have been tested positive for MSI. Among the MSI-positive group of 35 in Cederquist et al. (2001), mutation analysis was performed on 23 samples among which eight were found to harbor germline mutations. No mutation analysis was performed on the other 12 samples. x1 represents the potential number of mutation positives among these 12 untested subjects

Study High risk Low risk MSI = 1, Mut = 1 MSI = 1, Mut = 0 MSI = 0, Mut = 1 MSI = 0, Mut = 0
Bapat et al. (1999) 16 1 2 20
Calistri et al. (2000) 8 8 0 9
Debniak et al. (2000) 5 4 1 15
Dieumegard et al. (2000) 7 7 0 2
Lamberti et al. (1999) 13 22 2 10
Liu et al. (2000) 16 6 1 36
Scartozzi et al. (2002) 0 1 4 22
Wahlberg et al. (2002) 14 14 0 20
Cederquist et al. (2001) 8 + x1 15 + (12 − x1) x2 43 − x2
Ponz de Leon et al. (2004) x3 x4 89 − x3 75 − x4
Terdiman et al. (2001) 21 11 x5 63 − x5
Wang et al. (2003) 92 88 x6 188 − x6
Debniak et al. (2000) 0 5 0 38
Dieumegard et al. (2000) 0 0 0 7
Loukola et al. (1999) 10 53 y1 446 − y1
Percesepe et al. (2001) y2 28 − y2 y3 308 − y3
Salahshor et al. (1999) y4 22 − y4 y5 159 − y5
Dietmaier et al. (1997) y6 18 − y6 y7 130 − y7
Salovaara et al. (2000) 18 48 y8 469 − y8

If all missing counts in Table 1 were observed, then the stratified data would fit into the framework of a Hui–Walter design (Hui and Walter, 1980). In this design, the sensitivity and specificity of MSI testing can be estimated without ascertaining the true genetic status of the subjects, by exploiting the fact that the strata that are known to have different prevalences. Data consist of a three-way tabulation of the results of two tests and the stratum indicator. The Hui–Walter approach relies on two key modeling assumptions: independence of the testing modalities and different prevalences among the high- and low-risk groups (Parmigiani, 2002; Johnson et al., 2001). Further discussion of these assumptions is in Section 4.

To write the likelihood function, we define the multinomial probabilities of an individual falling into the four cells MSI = 1 & Mut = 1, MSI = 1 & Mut = 0, MSI = 0 & Mut = 1, and MSI = 0 & Mut = 0 as a, b, c, and d, respectively. Subscripts ‘h’ and ‘l’ will denote the high-risk group and low-risk group, respectively, while subscript + and - will denote true carrier status. For example, ah+ are MSI positive, mutation analysis positive, and true carriers in the high-risk group. We denote the sensitivity and specificity of MSI testing by βM and αM, those of the mutation analysis by βT and αT, and the mutation prevalences in the high-risk and low-risk groups by πh and πl. Then, using the independence assumption for the two testing modalities, we can write for the high-risk group,

ahah++ah=πhβMβT+(1πh)(1αM)(1αT),bhbh++bh=πhβM(1βT)+(1πh)(1αM)αT,chch++ch=πh(1βM)βT+(1πh)αM(1αT),dhdh++dh=πh(1βM)+(1βT)+(1πh)αMαT.

Similar relationships apply to the low-risk group. We also use θ to denote the parameter vector (αT, αM, βT, βM, πh, πl) and O to denote the totality of observed variables in Table 1.

Using this notation, we could write the contribution from each study to the likelihood function, depending on the stratum and missing data pattern. For example, the contribution from Bapat et al. (1999) was

ah16bh1ch2dh20,

that from Terdiman et al. (2001) is

ah21bh11(ch+dh)63,

and that from Percesepe et al. (2001) is

(al+bl)28(cl+dl)308.

Assuming independence of study results conditional on θ, the likelihood is the product of the contribution from each study, that is,

L(θ|O)={ah79bh63ch10dh134}{ah8bh15(ah+bh)12(ch+dh)45}{(ah+ch)89(bh+dh)75} ×{ah21bh11(ch+dh)63}{ah92bh88(ch+dh)188}{bl5dl38}dl7{al10bl53(cl+dl)446} ×{(al+bl)28(cl+dl)308}{(al+dl)22(cl+dl)159} ×{(al+bl)18(cl+dl)130}{al18bl48(cl+dl)469}=ah200bh177ch10dh134(ah+bh)12(ch+dh)294(ah+ch)89(bh+dh)75 ×al28bl106dl45(al+bl)68(cl+dl)1512, (2.1)

where each bracketed expression corresponds to a study. Under our modeling assumptions, all six parameters βM, αM, βT, αT, πh, and πl are identified, as long as the constraint πh > πl holds.

We drew inferences about θ via a Bayesian analysis. Advantages include the possibility to derive exact probability intervals without relying on asymptotic approximations, and ease of handling both the missing data patterns and the parametric constraint required for identifiability. Within this constraint, we adopted a uniform prior on all components of θ, with the exception of the specificity αT of mutation analysis, which we assumed to be 100%, a value likely to be close to the true specificity. However, as a sensitivity analysis, we also reported results using a uniform prior on αT, the opposite extreme in term of informativeness.

From Bayes theorem, the posterior is proportional to expression (2.1) wherever the prior is nonzero. Computation was simplified by introducing two sets of auxiliary variables: the counts of test results for subjects who have not been tested by both tests, and the counts of true genotypes for all subjects. Denoting auxiliary variables by A, we implemented a Markov chain Monte Carlo algorithm to draw from the distribution of (A, θ) given O (Tanner and Wong, 1987).

Our algorithm requires an expression for the so-called complete data likelihood, that is, p(A, O|θ). This is given by

ah+na,h+·ahna,h·bh+nb,h+·bhnb,h·ch+nc,h+·chnc,h·dh+nd,h+·dhnd,h·al+na,l+·alna,l·bl+nb,l+·blnb,l·cl+nc,l+·clnc,l·dl+nd,l+·dlnd,l. (2.2)

Each exponent in (2.2) arises as the sum of contributions from studies with different missing data structures, which needed to be simulated separately, requiring additional auxiliary variables defined in Table 2. To explain the significance of the auxiliary variables, for example, we observe in the high-risk group that 200 subjects are MSI positive and mutation analysis positive (MSI+ & Mut+) and 177 are MSI+ & Mut−. This corresponds to the first two factors in the expression (2.1), ah200bh177. After we assume the carrier status known, among the 200 MSI+ & Mut+ subjects, na,h+(1) would be the true carriers and na,h(1)=200na,h+(1) the true noncarriers. Similarly, among the 177 MSI+ & Mut− subjects, nb,h+(1) would be true carriers and nb,h(1) noncarriers. Thus, the corresponding factors in the complete data likelihood were written as ah+na,h+(1)ahna,h(1)bh+nb,h+(1)bhnb,h(1) with the constraints na,h+(1)+na,h(1)=200andnb,h+(1)+nb,h(1)=177. In other words, ah200bh177 is the result of integrating out the auxiliary variables na,h+(1),na,h(1),nb,h+(1),nb,h(1)fromah+na,h+(1)ahna,h(1)bh+nb,h+(1)bhnb,h(1) within the constraints. Details of the Gibbs sampler and the various constraints placed on the components are available from the first author.

Table 2.

Notation: the number in each cell can be broken down into those with a true germline mutation (subscript ‘+’) and those without (subscript ‘−’). High risk and low risk are indicated by the letter ‘h’ and ‘l’, respectively, in the subscript. The superscripts in parentheses signify contributions from studies with different missing data structures

Mut = 1 Mut = 0
High risk
   MSI = 1 na,h+=na,h+(1)+na,h+(2)+na,h+(3) nb,h+=nb,h+(1)+nb,h+(2)+nb,h+(3)
na,h=na,h(1)+na,h(2)+na,h(3) nb,h=nb,h(1)+nd,h(2)+nb,h(3)
   MSI = 0 nc,h+=nc,h+(1)+nc,h+(2)+nc,h+(3) nd,h+=nd,h+(1)+nd,h+(2)+nd,h+(3)
nc,h=nc,h(1)+nc,h(2)+nd,h(3) nd,h=nd,h(1)+nd,h(2)+nd,h(3)
Low risk
   MSI = 1 na,l+=na,l+(1)+na,l+(2) nb,l+=nb,l+(1)+nb,l+(2)
na,l=na,l(1)+na,l(2) nb,l=nb,l(1)+nd,l(2)
   MSI = 0 nc,l+ = nc,l+ nd,l+=nd,l+(1)+nd,l+(2)
nc,l− = nc,l− nd,l=nd,l+(1)+nd,l(2)

3. RESULTS

Posterior means and 95% credible intervals (CIs) on the parameteres of interest are shown in Table 3. The estimated specificity of MSI testing was 92% (95% CI, 90–94%), indicating that MSI is a more specific test than previously suggested (Salovaara et al., 2000; Lothe et al., 1993). Combined with an estimated sensitivity of 81%, our estimates translate into a positive predictive value (PPV) of 0.93 (95% CI, 0.90–0.95) among high-risk families. However, the PPV of MSI is drastically lower among unselected colorectal cancer cases: we estimate it to be 0.34 (95% CI, 0.24–0.48). Both PPV estimates are consistent with practical clinical experiences in MSI screening. Our prevalence estimate of true carriers among unselected colorectal cancer, 3–7%, coincides with previous estimates (Aaltonen et al., 1994, 1998).

Table 3.

Summary of inferences on parameters. The left part of the table is obtained by setting αT to 1. The right part is obtained by placing a uniform prior on αT

αT = 1
αT ~ u(0, 1)
Mean 95% PI Mean 95% PI
MSI specificity αM 0.92 (0.90, 0.94) 0.92 (0.90, 0.93)
MSI sensitivity βM 0.81 (0.73, 0.89) 0.85 (0.74, 0.96)
Mutation analysis specificity αT 1 0.97 (0.91, 1.0)
Mutation analysis sensitivity βT 0.62 (0.56, 0.67) 0.62 (0.57, 0.68)
Prevalence in high-risk group πh 0.56 (0.50, 0.63) 0.53 (0.46, 0.61)
Prevalence in low-risk group πl 0.049 (0.032, 0.071) 0.042 (0.022, 0.064)

We estimated the sensitivity of mutation analysis to be 62%. While this is a modest value, our estimates are comparable to those reported for similar mutation analysis techniques for other large cancer susceptibility genes. For example, Ford et al. (1998) reported an estimated sensitivity of 63% (51–77%) for direct sequencing in detecting mutations on BRCA1. They also find that the sensitivity for other mutation screening techniques is not significantly different from that of direct sequencing. While clinical use of sequencing-based testing is still justified in view of the much higher specificity of sequencing compared to MSI testing, our results further emphasize the need for cautious interpretation of negative test results in families with a large number of colorectal cancer cases (Yan et al., 2000).

At each iteration of the Gibbs sampler, we generated a draw of all entries in Table 2. These draws can be used to make inferences on the missing counts in Table 1. For example, in the Ponz de Leon et al. (2004) study, x3 was imputed as 72 on average by the Gibbs sampler, which means that 72 out of the 89 mutation-positive subjects in the study would have been tested positive for MSI. In addition, x1 was on average 7 and x4 was on average 24. The other missing counts could be estimated up to a sum. The quantities x2 + x5 + x6, y2 + y4 + y6, and y1 + y3 + y5 + y7 + y8 had posterior means around 38, 14, and 10, respectively.

With the Gibbs sampler, in addition to inferences on missing portions of the original studies, we also obtained inferences on the true genotypes for all subjects. Computing the total of each of the rightmost four columns of Table 1, and then breaking them down using imputed carrier status results, we produced Table 4. From Table 4 we can see that although MSI is quite specific, because of the low prevalence of mutations in the low-risk population, there will still be a significant portion of false positives (an estimated 133 out of 202).

Table 4.

The data table broken down by imputed genotype. The top table is imputated under the constraint that the specificity of mutation analysis is 1, the bottom without the constraint. In each cell, the first number is the expected number of true mutation carriers of all subjects in that corresponding column in Table 1; the second number is the number of noncarriers

High risk
Low risk
Mut = 1 Mut = 0 Mut = 1 Mut = 0
αT = 1
   MSI = 1 279 + 0 171 + 35 42 + 0 27 + 133
   MSI = 0 65 + 0 41 + 400 10 + 0 7 + 1541
αT < 1
   MSI = 1 278 + 1 168 + 38 38 + 4 24+ 136
   MSI = 0 51 + 13 32 + 410 7 + 45 4 + 1500

As a sensitivity analysis, we relaxed the assumption that the mutation analysis specificity is 100%, that is, that αT ≡ 1, in favor of a uniform prior. Results are also shown in Table 3 and Table 4. The posterior mean of the mutation analysis specificity was 0.97 (95% CI, 0.91–1.0). Comparing the rest of the parameter estimates with those obtained under αT ≡ 1, we can see that this departure from the assumption had a small impact on the posterior inference. The CIs were only slightly wider, as a result of increased uncertainty. This suggests that the likelihood is informative about the parameters of interest. Also, since some of the true positives (in terms of mutation analysis) under the specificity assumption were imputed as false positives, the posterior distributions of the proportions of carriers in the two groups shifted slightly to the left. The MSI sensitivity estimate increased slightly from 0.81 (0.73–0.89) to 0.85 (0.74–0.96). Comparing the upper portion of Table 4 to the lower portion, in all the mutation-positive cells, a few were imputed as false positives in the lower portion. However, such imputation occurred more often among MSI-negative subjects than among MSI-positive ones, causing a small increase in the overall MSI sensitivity. The only sensitive figure was the posterior mean of y1 + y3 + y5 + y7 + y8, which increased from 12 to 47 when a uniform prior was assigned to αT.

To further explore the sensitivity of our inferences to the choice of priors, we considered the study by Aaltonen et al. (1994), who found 25 out of 29 HNPCC patients to be MSI+. While their definition of an ‘HNPCC patient’ is different from the gold standard used in our paper, it can still provide a plausible reference for a sensitivity analysis. Thus, we explored using a Beta(25, 4) as the prior on βM. The resulting posterior mean and 95% probability interval shifted from 0.81 (0.73, 0.89) to 0.85 (0.76, 0.93), suggesting robustness of our findings in terms of both location and spread. This suggests that the width of the probability intervals we report should adequately convey the level of uncertainty about the parameters based on currently available evidence.

4. DISCUSSION

In this paper we presented a meta-analysis of the accuracy of MSI testing in detecting MSH2 and MLH1 mutations. Our meta-analysis overcame the difficulty posed by the fact that no gold standard is available in the published literature about this issue, and provided estimates that can be directly used in genetic counseling and colorectal cancer screening. Our methodology was a Bayesian meta-analytic implementation of the Hui–Walter approach, adapted to account for various forms of incomplete data.

We found MSI to be a very sensitive and specific indicator of germline mutations. The estimates of sensitivity and specificity of MSI in detecting MSH2 and MLH1 mutations will be incorporated in the latest version of the carrier probability package BayesMendel (Chen et al., 2004) and the risk prediction model CRCAPRO. The false-positive fraction associated with MSI testing can be large among unselected colorectal cancers. We estimated the sensitivity of mutation analysis at a modest 62%. This result suggests using caution when interpreting negative genetic testing results for potential mutation carriers.

One of the limitations of our study is that we only considered the MSH2 and MLH1 genes. Other, less prevalent HNPCC genes have been found. For example, MSH6 has a mutation prevalence of about 1.7% among unselected colorectal or endometrial cancer cases (Goodfellow et al., 2003). At the present time, it would be difficult to consider other genes because data from published studies involving such genes are scarce.

A further concern is that the published studies have used a variety of mutation analysis techniques. We pooled all these as though they had the same sensitivity. The sensitivity of a particular mutation search method depends on the true spectrum of mutations, particularly the relative frequency of each type of mutation. In the context of HNPCC mutations, the pathogenic mutations discovered to date in MLH1 and MSH2 are varied in type, and the true spectrum of mutations is still unknown. Systematic studies aimed at comparing sensitivity among the mutation search methods are not presently available, and thus there is no evidence as to which methods are more sensitive over the true mutation spectrum than others. In addition, there are practical reasons why allowing for different sensitivities is unlikely to be successful in our meta-analysis: mutation screening strategies differed across studies, and in many cases were not reported in sufficient detail to allow stratified analysis. Also, if we were to assign each of the testing strategies a separate sensitivity and specificity, the sample sizes for each strategy would become too small.

To further explore this issue, we designed a simple simulation experiment where subjects are tested with genotyping approaches with different sensitivities. Our results indicated that, as long as the distributions of genotyping methods in the high-risk and low-risk populations are similar, estimates of MSI specificity and sensitivity remain unbiased. Specifically, we simulated a data set where we tested a highrisk population of 800 (prevalence, 0.6) and a low-risk population of 1700 (prevalence, 0.05) with a prescreen with sensitivity of 0.78 and specificity of 0.92. In each population, we randomly chose half of the subjects to be subsequently genotyped with sensitivity 0.60 and the other half with sensitivity 0.80 (both specificities are 1). To imitate the missing data situation in our meta-analysis, those who were prescreened negative did not receive a second test. With 100 simulations, all parameter estimates were unbiased and the mean relative errors were all of the order of magnitude 10−3 or less.

A key assumption in our analysis is that the results of MSI testing and mutation analysis are independent given the true genotype. In other words, we assume that if a specific mutation is difficult to detect by mutation analysis, the same mutation is not necessarily difficult to detect by MSI testing. As we write, there is no strong biological evidence that such dependence may arise. The errors associated with MSI with respect to the carrier status of MSH2 and MLH1 are likely to arise from links between MSI and changes in the MMR system other than a mutation on MSH2 or MLH1. On the other hand, limitations of sequencing and other forms of conventional mutation analysis arise from the complex spectrum of mutations on MSH2 and MLH1 themselves. Thus, the unknown mechanisms driving test errors are likely to be different, though further validation of this assumption would be helpful.

A second key assumption in our analysis is that the prevalence of mutations in the two groups should be different. This is not a restrictive assumption, given the way the groups were defined. However, groups were separated based on a sensible but subjective choice and it is conceivable that the results may be sensitive to this choice. To address this issue, we performed a simulation experiment. The results indicated that different groupings of the data according to risk do not change the unbiasedness of the estimates of MSI sensitivity and specificity, as long as the risk in one group is higher than the other. In this experiment, in addition to a homogeneous high-risk group of prevalence 0.6 and a low-risk group of prevalence 0.05, we simulated a group of ‘medium-high risk’, with prevalence of 0.3. The rest of the setup was similar to the simulation described above. In case 1 we pooled the medium-high-risk group with the high-risk group; in case 2 we pooled it with the low-risk group. In both cases, the posterior mean was unbiased for all parameters.

ACKNOWLEDGMENTS

The work by Sining Chen and Giovanni Parmigiani is supported by grants NCI-P50CA62924, NCI R01CA105090, and NCI-P30CA06973. The work by Patrice Watson is supported by NIH Grant #1U01 CA86389. The authors thank Hormuzd A. Kakti, Kenneth Kinzler, and Jiang-wen Zhang for helpful comments.

APPENDIX

Summary of studies

While we included as many studies as possible, we had to exclude studies where the subjects are a mixture of high-risk and low-risk cases with no breakdown given, such as Ruszkiewicz et al. (2002) and Banno et al. (2003). We disregarded all immunohistochemistry (IHC) results where reported because the literature suggests a very high correlation between MSI and IHC (Lindor et al., 2002; Debniak et al., 2000; Marcus et al., 1999), which may violate the conditional independence assumption.

In the searches for germline mutations, it is common for studies to sequence those samples that showed alterations that first underwent PTT or DGGE as a confirmation step. However, to our knowledge there was no gain of sensitivity or specificity in this extra step. A few studies used novel methods to search for germline mutations that are hard to detect by conventional methods (Renkonen et al., 2003; Nakagawa et al., 2003; Wang et al., 2003). Unfortunately, since these technologies are very different from conventional methods, pooling them with the conventional methods may introduce bias. We decided to exclude such studies or to exclude mutations found by using such technologies. However, such data can serve as independent validation of the estimated low sensitivity for conventional methods.

Table A1 lists the reasons for exclusion, while Table A2 gives the characteristics of the 19 studies included in the meta-analysis.

Table A1.

Relevant studies and their reasons for being excluded from the meta-analysis Paper Reason for exclusion

Paper Reason for exclusion
Banno et al. (2003) Endometrial tumors
Debniak et al. (2001) Use of Alu-PCR (no conventional mutation analysis)
Loukola et al. (1999b) CR adenomas
Nakagawa et al. (2003) Use of multiplex ligation-dependent probe amplification method
Renkonen et al. (2003) Use of the SNuPE technique
Terdiman et al. (2002) Insufficient presentation of the data

Table A2.

Characteristics of studies that were included in the meta-analysis. Note that the Debniak et al. (2000) study and the Dieumegard et al. (2000) study were each subdivided into a high-risk set and a low-risk set, and thus appeared in Table 1 twice

Paper Ethnic Families Mutation testing strategy Notes
Bapat et al. (1999) Mixed 14 ACa + 19 MCb RT-PCR + PTT
Calistri et al. (2000) Italian 13 AC + 7MGc + 4 fhd Exon-by-exon SSCP
Cederquist et al. (2001) Swedish Double primary tumor at colorectum or endometrium DHPLC confirmed by sequencing MSH6 results excluded
Debniak et al. (2000) Polish 11 def.hnpcce (or highly likely) + 15 sporadic late-onset CRC Sequencing IHC results discarded
Dietmaier et al. (1997) German 148 sporadic CRC None
Dieumegard et al. (2000) French 15 ICGf + 1 fh + 7 sporadic < 50 SSCP IHC results discarded
Lamberti et al. (1999) German 47 AC + 5 looser AC PCR + SSCA + HA, etc. Some data excluded b/c of poor presentation
Liu et al. (2000) Swedish ‘Highly suspicious for HNPCC’ DGGE + RT-PCR + PTT Some AC, some fh, some early CRC, no break-down given
Loukola et al. (1999) Finnish CRC registry DGGE or Sequencing on MSI+
Percesepe et al. (2001) Italian CRC registry None
Ponz de Leon et al. (2004) Italian ‘Consistent with HNPCC’ SSCP or Sequencing MSI results n.a.
Salahshor et al. (1999) Swedish Sporadic CRC None
Salovaara et al. (2000) Finnish Sporadic CRC Search for 2 founder mutation with ASO and PCR-based, then sequencing the negatives
Scartozzi et al. (2002) Italian At least one of the BGg Sequencing
Terdiman et al. (2001) US 23% met AC, 70% BG DGGE
Wahlberg et al. (2002) US 38 AC or BG + 5 MG + 5 HNPCC-like Sequencing
Wang et al. (2003) German BG SSCP, HA, DHPLC followed by sequencing Semiquantitative multiplex PCR result excluded

CRC = colorectal cancer; SSCP = single-strand conformation polymorphism; DHPLC = denaturing high-performance liquid chromatography; ASO = allele-specific oligonucleotide.

a

Amsterdam criteria.

b

Mt. Sinai Hospital criteria.

c

Modified guidelines.

d

Positive family history.

e

Definitive HNPCC.

f

International Collaborative Group criteria.

g

Bethesda guidelines.

Contributor Information

Sining Chen, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, USA schen46@jhmi.edu.

Patrice Watson, Hereditary Cancer Institute, Creighton University School of Medicine, Omaha, NE, USA.

Giovanni Parmigiani, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, USA, and Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.

REFERENCES

  1. Aaltonen LA, Salovaara R, Kristo P, Canzian F, Hemminki A, Peltomaki P, Chadwick RB, Kaariainen H, Eskelinen M, Jarvinen H, et al. Incidence of hereditary nonpolyposis cancer and the feasibility of molecular screening for the disease. The New England Journal of Medicine. 1998;338:1481–1487. doi: 10.1056/NEJM199805213382101. [DOI] [PubMed] [Google Scholar]
  2. Aaltonen LA, Sankila R, Mecklin J, Jarvinen H, Pukkala E, Peltomaki P, De La Chapelle A. A novel approach to estimate the proportion of hereditary nonpolyposis colorectal cancer of total colorectal cancer burden. Cancer Detection and Prevention. 1994;18:57–63. [PubMed] [Google Scholar]
  3. Banno K, Susumu N, Hirao T, Yanokura M, Hirasawa A, Aoki D, Udagawa Y, Sugano K, Nozawa S. Identification of germline MSH2 gene mutations in endometrial cancer not fulfilling the new clinical criteria for hereditary nonpolyposis colorectal cancer. Cancer Genetics and Cytogenetics. 2003;146:58–65. doi: 10.1016/s0165-4608(03)00157-2. [DOI] [PubMed] [Google Scholar]
  4. Bapat B, Madlensky L, Temple L, Hiruki T, Redston M, Baron D, Xia L, Marcus V, Soravia C, Mitri A, et al. Family history characteristics, tumor microsatellite instability and germline MSH2 and MLH1 mutations in hereditary colorectal cancer. Human Genetics. 1999;104:167–176. doi: 10.1007/s004390050931. [DOI] [PubMed] [Google Scholar]
  5. Boland C, Thibodeau S, Hamilton S, Sidransky D, Eshleman J, Burt R, Meltzer S, Rodriguez-Bigas M, Fodde R, Ranzani G, Srivastava S. A National Cancer Institute workshop of MSI for cancer detection and familial predisposition: development of international criteria for the determination of MSI in colorectal cancer. Cancer Research. 1998;58:5248–5257. [PubMed] [Google Scholar]
  6. Calistri D, Presciuttini S, Buonsanti G, Radice P, Gazzoli I, Pensotti V, Sala P, Eboli M, Andreola S, Russo A, et al. Microsatellite instability in colorectal-cancer patients with suspected genetic predisposition. International Journal of Cancer. 2000;89:89–91. doi: 10.1002/(sici)1097-0215(20000120)89:1<87::aid-ijc14>3.0.co;2-9. [DOI] [PubMed] [Google Scholar]
  7. Cederquist K, Golovleva I, Emanuelsson M, Stenling R, Gronberg H. A population based cohort study of patients with multiple colon and endometrial cancer: correlation of microsatellite instability (MSI) status, age at diagnosis and cancer risk. International Journal of Cancer. 2001;91:486–491. doi: 10.1002/1097-0215(200002)9999:9999<::aid-ijc1093>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]
  8. Chen S, Wang W, Broman KW, Katki HA, Parmigiani G. BayesMendel: an R environment for Mendelian risk prediction. Statistical Applications in Genetics and Molecular Biology. 2004;3 doi: 10.2202/1544-6115.1063. http://www.bepress.com/sagmb/vol3/iss1/art21/ [DOI] [PMC free article] [PubMed]
  9. Debniak T, Gorski B, Cybulski C, Jakubowska A, Kurzawski G, Kladny J, Lubinski J. Comparison of Alu-PCR, microsatelite instability, and immunohistochemical analyses in finding features characteristic for hereditary nonpolyposis colorectal cancer. Journal of Cancer Research and Clinical Oncology. 2001;127:565–569. doi: 10.1007/s004320100261. [DOI] [PubMed] [Google Scholar]
  10. Debniak T, Kurzawski G, Gorski B, Kladny J, Domagala W, Lubinski J. Value of pedigree/clinical data, immunohistochemistry and microsatellite instability analyses in reducing the cost of determining hMLH1 and hMSH2 gene mutations in patients with colorectal cancer. European Journal of Cancer. 2000;36:49–54. doi: 10.1016/s0959-8049(99)00208-7. [DOI] [PubMed] [Google Scholar]
  11. Dietmaier W, Wallinger S, Bocker T, Kullmann F, Fishel R, Ruschoff J. Diagnostic microsatellite instability: definition and correlation with mismatch repair protein expression. Cancer Research. 1997;57:4749–4756. [PubMed] [Google Scholar]
  12. Dieumegard B, Grandjouan S, Sabourin JC, Le Bihan ML, Lefrere I, Bellefqih, Pignon JP, Rougier P, Lasser P, Benard J, et al. Extensive molecular screening for hereditary non-polyposis colorectal cancer. British Journal of Cancer. 2000;82:871–880. doi: 10.1054/bjoc.1999.1014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Ford D, Easton DF, Stratton M, Narod S, Goldgar D, Devilee P, Bishop DT, Weber B, Lenoir G, Chang-Claude J, et al. Genetic heterogeneity and penetrance analysis of the BRCA1 and BRCA2 genes in breast cancer families. American Journal of Human Genetics. 1998;62:676–689. doi: 10.1086/301749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Goodfellow P, Buttin B, Herzog T, Rader JS, Gibb RK, Swisher E, Look K, Walls KC, Fan MY, Mutch DG. Prevalence of defective DNA mismatch repair and MSH6 mutation in an unselected series of endometrial cancers. Proceedings of the National Academy of Sciences of the United States of America. 2003;100:5908–5913. doi: 10.1073/pnas.1030231100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Hui SL, Walter SD. Estimating the error rates of diagnostic tests. Biometrics. 1980;36:167–171. [PubMed] [Google Scholar]
  16. Johnson WO, Gastwirth JL, Pearson LM. Screening without a "Gold standard", the Hui-Walter paradigm revisited. American Journal of Epidemiology. 2001;153:921–924. doi: 10.1093/aje/153.9.921. [DOI] [PubMed] [Google Scholar]
  17. Lamberti C, Kruse R, Ruelfs C, Caspari R, Wang Y, Jungck M, Mathiak M, Malayeri H, Friedl W, Sauerbruch T, Propping P. Microsatellite instability-a useful diagnostic tool to select patients at high risk for hereditary non-polyposis colorectal cancer: a study in different groups of patients with colorectal cancer. Gut. 1999;44:839–843. doi: 10.1136/gut.44.6.839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Lin K, Shashidharan M, Thorson A, Ternent C, Blatchford G, Christensen M, Watson P, Lemon SJ, Franklin B, Karr B, et al. Cumulative incidence of colorectal and extracolonic cancers in MLH1 and MSH2 mutation carriers of hereditary colorectal cancer. Journal of Gastrointestinal Surgery. 1998;2:67–71. doi: 10.1016/s1091-255x(98)80105-4. [DOI] [PubMed] [Google Scholar]
  19. Lindor N, Burgart L, Leontovich O, Goldberg R, Cunningham J, Sargent D, Walshvockley C, Petersen G, Walsh M, Leggett B, et al. Immunohistochemistry versus micro-satellite instability testing in phenotyping colorectal tumors. Journal of Clinical Oncology. 2002;20:1043–1048. doi: 10.1200/JCO.2002.20.4.1043. [DOI] [PubMed] [Google Scholar]
  20. Liu T, Wahlberg S, Burek E, Lindblom P, Rubio C, Lindblom A. Microsatellite instability as a predictor of a mutation in a DNA mismatch repair gene in familial colorectal cancer. Genes, Chromosomes and Cancer. 2000;27:17–25. doi: 10.1002/(sici)1098-2264(200001)27:1<17::aid-gcc3>3.0.co;2-y. [DOI] [PubMed] [Google Scholar]
  21. Lothe R, Peltomäki P, Meling G, Aaltonen L, Nyström-Lahti M, Pylkkänen, Heimdal K, Andersen TI, Moller P, Rognum TO. Genomic instability in colorectal cancer: relationship to clinicopathological variables and family history. Cancer Research. 1993;53:5849–5852. [PubMed] [Google Scholar]
  22. Loukola A, De La Chapelle A, Aaltonen L. Strategies for screening for hereditary non-polyposis colorectal cancer. Journal of Medical Genetics. 1999a;36:819–822. [PMC free article] [PubMed] [Google Scholar]
  23. Loukola A, Salovaara R, Kristo P, Moisio AL, Kaariainen H, Ahtola H, Eskelinen M, Harkonen N, Julkunen R, Kangas E, et al. Microsatellite instability in adenomas as a marker for hereditary nonpolyposis colorectal cancer. American Journal of Pathology. 1999b;155:1849–1853. doi: 10.1016/S0002-9440(10)65503-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Lynch H, De La Chapelle A. Genetic susceptibility to non-polyposis colorectal cancer. Journal of Medical Genetics. 1999;36:801–818. [PMC free article] [PubMed] [Google Scholar]
  25. Marcus VA, Madlensky L, Gryfe R, Kim H, So K, Millar A, Temple LK, Hsieh E, Hiruki T, Narod S, et al. Immunohistochemistry for hMLH1 and hMSH2: a practical test for DNA mismatch repair-deficient tumors. American Journal of Surgical Pathology. 1999;23:1248–1255. doi: 10.1097/00000478-199910000-00010. [DOI] [PubMed] [Google Scholar]
  26. Nakagawa H, Hampel H, De La Chapelle A. Identification and characterization of genomic rearrangements of MSH2 and MLH1 in lynch syndrome (HNPCC) by novel techniques. Human Mutation. 2003;22:258. doi: 10.1002/humu.9171. [DOI] [PubMed] [Google Scholar]
  27. Nakagawa H, Yan H, Lockman J, Hampel H, Kinzler KW, Vogelstein B, De La Chapelle A. Allele separation facilitates interpretation of potential splicing alterations and genomic rearrangements. Cancer Research. 2002;62:4579–4582. [PubMed] [Google Scholar]
  28. Parmigiani G. Modeling in Medical Decision Making. New York: John Wiley and Sons; 2002. [Google Scholar]
  29. Payne S, Newman B, King M-C. Complex germline rearrangement of BRCA1 associated with breast and ovarian cancer. Genes, Chromosomes and Cancer. 2000:29–58. doi: 10.1002/1098-2264(2000)9999:9999<::aid-gcc1008>3.0.co;2-f. [DOI] [PubMed] [Google Scholar]
  30. Percesepe A, Borghi F, Menigatti M, Losi L, Foroni M, Di Gregorio C, Rossi G, Pedroni M, Sala E, Vaccina F, et al. Molecular screening for hereditary nonpolyposis colorectal cancer: a prospective, population-based study. Journal of Clinical Oncology. 2001;19:3944–3950. doi: 10.1200/JCO.2001.19.19.3944. [DOI] [PubMed] [Google Scholar]
  31. Ponz De Leon M, Benatti P, Di Gregorio C, Pedroni M, Losi L, Genuardi M, Viel A, Fornasarig M, Lucci-Cordisco E, Anti M, et al. Genetic testing among high-risk individuals in families with hereditary nonpolyposis colorectal cancer. British Journal of Cancer. 2004;90:882–887. doi: 10.1038/sj.bjc.6601529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Ramsey S, Clarke L, Etzioni R, Higashi M, Berry K, Urban N. Cost-effectiveness of microsatellite instability screening as a method for detecting hereditary nonpolyposis colorectal cancer. Annals of Internal Medicine. 2001;135:577–588. doi: 10.7326/0003-4819-135-8_part_1-200110160-00008. [DOI] [PubMed] [Google Scholar]
  33. Renkonen E, Zhang Y, Lohi H, Salovaara R, Abdel-Rahman WM, Nilbert M, Aittomaki K, Jarvinen HJ, Mecklin JP, Lindblom A, Peltomaki P. Altered expression of MLH1, MSH2, and MSH6 in predisposition to hereditary nonpolyposis colorectal cancer. Journal of Clinical Oncology. 2003;21:3629–3637. doi: 10.1200/JCO.2003.03.181. [DOI] [PubMed] [Google Scholar]
  34. Reyes C, Allen B, Terdiman J, Wilson L. Comparison of selection strategies for genetic testing of patients with hereditary nonpolyposis colorectal carcinoma. Cancer. 2002;95:1848–1856. doi: 10.1002/cncr.10910. [DOI] [PubMed] [Google Scholar]
  35. Ruszkiewicz A, Bennett G, Moore J, Manavis J, Rudzki B, Shen L, Suthers G. Correlation of mismatch repair genes immunohistochemistry and microsatellite instability status in HNPCC-associated tumours. Gastrointestinal Pathology. 2002;34:541–547. doi: 10.1080/0031302021000035965-2. [DOI] [PubMed] [Google Scholar]
  36. Salahshor S, Kressner U, Fischer H, Lindmark G, Glimelius B, Pahlman L, Lindblom A. Microsatellite instability in sporadic colorectal cancer is not an independent prognostic factor. British Journal of Cancer. 1999;81:190–193. doi: 10.1038/sj.bjc.6690676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Salovaara R, Loukola A, Kristo P, Kaariainen H, Ahtola H, Eskelinen M, Harkonen N, Julkunen R, Kangas E, Ojala S, Tulikoura J, et al. Population-based molecular detection of hereditary nonpolyposis colorectal cancer. Journal of Clinical Oncology. 2000;18:2193–2200. doi: 10.1200/JCO.2000.18.11.2193. [DOI] [PubMed] [Google Scholar]
  38. Scartozzi M, Bianchi F, Rosati S, Galizia E, Antolini A, Loretelli C, Piga A, Bearzi I, Cellerino R, Porfiri E. Mutations of hMLH1 and hMSH2 in patients with suspected hereditary nonpolyposis colorectal cancer: correlation with microsatellite instability and abnormalities of mismatch repair protein expression. Journal of Clinical Oncology. 2002;20:1203–1208. doi: 10.1200/JCO.2002.20.5.1203. [DOI] [PubMed] [Google Scholar]
  39. Sophie G, Alain A, Nadine P, Aline M, Catherine S, Marco M, Sabine P, Virginie C, Sylvie M, Aaron B, Dominique S-L. Color bar coding the BRCA1 gene on combed DNA: a useful strategy for detecting large gene rearrangements. Genes, Chromosomes and Cancer. 2001;31:75. doi: 10.1002/gcc.1120. [DOI] [PubMed] [Google Scholar]
  40. Tanner MA, Wong WH. The calculation of posterior distributions by data augmentation. Journal of the American Statistical Association. 1987;82:528–550. [Google Scholar]
  41. Terdiman JP, Gum JR, Jr, Conrad PG, Miller GA, Weinberg V, Crawley SC, Levin TR, Reeves C, Schmitt A, Hepburn M, et al. Efficient detection of hereditary nonpolyposis colorectal cancer gene carriers by screening for tumor microsatellite instability before germline genetic testing. Gastroenterology. 2001;120:21–30. doi: 10.1053/gast.2001.20874. [DOI] [PubMed] [Google Scholar]
  42. Terdiman J, Levin T, Allen B, Gum JJ, Fishbach A, Conrad P, Miller G, Weinberg V, Bachman R, Bergoffen J, et al. Hereditary nonpolyposis colorectal cancer in young colorectal cancer patients: high-risk clinic versus population-based registry. Gastroenterology. 2002;122:940–947. doi: 10.1053/gast.2002.32537. [DOI] [PubMed] [Google Scholar]
  43. Thibodeau S, Bren G, Schaid D. Microsatellite instability in cancer of the proximal colon. Science. 1993;260:816–819. doi: 10.1126/science.8484122. [DOI] [PubMed] [Google Scholar]
  44. Vasen HF, Stormorken A, Menko FH, Nagengast F, Kleibeuker JH, Griffioen G, Taal BG, Moller P, Wijnen JT. MSH2 mutation carriers are at higher risk of cancer than MLH1 mutation carriers: a study of hereditary nonpolyposis colorectal cancer families. Journal of Clinical Oncology. 2001;19:4074–4080. doi: 10.1200/JCO.2001.19.20.4074. [DOI] [PubMed] [Google Scholar]
  45. Wahlberg S, Schmeits J, Thomas G, Loda M, Garber J, Syngal S, Kolodner R, Fox E. Evaluation of microsatellite instability and immunohistochemistry for the prediction of germ-line MSH2 and MLH1 mutations in hereditary nonpolyposis colon cancer families. Cancer Research. 2002;62:3485–3492. [PubMed] [Google Scholar]
  46. Wang T, Lerer I, Gueta Z, Sagi M, Kadouri L, Peretz T, Abeliovich D. A deletion/insertion mutation in the BRCA2 gene in a breast cancer family: a possible role of the Alu-polyA tail in the evolution of the deletion. Genes, Chromosomes and Cancer. 2001;31:91. doi: 10.1002/gcc.1110. [DOI] [PubMed] [Google Scholar]
  47. Wang Y, Friedl W, Lamberti C, Jungck M, Mathiak M, Pagenstecher C, Propping P, Mangold E. HNPCC: frequent occurrence of large genomic deletions in MSH2 and MLH1. International Journal of Cancer. 2003;103:636–641. doi: 10.1002/ijc.10869. [DOI] [PubMed] [Google Scholar]
  48. Yan H, Papadopoulos N, Marra G, Perrera C, Jiricny J, Boland CR, Lynch HT, Chadwick RB, De La Chapelle A, Berg K, et al. Conversion of diploidy haploidy. Nature. 2000;403:723–724. doi: 10.1038/35001659. [DOI] [PubMed] [Google Scholar]

RESOURCES