Skip to main content
eLife logoLink to eLife
. 2019 Nov 7;8:e49138. doi: 10.7554/eLife.49138

Computational and cellular studies reveal structural destabilization and degradation of MLH1 variants in Lynch syndrome

Amanda B Abildgaard 1, Amelie Stein 1,, Sofie V Nielsen 1, Katrine Schultz-Knudsen 1, Elena Papaleo 1,, Amruta Shrikhande 2, Eva R Hoffmann 2, Inge Bernstein 3, Anne-Marie Gerdes 4, Masanobu Takahashi 5, Chikashi Ishioka 5, Kresten Lindorff-Larsen 1,, Rasmus Hartmann-Petersen 1,
Editors: Sarel Jacob Fleishman6, John Kuriyan7
PMCID: PMC6837844  PMID: 31697235

Abstract

Defective mismatch repair leads to increased mutation rates, and germline loss-of-function variants in the repair component MLH1 cause the hereditary cancer predisposition disorder known as Lynch syndrome. Early diagnosis is important, but complicated by many variants being of unknown significance. Here we show that a majority of the disease-linked MLH1 variants we studied are present at reduced cellular levels. We show that destabilized MLH1 variants are targeted for chaperone-assisted proteasomal degradation, resulting also in degradation of co-factors PMS1 and PMS2. In silico saturation mutagenesis and computational predictions of thermodynamic stability of MLH1 missense variants revealed a correlation between structural destabilization, reduced steady-state levels and loss-of-function. Thus, we suggest that loss of stability and cellular degradation is an important mechanism underlying many MLH1 variants in Lynch syndrome. Combined with analyses of conservation, the thermodynamic stability predictions separate disease-linked from benign MLH1 variants, and therefore hold potential for Lynch syndrome diagnostics.

Research organism: Human

Introduction

The DNA mismatch repair (MMR) pathway corrects mismatched base pairs inserted during replication. The MutSα (MSH2-MSH6) heterodimer initiates repair by detecting the mismatch after which the MutLα (MLH1-PMS2) heterodimer promotes the process by generating a nick in the newly synthesized DNA strand, thereby stimulating downstream repair proteins (Jiricny, 2006; Jun et al., 2006). The MMR pathway is phylogenetically highly conserved, emphasizing its importance as a key DNA repair mechanism of the cell (Jiricny, 2013; Sachadyn, 2010). Loss of MMR activity causes genome instability, and can result in both sporadic and inherited cancer, such as Lynch syndrome (LS) (OMIM: #609310), also known as hereditary nonpolyposis colorectal cancer (HNPCC). The predominant consequence of LS is colorectal cancer (CRC), making LS the underlying reason for around 4% of all CRC cases (Aarnio et al., 1999; Hampel et al., 2008; Sijmons and Hofstra, 2016; Vasen et al., 1996; Vasen and de Vos Tot Nederveen Cappel, 2013; Thompson et al., 2014; Møller et al., 2018). Importantly, the cumulative lifetime cancer risk varies considerably between patients and depends on the specific germline mutation in the genes encoding the key mismatch repair proteins MSH2, MSH6, MLH1, and PMS2 (Barrow et al., 2008; Dowty et al., 2013; Dunlop et al., 1997; Lynch et al., 2015; Peltomäki et al., 1993; Plaschke et al., 2004; Sijmons and Hofstra, 2016).

The majority of LS cases result from MLH1 and MSH2 mutations (Peltomäki, 2016), many of which are missense mutations (Heinen, 2010; Palomaki et al., 2009; Peltomäki and Vasen, 1997; Peltomäki, 2016). Evidently, such missense mutations may cause loss-of-function by directly perturbing protein-protein interactions or ablating enzymatic activity. Many missense mutations, however, cause loss-of-function by inducing structural destabilization of the protein (Stein et al., 2019), which in turn may trigger protein misfolding and degradation by the ubiquitin-proteasome system (UPS) (Kampmeyer et al., 2017; Nielsen et al., 2014; Kriegenburg et al., 2014). As a result, the cellular amount of a missense protein may be reduced to an insufficient level, which can ultimately cause disease (Ahner et al., 2007; Casadio et al., 2011; Matreyek et al., 2018; Nielsen et al., 2017), as we and others have previously shown for LS-linked variants of MSH2 (Gammie et al., 2007; Arlow et al., 2013; Nielsen et al., 2017).

In this study, we investigated whether this is the case for LS-linked variants of the MLH1 protein. We determined cellular abundance for 69 missense variants, and show that several destabilized LS-linked MLH1 variants are targeted for chaperone-assisted proteasomal degradation and are therefore present at reduced cellular amounts. In turn, this lower amount of MLH1 results in degradation of the MLH1-binding proteins PMS1 and PMS2. In silico saturation mutagenesis and computational prediction of the thermodynamic stability of all possible MLH1 single site missense variants revealed a correlation between the structural destabilization of MLH1, reduced steady-state levels and the loss-of-function phenotype. Accordingly, the thermodynamic stability predictions accurately separate disease-linked MLH1 missense mutations from benign MLH1 variants (area under the curve is 0.82 in a receiver-operating characteristic analysis), and therefore hold potential for classification of MLH1 variants of unknown consequence, and hence for LS diagnostics. Further, by suggesting a mechanistic origin for many LS-causing MLH1 missense variants our studies provide a starting point for development of novel therapies.

Results

In silico saturation mutagenesis and thermodynamic stability predictions

Most missense proteins are less structurally stable than the wild-type protein (Tokuriki and Tawfik, 2009), and individual missense variants may thus lead to increased degradation and insufficient amounts of protein. To comprehensively assess this effect for MLH1, we performed energy calculations based on crystal structures of MLH1 to predict the consequences of missense mutations in MLH1 on the thermodynamic stability of the MLH1 protein structure. Full-length human MLH1 is a 756 residue protein which forms two folded units, an N-terminal domain (residues 7–315) and a C-terminal domain (residues 502–756) (Mitchell et al., 2019) separated by a flexible and intrinsically disordered linker (Figure 1A). Using the structures (Wu et al., 2015) of the two domains (PDB IDs 4P7A and 3RBN) (Figure 1A), we performed in silico saturation mutagenesis, introducing all possible single site amino acid substitutions into the wild-type MLH1 sequence at the 564 structurally resolved residues. We then applied the FoldX energy function (Schymkowitz et al., 2005) to estimate the change in thermodynamic folding stability compared to the wild-type MLH1 protein (ΔΔG) (Figure 1BC). Negative values indicate mutations that are predicted to stabilize MLH1, while positive values indicate that the mutations may destabilize the protein. Thus, those variants with ΔΔG predictions > 0 kcal/mol are expected to have a larger population of fully or partially unfolded structures that, in turn, may be prone to protein quality control (PQC)-mediated degradation. Our saturation mutagenesis dataset comprises 19 (amino acids, excluding the wild-type residue) * 564 (residues resolved in the N- and C-terminal structures)=10,716 different MLH1 variants, thus covering 75% of all possible missense variants in MLH1. We illustrate a subsection as a heat map in Figure 1D (the entire dataset is included in the supplemental material, Supplementary file 1). The predictions reveal that 34% of the substitutions are expected to change the stability of MLH1 by less than 0.7 kcal/mol, which is the typical error of the predictions (Guerois et al., 2002) (Figure 1E). A comparable fraction (32%) are, however, predicted to cause a substantial destabilization (>2.5 kcal/mol) of the MLH1 protein (Figure 1E).

Figure 1. MLH1 structural stability predictions.

Figure 1.

(A) Structure of MLH1 (PDB IDs 4P7A and 3RBN). Positions of variants tested in this work are highlighted with colored spheres, indicating the predicted ΔΔG (<0.5 kcal/mol, purple,<1, cyan,<3.5 green,<7, yellow,<12, orange,>12, red). (B) Many disease-linked MLH1 missense variants (red) are structurally destabilized and therefore, compared to wild-type MLH1 (green), more likely to unfold. (C) The free energy of the folded conformation of a destabilized missense variant (red) is closer to that of the fully unfolded state. The employed stability calculations predict the difference of the free energy of unfolding (ΔΔG) between a missense variant (red) and wild-type MLH1 (green). (D) Excerpt of the in silico saturation mutagenesis map (full dataset provided in Supplementary file 1). (E) Distribution of all predicted ΔΔGs from saturation mutagenesis. The peak at 15 kcal/mol contains all variants with ΔΔG values greater than this value.

Thermodynamic stability calculations predict severely reduced MLH1 steady-state levels

To test whether the in silico stability predictions are predictive of cellular stability, abundancy, and function, we selected 69 naturally occurring MLH1 missense variants with predicted ΔΔGs spanning from −1.6 to >15 kcal/mol (Table 1). We further ensured that the selected mutations were distributed throughout the MLH1 gene, thus probing the entire structured parts of the MLH1 protein (Figure 1A). Then, the variants were introduced into MLH1-negative HCT116 cells and analyzed by automated immunofluorescence microscopy using a polyclonal antiserum to MLH1.

Table 1. Characteristics of the selected naturally occurring MLH1 variants.

Table 1—source data 1. MLH1 variants tested in this work.
DOI: 10.7554/eLife.49138.004
Variant* Steady-state level (% of WT) FoldX ΔΔG (kcal/mol) ClinVar annotation¤ DME#
E23D 90.0 0.49 VUS 3
I25T 51.3 2.40 VUS 3
A29S 109.6 2.06 (likely) pathogenic 3
M35R 72.1 3.52 (likely) pathogenic 0
I36S 63.2 4.08 VUS NA
N38D 63.5 1.61 VUS 2
S44F 9.8 >15 (likely) pathogenic 0
S44A 103.2 −1.35 VUS 3
G54E 15.3 >15 VUS 1
N64S 96.3 2.16 VUS 1
G67R 35.0 >15 (likely) pathogenic 0
G67W 14.5 >15 (likely) pathogenic 0
I68N 62.5 2.22 (likely) pathogenic 0
R69K 104.9 −0.18 VUS 3
C77Y 61.0 6.57 (likely) pathogenic 2
F80V 72.3 2.22 (likely) pathogenic 1
T82I 100.0 0.54 (likely) pathogenic 2
R100P 46.4 −1.25 (likely) pathogenic 2
E102D 97.9 0.34 (likely) pathogenic 3
A111V 68.8 4.96 (likely) pathogenic 0
T117M 32.8 7.14 (likely) pathogenic 0
T117R 46.2 12.70 (likely) pathogenic 0
A128P 62.8 2.40 (likely) pathogenic 0
D132H 110.2 −0.30 (likely) benign 3
A160V 107.3 0.38 VUS 3
R182G 87.9 2.60 (likely) pathogenic 3
S193P 100.7 2.73 VUS 0
V213M 103.9 −0.81 (likely) benign 3
R217C 74.1 1.06 VUS 2
I219V 112.5 0.66 (likely) benign 3
I219L 121.9 −0.05 (likely) benign 3
R226L 63.3 0.27 (likely) pathogenic 1
G244V 32.0 >15 VUS 0
G244D 38.8 >15 (likely) pathogenic 0
H264R 117.6 −0.60 VUS 3
R265C 57.2 0.28 (likely) pathogenic 2
R265H 81.4 0.04 VUS 3
E268G 81.1 0.81 (likely) benign 2
L272V 80.0 1.95 VUS 3
A281V 82.5 0.87 (likely) pathogenic 3
K286Q 101.8 0.28 VUS 2
S295G 88.6 0.13 (likely) pathogenic 2
H329P 54.1 5.67 (likely) pathogenic 1
V506A 62.1 2.18 VUS 2
Q542L 110.2 −1.56 VUS 3
L549P 63.7 5.17 VUS 0
I565F 65.9 9.64 VUS 0
L574P 34.4 11.97 (likely) pathogenic 0
E578G 103.2 0.45 (likely) benign 2
L582V 100.0 1.93 VUS 3
L588P 88.6 3.30 VUS 1
K618A 80.4 0.61 VUS 1
K618T 106.5 0.09 (likely) benign 0
L622H 61.1 4.97 (likely) pathogenic 0
P640T 61.1 3.78 VUS 0
L653R 66.1 3.22 (likely) pathogenic 0
I655V 89.1 1.03 (likely) benign 3
I655T 71.1 1.29 VUS 3
R659P 69.0 6.93 (likely) pathogenic 0
R659Q 84.9 2.41 VUS 2
T662P 72.7 5.23 (likely) pathogenic 0
E663G 72.0 −0.23 VUS 3
E663D 97.2 0.66 (likely) pathogenic 2
L676R 36.8 5.12 VUS 0
R687W 115.4 1.12 (likely) pathogenic 0
Q689R 71.0 −0.54 (likely) benign 3
V716M 100.1 1.41 (likely) benign 1
H718Y 73.7 0.16 (likely) benign 2
K751R 82.1 −0.23 (likely) benign 3

*:boldfaced variants studied in detail; ¤: VUS: variant of unknown significance; #DME: dominant mutator effect See also source data (Table 1—source data 1).

As expected, wild-type MLH1 localized primarily to the nucleus (Figure 2A). This localization pattern was also observed for all the MLH1 variants, and we did not detect any protein aggregates. We did, however, observe large variations in the fluorescence intensity, and consequently the steady-state protein levels, between the different MLH1 variants (Figure 2A). To quantify these differences, we first excluded the non-transfected cells using the intensity in the non-transfected control. Then we measured the total intensity of the MLH1 fluorescence in each cell and normalized to the intensity for wild-type MLH1. This analysis revealed up to 12-fold difference in intensity between the variants showing sizable differences in abundance.

Figure 2. Steady-state levels of MLH1 variants correlate with structural stability predictions.

(A) Example of the immunofluorescence imaging of HCT116 cells using antibodies to MLH1. Hoechst staining was used to mark the nucleus. Note the reduced steady-state levels of the G67R MLH1 variant compared to wild-type MLH1. (B) The total fluorescent intensity for each of the 69 different MLH1 variants was determined after excluding the non-transfected cells and normalizing the intensities to that for wild-type MLH1. The intensities were then plotted vs. the predicted ΔΔG values. Between 200 and 1,000 cells were included for each quantification. The error bars indicate the standard error of the mean (n = 5 experiments). Each variant is color-coded according to the ClinVar disease category. (C) Distribution of steady-state levels by DME category – 0 is loss-of-function in all assays by Takahashi et al. (2007), 3 represents function in all these assays (for details see the Materials and Methods). Raincloud plot visualization as described in Allen et al. (2018). Colored surface, smoothed density estimate. Gray dots represent means within each DME category, with bars for standard error. (D) Distribution of FoldX ΔΔGs across DME categories (as in (C)). (E) FoldX ΔΔGs for all variants tested in this work, indicating their position in the MLH1 sequence. As elsewhere, values above 15 kcal/mol were here set to this value.

Figure 2.

Figure 2—figure supplement 1. Solubility of wild-type MLH1 at a range of temperatures.

Figure 2—figure supplement 1.

(A) Transfected cells were lysed by sonication and incubated for 30 min. at the indicated temperatures. Then the lysates were separated into supernatant and pellet fractions by centrifugation. Western blotting using antibodies to MLH1 was employed to determine the amount of MLH1 in the fractions. Ponceau S staining and blotting with antibodies to GAPDH served as controls. (B) Quantification of blots as shown in panel (A) showing the amount of soluble protein (supernatant) normalized to total (supernatant + pellet). The total protein quantification is based on the full lanes of the Ponceau S stainings. The error bars indicate the standard deviation (n = 3).
Figure 2—figure supplement 2. Variants within the central disordered region.

Figure 2—figure supplement 2.

(A) ClinVar benign and likely benign variants (blue) are found throughout the protein, while pathogenic and likely pathogenic variants (red) are enriched in the structured domains. (B) The variants previously tested for function (Takahashi et al., 2007) are distributed throughout the MLH1 protein. Similar to the observations from ClinVar, functional variants (DME score 3) are seen throughout the protein, while loss-of-function in multiple assays (DME scores.

To examine whether these variations in cellular abundance is correlated with thermodynamic stability, we plotted the normalized values against the predicted structural stabilities (ΔΔGs). This analysis indeed reveals that those MLH1 variants that were predicted to be structurally destabilized (high ΔΔGs) also displayed reduced steady-state levels (Figure 2B), indicating that the predicted structural destabilization and low steady-state MLH1 levels go hand in hand. Almost all (30 out of 31) variants with steady state >75% have a ΔΔG < 3 kcal/mol, and similarly most (22/23) with ΔΔG > 3 kcal/mol have steady state levels < 75%. A destabilization of 3 kcal/mol is a relatively low threshold, but consistent with previous observations of other unrelated proteins (Nielsen et al., 2017; Scheller et al., 2019; Bullock et al., 2000). Such low stability thresholds may indicate that local unfolding (discussed below) plays an important role in the recognition and degradation of pathogenic variants, and/or reflect that wild-type MLH1 is a marginally stable protein. To test this, cell lysates were incubated for 30 min. at a range of temperatures. Then the lysates were separated into soluble (supernatant) and insoluble (pellet) fractions by centrifugation, and the amount of soluble MLH1 was determined by blotting (Figure 2—figure supplement 1). Comparison with abundant cellular proteins stained by Ponceau S and blotting for GAPDH revealed that wild-type MLH1 appears somewhat less thermostable than these other proteins (Figure 2—figure supplement 1), supporting that MLH1 may indeed be marginally stable.

Given that decreased levels of MLH1 protein could cause loss of MMR function, we also examined whether cellular abundancy correlated with pathogenicity. Of the 69 variants that we studied, 29 are classified as pathogenic or likely pathogenic in the ClinVar database (Landrum et al., 2018), whereas 12 are (likely) benign, and 28 are variants of unknown significance. We found that all (likely) benign variants appeared stable and had steady-state levels > 70% (Figure 2B). Conversely, 18 out of the 29 pathogenic variants (62%) had steady-state levels < 70% (Figure 2B), suggesting that protein destabilization is a common feature for more than half of the MLH1 variants linked to LS, and that predictions of stability might be useful for classifying such variants (see below).

Next, we analyzed how the measured steady-state levels and the stability predictions correlated with previously published in vivo functional data on MLH1 (Takahashi et al., 2007). In that study, MLH1 function was tested in a number of assays and ranked from 0 (no function) to 3 (full function) based on their dominant mutator effect (DME) when human MLH1 variants are expressed in yeast cells (Shimodaira et al., 1998). Our comparison revealed that variants with reduced steady-state levels and high risk of destabilization in general are less likely to be functional (Figure 2CD), which again indicates that the reduced structural stability may be linked to the observed loss-of-function phenotype. For example, while 22/23 variants with DME = 3 have steady-state levels > 70%, only five of the 23 variants with DME = 0 have this high amount of protein. These functional differences are also reflected in the correlation between loss of stability (ΔΔG) and function (Figure 2D). In particular none of the fully functional proteins (DME = 3) are predicted to be destabilized by more than 3 kcal/mol, whereas 18/23 variants with DME = 0 are predicted to be destabilized by at least this amount. The unstable and non-functional variants do not appear structurally clustered to a particular site within the protein, and are found throughout both the N- and C-terminal domains of MLH1 (Figure 2E). These affected positions are, however, closer to one another than random pairs in the respective domain (average pairwise distances of 17.3 Å vs. 24.1 Å in the N-terminal domain, and 14.7 Å vs. 21.0 Å in the C-terminal domain). In contrast, the linker region is depleted in detrimental variants but not in benign variants, hence functional (Takahashi et al., 2007) and benign (Landrum et al., 2018) variants are found both in structured and unstructured regions (Figure 2—figure supplement 2).

Proteasomal degradation causes reduced steady-state levels of destabilized MLH1 variants

Next, we analyzed why the steady-state levels of certain MLH1 variants were reduced. For this purpose, we carefully selected eight of the 69 missense MLH1 variants for further in-depth analyses (E23D, G67R, R100P, T117M, I219V, R265C, K618A, and R659P). As previously, these variants were chosen so the mutations were distributed across the MLH1 gene, and to represent a broad range of predicted structural stabilities (ΔΔGs) as well as different pathogenicity annotations from the ClinVar database (Table 1).

The variants were transiently transfected into HCT116 cells. Indeed, six of the variants (G67R, R100P, T117M, R265C, K618A, R659P) displayed reduced steady-state levels, while wild type-like levels were observed for two variants (E23D, I219V), in agreement with the fluorescence-based observations (Figure 3AB). Co-transfection with a GFP-expression vector revealed that this was not caused by differences between transfection efficiencies since the amount of GFP was unchanged (Figure 3A).

Figure 3. Many MLH1 variants are degraded by the proteasome.

(A) HCT116 cells transfected with the indicated variants were analyzed by blotting with antibodies to MLH1. Co-transfection with a plasmid expressing GFP was included to test the transfection efficiencies between the MLH1 variants. β-actin served as a loading control. (B) Quantification of blots as in (A) normalized to the steady-state level of wild-type (WT) MLH1. The error bars show the standard deviation (n = 3). (C) MLH1-transfected HCT116 cells were treated with 25 µg/mL cycloheximide (CHX) for 0, 4, 8 or 12 hr, and lysates were analyzed by blotting using antibodies to MLH1. β-actin was used as a loading control. (D) Quantification of blots as in panel (C), normalized to the steady-state levels at t = 0 hr. The error bars indicate the standard deviation (n = 3). (E) Western blotting with antibodies to MLH1 of whole cell lysates from transfected cells either untreated or treated for 16 hr with 10 µM bortezomib (BZ). Blotting for β-actin was included as a loading control.

Figure 3.

Figure 3—figure supplement 1. Solubility of selected MLH1 variants.

Figure 3—figure supplement 1.

Transfected cells, either untreated (-) or treated for 16 hr with 10 μM bortezomib (+BZ) were lysed by sonication and immediately separated into supernatant (S) and pellet (P) fractions by centrifugation. Western blotting using antibodies to MLH1 was employed to determine the amount of MLH1 in the fractions. Blotting with antibodies to GAPDH and PMCA served as loading controls for the soluble and insoluble fraction, respectively.

Next, in order to investigate if the reduced MLH1 levels were caused by degradation, we monitored the amounts of MLH1 over time in cultures treated with the translation inhibitor cycloheximide (CHX). This revealed that variants with reduced steady-state levels were indeed rapidly degraded (half-life between 3 and 12 hr), whereas wild-type MLH1 and the other variants appeared stable (estimated half-life >>12 hr) (Figure 3CD). Treating the cells with the proteasome-inhibitor bortezomib (BZ) significantly increased the steady-state levels of the unstable variants, whereas the levels of the wild-type and stable MLH1 variants were unaffected (Figure 3E). Separating cell lysates into soluble and insoluble fractions by centrifugation revealed that the destabilized MLH1 variants appeared more insoluble than the stable MLH1 variants, and bortezomib treatment mainly caused an increase in the amount of insoluble MLH1 (Figure 3—figure supplement 1). Based on these results, we conclude that certain missense MLH1 variants are structurally destabilized, which in turn leads to proteasomal degradation and reduced steady-state protein levels, and a loss-of-function phenotype as scored by the DME.

PMS1 and PMS2 are destabilized when MLH1 is degraded

In order to carry out its role in MMR, it is essential that MLH1 associates with PMS2 to form the active MutLα complex (Li and Modrich, 1995; Räschle et al., 2002; Tomer et al., 2002). Additionally, MLH1 can bind the PMS1 protein and form the MutLβ complex, the function of which remains unknown (Cannavo et al., 2007; Kondo et al., 2001; Wu et al., 2003).

As typical for ‘orphan’ proteins lacking their binding partners (Yanagitani et al., 2017; McShane et al., 2016), PMS2 has been found to be unstable in the absence of MLH1 (Hinrichsen et al., 2017; Lynch et al., 2015; Mohd et al., 2006; Perera and Bapat, 2008). To test the mechanism underlying this instability, we measured the stability of endogenous PMS1 and PMS2 in HCT116 cells with or without introducing wild-type MLH1. In cells treated with cycloheximide, the absence of MLH1 led to rapid degradation of both PMS1 and PMS2 (t½~3–5 hr). However, when wild-type MLH1 was present, PMS1 and PMS2 were dramatically stabilized (t½~12 hr) (Figure 4AB). Treating untransfected HCT116 cells with bortezomib led to an increase in the amount of endogenous PMS1 and PMS2, showing that their degradation is proteasome-dependent (Figure 4C). The stabilizing effect of MLH1 on PMS1 and PMS2 was also observed for the stable MLH1 variants (Figure 4DE). Accordingly, we found that the MLH1 levels correlated with the PMS1 and PMS2 levels (Figure 4F).

Figure 4. Stable MLH1 variants increase steady-state levels of PMS1 and PMS2.

Figure 4.

(A) The levels of endogenous PMS1 and PMS2 were determined by blotting of whole-cell lysates of HCT116 cells transfected with either empty vector or with wild-type MLH1 and treated with 25 µg/mL cycloheximide (CHX) for 0, 4, 8 or 12 hr. The antibodies used were to PMS1 and PMS2, and as a control to MLH1. β-actin served as loading control. (B) Quantification of blots as in panel (A) normalized to protein levels at 0 hr. The error bars indicate the standard deviation (n = 3). (C) The levels of endogenous MLH1, PMS1 and PMS2 were compared by blotting of cell lysates of HCT116 cells either untreated, or treated with cycloheximide (CHX) or with bortezomib (BZ) and CHX. β-actin served as loading control. (D) The levels of endogenous PMS1 and PMS2 and transfected MLH1 were compared by western blotting using antibodies to PMS1, PMS2 and MLH1. β-actin served as loading control. (E) Quantification of blots as in panel (C) normalized to the level of endogenous PMS1 (grey) or PMS2 (red) in untransfected HCT116 cells. The error bars show the standard deviation (n = 3). (F) Plotting the levels of the MLH1 variants vs. the levels of endogenous PMS1 (grey) and PMS2 (red). The error bars show the standard deviation (n = 3). (G) The levels of MLH1 and YFP-tagged PMS2 were analyzed by SDS-PAGE and blotting of whole-cell lysates of HCT116 cells transfected with the indicated expression plasmids. β-actin was included as loading control. (H) Co-transfected PMS2-YFP was immunoprecipitated (IP) using GFP-trap beads, and the precipitated material was analyzed by electrophoresis and blotting. Bortezomib was added to all cultures 16 hr prior to cell lysis to ensure ample amounts of the unstable MLH1 variants.

Collectively, these results suggest that either there is not enough MLH1 variant in the cells to form complexes with PMS1 and PMS2 or that only stable MLH1 variants are able to bind PMS1 and PMS2, and that this binding in turn protects them from proteasomal degradation. To test these possibilities, we proceeded to assess the PMS2-binding activity of the selected MLH1 variants. To this end, HCT116 cells were co-transfected with both MLH1 and YFP-tagged PMS2. Importantly, the overexpressed YFP-PMS2 protein did not affect the MLH1 level and appeared stable in the absence of MLH1 (Figure 4G), allowing us to directly compare the PMS2-binding activity of the selected MLH1 variants. To ensure that the cells contained sufficient levels of the unstable MLH1 variants, the cells were treated with bortezomib prior to lysis. We found that the wild-type and stable MLH1 variants (E23D, I219V) were efficiently co-precipitated with the YFP-tagged PMS2 (Figure 4H). Several of the unstable MLH1 variants did not display appreciable affinity for PMS2, even after blocking their degradation, suggesting that these MLH1 variants are structurally perturbed or unfolded to an extent that disables complex formation with PMS2. Interestingly, the K618A variant displayed a strong interaction with PMS2 (Figure 4H), indicating that this unstable variant retains the ability to bind PMS2, and therefore potentially engage in mismatch repair. We note that this result is supported by the K618A variant’s ability to stabilize PMS2 (Figure 4E) and the distal positioning of K618 to the PMS2 binding site (Gueneau et al., 2013).

HSP70 is required for degradation of some destabilized MLH1 variants

Since structurally destabilized proteins are prone to expose hydrophobic regions that are normally buried in the native protein conformation, molecular chaperones, including the prominent HSP70 and HSP90 enzymes, often engage such proteins in an attempt to refold them or to target them for proteasomal degradation (Arndt et al., 2007). Indeed, both HSP70 and HSP90 are known to interact with many missense variants though with different specificities and cellular consequences (Karras et al., 2017), and a previous study has linked HSP90 to MLH1 function (Fedier et al., 2005).

To test the involvement of molecular chaperones in degradation of the selected MLH1 variants, we analyzed their interaction with HSP70 and HSP90 by co-immunoprecipitation and western blotting. Similar to above, the cells were treated with bortezomib to ensure detectable amounts of MLH1. Interestingly, four of the destabilized MLH1 variants (G67R, R100P, T117M and R265C) displayed a strong interaction with HSP70, up to approximately 7-fold greater compared to wild-type MLH1 (Figure 5AB). Conversely, in the case of HSP90 we observed binding to all the tested MLH1 variants, including the wild-type (Figure 5CD), suggesting that HSP90 may be involved in the de novo folding of wild-type MLH1 or assembly of MLH1-containing MMR complexes, while HSP70 may be involved in regulation of certain destabilized MLH1 variants, potentially playing a role in their degradation.

Figure 5. Molecular chaperones play a role in the proteasomal degradation of MLH1.

Figure 5.

(A) Co-transfected HSP70-myc was immunoprecipitated (IP) using myc-trap beads and analyzed by blotting with antibodies to the myc-tag (HSP70) and MLH1. Bortezomib was added to all cultures 8 hr prior to cell lysis to ensure ample amounts of the unstable MLH1 variants. (B) Quantification of blots as shown in panel (A) normalized to level of wild-type MLH1. The error bars indicate the standard deviation (n = 3). (C) Co-transfected HSP90-HA was immunoprecipitated (IP) with anti-HA resin, and the precipitated material analyzed by electrophoresis and western blotting using antibodies to the HA-tag (HSP90) and MLH1. As above, bortezomib was added to all cultures prior to cell lysis. (D) Quantification of blots as in panel (C) normalized to amount of precipitated wild-type MLH1. The error bars show the standard deviation (n = 3). (E) Western blotting using antibodies to MLH1 of whole-cell lysates from transfected cells treated with 5 µM YM01 for 24 hr as indicated. (F) Quantification of blots as shown in panel (E) normalized to level of MLH1 without YM01. The error bars indicate the standard deviation (n = 3). (G) Western blotting using antibodies to MLH1 of whole-cell lysates from transfected cells treated 1 µM geldanamycin (GA) for 24 hr. (H) Quantification of blots as shown in panel (G) normalized to level of MLH1 without GA. The error bars indicate the standard deviation (n = 3).

To test this hypothesis, we measured the steady-state levels of the MLH1 variants following inhibition of HSP70 and HSP90, respectively. We treated cells with the HSP70 inhibitor YM01 or the HSP90 inhibitor geldanamycin (GA) and compared with the MLH1 levels in untreated cells. In comparison to the HSP70 binding, the effect of YM01 appeared more subtle. The levels of three variants (G67R, R100P, T117M) were, however, increased (Figure 5EF), and all three were also found to bind HSP70 (Figure 5A) and had the lowest steady-state levels of the eight tested variants prior to HSP70 inhibition. Together, these results suggest that HSP70 actively partakes in detecting and/or directing certain destabilized MLH1 variants for degradation. We did not observe any effect of HSP90 inhibition on the MLH1 protein levels for any of the tested variants (Figure 5GH).

Structural stability calculations for predicting pathogenic mutations

Our results show that unstable protein variants are likely to be rapidly degraded, suggesting that predictions of changed thermodynamic stability of missense MLH1 variants could be used to estimate whether a particular MLH1 missense variant is pathogenic or not. In comparison with the sequence-based tools (e.g. PolyPhen2, PROVEAN and REVEL) that are currently employed in the clinic (Adzhubei et al., 2010; Choi and Chan, 2015), the FoldX energy predictions provide an orthogonal structure-based and sequence-conservation-independent prediction of whether a mutation is likely to be pathogenic. Unlike most variant consequence predictors, FoldX was not trained on whether mutations were benign or pathogenic, but solely on biophysical stability measurements (Guerois et al., 2002). This considerably reduces the risk of overfitting to known pathogenic variants. More importantly, because of the mechanistic link to protein stability, FoldX predictions enable insights into why a particular mutation is problematic (Kiel and Serrano, 2014; Kiel et al., 2016; Pey et al., 2007; Nielsen et al., 2017; Stein et al., 2019).

As a first test for utilizing the biophysical calculations, we analyzed the predicted protein stabilities of MLH1 variants reported in the >140.000 exomes available in the Genome Aggregation Database (gnomAD) (Lek et al., 2016; Karczewski et al., 2019). Gratifyingly, this revealed that those variants reported to occur at a high frequency in the population all displayed low ΔΔG values (Figure 6A), suggesting that these MLH1 proteins are stable. Accordingly, with only a few exceptions, the most common MLH1 alleles reported in gnomAD also appeared functional (high DME scores) (Figure 6A). To further test the performance of the structural stability calculations for identifying pathogenic MLH1 variants, we then compared the ΔΔG values for ClinVar-annotated MLH1 variants. This revealed that the benign MLH1 variants all appeared structurally stable, while many pathogenic variants appeared destabilized (Figure 6B). For example, in our dataset 15 of the 28 pathogenic variants (54%) for which we could calculate a stability change have ΔΔG > 3 kcal/mol and 20 (71%) have ΔΔG > 2 kcal/mol, whereas none of the 11 benign variants have ΔΔG > 1.5 kcal/mol. Applying this calculation to all MLH1 missense variants in ClinVar revealed similar trends, with 55/95 (57%) of the pathogenic variants having ΔΔG > 3 kcal/mol, while only 2/21 benign variants have ΔΔG > 1.5 kcal/mol.

Figure 6. Assessing stability calculations for predicting pathogenicity.

(A) ‘Fishtail plot’ of ΔΔG-values vs. allele frequencies for all variants listed in gnomAD (gray), as well as those analyzed by Takahashi et al. (2007); the latter are color-coded by DME. Note that the leftmost group of colored dots are variants that have been reported in patients, but are not recorded in gnomAD (thus their allele frequency in gnomAD is zero). Variants with common to intermediate frequencies are all predicted to be stable, while some rare variants are predicted to be destabilized. ΔΔGs for gnomAD variants are provided as source data (Figure 6—source data 1), those for variants characterized by Takahashi et al. (2007) in Table 1 and source data (Table 1—source data 1). (B) FoldX ΔΔG for benign (blue), likely benign (cyan), likely pathogenic (orange), and pathogenic (red) variants that are reported in ClinVar with ‘at least one star’ curation. The whiskers represent the mean and standard error of the mean. (C) Evolutionary sequence energies for ClinVar-reported variants, color scheme as in (B). The whiskers represent the mean and standard error of the mean. (D) Landscape of variant tolerance by combination of changes in protein stability (x axis) and evolutionary sequence energies (y axis), such that the upper right corner indicates most likely detrimental variants, while those in the lower left corner are predicted stable and observed in MLH1 homologs. The green background density illustrates the distribution of all variants listed in gnomAD. The combination of metrics captures most non-functional variants (DME scores 0 or 1). Outliers are discussed in the main text. (E) Logistic regression model of FoldX ΔΔGs and evolutionary sequence energies. Pathogenic variants in red, benign in blue. Dot shape indicates whether pathogenicity of the respective variant was correctly predicted by a regression model trained on all but this data point (‘jackknife’, TP, true positives, FN, false negatives, FP, false positives, TN, true negatives). Parameters for a model trained on the full dataset are: FoldX ΔΔG weight 0.52, evolutionary sequence energy weight 3.50, intercept −1.55. (F) ROC curves for logistic regression model, FoldX ΔΔGs, evolutionary sequence energies, and the ensemble-predictor REVEL to assess their performance in separating benign from pathogenic variants. TPR, true positive rate. FPR, false positive rate. Standard deviations in AUC were determined by performing 100 ROC analyses on randomly sampled but balanced subsets, so that there are equal numbers of positive and negative cases. (G) Integrating potential effects these variants may have on splicing in the genomic context. Purple squares indicate pathogenic variants that are predicted to affect splicing (SpliceAI, threshold 0.5). No benign variants are predicted to affect splicing. Mapping to genomic loci (Yates et al., 2015) and SpliceAI Scores for ClinVar entries used in this work are provided as source data (Figure 6—source data 2).

Figure 6—source data 1. Data for gnomAD variants.
DOI: 10.7554/eLife.49138.016
Figure 6—source data 2. Data for ClinVar variants.
DOI: 10.7554/eLife.49138.017

Figure 6.

Figure 6—figure supplement 1. Landscape of ClinVar MLH1 variant tolerance.

Figure 6—figure supplement 1.

Landscape of variant tolerance by combination of changes in protein stability (x axis) and evolutionary sequence energies (y axis), such that the upper right corner indicates most likely detrimental variants, while those in the lower left corner are predicted stable and observed in MLH1 homologs. The green background density illustrates the distribution of all variants listed in gnomAD. Outliers are discussed in the main text.
Figure 6—figure supplement 2. Positioning of selected variants near the active site in the N-terminal domain.

Figure 6—figure supplement 2.

The MLH1 structure (PDB: 4P7A) (Wu et al., 2015) with highlighted stable loss-of-function MLH1 variants (M35R, N64S, F80V, S193P) as spheres, as well as ADP. The domain is colored in a rainbow color scheme, with blue at the N-terminus and red at the C-terminal end of the N-terminal domain (MLH1 sequence position ~300); sidechain and ligand oxygen atoms are red, nitrogens blue, ligand carbons gray. Three of the stable loss-of-function positions (M35R, N64S, F80V) are very close to the ligand. Variation at these sites may thus interfere with ligand binding, which could explain why they lead to loss of function despite wild-type-like cellular protein levels.
Figure 6—figure supplement 3. ROC curves for variant consequence predictors tested in this work.

Figure 6—figure supplement 3.

The indicated predictors were used to score all benign, likely benign, likely pathogenic and pathogenic missense variants reported in ClinVar (Aug 2018). We merged benign and likely benign variants into the negative control set and pathogenic and likely pathogenic into the positive set and generated ROC curves for each predictor, assessing their power in separating benign from pathogenic variants. In addition to the individual predictors, we also assessed a logistic regression model combining FoldX ΔΔG and Gremlin evolutionary sequence energy (E~). AUC, area under the curve. th, threshold (units specific to the respective predictor). Error bars indicate standard deviation from 100 randomly sub-sampled balanced ROC curves. See also Figure 6E in the main manuscript.

Overall, while many pathogenic variants are severely destabilized, a subset of these are predicted to be as stable as non-pathogenic variants (e.g. 7/28 have predicted ΔΔG < 1.0 kcal/mol). This observation could be explained for example by inaccuracies of our stability calculations or by loss of function via other mechanisms such as direct loss of enzymatic activity, post-translational modifications or protein-protein interactions (Wagih et al., 2018). Thus, as a separate method for predicting the biological consequences of mutations, we explored if sequence analysis of the MLH1 protein family across evolution would reveal differences in selective pressure between benign and pathogenic variants. We performed an analysis of a multiple sequence alignment of MLH1 homologs, which considers both conservation at individual sites, but also non-trivial co-evolution between pairs of residues (Balakrishnan et al., 2011; Stein et al., 2019). Turning this data into a statistical model allowed us to score all possible missense MLH1 variants. As this statistical sequence model is based on homologous sequences shaped by evolutionary pressures, it is expected to capture which residues, and pairs of residues, are tolerated (Balakrishnan et al., 2011). As opposed to stability calculations via for example FoldX, this approach is not directly linked to an underlying mechanistic model. Thus, we generally expect destabilizing residues to be recognized as detrimental by both FoldX and the evolutionary sequence energies, while variants in functionally active sites might only be recognized by the latter, if they do not affect protein stability (Stein et al., 2019). On the other hand, stability calculations could capture effects specific to human MLH1 that are more difficult to disentangle through the sequence analyses. In our implementation, low scores indicate mutations that during evolution appear tolerated, while high scores mark amino acid substitutions that are rare and therefore more likely to be detrimental to protein structure and/or function. Indeed, the average sequence-based score for the benign variants is lower (variations more likely to be tolerated) than the average for the ClinVar-curated pathogenic variants (Figure 6C). The full matrix of evolutionary sequence energies is included in the supplemental material (Supplementary file 2).

Next, we compared the structure-based stability calculations and evolutionary sequence energies in a two-dimensional landscape of variant tolerance (Figure 6D), which we find largely agrees with the functional classification by Takahashi et al. (2007). There are, however, three variants with low evolutionary sequence energies (typically indicating tolerance), but predicted and experimentally-confirmed to be destabilized relative to wild-type MLH1 (T662P, I565F, G244V). Further, a number of stable variants have high DME scores (indicating wild-type-like function), but also high evolutionary sequence energies, indicating likely loss of function, and indeed several of these are classified as pathogenic in ClinVar (Figure 6—figure supplements 1 and 2). One possible explanation for these discrepancies is a different sensitivity in the employed yeast assays (Takahashi et al., 2007), that is these variants may be sufficiently functional under assay conditions, but their impaired function relative to wild-type MLH1 may nevertheless render them pathogenic in human variant carriers.

To exploit the complementary nature that ΔΔG and evolutionary sequence energies display for a subset of the variants, we applied logistic regression to combine these two metrics. In a jackknife test (to avoid overfitting) we found that 99 of 116 (85%) ClinVar (Landrum et al., 2018) missense variants were classified correctly by the regression model (Figure 6E).

To compare the capability of the above-described evolutionary sequence energies, FoldX, our regression model, and more traditional sequence-based methods (PolyPhen2, PROVEAN and REVEL) in separating pathogenic and non-pathogenic variants, we applied these approaches to the set of 116 known benign and disease-causing MLH1 variants. We then used receiver-operating characteristic (ROC) analyses to compare how well the different methods are able to distinguish the 21 benign variants from the 95 known pathogenic variants (Figure 6F and Figure 6—figure supplement 3). The results show that although all predictors perform fairly well, the logistic regression model performs best (AUC: 0.90 ± 0.03), and evolutionary sequence energies alone (AUC: 0.88 ± 0.03) are slightly better at distinguishing disease-linked missense variants from harmless variants than REVEL (AUC: 0.83 ± 0.03), PolyPhen2 (AUC: 0.84 ± 0.03) and PROVEAN (AUC: 0.76 ± 0.04). Structure-based ΔΔG calculations show similar performance to these sequence-based predictors (AUC: 0.82 ± 0.04), but, as the shape of the ROC curve illustrates, are particularly informative in the region of high specificity.

Lastly, we assessed whether the underlying genomic changes could affect splicing and thus have pathogenic potential, rather than directly acting on the protein level. Using SpliceAI (Jaganathan et al., 2019) we predicted that 19/116 (16%) of the ClinVar variants may affect splicing. All 19 are pathogenic variants; no benign variants were predicted to affect splicing. Interestingly, 4 of these variants have scores in our regression model that suggest them to be non-detrimental, and thus predicted not to affect protein function or stability (Figure 6G). Hence, more than half (4/7) of these erroneously classified variants may affect splicing rather that protein function, a substantially higher fraction than the 17% (15/88) that are predicted to affect splicing among those variants that are correctly classified by our regression model integrating ΔΔG and evolutionary sequence energies. The overall fraction predicted to affect splicing is similar to that reported in a recent genome-editing-based study on BRCA1 variants (Findlay et al., 2018).

Discussion

Missense variants in the MLH1 gene are a leading cause of Lynch syndrome (LS) and colorectal cancer (Peltomäki, 2016). In recent years, germline mutations that cause structural destabilization and subsequent protein misfolding have surfaced as the cause of several diseases, including cystic fibrosis (Ahner et al., 2007), phenylketonuria (Pey et al., 2007; Scheller et al., 2019), early onset Parkinson’s disease (Mathiassen et al., 2015; Olzmann et al., 2004) and MSH2-linked LS (Arlow et al., 2013; Nielsen et al., 2017). Although previous studies have shown some MLH1 variants to be destabilized (Takahashi et al., 2007; Perera and Bapat, 2008), this has not been systematically addressed and the contribution of MLH1 protein stability for LS remains to be resolved.

In this study, we performed in silico saturation mutagenesis and stability predictions of all single-site MLH1 missense variants in the structurally-resolved regions of MLH1. Comparisons with a selected group of naturally occurring MLH1 variants revealed that those variants that are predicted to be destabilized indeed display substantially reduced steady-state protein levels. The decreased cellular amounts are caused by rapid proteasomal degradation. In turn, the loss of MLH1 causes a dramatic destabilization and proteasomal degradation of both PMS1 and PMS2. This effect suggests that the MutLα and MutLβ heterodimers are likely to be rather stable protein complexes, as the PMS1 and PMS2 proteins would otherwise be required to be stable in the absence of MLH1. These observations are in line with several previous studies on individual MLH1 variants (Cravo et al., 2002; Kosinski et al., 2010; Perera and Bapat, 2008; Raevaara et al., 2005) including a thorough analysis by Takahashi et al. (2007), and also agree with tissue staining of tumor cells from patients with germline MLH1 mutations (Hampel et al., 2008; de Jong et al., 2004).

Our observation that single amino acid changes in MLH1 are sufficient to cause degradation is similar to results from multiple other proteins including recent deep mutational scans on PTEN and TPMT (Matreyek et al., 2018), our previous results on MSH2 in human cells (Nielsen et al., 2017), and earlier observations on Lynch syndrome MSH2 variants in yeast (Gammie et al., 2007; Arlow et al., 2013). Our structural stability calculations predict that a relatively mild destabilization of just a few (~3) kcal/mol is sufficient to trigger MLH1 degradation, an observation in line with previous studies on other proteins (Bullock et al., 2000; Nielsen et al., 2017; DDD Study et al., 2017; Scheller et al., 2019; Jepsen et al., 2019; Caswell et al., 2019). Although the absolute thermodynamic stability of MLH1 is unknown, both in vitro and in a cellular context, it is possible that the 3 kcal/mol destabilization necessary to trigger degradation is lower than that required to reach the fully unfolded state. Although we show that wild-type MLH1 is perhaps only marginally stable, we propose that these effects of slightly destabilizing amino acid substitutions could be the result of local unfolding events rather than global unfolding. Accordingly, it appears that the PQC system is tightly tuned to detect increased amounts of minor or transient structural defects. This is supported by observations on MSH2 showing that in some cases the destabilized variants are even functional when degradation is blocked (Arlow et al., 2013), and results showing that the PQC system preferably targets folding intermediates (Bershtein et al., 2013). As a result, it is surprising that predictions of changes to the global folding stability (using the fully unfolded state as a reference) are so effective in distinguishing variants that are stable in the cell from those that are more rapidly degraded. These observations also suggest that improved understanding of the biophysical basis for cellular quality control might lead to even better predictions of cellular abundance.

In our cellular studies, the MLH1 variants were expressed from a constitutive promoter, thus bypassing any potential transcriptional regulation of MLH1 expression involving, for example, amounts of the MLH1 protein or its function. Hence, in other experimental setups where MLH1 variants are generated at the endogenous locus, additional layers of control may affect the observed correlation between structural stability and steady-state amounts. Another main difference between the cellular context and the predicted stability is the multiple roles played by the PQC system. For example, on one hand, chaperones may aid in folding or stabilizing proteins, but may at the same time act as sensors of misfolded proteins and help target them for degradation. Increased amounts of unfolded or misfolded proteins may affect and titrate PQC components, thus complicating the relationship between protein stability and abundance. Nevertheless, since the predicted protein stabilities can, to a large extent, discriminate between pathogenic and benign variants, we expect that multiple disease-linked MLH1 variants will indeed display reduced cellular levels.

Similar to our previous observations for MSH2 (Nielsen et al., 2017), these results indicate that structural destabilization appears to be a common consequence of many disease-linked MLH1 missense variants. Supported by earlier functional studies (Takahashi et al., 2007), we suggest that the loss-of-function phenotype in many cases can be explained by structural destabilization and subsequent degradation. Indeed, 20 out of the 31 variants with DME = 0 or DME = 1 have steady-state levels less than 70% of wildtype MLH1. Our data also include examples of loss-of-function variants with high steady-state levels, which is expected, as variants can affect function without modifying stability, for example, by changing binding interfaces or active sites (Gueneau et al., 2013), and these would therefore be interesting to analyze biochemically in more detail. Hence, M35R, N64S and F80V are all close to the ATP binding site in the N-terminal domain of MLH1 (Figure 6—figure supplement 2), and might thus interfere with the catalytic activity. This would be consistent with a loss-of-function phenotype (Takahashi et al., 2007) but wild-type-like cellular protein levels (Table 1).

The correlation between the predicted structural stability with both cellular stability and protein function suggests that the stability predictions may be used for classifying MLH1 missense variants. This is particularly relevant for LS, where according to the ClinVar database (Landrum et al., 2018) 711 out of 851 (~84%) reported MLH1 missense variants are assigned as so-called variants of uncertain significance (VUS) (Manolio et al., 2017). Of note, the fact that we did not observe the E23D VUS as destabilized does not preclude this variant from being pathogenic, since it may affect function without being structurally perturbed. Although the variant appears functional in yeast cells (Takahashi et al., 2007), the evolutionary sequence energy of 0.9 indicates that this change is rare across the MLH1 protein family evolution and thus might be detrimental. Our results for the K618A VUS suggest that this variant, albeit being unstable, is still able to associate with PMS2 and may therefore be functional, but untimely degraded. Of the 69 MLH1 variants that we analyzed, 30 have status as VUSs. Of these, our analysis identified several (e.g. G54E, G244V, and L676R) that hold characteristics indicating that they are likely pathogenic: steady-state levels below 50% of WT, high ΔΔG and low functionality score in vivo (Takahashi et al., 2007) (Table 1).

The potential use of stability predictions for LS diagnostics is supported by the predicted MLH1 stabilities clearly separating into disease-linked and benign MLH1 variants. Moreover, since we observe that those MLH1 alleles that occur more frequently in the population are in general predicted as stable, this suggests that these common MLH1 alleles are either benign or at least only disease-causing with a low penetrance. However, certainly not all the unstable variants were accurately detected by the structural predictions. For instance, out of our eight selected variants, three (R100P, R265C, K618A) appeared unstable, but were not predicted to be so (Table 1). As described above, it is important to note that the stability predictions report on the global stability of the protein, while in a cellular context it is unlikely that any of the variants are fully unfolded. Instead, it is likely that local elements unfold (Stein et al., 2019), and although refolding may occur, the locally unfolded state allows chaperones and other protein quality control components to associate and target the protein for proteasomal degradation (Figure 7). A better understanding of the importance of local unfolding events for cellular stability is an important area for further research. Indeed, in addition to the utility for classifying potential LS variants, we believe that an important aspect of our work is that it suggests a single mechanistic origin of ~60% of the LS MLH1 variants that we have studied. This observation is also supported by our ROC analysis (Figure 6D), which demonstrates that the stability calculations can identify ~60% of the pathogenic variants at very high specificity (few false positives) exactly because these are the variants that appear to cause disease via this mechanism. The ability to separately analyze effects on protein stability and other effects that might be captured by the sequence analysis is also an advantage of applying the predictors individually, rather than relying on combined predictors such as our regression model or published ensemble-based meta-predictors. While those have slightly higher overall accuracy, they do not directly indicate the underlying molecular reason for pathogenicity. Incidentally, we note that by analyzing both calculations and multiplexed assays of variant effects, we recently found that ~60% of disease-causing variants in the protein PTEN were caused by destabilization and a resulting drop in cellular abundance (Jepsen et al., 2019). On the other hand, while stability prediction is very useful for accurate identification of many pathogenic variants, it may have lower overall sensitivity. We speculate that this is due to stability being a necessary, but not sufficient criterion. Thus, destabilized variants are likely pathogenic, while a variant being stable does not necessary imply it being functional.

Figure 7. Model for how structural destabilization of MLH1 contributes to disease.

Figure 7.

The wild-type (green) MLH1-PMS2 heterodimer promotes DNA mismatch repair. Disease-linked missense MLH1 variants (red) may also promote DNA repair, but are at risk of dissociating from PMS2 due to structural destabilization. The structural destabilization of MLH1 may also cause a partial unfolding of MLH1 which is recognized by the molecular chaperone HSP70 and causes proteasomal degradation of the MLH1 variant. In turn, the degradation of MLH1 leaves PMS2 without a partner protein, resulting in proteasomal degradation of PMS2.

Lastly, we predicted the effects of variants with known consequences (benign or pathogenic) may have on splicing, which may also lead to pathogenic changes. We found that, overall, about 16% of the variants are predicted to affect splicing, though none of the 19 benign variants are in this category. Interestingly, 4/7 pathogenic variants in the region that the logistic regression model predicts to be benign may affect splicing (Figure 6G), indicating that integration of effects on the genomic level is likely to boost overall predictive power, and should be considered in future developments of pathogenicity predictors.

In line with observations on other destabilized proteins, we found that the degradation of some structurally destabilized MLH1 variants depends on the molecular chaperone HSP70. This suggests that HSP70 recognizes the destabilized MLH1 variants and targets them for proteasomal degradation. Accordingly, we observed that several destabilized MLH1 variants associate with HSP70. Involvement of molecular chaperones in protein degradation is a well-established phenomenon (Samant et al., 2018; Arndt et al., 2007; Kandasamy and Andréasson, 2018). Moreover, our findings are consistent with recent developments in the field, showing that degradation signals, so-called degrons, are buried within the native structure of most globular proteins. Upon exposure when the protein structure is destabilized, the degrons are recognized by chaperones and other protein quality control components, which in turn guide the target protein for degradation (Enam et al., 2018; Geffen et al., 2016; Kim et al., 2013; Maurer et al., 2016; Ravid and Hochstrasser, 2008). Thus, ultimately the degradation of a protein will depend on both the structural destabilization (ΔΔG) as well as the exposed degrons, and how efficiently these are recognized by the degradation machinery. The results presented here suggest that biophysical calculations are able to predict the structural destabilization (ΔΔG), however, since the nature of protein quality control degrons is still largely undefined (Geffen et al., 2016; Maurer et al., 2016; Rosenbaum et al., 2011; van der Lee et al., 2014), in silico prediction of these is currently not possible.

In conclusion, our results support a model (Figure 7) where missense mutations can cause destabilization of the MLH1 protein, leading to exposure of degrons which, in turn, trigger HSP70-assisted proteasomal degradation, causing disruption of the MMR pathway and ultimately leading to an increased cumulative lifetime risk of cancer development in LS patients. Potentially, this opens up for new therapeutic approaches, including inhibiting the PQC-mediated clearance of marginally stable MLH1 variants, or small molecule stabilizers of MLH1.

Materials and methods

Key resources table.

Reagent type
(species) or
resource
Designation Source or reference Identifiers Additional
information
Gene (Homo sapiens) MLH1 - UniProt identifier: P40692-1 -
Cell line (Homo sapiens) HCT116 ATCC CCL-247EMT; RRID:CVCL_0291 -
Antibody anti-MLH1 (rabbit polyclonal) Santa Cruz
Biotechnology
sc-11442; RRID:AB_2145332 Dilution:
1:100 (IF)
1:1000 (WB)
Antibody anti-β-actin (mouse monoclonal) Sigma-Aldrich A5441; RRID:AB_476744 Dilution:
1:20000
Antibody anti-PMS2 (mouse monoclonal) BD Biosciences 556415; RRID:AB_396410 Dilution:
1:2500
Antibody anti-PMS1 (rabbit polyclonal) Invitrogen PA5-35952; RRID:AB_2553262 Dilution:
1:2500
Antibody anti-GFP (rat monoclonal) ChromoTek 3H9; RRID:AB_10773374 Dilution:
1:2000
Antibody anti-myc (rat monoclonal) ChromoTek 9E1; RRID:AB_2631398 Dilution:
1:1000
Antibody anti-HA (rat monoclonal) Roche 3F10; RRID:AB_2314622 Dilution:
1:2000
Antibody anti-GAPDH (rabbit monoclonal) Cell Signaling Technologies 14C10; RRID:AB_10693448 Dilution:
1:2000
Antibody anti-PMCA (mouse monoclonal) Invitrogen MA3-914;
RRID:AB_2061566
Dilution:
1:2000
Recombinant DNA reagent pCMV6-MYC-DDK-HSP70-1A (HSPA1A) OriGene RC200270 -
Recombinant
 DNA reagent
pcDNA3
-HA-HSP90
Addgene 22487; RRID:Addgene_22487 -
Recombinant DNA reagent pEYFP-C2-PMS2 Prof. Lene J. Rasmussen (Andersen et al., 2012) -
Recombinant DNA reagent pEGFP-C1 Clontech Discontinued
by supplier
Available from
NovoPro Labs (Cat. No. V12024)
Recombinant DNA reagent pcDNA3.1-V5-His Invitrogen V81020 -
Recombinant DNA reagent pCMV-MLH1 and MLH1 variants Prof. Chikashi Ishioka (Takahashi et al., 2007) -
Commercial assay or kit FuGENE HD Promega E2311 -
Chemical
compound, drug
Bortezomib LC Laboratories B-1408 -
Chemical compound, drug YM01 StressMarq SIH-121 -
Chemical compound, drug Geldanamycin Sigma-Aldrich G3381 -
Chemical compound, drug Cycloheximide Sigma-Aldrich C1988 -
Software, algorithm UnScanIt gel Silk Scientific V6.1; RRID:SCR_017291 -
Software,
algorithm
FoldX http://foldxsuite.crg.eu/ January 2017; RRID:SCR_008522 Details see Materials and methods
Software, algorithm Gremlin https://github.com/sokrypton/GREMLIN V2.01 Details see Materials and methods
Software, algorithm Custom R script - - -
Software, algorithm SpliceAI https://github.com/Illumina/SpliceAI V1.2.1 -

Plasmids

Plasmids for expression of wild-type and mutant MLH1 variants have been described before (Takahashi et al., 2007). pCMV-MYC-DDK-HSP70 and pcDNA3-HA-HSP90 were kindly provided by Dr. Kenneth Thirstrup (H. Lundbeck A/S). The pEYFP-C2-PMS2 plasmid was kindly provided by Prof. Lene J. Rasmussen (University of Copenhagen). pEGFP was purchased from Clontech. A pcDNA3-V5 vector served as a negative control.

Cell culture

HCT116 cells (kindly provided by Prof. Mads Gyrd-Hansen (University of Oxford)) were maintained in McCoy’s 5A medium (Gibco) supplemented with 10% fetal calf serum (Invitrogen), 2 mM glutamine, 5000 IU/ml penicillin and 5 mg/ml streptomycin at 37°C in a humidified atmosphere with 5% CO2. Cell line authentication by STR analysis was performed by Eurofins. The cells were not contaminated by mycoplasma.

Transfections were performed using FugeneHD (Promega) as described by the manufacturer in reduced serum medium OptiMEM (Gibco). Cells were harvested no later than 72 hr after transfection at a final confluence around 90%. About 24 hr after transfection, cells were treated with serum-free growth medium containing 25 µg/mL cycloheximide (Sigma) for a duration of 4, 8 or 12 hr, 10 µM bortezomib (LC laboratories) for 8 or 16 hr, 5 µM YM01 (StressMarq) for 24 hr or 1 µM geldanamycin (Sigma) for 24 hr. Cells were lysed in SDS sample buffer (94 mM Tris/HCl pH 6.8, 3% SDS, 19% glycerol and 0.75% β-mercaptoethanol) and protein levels were analyzed by SDS-PAGE and western blotting.

SDS-PAGE and western blotting

Proteins were resolved by SDS-PAGE on 7 × 8 cm 12.5% acrylamide gels, and transferred to 0.2 μm nitrocellulose membranes (Advantec, Toyo Roshi Kaisha Ltd.). Blocking was performed using PBS (8 g/L NaCl, 0.2 g/L KCl, 1.44 g/L Na2HPO4, 0.24 g/L KH2PO4, pH 7.4) with 5% dry milk powder and 0.05% Tween-20. Membranes were probed with primary antibodies (see Key Resources Table) at 4°C overnight. HRP-conjugated secondary antibodies were purchased from DAKO. ECL detection reagent (GE Life Sciences) was used for development.

Immunofluorescence and imaging

Transfected cells were seeded 24 hr prior to fixing with 4% formaldehyde in PBS in thin-bottomed 384-well plates. The fixed cells were then washed three times in PBS and permeabilized with 0.25% Triton-X-100 in PBS for 5 min at room temperature (RT). After washing with PBS, 5% bovine serum albumin (BSA, Sigma) in PBS was used for 45 min at RT for blocking. The cells were then washed with PBS and incubated with a 1:100 dilution of the anti-MLH1 antibody (Santa Cruz Biotechnology, Product no.: sc-11442) in 1% BSA in PBS for 1 hr at RT. The cells were washed with PBS and incubated with a 1:1000 dilution Alexa Fluor 568 anti-rabbit antibody (Invitrogen) in 1% BSA in PBS for 1 hr at RT. After additional washing with PBS, the DNA was stained with Höchst 33342 (Sigma) for 10 min. Microscopy was performed using an InCell2200 microscope (GE Healthcare). The filters were Höchst (ex 390 nm, em 432 nm) and TexasRed (ex 575 nm, em 620 nm). The InCell Developer Toolbox (GE Healthcare) was used for image analysis. To determine the abundance of the MLH1 variants, the total intensity of the red channel in each cell was measured after excluding the non-transfected cells.

Co-immunoprecipitation

Transfected cells were lysed in buffer A (50 mM Tris/HCl pH 7.5, 150 mM NaCl, 1 mM EDTA and 0.5% NP-40 supplemented with Complete Mini EDTA-free Protease inhibitor cocktail tablets (Roche)) and left to incubate for 20 min on ice. The lysates were cleared by centrifugation (13000 g, 30 min) and the proteins were captured with GFP-trap (Chromotek), Myc-trap (Chromotek) or HA-agarose beads (Sigma) by tumbling end-over-end overnight at 4°C. The beads were washed three times by centrifugation (1000 g, 10 s) in buffer A. Finally, the beads were resuspended in SDS sample buffer (94 mM Tris/HCl pH 6.8, 3% SDS, 19% glycerol and 0.75% β-mercaptoethanol) and analyzed by SDS-PAGE and western blotting.

Solubility assays

For analyses of wild-type MLH1, HCT116 cells were transfected as described above. Cells were harvested in buffer B (50 mM Tris/HCl pH 7.5, 150 mM NaCl, 1 mM EDTA) supplemented with complete protease inhibitor cocktail tablets (Roche), and lysed by sonication (3 × 10 s). Then the lysate was distributed into different tubes that were incubated at 30 min. at the indicated temperatures. The soluble and insoluble fractions were separated by centrifugation (15000 g, 4°C, 30 min), after which the supernatant was removed and the pellet washed once in buffer B. Subsequently, SDS sample buffer was added, and the final sample volume was kept identical between the pellet and the supernatant. Finally, fractions were analyzed by SDS-PAGE and western blotting as described.

For comparison of the variants, HCT116 cells were transfected and treated with bortezomib as described above. Cells were harvested in buffer B supplemented with complete protease inhibitor cocktail tablets (Roche), and lysed by sonication (3 × 10 s). The soluble and insoluble fractions were then immediately separated by centrifugation and analyzed SDS-PAGE and western blotting as described above.

Stability calculations

The changes in folding stability (ΔΔG) were calculated using FoldX (Guerois et al., 2002) based on PDB IDs 4P7A (Wu et al., 2015) for the N-terminal domain of MLH1, and 3RBN for the C-terminal domain. The ΔΔGs were calculated from each structure individually by first applying the RepairPDB function to fix minor issues in the original coordinates, and then the BuildModel function to generate each individual amino acid variant. Each calculation was repeated five times, and the average difference in stability between wild type and variant is reported, such that values < 0 kcal/mol indicate stabilized variants, and values > 0 kcal/mol indicate destabilized variants with respect to the wild type MLH1 protein. Values > 15 kcal/mol likely indicate clashes in the model FoldX generated. While this does indicate that major destabilization is likely, the actual values are less meaningful for these clashing variants, and we thus truncated them to 15 kcal/mol in our figures.

Evolutionary sequence energy calculations

To assess the likelihood of finding any given variant in the protein family, we created a multiple sequence alignment of human MLH1 using HHblits (Zimmermann et al., 2018) and then calculated a sequence log-likelihood score combining site-conservation and pairwise co-variation using Gremlin (Balakrishnan et al., 2011). Scores were normalized by their rank to a range of (0,1), with low scores indicating tolerated sequences and high scores indicating variants that are rare or unobserved across the multiple sequence alignment. Positions at which the number of distinct homologous sequences was too small to extract meaningful evolutionary sequence energies are set to NA (Supplementary file 2). Other sequence-based predictions of functional variant consequences were retrieved from the webservers of PROVEAN (Choi and Chan, 2015) and PolyPhen2 (Adzhubei et al., 2010), and extracted from the pre-calculated scores for REVEL (Ioannidis et al., 2016).

Dominant mutator effect (DME)

We grouped the functional classification observations from Takahashi et al. (2007) into four categories by summarizing the number of assays each variant showed functional behavior in. Thus, variants in group 0 were non-functional in all three assays, those in group three were functional in all three assays, and the rest were functional in some, but not in other assays.

Acknowledgements

The authors thank Prof. Lene J Rasmussen, Prof. Mads Gyrd-Hansen, and Dr. Kenneth Thirstrup for sharing cells and reagents, and Dr. Elin J Pietras, Dr. Cornelia Steinhauer, Cecilie Søltoft, Anne-Marie Lauridsen, and the Core Facility for Integrated Microscopy for excellent technical assistance.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Amelie Stein, Email: amelie.stein@bio.ku.dk.

Kresten Lindorff-Larsen, Email: lindorff@bio.ku.dk.

Rasmus Hartmann-Petersen, Email: rhpetersen@bio.ku.dk.

Sarel Jacob Fleishman, Weizmann Institute of Science, Israel.

John Kuriyan, University of California, Berkeley, United States.

Funding Information

This paper was supported by the following grants:

  • Novo Nordisk Foundation PRISM Challenge Program to Amelie Stein, Kresten Lindorff-Larsen, Rasmus Hartmann-Petersen.

  • Novo Nordisk Foundation Young Investigator Award, NNF15OC0016662 to Eva R Hoffmann.

  • The A.P. Møller Foundation Project Grant to Rasmus Hartmann-Petersen.

  • Danish Cancer Society Project Grant to Rasmus Hartmann-Petersen.

  • Danish Council for Independent Research Project Grant to Rasmus Hartmann-Petersen.

  • Lundbeck Foundation Project Grant to Kresten Lindorff-Larsen, Rasmus Hartmann-Petersen.

  • Lundbeck Foundation Fellowship Grant to Amelie Stein.

  • Novo Nordisk Foundation Hallas-Møller Program to Kresten Lindorff-Larsen.

Additional information

Competing interests

No competing interests declared.

Author contributions

Formal analysis, Investigation, Visualization, Writing—original draft.

Formal analysis, Investigation, Visualization, Writing—review and editing.

Formal analysis, Methodology, Writing—review and editing.

Formal analysis, Investigation, Writing—review and editing.

Conceptualization, Investigation.

Supervision, Investigation.

Conceptualization, Supervision.

Conceptualization, Writing—review and editing.

Conceptualization, Writing—review and editing.

Resources, Writing—review and editing.

Resources, Writing—review and editing.

Conceptualization, Formal analysis, Supervision, Writing—review and editing.

Conceptualization, Formal analysis, Supervision, Writing—review and editing.

Additional files

Supplementary file 1. Full matrix for FoldX stability predictions.
elife-49138-supp1.xlsx (122.6KB, xlsx)
DOI: 10.7554/eLife.49138.019
Supplementary file 2. Full matrix for evolutionary sequence energies.
elife-49138-supp2.xlsx (178KB, xlsx)
DOI: 10.7554/eLife.49138.020
Transparent reporting form
DOI: 10.7554/eLife.49138.021

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files.

References

  1. Aarnio M, Sankila R, Pukkala E, Salovaara R, Aaltonen LA, de la Chapelle A, Peltomäki P, Mecklin JP, Järvinen HJ. Cancer risk in mutation carriers of DNA-mismatch-repair genes. International Journal of Cancer. 1999;81:214–218. doi: 10.1002/(SICI)1097-0215(19990412)81:2&#x0003c;214::AID-IJC8&#x0003e;3.0.CO;2-L. [DOI] [PubMed] [Google Scholar]
  2. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense mutations. Nature Methods. 2010;7:248–249. doi: 10.1038/nmeth0410-248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ahner A, Nakatsukasa K, Zhang H, Frizzell RA, Brodsky JL. Small heat-shock proteins select deltaF508-CFTR for endoplasmic reticulum-associated degradation. Molecular Biology of the Cell. 2007;18:806–814. doi: 10.1091/mbc.e06-05-0458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Allen M, Poggiali D, Whitaker K, Marshall TR, Kievit R. Raincloud plots: a multi-platform tool for robust data visualization. PeerJ Preprints. 2018 doi: 10.7287/peerj.preprints.27137v1. [DOI] [PMC free article] [PubMed]
  5. Andersen SD, Liberti SE, Lützen A, Drost M, Bernstein I, Nilbert M, Dominguez M, Nyström M, Hansen TV, Christoffersen JW, Jäger AC, de Wind N, Nielsen FC, Tørring PM, Rasmussen LJ. Functional characterization of MLH1 missense variants identified in lynch syndrome patients. Human Mutation. 2012;33:1647–1655. doi: 10.1002/humu.22153. [DOI] [PubMed] [Google Scholar]
  6. Arlow T, Scott K, Wagenseller A, Gammie A. Proteasome inhibition rescues clinically significant unstable variants of the mismatch repair protein Msh2. PNAS. 2013;110:246–251. doi: 10.1073/pnas.1215510110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Arndt V, Rogon C, Höhfeld J. To be, or not to be--molecular chaperones in protein degradation. Cellular and Molecular Life Sciences. 2007;64:2525–2541. doi: 10.1007/s00018-007-7188-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Balakrishnan S, Kamisetty H, Carbonell JG, Lee SI, Langmead CJ. Learning generative models for protein fold families. Proteins: Structure, Function, and Bioinformatics. 2011;79:1061–1078. doi: 10.1002/prot.22934. [DOI] [PubMed] [Google Scholar]
  9. Barrow E, Alduaij W, Robinson L, Shenton A, Clancy T, Lalloo F, Hill J, Evans DG. Colorectal Cancer in HNPCC: cumulative lifetime incidence, survival and tumour distribution. A report of 121 families with proven mutations. Clinical Genetics. 2008;74:233–242. doi: 10.1111/j.1399-0004.2008.01035.x. [DOI] [PubMed] [Google Scholar]
  10. Bershtein S, Mu W, Serohijos AW, Zhou J, Shakhnovich EI. Protein quality control acts on folding intermediates to shape the effects of mutations on organismal fitness. Molecular Cell. 2013;49:133–144. doi: 10.1016/j.molcel.2012.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bullock AN, Henckel J, Fersht AR. Quantitative analysis of residual folding and DNA binding in mutant p53 core domain: definition of mutant states for rescue in cancer therapy. Oncogene. 2000;19:1245–1256. doi: 10.1038/sj.onc.1203434. [DOI] [PubMed] [Google Scholar]
  12. Cannavo E, Gerrits B, Marra G, Schlapbach R, Jiricny J. Characterization of the interactome of the human MutL homologues MLH1, PMS1, and PMS2. Journal of Biological Chemistry. 2007;282:2976–2986. doi: 10.1074/jbc.M609989200. [DOI] [PubMed] [Google Scholar]
  13. Casadio R, Vassura M, Tiwari S, Fariselli P, Luigi Martelli P. Correlating disease-related mutations to their effect on protein stability: a large-scale analysis of the human proteome. Human Mutation. 2011;32:1161–1170. doi: 10.1002/humu.21555. [DOI] [PubMed] [Google Scholar]
  14. Caswell RC, Owens MM, Gunning AC, Ellard S, Wright CF. Using structural analysis in silico to assess the impact of missense variants in MEN1. bioRxiv. 2019 doi: 10.1101/661512. [DOI] [PMC free article] [PubMed]
  15. Choi Y, Chan AP. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31:2745–2747. doi: 10.1093/bioinformatics/btv195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cravo M, Afonso AJ, Lage P, Albuquerque C, Maia L, Lacerda C, Fidalgo P, Chaves P, Cruz C, Nobre-Leitao C. Pathogenicity of missense and splice site mutations in hMSH2 and hMLH1 mismatch repair genes: implications for genetic testing. Gut. 2002;50:405–412. doi: 10.1136/gut.50.3.405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. DDD Study. Suri M, Evers JMG, Laskowski RA, O'Brien S, Baker K, Clayton-Smith J, Dabir T, Josifova D, Joss S, Kerr B, Kraus A, McEntagart M, Morton J, Smith A, Splitt M, Thornton JM, Wright CF. Protein structure and phenotypic analysis of pathogenic and population missense variants in STXBP1. Molecular Genetics & Genomic Medicine. 2017;5:495–507. doi: 10.1002/mgg3.304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. de Jong AE, van Puijenbroek M, Hendriks Y, Tops C, Wijnen J, Ausems MG, Meijers-Heijboer H, Wagner A, van Os TA, Bröcker-Vriends AH, Vasen HF, Morreau H. Microsatellite instability, immunohistochemistry, and additional PMS2 staining in suspected hereditary nonpolyposis colorectal Cancer. Clinical Cancer Research. 2004;10:972–980. doi: 10.1158/1078-0432.CCR-0956-3. [DOI] [PubMed] [Google Scholar]
  19. Dowty JG, Win AK, Buchanan DD, Lindor NM, Macrae FA, Clendenning M, Antill YC, Thibodeau SN, Casey G, Gallinger S, Marchand LL, Newcomb PA, Haile RW, Young GP, James PA, Giles GG, Gunawardena SR, Leggett BA, Gattas M, Boussioutas A, Ahnen DJ, Baron JA, Parry S, Goldblatt J, Young JP, Hopper JL, Jenkins MA. Cancer risks for MLH1 and MSH2 mutation carriers. Human Mutation. 2013;34:490–497. doi: 10.1002/humu.22262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Dunlop MG, Farrington SM, Carothers AD, Wyllie AH, Sharp L, Burn J, Liu B, Kinzler KW, Vogelstein B. Cancer risk associated with germline DNA mismatch repair gene mutations. Human Molecular Genetics. 1997;6:105–110. doi: 10.1093/hmg/6.1.105. [DOI] [PubMed] [Google Scholar]
  21. Enam C, Geffen Y, Ravid T, Gardner RG. Protein quality control degradation in the nucleus. Annual Review of Biochemistry. 2018;87:725–749. doi: 10.1146/annurev-biochem-062917-012730. [DOI] [PubMed] [Google Scholar]
  22. Fedier A, Stuedli A, Fink D. Presence of MLH1 protein aggravates the potential of the HSP90 inhibitor radicicol to sensitize tumor cells to cisplatin. International Journal of Oncology. 2005;27:1697–1705. [PubMed] [Google Scholar]
  23. Findlay GM, Daza RM, Martin B, Zhang MD, Leith AP, Gasperini M, Janizek JD, Huang X, Starita LM, Shendure J. Accurate classification of BRCA1 variants with saturation genome editing. Nature. 2018;562:217–222. doi: 10.1038/s41586-018-0461-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Gammie AE, Erdeniz N, Beaver J, Devlin B, Nanji A, Rose MD. Functional characterization of pathogenic human MSH2 missense mutations in Saccharomyces cerevisiae. Genetics. 2007;177:707–721. doi: 10.1534/genetics.107.071084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Geffen Y, Appleboim A, Gardner RG, Friedman N, Sadeh R, Ravid T. Mapping the Landscape of a Eukaryotic Degronome. Molecular Cell. 2016;63:1055–1065. doi: 10.1016/j.molcel.2016.08.005. [DOI] [PubMed] [Google Scholar]
  26. Gueneau E, Dherin C, Legrand P, Tellier-Lebegue C, Gilquin B, Bonnesoeur P, Londino F, Quemener C, Le Du MH, Márquez JA, Moutiez M, Gondry M, Boiteux S, Charbonnier JB. Structure of the MutLα C-terminal domain reveals how Mlh1 contributes to Pms1 endonuclease site. Nature Structural & Molecular Biology. 2013;20:461–468. doi: 10.1038/nsmb.2511. [DOI] [PubMed] [Google Scholar]
  27. Guerois R, Nielsen JE, Serrano L. Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. Journal of Molecular Biology. 2002;320:369–387. doi: 10.1016/S0022-2836(02)00442-4. [DOI] [PubMed] [Google Scholar]
  28. Hampel H, Frankel WL, Martin E, Arnold M, Khanduja K, Kuebler P, Clendenning M, Sotamaa K, Prior T, Westman JA, Panescu J, Fix D, Lockman J, LaJeunesse J, Comeras I, de la Chapelle A. Feasibility of screening for lynch syndrome among patients with colorectal Cancer. Journal of Clinical Oncology. 2008;26:5783–5788. doi: 10.1200/JCO.2008.17.5950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Heinen CD. Genotype to phenotype: analyzing the effects of inherited mutations in colorectal Cancer families. Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis. 2010;693:32–45. doi: 10.1016/j.mrfmmm.2009.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Hinrichsen I, Weßbecher IM, Huhn M, Passmann S, Zeuzem S, Plotz G, Biondi RM, Brieger A. Phosphorylation-dependent signaling controls degradation of DNA mismatch repair protein PMS2. Molecular Carcinogenesis. 2017;56:2663–2668. doi: 10.1002/mc.22709. [DOI] [PubMed] [Google Scholar]
  31. Ioannidis NM, Rothstein JH, Pejaver V, Middha S, McDonnell SK, Baheti S, Musolf A, Li Q, Holzinger E, Karyadi D, Cannon-Albright LA, Teerlink CC, Stanford JL, Isaacs WB, Xu J, Cooney KA, Lange EM, Schleutker J, Carpten JD, Powell IJ, Cussenot O, Cancel-Tassin G, Giles GG, MacInnis RJ, Maier C, Hsieh CL, Wiklund F, Catalona WJ, Foulkes WD, Mandal D, Eeles RA, Kote-Jarai Z, Bustamante CD, Schaid DJ, Hastie T, Ostrander EA, Bailey-Wilson JE, Radivojac P, Thibodeau SN, Whittemore AS, Sieh W. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. The American Journal of Human Genetics. 2016;99:877–885. doi: 10.1016/j.ajhg.2016.08.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Jaganathan K, Kyriazopoulou Panagiotopoulou S, McRae JF, Darbandi SF, Knowles D, Li YI, Kosmicki JA, Arbelaez J, Cui W, Schwartz GB, Chow ED, Kanterakis E, Gao H, Kia A, Batzoglou S, Sanders SJ, Farh KK-H. Predicting Splicing from Primary Sequence with Deep Learning. Cell. 2019;176:535–548. doi: 10.1016/j.cell.2018.12.015. [DOI] [PubMed] [Google Scholar]
  33. Jepsen MM, Fowler DM, Hartmann-Petersen R, Stein A, Lindorff-Larsen K. Classifying disease-associated variants using measures of protein activity and stability. bioRxiv. 2019 doi: 10.1101/688234. [DOI]
  34. Jiricny J. The multifaceted mismatch-repair system. Nature Reviews Molecular Cell Biology. 2006;7:335–346. doi: 10.1038/nrm1907. [DOI] [PubMed] [Google Scholar]
  35. Jiricny J. Postreplicative mismatch repair. Cold Spring Harbor Perspectives in Biology. 2013;5:a012633. doi: 10.1101/cshperspect.a012633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Jun SH, Kim TG, Ban C. DNA mismatch repair system. Classical and fresh roles. FEBS Journal. 2006;273:1609–1619. doi: 10.1111/j.1742-4658.2006.05190.x. [DOI] [PubMed] [Google Scholar]
  37. Kampmeyer C, Nielsen SV, Clausen L, Stein A, Gerdes A-M, Lindorff-Larsen K, Hartmann-Petersen R. Blocking protein quality control to counter hereditary cancers. Genes, Chromosomes and Cancer. 2017;56:823–831. doi: 10.1002/gcc.22487. [DOI] [PubMed] [Google Scholar]
  38. Kandasamy G, Andréasson C. Hsp70-Hsp110 chaperones deliver ubiquitin-dependent and -independent substrates to the 26S proteasome for proteolysis in yeast. Journal of Cell Science. 2018;131:jcs210948. doi: 10.1242/jcs.210948. [DOI] [PubMed] [Google Scholar]
  39. Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alfoldi J, Wang Q, Collins RL, Laricchia KM, Ganna A, Birnbaum DP, Gauthier LD, Brand H, Solomonson M, Watts NA, Rhodes D, Singer-Berk M, Seaby EG, Kosmicki JA, Walters RK, Tashman K, Farjoun Y, Banks E, Poterba T, Wang A, Seed C, Whiffin N, Chong J, Samocha KE, Pierce-Hoffman E, Zappala Z, Donnell-Luria AH, Minikel E, Weisburd B, Lek M, Ware JS, Vittal C, Armean IM, Bergelson L, Cibulskis K, Connolly KM, Covarrubias M, Donnelly S, Ferriera S, Gabriel S, Gentry J, Gupta N, Jeandet T, Kaplan D, Llanwarne C, Munshi R, Novod S, Petrillo N, Roazen D, Ruano-Rubio V, Saltzman A, Schleicher M, Soto J, Tibbetts K, Tolonen C, Wade G, Talkowski ME, The Genome Aggregation Database Consortium. Daly MJ, MacArthu DG, Neale NB. Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes. bioRxiv. 2019 doi: 10.1101/531210. [DOI]
  40. Karras GI, Yi S, Sahni N, Fischer M, Xie J, Vidal M, D'Andrea AD, Whitesell L, Lindquist S. HSP90 shapes the consequences of human genetic variation. Cell. 2017;168:856–866. doi: 10.1016/j.cell.2017.01.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Kiel C, Benisty H, Lloréns-Rico V, Serrano L. The yin-yang of kinase activation and unfolding explains the peculiarity of Val600 in the activation segment of BRAF. eLife. 2016;5:e12814. doi: 10.7554/eLife.12814. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Kiel C, Serrano L. Structure-energy-based predictions and network modelling of RASopathy and Cancer missense mutations. Molecular Systems Biology. 2014;10:727. doi: 10.1002/msb.20145092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Kim I, Miller CR, Young DL, Fields S. High-throughput analysis of in vivo protein stability. Molecular & Cellular Proteomics : MCP. 2013;12:3370–3378. doi: 10.1074/mcp.O113.031708. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Kondo E, Horii A, Fukushige S. The interacting domains of three MutL heterodimers in man: hmlh1 interacts with 36 homologous amino acid residues within hMLH3, hPMS1 and hPMS2. Nucleic Acids Research. 2001;29:1695–1702. doi: 10.1093/nar/29.8.1695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Kosinski J, Hinrichsen I, Bujnicki JM, Friedhoff P, Plotz G. Identification of lynch syndrome mutations in the MLH1-PMS2 interface that disturb dimerization and mismatch repair. Human Mutation. 2010;31:975–982. doi: 10.1002/humu.21301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kriegenburg F, Jakopec V, Poulsen EG, Nielsen SV, Roguev A, Krogan N, Gordon C, Fleig U, Hartmann-Petersen R. A chaperone-assisted degradation pathway targets kinetochore proteins to ensure genome stability. PLOS Genetics. 2014;10:e1004140. doi: 10.1371/journal.pgen.1004140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Landrum MJ, Lee JM, Benson M, Brown GR, Chao C, Chitipiralla S, Gu B, Hart J, Hoffman D, Jang W, Karapetyan K, Katz K, Liu C, Maddipatla Z, Malheiro A, McDaniel K, Ovetsky M, Riley G, Zhou G, Holmes JB, Kattman BL, Maglott DR. ClinVar: improving access to variant interpretations and supporting evidence. Nucleic Acids Research. 2018;46:D1062–D1067. doi: 10.1093/nar/gkx1153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, O'Donnell-Luria AH, Ware JS, Hill AJ, Cummings BB, Tukiainen T, Birnbaum DP, Kosmicki JA, Duncan LE, Estrada K, Zhao F, Zou J, Pierce-Hoffman E, Berghout J, Cooper DN, Deflaux N, DePristo M, Do R, Flannick J, Fromer M, Gauthier L, Goldstein J, Gupta N, Howrigan D, Kiezun A, Kurki MI, Moonshine AL, Natarajan P, Orozco L, Peloso GM, Poplin R, Rivas MA, Ruano-Rubio V, Rose SA, Ruderfer DM, Shakir K, Stenson PD, Stevens C, Thomas BP, Tiao G, Tusie-Luna MT, Weisburd B, Won HH, Yu D, Altshuler DM, Ardissino D, Boehnke M, Danesh J, Donnelly S, Elosua R, Florez JC, Gabriel SB, Getz G, Glatt SJ, Hultman CM, Kathiresan S, Laakso M, McCarroll S, McCarthy MI, McGovern D, McPherson R, Neale BM, Palotie A, Purcell SM, Saleheen D, Scharf JM, Sklar P, Sullivan PF, Tuomilehto J, Tsuang MT, Watkins HC, Wilson JG, Daly MJ, MacArthur DG, Exome Aggregation Consortium Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536:285–291. doi: 10.1038/nature19057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Li GM, Modrich P. Restoration of mismatch repair to nuclear extracts of H6 colorectal tumor cells by a heterodimer of human MutL homologs. PNAS. 1995;92:1950–1954. doi: 10.1073/pnas.92.6.1950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Lynch HT, Snyder CL, Shaw TG, Heinen CD, Hitchins MP. Milestones of lynch syndrome: 1895-2015. Nature Reviews Cancer. 2015;15:181–194. doi: 10.1038/nrc3878. [DOI] [PubMed] [Google Scholar]
  51. Manolio TA, Fowler DM, Starita LM, Haendel MA, MacArthur DG, Biesecker LG, Worthey E, Chisholm RL, Green ED, Jacob HJ, McLeod HL, Roden D, Rodriguez LL, Williams MS, Cooper GM, Cox NJ, Herman GE, Kingsmore S, Lo C, Lutz C, MacRae CA, Nussbaum RL, Ordovas JM, Ramos EM, Robinson PN, Rubinstein WS, Seidman C, Stranger BE, Wang H, Westerfield M, Bult C. Bedside back to bench: building bridges between basic and clinical genomic research. Cell. 2017;169:6–12. doi: 10.1016/j.cell.2017.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Mathiassen SG, Larsen IB, Poulsen EG, Madsen CT, Papaleo E, Lindorff-Larsen K, Kragelund BB, Nielsen ML, Kriegenburg F, Hartmann-Petersen R. A Two-step protein quality control pathway for a misfolded DJ-1 variant in fission yeast. Journal of Biological Chemistry. 2015;290:21141–21153. doi: 10.1074/jbc.M115.662312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Matreyek KA, Starita LM, Stephany JJ, Martin B, Chiasson MA, Gray VE, Kircher M, Khechaduri A, Dines JN, Hause RJ, Bhatia S, Evans WE, Relling MV, Yang W, Shendure J, Fowler DM. Multiplex assessment of protein variant abundance by massively parallel sequencing. Nature Genetics. 2018;50:874–882. doi: 10.1038/s41588-018-0122-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Maurer MJ, Spear ED, Yu AT, Lee EJ, Shahzad S, Michaelis S. Degradation signals for Ubiquitin-Proteasome dependent cytosolic protein quality control (CytoQC) in yeast. G3: Genes, Genomes, Genetics. 2016 doi: 10.1534/g3.116.027953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. McShane E, Sin C, Zauber H, Wells JN, Donnelly N, Wang X, Hou J, Chen W, Storchova Z, Marsh JA, Valleriani A, Selbach M. Kinetic analysis of protein stability reveals Age-Dependent degradation. Cell. 2016;167:803–815. doi: 10.1016/j.cell.2016.09.015. [DOI] [PubMed] [Google Scholar]
  56. Mitchell AL, Attwood TK, Babbitt PC, Blum M, Bork P, Bridge A, Brown SD, Chang HY, El-Gebali S, Fraser MI, Gough J, Haft DR, Huang H, Letunic I, Lopez R, Luciani A, Madeira F, Marchler-Bauer A, Mi H, Natale DA, Necci M, Nuka G, Orengo C, Pandurangan AP, Paysan-Lafosse T, Pesseat S, Potter SC, Qureshi MA, Rawlings ND, Redaschi N, Richardson LJ, Rivoire C, Salazar GA, Sangrador-Vegas A, Sigrist CJA, Sillitoe I, Sutton GG, Thanki N, Thomas PD, Tosatto SCE, Yong SY, Finn RD. InterPro in 2019: improving coverage, classification and access to protein sequence annotations. Nucleic Acids Research. 2019;47:D351–D360. doi: 10.1093/nar/gky1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Mohd AB, Palama B, Nelson SE, Tomer G, Nguyen M, Huo X, Buermeyer AB. Truncation of the C-terminus of human MLH1 blocks intracellular stabilization of PMS2 and disrupts DNA mismatch repair. DNA Repair. 2006;5:347–361. doi: 10.1016/j.dnarep.2005.11.001. [DOI] [PubMed] [Google Scholar]
  58. Møller P, Seppälä TT, Bernstein I, Holinski-Feder E, Sala P, Gareth Evans D, Lindblom A, Macrae F, Blanco I, Sijmons RH, Jeffries J, Vasen HFA, Burn J, Nakken S, Hovig E, Rødland EA, Tharmaratnam K, de Vos Tot Nederveen Cappel WH, Hill J, Wijnen JT, Jenkins MA, Green K, Lalloo F, Sunde L, Mints M, Bertario L, Pineda M, Navarro M, Morak M, Renkonen-Sinisalo L, Valentin MD, Frayling IM, Plazzer JP, Pylvanainen K, Genuardi M, Mecklin JP, Moeslein G, Sampson JR, Capella G, Mallorca Group Cancer risk and survival in path_MMR carriers by gene and gender up to 75 years of age: a report from the prospective lynch syndrome database. Gut. 2018;67:1306–1316. doi: 10.1136/gutjnl-2017-314057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Nielsen SV, Poulsen EG, Rebula CA, Hartmann-Petersen R. Protein quality control in the nucleus. Biomolecules. 2014;4:646–661. doi: 10.3390/biom4030646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Nielsen SV, Stein A, Dinitzen AB, Papaleo E, Tatham MH, Poulsen EG, Kassem MM, Rasmussen LJ, Lindorff-Larsen K, Hartmann-Petersen R. Predicting the impact of lynch syndrome-causing missense mutations from structural calculations. PLOS Genetics. 2017;13:e1006739. doi: 10.1371/journal.pgen.1006739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Olzmann JA, Brown K, Wilkinson KD, Rees HD, Huai Q, Ke H, Levey AI, Li L, Chin LS. Familial parkinson's disease-associated L166P mutation disrupts DJ-1 protein folding and function. Journal of Biological Chemistry. 2004;279:8506–8515. doi: 10.1074/jbc.M311017200. [DOI] [PubMed] [Google Scholar]
  62. Palomaki GE, McClain MR, Melillo S, Hampel HL, Thibodeau SN. EGAPP supplementary evidence review: dna testing strategies aimed at reducing morbidity and mortality from lynch syndrome. Genetics in Medicine. 2009;11:42–65. doi: 10.1097/GIM.0b013e31818fa2db. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Peltomäki P, Aaltonen LA, Sistonen P, Pylkkänen L, Mecklin JP, Järvinen H, Green JS, Jass JR, Weber JL, Leach FS. Genetic mapping of a locus predisposing to human colorectal Cancer. Science. 1993;260:810–812. doi: 10.1126/science.8484120. [DOI] [PubMed] [Google Scholar]
  64. Peltomäki P. Update on lynch syndrome genomics. Familial Cancer. 2016;15:385–393. doi: 10.1007/s10689-016-9882-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Peltomäki P, Vasen HF. Mutations predisposing to hereditary nonpolyposis colorectal Cancer: database and results of a collaborative study. The international collaborative group on hereditary nonpolyposis colorectal Cancer. Gastroenterology. 1997;113:1146–1158. doi: 10.1053/gast.1997.v113.pm9322509. [DOI] [PubMed] [Google Scholar]
  66. Perera S, Bapat B. The MLH1 variants p.arg265cys and p.lys618ala affect protein stability while p.leu749gln affects heterodimer formation. Human Mutation. 2008;29:332. doi: 10.1002/humu.9523. [DOI] [PubMed] [Google Scholar]
  67. Pey AL, Stricher F, Serrano L, Martinez A. Predicted effects of missense mutations on native-state stability account for phenotypic outcome in Phenylketonuria, a paradigm of misfolding diseases. The American Journal of Human Genetics. 2007;81:1006–1024. doi: 10.1086/521879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Plaschke J, Engel C, Krüger S, Holinski-Feder E, Pagenstecher C, Mangold E, Moeslein G, Schulmann K, Gebert J, von Knebel Doeberitz M, Rüschoff J, Loeffler M, Schackert HK. Lower incidence of colorectal Cancer and later age of disease onset in 27 families with pathogenic MSH6 germline mutations compared with families with MLH1 or MSH2 mutations: the german hereditary nonpolyposis colorectal Cancer consortium. Journal of Clinical Oncology. 2004;22:4486–4494. doi: 10.1200/JCO.2004.02.033. [DOI] [PubMed] [Google Scholar]
  69. Raevaara TE, Korhonen MK, Lohi H, Hampel H, Lynch E, Lönnqvist KE, Holinski-Feder E, Sutter C, McKinnon W, Duraisamy S, Gerdes AM, Peltomäki P, Kohonen-Ccorish M, Mangold E, Macrae F, Greenblatt M, de la Chapelle A, Nyström M. Functional significance and clinical phenotype of nontruncating mismatch repair variants of MLH1. Gastroenterology. 2005;129:537–549. doi: 10.1053/j.gastro.2005.06.005. [DOI] [PubMed] [Google Scholar]
  70. Räschle M, Dufner P, Marra G, Jiricny J. Mutations within the hMLH1 and hPMS2 subunits of the human MutLalpha mismatch repair factor affect its ATPase activity, but not its ability to interact with hMutSalpha. Journal of Biological Chemistry. 2002;277:21810–21820. doi: 10.1074/jbc.M108787200. [DOI] [PubMed] [Google Scholar]
  71. Ravid T, Hochstrasser M. Diversity of degradation signals in the ubiquitin-proteasome system. Nature Reviews Molecular Cell Biology. 2008;9:679–689. doi: 10.1038/nrm2468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Rosenbaum JC, Fredrickson EK, Oeser ML, Garrett-Engele CM, Locke MN, Richardson LA, Nelson ZW, Hetrick ED, Milac TI, Gottschling DE, Gardner RG. Disorder targets misorder in nuclear quality control degradation: a disordered ubiquitin ligase directly recognizes its misfolded substrates. Molecular Cell. 2011;41:93–106. doi: 10.1016/j.molcel.2010.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Sachadyn P. Conservation and diversity of MutS proteins. Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis. 2010;694:20–30. doi: 10.1016/j.mrfmmm.2010.08.009. [DOI] [PubMed] [Google Scholar]
  74. Samant RS, Livingston CM, Sontag EM, Frydman J. Distinct proteostasis circuits cooperate in nuclear and cytoplasmic protein quality control. Nature. 2018;563:407–411. doi: 10.1038/s41586-018-0678-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Scheller R, Stein A, Nielsen SV, Marin FI, Gerdes AM, Di Marco M, Papaleo E, Lindorff-Larsen K, Hartmann-Petersen R. Toward mechanistic models for genotype-phenotype correlations in phenylketonuria using protein stability calculations. Human Mutation. 2019;40:444–457. doi: 10.1002/humu.23707. [DOI] [PubMed] [Google Scholar]
  76. Schymkowitz J, Borg J, Stricher F, Nys R, Rousseau F, Serrano L. The FoldX web server: an online force field. Nucleic Acids Research. 2005;33:W382–W388. doi: 10.1093/nar/gki387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Shimodaira H, Filosi N, Shibata H, Suzuki T, Radice P, Kanamaru R, Friend SH, Kolodner RD, Ishioka C. Functional analysis of human MLH1 mutations in Saccharomyces cerevisiae. Nature Genetics. 1998;19:384–389. doi: 10.1038/1277. [DOI] [PubMed] [Google Scholar]
  78. Sijmons RH, Hofstra RMW. Review: clinical aspects of hereditary DNA mismatch repair gene mutations. DNA Repair. 2016;38:155–162. doi: 10.1016/j.dnarep.2015.11.018. [DOI] [PubMed] [Google Scholar]
  79. Stein A, Fowler DM, Hartmann-Petersen R, Lindorff-Larsen K. Biophysical and mechanistic models for Disease-Causing protein variants. Trends in Biochemical Sciences. 2019;44:575–588. doi: 10.1016/j.tibs.2019.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Takahashi M, Shimodaira H, Andreutti-Zaugg C, Iggo R, Kolodner RD, Ishioka C. Functional analysis of human MLH1 variants using yeast and in vitro mismatch repair assays. Cancer Research. 2007;67:4595–4604. doi: 10.1158/0008-5472.CAN-06-3509. [DOI] [PubMed] [Google Scholar]
  81. Thompson BA, Spurdle AB, Plazzer JP, Greenblatt MS, Akagi K, Al-Mulla F, Bapat B, Bernstein I, Capellá G, den Dunnen JT, du Sart D, Fabre A, Farrell MP, Farrington SM, Frayling IM, Frebourg T, Goldgar DE, Heinen CD, Holinski-Feder E, Kohonen-Corish M, Robinson KL, Leung SY, Martins A, Moller P, Morak M, Nystrom M, Peltomaki P, Pineda M, Qi M, Ramesar R, Rasmussen LJ, Royer-Pokora B, Scott RJ, Sijmons R, Tavtigian SV, Tops CM, Weber T, Wijnen J, Woods MO, Macrae F, Genuardi M. Application of a 5-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants in the InSiGHT locus-specific database. Nature Genetics. 2014;46:107–115. doi: 10.1038/ng.2854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Tokuriki N, Tawfik DS. Stability effects of mutations and protein evolvability. Current Opinion in Structural Biology. 2009;19:596–604. doi: 10.1016/j.sbi.2009.08.003. [DOI] [PubMed] [Google Scholar]
  83. Tomer G, Buermeyer AB, Nguyen MM, Liskay RM. Contribution of human mlh1 and pms2 ATPase activities to DNA mismatch repair. Journal of Biological Chemistry. 2002;277:21801–21809. doi: 10.1074/jbc.M111342200. [DOI] [PubMed] [Google Scholar]
  84. van der Lee R, Lang B, Kruse K, Gsponer J, Sánchez de Groot N, Huynen MA, Matouschek A, Fuxreiter M, Babu MM. Intrinsically disordered segments affect protein half-life in the cell and during evolution. Cell Reports. 2014;8:1832–1844. doi: 10.1016/j.celrep.2014.07.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Vasen HF, Wijnen JT, Menko FH, Kleibeuker JH, Taal BG, Griffioen G, Nagengast FM, Meijers-Heijboer EH, Bertario L, Varesco L, Bisgaard ML, Mohr J, Fodde R, Khan PM. Cancer risk in families with hereditary nonpolyposis colorectal Cancer diagnosed by mutation analysis. Gastroenterology. 1996;110:1020–1027. doi: 10.1053/gast.1996.v110.pm8612988. [DOI] [PubMed] [Google Scholar]
  86. Vasen HF, de Vos Tot Nederveen Cappel WH. A hundred years of lynch syndrome research (1913-2013) Familial Cancer. 2013;12:141–142. doi: 10.1007/s10689-013-9654-7. [DOI] [PubMed] [Google Scholar]
  87. Wagih O, Galardini M, Busby BP, Memon D, Typas A, Beltrao P. A resource of variant effect predictions of single nucleotide variants in model organisms. Molecular Systems Biology. 2018;14:e8430. doi: 10.15252/msb.20188430. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Wu X, Platt JL, Cascalho M. Dimerization of MLH1 and PMS2 limits nuclear localization of MutLalpha. Molecular and Cellular Biology. 2003;23:3320–3328. doi: 10.1128/MCB.23.9.3320-3328.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Wu H, Zeng H, Lam R, Tempel W, Kerr ID, Min J. Structure of the human MLH1 N-terminus: implications for predisposition to lynch syndrome. Acta Crystallographica Section F Structural Biology Communications. 2015;71:981–985. doi: 10.1107/S2053230X15010183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Yanagitani K, Juszkiewicz S, Hegde RS. UBE2O is a quality control factor for orphans of multiprotein complexes. Science. 2017;357:472–475. doi: 10.1126/science.aan0178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Yates A, Beal K, Keenan S, McLaren W, Pignatelli M, Ritchie GRS, Ruffier M, Taylor K, Vullo A, Flicek P. The Ensembl REST API: Ensembl Data for Any Language. Bioinformatics. 2015;31:143–145. doi: 10.1093/bioinformatics/btu613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Zimmermann L, Stephens A, Nam SZ, Rau D, Kübler J, Lozajic M, Gabler F, Söding J, Lupas AN, Alva V. A completely reimplemented MPI bioinformatics toolkit with a new HHpred server at its core. Journal of Molecular Biology. 2018;430:2237–2243. doi: 10.1016/j.jmb.2017.12.007. [DOI] [PubMed] [Google Scholar]

Decision letter

Editor: Sarel Jacob Fleishman1
Reviewed by: Sarel Jacob Fleishman2

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Acceptance summary:

This contribution presents a comprehensive computational-experimental analysis of mutations in the MLH1 gene, which is involved in DNA mismatch repair, and their relationship to protein stability, association with chaperones and cellular expression levels. The manuscript reports a large array of computational and experimental methods, including computational mutation scanning, structural analysis, evolutionary conservation, clinical relevance, cellular localization and expression levels, and chaperone association. Although the association between protein stability and disease has a very long history in molecular biology, the ability to separate potentially disease-causing mutations from benign ones based on energetics is important and the large-scale analysis reported here may in the future enable improved predictive and even diagnostic capabilities.

Decision letter after peer review:

Thank you for submitting your article "Computational and cellular studies reveal structural destabilization and degradation of MLH1 variants in Lynch syndrome" for consideration by eLife. Your article has been reviewed by three peer reviewers, including Sarel Jacob Fleishman as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by John Kuriyan as the Senior Editor.

The reviewers have discussed the reviews with one another, and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

Lynch syndrome is an inherited disorder, associated with a genetic predisposition to certain cancers. A major cause for the development of Lynch disease is the loss of function of the MLH1 protein, which is an essential component of the cellular defective mismatch repair (MMR) complex. In this manuscript, Abildgaard and co-workers used various computational, cellular and biochemical tools to test the link between structural destabilization of disease-linked variants of MLH1, their steady-state levels, and pathogenicity. The authors show that levels of MLH1 variants with lower thermodynamic folding stability (ΔΔG) are reduced, compared to the wild-type protein. This is largely due to their HSP70 chaperone-assisted degradation by the proteasome. Comparison of the prediction of structural changes, due to thermodynamic instability of missense MLH1 variants, to sequence-based tools currently employed in the clinic, indicate that the former is slightly better at distinguishing pathogenic and non-pathogenic variants. Although the association between protein stability and disease has a very long history in molecular biology, the ability to separate potentially disease-causing mutations from benign ones based on energetics is important, and the large-scale analysis reported here may in the future enable improved predictive and even diagnostic capabilities.

This is a well written, coherent study that highlights the potential of missense mutations to trigger protein destabilization, thereby to reduce MLH1 activity, resulting in a disease phenotype. Notably, a similar approach was previously taken by the same research group to explore the role of missense mutations in the destabilization of phenylalanine hydroxylase, leading to phenylketonuria. In the current manuscript, the authors delve deeper into the mechanism of MLH1 pathogenicity, further demonstrating the effects of unstable variants on the stability of PMS1 and PMS2, which co-operate with MLH1 in the MMR complex. The authors also identified HSP70 (but not HSP90) chaperones as key players in the degradation pathway. Altogether, these studies provide new and exciting insights about the mechanisms of certain pathogenies, which are based on structural instability, leading to reduced protein levels.

Essential revisions:

1) The authors linked structural instability, chaperone requirements and proteasomal degradation to reduced levels of pathogenic MLH1 variants. It is unclear, however, how local protein instability leads to proteasomal degradation. One obvious possibility is that major thermodynamic instability leads to protein misfolding. None of the tested variants, however, form notable inclusion bodies in the cell, which is the hallmark of protein misfolding diseases. Hence, in order to show a link between ∆∆G values, protein misfolding and proteasomal degradation the authors can test the fate MLH1 mutants when degradation is inhibited. This can be done by testing the formation of inclusion bodies (via fluorescence microscopy or protein solubility assays) upon proteasome/lysosome inhibition or upon inhibition of HSP70. The accumulation of protein aggregates will be indicative of protein misfolding, thus explaining the need for HSP70 chaperones. Also, in Figure 5E, F, changes in MLH1 steady-state levels upon chaperone inhibition should be quantified and compared to control no treatment as done for Figure 5A, C. Otherwise, it is difficult to discern the differences.

2) FoldX is used to predict ∆∆G values of individual mutations, but Figure 6D suggests that it provides only a minor benefit above standard evolutionary conservation calculations (AUC of 87 and 84%, respectively). The main advantage of evolutionary conservation analysis is that it is independent of structures which are often not available. The authors should comment on whether FoldX represents a sufficiently substantial improvement over evolutionary analysis to warrant the added complication of relying on a structure. Moreover, Figure 6E suggests that in fact the two analyses could be combined to provide a better analysis of harmful/benign mutations but there are no statistics provided for such a combination. Could the authors use logistic regression or similar to suggest cutoffs that use both terms (with appropriate statistical controls, such as cross-validation)? If this analysis is superior to FoldX alone, the message of the paper may need to change from stability to stability+conservation analysis.

3) Somewhat conflicting themes run through the paper. On the one hand, the authors note in several places that misfolding is likely to be an important consequence of destabilizing mutations; yet, in many places, the reference state for the mutations is the unfolded state (Figure 1C, Figure 7, and main text). Additionally, the authors note that above a predicted value of ∆∆G>3kcal/mol, mutations are harmful, finding this low value to be surprising (subsection “Thermodynamic stability calculations predict severely reduced MLH1 steady-state levels”, third paragraph). Have they considered whether the MLH1 protein is marginally stable, which would reconcile these observations. If the protein is not marginally stable, then its intrinsic stability would be predicted to buffer the effects of many of the single-point mutations. The best way to address this question is to show (if this has not been done before) a standard denaturation assay of the protein (temperature melt, for instance). It may also be beneficial to clearly state and illustrate (in the figures showing the reference states) that the misfolded states are likely to be the dominant reference for the mutants and that the predictive value of ∆∆G calculations is likely to be relevant to marginally stable proteins.

4) Additionally, the link that the authors claim between protein stability and cellular expression levels is not absolutely solid since there are numerous layers of control of protein expression in mammalian cells. As noted above, this link is likely to apply more to marginally stable proteins. For the benefits of the readers, it would be good if a more elaborate discussion of this relationship is presented already in the Introduction.

5) The authors should make it clearer that the approach and findings in this manuscript are quite similar to their previous paper on MSH2 (Nielsen et al., 2017) as this is not immediately apparent from the manuscript.

eLife. 2019 Nov 7;8:e49138. doi: 10.7554/eLife.49138.024

Author response


Essential revisions:

1) The authors linked structural instability, chaperone requirements and proteasomal degradation to reduced levels of pathogenic MLH1 variants. It is unclear, however, how local protein instability leads to proteasomal degradation. One obvious possibility is that major thermodynamic instability leads to protein misfolding. None of the tested variants, however, form notable inclusion bodies in the cell, which is the hallmark of protein misfolding diseases. Hence, in order to show a link between ∆∆G values, protein misfolding and proteasomal degradation the authors can test the fate MLH1 mutants when degradation is inhibited. This can be done by testing the formation of inclusion bodies (via fluorescence microscopy or protein solubility assays) upon proteasome/lysosome inhibition or upon inhibition of HSP70. The accumulation of protein aggregates will be indicative of protein misfolding, thus explaining the need for HSP70 chaperones. Also, in Figure 5E, F, changes in MLH1 steady-state levels upon chaperone inhibition should be quantified and compared to control no treatment as done for Figure 5A, C. Otherwise, it is difficult to discern the differences.

We tested the solubility of selected MLH1 variants with and without proteasome inhibitor by separating crude cell lysates into soluble and insoluble fractions by centrifugation. The results show that the destabilized variants are more insoluble than the stable variants, and bortezomib treatment appears to mainly result in accumulation of insoluble MLH1. In the revised manuscript, we discuss these results in the last paragraph of the subsection “Proteasomal degradation causes reduced steady-state levels of destabilized MLH1 variants” and include them as Figure 3—figure supplement 1. We find these additional data supports the notion that the degraded MLH1 variants are structurally destabilized, and that under normal conditions these are cleared by the protein quality control system (rather than forming substantial aggregates).

In addition, we added quantifications of the blots in Figure 5 (subsection “HSP70 is required for degradation of some destabilized MLH1 variants” and Figure 5 legend).

2) FoldX is used to predict ∆∆G values of individual mutations, but Figure 6D suggests that it provides only a minor benefit above standard evolutionary conservation calculations (AUC of 87 and 84%, respectively). The main advantage of evolutionary conservation analysis is that it is independent of structures which are often not available. The authors should comment on whether FoldX represents a sufficiently substantial improvement over evolutionary analysis to warrant the added complication of relying on a structure. Moreover, Figure 6E suggests that in fact the two analyses could be combined to provide a better analysis of harmful/benign mutations but there are no statistics provided for such a combination. Could the authors use logistic regression or similar to suggest cutoffs that use both terms (with appropriate statistical controls, such as cross-validation)? If this analysis is superior to FoldX alone, the message of the paper may need to change from stability to stability+conservation analysis.

The reviewer correctly asserts that the stability calculations by FoldX provide a similar prediction accuracy (AUC~0.82) to the sequence analysis (AUC~0.88) (which in this case takes both site conservation and context into account). Thus, from a purely predictive perspective, these two methods are comparable. We argue, however, that the stability calculations – because they focus on a single molecular effect (change in protein stability) – are useful to help support our observation of Lynch syndrome as a “misfolding/unfolding” disease. Thus, the strength lies in the mechanistic argument and not necessarily in the purely predictive value. Indeed, the two-dimensional analysis in Figure 6D shows that most non-functional variants (DME=0) lie in the upper-right quadrant suggesting that they are unstable and that this why the evolutionary analysis also find these to be likely pathogenic. This is the reason why we generally kept the two analyses (stability and conservation) separate – so that we can examine whether those variants that are predicted to be poor from a conservation analysis are so because of the lack of stability. In the revised manuscript we make this point clearer (Discussion). As suggested by the reviewers, we have also developed a combined predictor using logistic regression on FoldX and evolutionary analysis. As Figure 6F shows, performance of the combined model is above either of the individual predictors, though not by much (AUC~0.90). As we have also previously pointed out, and as the 2D plots illustrate (Figure 6D, E, G), most pathogenic and non-functional variants are both destabilized and rare in the alignment. We hypothesize that they are selected against because those variants would destabilize the protein. Nevertheless, a few pathogenic/non-functional variants are only identified by one of the predictors, and the linear regression model includes several of these from both predictors, thus gaining in overall performance. We tested performance of the regression model in a leave-one-out jackknife procedure, which correctly classified 85% of the respective data points when they had been left out of the training set (see subsection “Structural stability calculations for predicting pathogenic mutations”).

3) Somewhat conflicting themes run through the paper. On the one hand, the authors note in several places that misfolding is likely to be an important consequence of destabilizing mutations; yet, in many places, the reference state for the mutations is the unfolded state (Figure 1C, Figure 7, and main text). Additionally, the authors note that above a predicted value of ∆∆G>3kcal/mol, mutations are harmful, finding this low value to be surprising (subsection “Thermodynamic stability calculations predict severely reduced MLH1 steady-state levels”, third paragraph). Have they considered whether the MLH1 protein is marginally stable, which would reconcile these observations. If the protein is not marginally stable, then its intrinsic stability would be predicted to buffer the effects of many of the single-point mutations. The best way to address this question is to show (if this has not been done before) a standard denaturation assay of the protein (temperature melt, for instance). It may also be beneficial to clearly state and illustrate (in the figures showing the reference states) that the misfolded states are likely to be the dominant reference for the mutants and that the predictive value of ∆∆G calculations is likely to be relevant to marginally stable proteins.

We agree with the reviewer that this was somewhat confusing, and have attempted to be clearer about this issue in the revised manuscript. Due to the size of MLH1 (84.6 kDa) and its binding partner PMS2 (95.8 kDa) a careful in vitro analysis to obtain an absolute measure of MLH1’s thermodynamic stability would be rather difficult to perform. In the revised manuscript we therefore include new data where we have followed the solubility of wild-type MLH1 over a range of temperatures (Figure 2—figure supplement 1). These data clearly show, as the reviewers suspected, that MLH1 is somewhat less stable than the most abundant cellular proteins (Ponceau S stained) and GAPDH. Hence, it is possible that MLH1 is marginally stable. However, under normal conditions wild-type MLH1 is clearly not so unstable that it is rapidly degraded. Our analyses indicate that MLH1 has a half-life > 12 hours, which is in line with previous studies showing a t½ of about 40 hours in B-cells (Mathieson et al. 2018 Nat. Commun. 9:689). In yeast, wild-type MLH1 is also a stable protein, with a cellular stability similar to that of proteasome subunits, histones H3/H4 and certain ribosomal proteins (Christiano et al. 2014 Cell Rep. 9:1959-1965). We agree that stable proteins could buffer the effect of mutations at least in terms of in vitro stability, but a number of recent studies have shown that in many cases a sizable fraction of single amino acid substitutions may render an otherwise perfectly stable protein unstable in cells (e.g. Matreyek et al., 2018; Suiter et al., 2019, bioRxiv https://doi.org/10.1101/740837). Accordingly, it appears that the PQC system is tightly tuned to detect even minor or transient structural defects, which is supported by observations that in some cases the destabilized variants are even functional when the PQC system is blocked (e.g. Arlow et al., 2013). We have included a discussion of this point in the manuscript (Discussion). Since Figure 1B, C describe the ΔΔG predictions that refer to the unfolded state, we have not made any changes to Figure 1. However, in Figure 7, which summarizes the fate for MLH1 variants in the cell we have change “unfolded” to “non-native”, since we fully agree that none of the MLH1 variants are likely to fully unfold within a living cell (also discussed in the Discussion).

Regarding the last point, it remains unclear for which types of proteins these kinds of calculations will be useful to predict pathogenicity and cellular abundance, and to which extent the calculations of effects on global stability are correlated with more subtle conformational changes. We note, however, that we have made similar observations in other proteins (MSH2, phenylalanine hydroxylase, PTEN) that differ in size and likely intrinsic stability. We expect that while our stability calculations use the unfolded state as reference, these values will be correlated with effects on local stability effects that may underlie cellular degradation, though future studies that combine more detailed biophysical experiments are likely needed to answer this more quantitatively. In the revised manuscript we discuss this issue, and also draw parallels to related observations other proteins.

In addition, we have amended the text and Figure 7 so they more clearly reflect that the misfolded state is likely the dominant state for the destabilized MLH1 variants.

4) Additionally, the link that the authors claim between protein stability and cellular expression levels is not absolutely solid since there are numerous layers of control of protein expression in mammalian cells. As noted above, this link is likely to apply more to marginally stable proteins. For the benefits of the readers, it would be good if a more elaborate discussion of this relationship is presented already in the Introduction.

We agree that the situation in a human being will be substantially more complicated than in the cellular system we studied, and that the biophysical calculations only capture some of the effects seen in cells. Related to the specific point the reviewer makes, several of the effects from potential feedback mechanisms have purposely been removed to focus on a single effect. Specifically, we study protein levels from MLH1 expressed from a constitutive promotor, and under conditions where MLH1 function is not essential. Thus, we do not expect MLH1-specific feedback mechanisms to play a substantial role. Nevertheless, we agree that MLH1 misfolding is likely to lead to a cellular response including potentially induction of chaperones and other PQC components. Thus, the abundance levels and degradation rates we measure should be seen in this background, and thus report on the “effective stability” in the cell, which – as the reviewer suggests – may be different from the predicted thermodynamic effect. We now elaborate on this in the revised Discussion.

5) The authors should make it clearer that the approach and findings in this manuscript are quite similar to their previous paper on MSH2 (Nielsen et al., 2017) as this is not immediately apparent from the manuscript.

We now stress this point in the manuscript (Introduction and Discussion).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Table 1—source data 1. MLH1 variants tested in this work.
    DOI: 10.7554/eLife.49138.004
    Figure 6—source data 1. Data for gnomAD variants.
    DOI: 10.7554/eLife.49138.016
    Figure 6—source data 2. Data for ClinVar variants.
    DOI: 10.7554/eLife.49138.017
    Supplementary file 1. Full matrix for FoldX stability predictions.
    elife-49138-supp1.xlsx (122.6KB, xlsx)
    DOI: 10.7554/eLife.49138.019
    Supplementary file 2. Full matrix for evolutionary sequence energies.
    elife-49138-supp2.xlsx (178KB, xlsx)
    DOI: 10.7554/eLife.49138.020
    Transparent reporting form
    DOI: 10.7554/eLife.49138.021

    Data Availability Statement

    All data generated or analysed during this study are included in the manuscript and supporting files.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES