Summary
To gain insight into the interplay of processes and species that maintain a correctly folded, functional proteome, we have developed a computational model called FoldEco. FoldEco models the cellular proteostasis network of the E. coli cytoplasm, including protein synthesis, degradation, aggregation, chaperone systems, and the folding characteristics of protein clients. We focused on E. coli because much of the needed input information – including mechanisms, rate parameters, and equilibrium coefficients – is available, largely from in vitro experiments; however, FoldEco will shed light on proteostasis in other organisms. FoldEco can generate hypotheses to guide the design of new experiments. Hypothesis generation leads to system-wide questions and shows how to convert these questions to experimentally measurable quantities, such as changes in protein concentrations with chaperone or protease levels, which can then be used to improve our current understanding of proteostasis and refine the model. A web version of FoldEco is available at http://foldeco.scripps.edu.
Introduction
As they fold to their native states, most proteins sample marginally stable misfolded states that are susceptible to self-association (Chiti and Dobson, 2006), which can turn the marginally stable misfolded monomers into stable aggregates, especially at high protein concentrations (Oosawa and Asakura, 1975). To minimize protein misfolding and aggregation, cellular pathways have evolved that either assist protein folding or degrade proteins that have failed to fold (Hartl et al., 2011; Powers et al., 2009). The assisted folding pathways feature chaperones; the degradation pathways feature proteases.
Biologically assisted folding and degradation enable organisms to maintain protein homeostasis, or proteostasis. Proteostasis is critical for the health of an organism. The truth of this statement is demonstrated by the consequences of failures of proteostasis. Deleting chaperone genes in E. coli results in, at best, sensitivity to stresses that affect protein folding (e.g., heat), and at worst, organisms that are not viable (Deuerling and Bukau, 2004). In higher organisms, failure of proteostasis leads to loss-of-function diseases, such as cystic fibrosis, when too little native protein is produced or gain-of-toxicity diseases, such as Alzheimer’s disease, when too much aggregated protein is produced (Chiti and Dobson, 2006; Powers et al., 2009).
Understanding proteostasis requires two steps. The first step is reductionist: the mechanisms of assisted folding pathways must be understood at a molecular level. The second step is holistic: the cooperation and competition among the assisted folding pathways, the “proteostasis network” (Powers et al., 2009), must be understood when they are present and operating simultaneously. Neither of these steps can be accomplished using intuition alone; mathematical models are necessary to make sense of experimental data and to design incisive experiments (Hu et al., 2006; Jewett and Shea, 2008; Tehver and Thirumalai, 2008; Wiseman et al., 2007).
To create a comprehensive, yet detailed, model for proteostasis we focused on a simple organism: E. coli. E. coli have only one major Hsp70-based system, consisting of DnaK, the Hsp40 co-chaperone DnaJ, and the nucleotide exchange factor GrpE (Genevaux et al., 2007). They also have only one chaperonin system, consisting of GroEL and its co-chaperone GroES (Horwich et al., 2006), and one disaggregation system, in which the Hsp104-type chaperone ClpB cooperates with the DnaK/DnaJ/GrpE system (Mogk et al., 2003). A wealth of available information on the biochemistry of these chaperones enabled us to build a model for proteostasis in E. coli. Here we describe the resulting model, called ‘FoldEco’, and illustrate how it can be used to generate hypotheses about proteostasis in E. coli. We also introduce a publically-accessible, web-based version of FoldEco at http://foldeco.scripps.edu. We offer this model to the scientific community as an organism-based paradigm for the interplay of species and processes in proteostasis networks generally.
Results and Discussion
The FoldEco model
FoldEco (Figure 1, Figure S1) tracks what happens to soluble proteins of interest (“client proteins”) as they are produced in the E. coli cytosol (membrane and periplasmic proteins are currently not included in FoldEco; see Extended Experimental Procedures). FoldEco has five systems:
Protein synthesis and folding. The synthesis of a client protein (protein “i”) in FoldEco is represented as a simplified three-step process (see Extended Experimental Procedures). It begins with ribosomes (R) becoming translationally active (Ra,i). The protein then emerges from the ribosomal exit tunnel and the ribosome:nascent protein complex is formed (Ra:Ui). Finally, the ribosome releases the unfolded client protein (Ui) when synthesis is complete. A trigger factor monomer, T, can bind to the ribosome or ribosome:nascent protein complex yielding the species R:T, Ra,i:T, or Ra:Ui:T (Deuerling and Bukau, 2004). The ribosome releases the unfolded client protein free (Ui) or in complex with trigger factor (T:Ui). The unfolded client protein reversibly folds to the native state (Ni) or reversibly misfolds to form an off-pathway intermediate (Mi). Mi kinetically partitions between self-association by nucleated polymerization (Oosawa and Asakura, 1975) to form aggregates (Ai,j, where there are j monomers in the aggregate) or chaperone binding. We have not incorporated the possibility of co-translational folding or misfolding (see Extended Experimental Procedures).
The DnaK/DnaJ/GrpE (KJE) system. The role of the KJE system in proteostasis is to bind misfolded proteins and unfold them, thereby giving them another chance to fold (Mayer and Bukau, 2005). In FoldEco, the mechanism of the KJE system begins with unfolded or misfolded protein binding to dimeric DnaJ (J2) or ATP-bound DnaK (KT), forming binary complexes (J2:Ui/Mi or KT:Ui/Mi, where the “/” indicates “either-or”). J2:Ui/Mi or KT:Ui/Mi can then add KT or J2, respectively, to form the ternary complex KT:Ui/Mi:J2. However, KT binds weakly to substrates (Gisler et al., 1998; Mayer et al., 2000), so binding to J2 is preferred. J2 binding accelerates ATP hydrolysis by DnaK in the KT:Ui/Mi:J2 complex, which causes a conformational change within DnaK. This conformational change is propagated to the bound client protein, causing it to unfold. Thus, both KT:Ui:J2 and KT:Mi:J2 produce KD:Ui:J2 upon ATP hydrolysis (KD = ADP-bound DnaK). KD:Ui:J2 then releases J2 to give KD:Ui (KD:Ui can also be produced by slow ATP hydrolysis in KT:Ui/Mi). Nucleotide exchange in DnaK is induced by binding to dimeric GrpE (E2), producing KT:Ui:E2. Dissociation of E2 yields KT:Ui, which can dissociate to yield free unfolded protein, Ui. Note that because either U or M can enter the KJE cycle, and U is released, this chaperone team provides a conduit from M to U. Note also that KT:Ui can also unproductively re-enter the KJE cycle by binding to J2, depending on the relative rates of substrate dissociation and J2 binding. The total number of KJE cycles that a particular client protein experiences before it folds is determined by the relative rates of entry into the KJE cycle and folding. Our model for the KJE cycle is in part based on that of Hu and co-workers (Hu et al., 2006).
The GroEL/GroES (GroELS) system. The role of the GroELS system in proteostasis is to encapsulate client proteins, thereby providing them with an isolated environment in which to fold (Horwich et al., 2006). In FoldEco the GroELS cycle begins when an unfolded or misfolded client protein binds to the cis ring of GroEL, which is either ATP-bound (GrLT) or not (GrL), thus yielding GrL:Ui/Mi or GrLT:Ui/Mi. Therefore, client proteins can bind to either the ATP-free or ATP-bound state of GroEL, but there should be very little of the former at typical in vivo ATP concentrations (Tyagi et al., 2009). GrL:Ui and GrL:Mi are both converted to GrLT:Ui by ATP-binding-induced forced unfolding of the bound client protein (Lin et al., 2008). GrLT:Ui/Mi is then capped by GroES (GrS), which releases the client protein into the cis cavity (GrLT:Ui/Mi:GrS), where it can fold or, in principle, misfold, to give GrLT:Ui/Mi/Ni:GrS. In FoldEco, the rate of folding of a client protein in the GroEL cavity can be the same as it is in solution or different, as desired by the user; both circumstances have been reported (Apetri and Horwich, 2008; Chakraborty et al., 2010), and both are therefore allowed in FoldEco. ATP hydrolysis in the cis ring yields GrLD:Ui/Mi/Ni:GrS, which can follow one of two pathways. In the first, ATP binding in the trans ring results in GrLD:Ui/Mi/Ni:GrS::GrLT and release of ADP, GrS, and the client protein from the cis ring. In the second pathway, binding of unfolded or misfolded client protein (Uk or Mk; the subscript k indicates that it could be a different type of client protein from the one in the cis cavity) in the trans ring results in GrLD:Ui/Mi/Ni:Grs::GrL:Uk/Mk, which then binds ATP in the trans ring, stimulating release of ADP, GrS, and client protein from the cis ring. The trans ring is left in the GrLT:Uk state, ready to re-enter the cycle. The partitioning between the first and second pathways depends on the rates of binding of ATP vs. substrate. As with the KJE cycle, the total number of GroELS cycles that a particular client protein experiences before it folds is determined by the relative rates of entry into the GroELS cycle and folding. Our model of the GroELS cycle is based in part on those of Tehver and Thirumalai (Tehver and Thirumalai, 2008) and Jewett and Shea (Jewett and Shea, 2008).
The ClpB/DnaK/DnaJ/GrpE (B+KJE) disaggregation system. Disaggregation is critical for the recovery of E. coli from heat shock (Mogk et al., 2003). This function can be performed for small aggregates by the KJE cycle (Diamant et al., 2000), but larger aggregates require the AAA+ ATPase ClpB, ‘B’ for short (Mogk et al., 2003; Weibezahn et al., 2004). We based the mechanism of the B+KJE cycle in FoldEco on several studies that suggest that the KJE system prepares aggregates for B (Acebrón et al., 2008; Acebrón et al., 2009; Doyle et al., 2007), which then removes monomers from aggregates. Thus, client protein aggregates of size j (Ai,j) bind first to J2 or KT (we have assumed a 1:1 stoichiometry of chaperone:aggregate in these complexes; see Extended Experimental Procedures). The resulting complexes (J2:Ai,j or KT:Ai,j) then bind to KT or J2, respectively, to form the ternary complex KT:Ai,j:J2. ATP hydrolysis produces KD:Ai,j*:J2, where the “*” indicates that the aggregate is now prepared for ClpB binding. Release of J2 yields KD:Ai,j*, which has two possible fates. First, it can bind to E2, which induces nucleotide exchange to give KT:Ai,j:E2. For small oligomers (j ≤ 4 in FoldEco), a monomer is lost and E2 dissociates to give KT:Ai,j-1 and a free monomer, Ui (Diamant et al., 2000). For larger aggregates, E2 dissociates and KT:Ai,j is produced without monomer loss, consistent with the observation that the KJE system on its own cannot disperse large aggregates (Diamant et al., 2000). KT:Ai,j-1 and KT:Ai,j can then either re-enter the cycle or dissociate to give a free aggregate and KT. Second, KD:Ai,j* can bind to ClpB (B) to form KD:Ai,j*:B. E2 binding followed by nucleotide exchange gives E2:KT:Ai,j*:B, from which E2 and KT dissociate to give Ai,j*:B. Finally, a monomer is translocated through the central pore of B (Weibezahn et al., 2004) and the complex dissociates giving Ai,j-1, Ui, and B.
Protein degradation. The most important proteases for proteostasis in E. coli are the energy-dependent proteases (Gottesman, 2003). FoldEco incorporates two degradation pathways. The first features Lon, a protease that degrades unfolded and misfolded client proteins (Gottesman, 2003). This pathway begins with Lon and Ui or Mi associating to form the reversible complex Lon:Ui/Mi. The bound substrate is then transferred to the proteolytic chamber (with concurrent forced unfolding, if the substrate is misfolded), to form Lon:Ui*, where the “*” indicates that the substrate is now committed to degradation. Degradation is processive, with the ATPase domains of Lon feeding the substrate into the proteolytic chamber. In the second pathway ClpAP-type proteases (Dn) degrade native proteins (Ni) that have been tagged with a degradation signal, or degron (Gottesman, 2003; Varshavsky, 2011). This pathway begins with Dn binding to Ni, perhaps after delivery by an adapter protein like ClpS (Erbse et al., 2006), to form the reversible complex Dn:Ni. The bound client protein is then forcibly unfolded and transferred to the proteolytic chamber, yielding Dn:Ui*. The substrate is degraded processively as described above. Note that degron installation, adaptor protein binding, and delivery of the substrate to the protease are subsumed into the binding step in FoldEco (see Extended Experimental Procedures).
Figure 1. The FoldEco model for protein folding in E. coli.
The five subsystems in FoldEco are: protein synthesis and folding (light red), the DnaK/DnaJ/GrpE chaperone system (light blue), the GroEL/GroES chaperone system (light green), the ClpB+DnaK/DnaJ/GrpE disaggregation system (light yellow), and protein degradation (light purple). Also included are several client-free processes, like DnaJ dimerization (gray). The free unfolded (Ui), misfolded (Mi), native (Ni), and aggregated (Ai,j) states of the client protein are shown in large bold font. Complexes between proteostasis network components and the client protein are denoted by abbreviations for the species in the complex (see key in upper right) separated by colons. When the states of the cis and trans cavities of GroEL are both defined, they are separated by two colons. The subscripts “T” and “D” refer to the ATP- and ADP-bound states of chaperones. See also Figure S1.
FoldEco is implemented by writing the differential equations that describe the time-dependent concentrations of each species in the model. Solving this system of equations requires the model parameters (the rate constants and initial concentrations) to have numerical values. Fortunately, the rate constants for virtually every step in FoldEco can be estimated from the literature (Figure S1). Note that parameters such as chaperone-substrate binding affinity and on/off rates are protein-specific. The default parameters that we have used here and on the FoldEco website were measured for model proteins and can be changed by users on the input page of the FoldEco website. Literature estimates of the initial concentrations of the proteostasis network components are also available (Table S1). Parameters for the folding of a particular client protein can be estimated from literature data, experiments, or models, for example, that of Ghosh and Dill for protein stability (Ghosh and Dill, 2010; Ghosh and Dill, 2009), or those of Plaxco et al. (Plaxco et al., 1998) or Ouyang and Liang (Ouyang and Liang, 2008) for folding rates. The differential equations are then solved numerically, and the time-dependent concentrations of all species are the output.
Any model of biological networks must by necessity contain some approximations. Thus, FoldEco in its present form does not account for the effect of bacterial growth on proteostasis, stress responses, or possible changes in ATP levels. These and other approximations and their potential effects on the model are discussed in the Extended Experimental Procedures.
Models like FoldEco are powerful tools for rationalizing experimental observations and generating hypotheses. We describe in the following sections some illustrative examples of findings and predictions derived from our own exploration of FoldEco. We have deployed a web version of FoldEco (described in the final section) to allow others to use it for their own purposes.
The Importance of Synthesis and Degradation Rates to Protein Aggregation
An important use for any model of proteostasis is to characterize the circumstances under which a client protein aggregates. Aggregation is possible when the client protein’s aggregation-prone intermediate (in FoldEco, the Mi state) has a concentration higher than its critical concentration for aggregation, or Ccrit (Oosawa and Asakura, 1975). The concentration of Mi, in turn, is determined by the balance between the rates of protein synthesis and of degradation by Lon, and the folding and misfolding energetics (see Extended Experimental Procedures).
To illustrate the effect of synthesis rate and Lon concentration on protein aggregation, we used FoldEco to examine the behavior of a model client protein in the non-physiological, but nevertheless instructive situation of being in the absence of any chaperones. The model protein was assigned a moderately aggregation-prone “biophysical profile”, which includes (see Figures 1 and S1): the folding rate and equilibrium constants (kf and Kf), the misfolding rate and equilibrium constants (km and Km), the monomer-aggregate association rate constant (ka), and the critical concentration for aggregation (Ccrit). Thus, the model protein was given moderately favorable misfolding parameters (km and Km = 1 s−1 and 10, respectively), more favorable folding parameters (kf and Kf = 1 s−1 and 1000, respectively), moderately fast aggregation kinetics (ka = 0.1 µM−1 s−1), and a strong aggregation propensity (Ccrit = 0.1 µM). The chosen value of Ccrit is comparable to the in vitro critical concentration of Aβ42, the 42-residue-long and most aggregation-prone of the major isoforms of the Aβ peptide, the aggregation of which is implicated in Alzheimer’s disease (Usui et al., 2009). The other parameters used in these simulations are listed in Table S2. FoldEco was solved for this model protein at synthesis rates varying from 0.01 µM s−1 to 1 µM s−1. The upper end of this range is roughly equivalent to the protein synthesis rate in bacteria with a dividing time of about 2 h (Bremer and Dennis, 1987). At each synthesis rate, we determined the minimum concentration of Lon hexamer that would suppress aggregation to below 5% of total protein after 10,000 s of simulation time (about 2.75 hours; Figure 2A, black curve). This length of simulation time was arbitrarily chosen, but qualitatively similar results are obtained at different times and at steady state (Figure S2A). The time required to reach steady state appears to depend primarily on the folding energetics (kf and Kf) and degradation rates (see Extended Experimental Procedures).
Figure 2. Aggregation and the balance between synthesis and degradation.
(A) The black curve represents the minimum Lon concentration, or [Lon], required to suppress aggregation to <5% of total protein after 10,000 s of simulation time for an aggregation-prone model protein as a function of synthesis rate in the absence of chaperones. Below the curve (gray), there is substantial aggregation; above the curve (white), aggregation is largely suppressed. The synthesis rates that can be tolerated at the normal and heat shock Lon concentrations are indicated by dashed lines. The concentrations (µM) of native protein present at 10,000 s for points along the black curve are indicated in blue font. The fractions of the total synthesized protein that have been degraded at 10,000 s at the same points are indicated in red font. (B) As (A) but with a full complement of chaperones. The parameters used in this simulation are listed in Table S2. See also Figure S2.
These simulations show the importance of synthesis and degradation rates to protein aggregation. The typical concentration of Lon hexamers in E. coli is about 0.3 µM (Table S1), which is the minimum concentration needed to suppress aggregation at the relatively slow synthesis rate of 0.019 µM s−1 (Figure 2A). Higher synthesis rates for this aggregation-prone model protein require higher concentrations of Lon (or the introduction of chaperones; see below). Heat shock can increase the concentration of Lon by up to four-fold (Zhao et al., 2005); a four-fold increase in Lon concentration would increase the synthesis rate that can be tolerated to 0.074 µM s−1 (Figure 2A). Expression from a plasmid can give even higher concentrations of Lon, which would enable higher synthesis rates.
Higher synthesis and degradation rates may be desirable for protein production because they lead to higher concentrations of native protein (Figure 2A, blue numbers). For example, at a synthesis rate of 0.01 µM s−1 and Lon hexamer concentration of 0.12 µM (the minimum required to suppress aggregation), the native state concentration of our model protein at 10,000 s is 41 µM. Increasing the synthesis rate ten-fold to 0.1 µM s−1 and the Lon hexamer concentration to 1.6 µM gives a native protein concentration of 112 µM at 10,000 s. Analyzing the kinetics of protein accumulation (Figure S2B) and a simplified model for protein synthesis and degradation (Figure S2C) reveals that this increase in the native protein concentration arises not from the higher synthesis rate, but rather from the suppression of misfolding by degradation, which increases the concentration of the unfolded state during protein synthesis, and thereby increases the rate of production of the native state (see Extended Experimental Procedures).
The strategy of jointly increasing the synthesis rate and Lon concentration to increase native protein concentration comes at a cost to the cell. Our simulations suggest that although the native protein concentration increases as the synthesis and degradation rates increase, the fraction of protein that is degraded increases as well (Figure 2A, red numbers). Thus, while our model protein can be kept soluble when the synthesis rate is as high as 1 µM s−1, very high Lon concentrations (13 µM; Figure 2A) are required to do so. In these circumstances, 94% of the total synthesized protein becomes degraded. The enormous metabolic cost of so much wasted protein synthesis would be acceptable to an experimenter expressing a protein in E. coli, but would be a hindrance to an organism in its natural setting (Stoebel et al., 2008).
Proteostasis can be efficiently maintained at considerably lower metabolic cost by using chaperones to minimize aggregation. To demonstrate this notion, we repeated the simulations described above with a full complement of chaperones (20 µM trigger factor, 30 µM DnaK, 1 µM DnaJ, 15 µM GrpE, 3 µM GroEL tetradecamers, 5 µM GroES heptamers, 0.3 µM ClpB hexamers; Table S1). At physiological concentrations of Lon (0.3 µM), chaperones permit synthesis rates up to almost 0.3 µM s−1, about 15-fold higher than in the absence of chaperones, and yield much higher concentrations of native protein (Figure 2B). Higher synthesis rates, however, require sharply increasing concentrations of Lon to suppress aggregation. Thus, chaperones do not entirely obviate the need for degradation during the expression of aggregation-prone proteins.
Misfolding-Prone Proteins Benefit the Most from the KJE System
In addition to enhancing our understanding of in vivo protein aggregation, models for proteostasis should also be useful for revealing how client proteins with particular biophysical profiles benefit from particular chaperoning mechanisms. To find client proteins that benefit strongly from the KJE system, we generated 4,000 random biophysical profiles, input each of them into FoldEco, and solved the model with a modest synthesis rate (0.02 µM s−1), typical concentrations of Lon (0.3 µM hexamer) and trigger factor (20 µM; Table S1), and with or without the KJE system (30 µM DnaK, 1 µM DnaJ, and 15 µM GrpE; Table S1). The GroELS system and ClpB were excluded from the simulations in order to yield a focused view of profiles for which the KJE system alone influenced client outcome. While such conditions cannot exist in reality, since GroEL is an essential protein in E. coli, isolating the KJE system like this is the best way to understand its client preferences and overall capacity to maintain proteostasis.
To generate the random biophysical profiles, kf and km were varied between 10−3 and 103 s−1; ka was varied between 10−3 and 103 µM−1 s−1; Kf was varied between 102 and 108; Km varied between 10−3 and 103; and Ccrit was varied between 1 mM and 1 nM (Table S3). Where possible, the bounds on these variables were based on available protein folding data (see Extended Experimental Procedures). The parameters used for the association between client proteins and DnaK and DnaJ2 were those measured for the heat shock transcription factor σ32, a known substrate of the KJE system (Gamer et al., 1996; Mayer et al., 2000). The other parameters used in these simulations are listed in Table S4.
We examined the biophysical profiles of the client proteins with the largest (top 5%) increases in the concentration of the native state at an arbitrarily chosen time point (10,000 s) upon adding the KJE system (Table S3). The greatest increases varied from about +50 to +139 µM, which are substantial enhancements given that 200 µM of protein was synthesized. The outstanding feature of the biophysical profiles of the top KJE substrates is their uniformly high misfolding propensities (Figure 3A). This feature shows that client proteins that benefit most from the KJE system have a strong tendency to misfold. By pumping client proteins from the Mi state, which cannot fold, to the Ui state, which can, the KJE system promotes folding and rescues client proteins from aggregation or degradation. Which of these two fates a client protein is rescued from depends on its aggregation propensity, as measured by Ccrit. At a synthesis rate of 0.02 µM s−1, client proteins need an exceptionally strong aggregation propensity (low Ccrit value) to aggregate because so little protein is produced: the median Ccrit value for proteins that aggregated more than 5% (10 µM) in the absence of the KJE system was 7.4 nM (Figure S3A). Thus, only 25% of the top KJE substrates are rescued primarily from aggregation (Table S3). The rest are rescued from degradation.
Figure 3. Biophysical profiles for client proteins that benefit the most from the KJE and GroELS systems at a low synthesis rate.
(A) Medians (red points) and interquartile ranges (black lines) for the misfolding, folding, and aggregation parameters of the best substrates for the KJE system. These substrates were identified from a pool of 4,000 client proteins with randomly-generated parameters as having the largest (top 5%) KJE-system-mediated increases in native state concentration after 10,000 s at a synthesis rate of 0.02 µM s−1. The median values of the parameters for the underlying pool of 4,000 randomly-generated client proteins are shown as horizontal dashed lines. The parameters represented are: km = misfolding rate constant; Km = misfolding equilibrium constant; kf = folding rate constant; Kf = folding equilibrium constant; ka = monomer-aggregate association rate constant; Ccrit = critical concentration of misfolded protein required for aggregation. Note the inverted scale for Ccrit. (B) As (A), but for the top GroELS system substrates. (C) Increase in native state concentration, [N], due to the simultaneous presence of the KJE and GroELS systems (the “combined effect”; y axis) plotted against the sum of the increases due to the individual presence of either the KJE or the GroELS system (the “summed effects”; x axis). Region 1 contains the 415 cases for which the combined effects are less than the summed effects; region 2 contains the 161 cases for which the combined and summed effects are roughly equal; and region 3 contains the 81 cases for which the combined effects are greater than the summed effects. Red open circles: cases in which the KJE or GroELS systems by themselves almost completely rescue the client protein. Blue open circles: cases in which neither system by itself completely rescues the client protein. (D) As (A), but for the substrates in region 3 of (C), for which the combined effects of the KJE and GroELS systems are greater than the summed effects. Table S3 lists all of the 4,000 randomly generated biophysical profiles, and the effects of adding the KJE, GroELS, or both systems at a synthesis rate of 0.02 µM s−1. Table S4 lists all of the other parameters used. See also Figure S3.
Slow Folding Proteins Benefit the Most from the GroELS System
To find the biophysical profiles that benefit the most from the GroELS system, we repeated the simulation experiment described above with or without the GroELS system (3 µM GroEL tetradecamers, 5 µM GroES heptamers; Table S1). The KJE system and ClpB were excluded in order to focus on the impact of the GroELS system, and the folding and misfolding parameters were assumed to be the same inside and outside the GroEL cavity. Binding parameters for typical GroELS system substrates were used (Figure S1C, Table S4). At a synthesis rate of 0.02 µM s−1, the top 5% of increases in the concentration of the native state upon adding the GroELS system varied from +102 to +153 µM (Table S3). In addition, the native state concentration increased by more than 10 µM for 2,223 of the 4,000 randomly generated client proteins when the GroELS system was added, but for only 737 client proteins when the KJE system was added (Table S3). This result demonstrates that the GroELS system is beneficial to a broader range of our randomly generated client proteins than the KJE system at this synthesis rate. The efficiency of the GroELS system is especially remarkable considering that we assumed that the folding and misfolding rates in the GroEL cavity were the same as they are in solution. If faster folding rates inside the cavity had been used, the effect of the GroELS system on native state concentration would presumably have been even larger.
A striking feature of the top GroELS substrates is their low and narrowly distributed folding rate constants (Figure 3B). This rate selectivity arises from two factors: fast-folding client proteins (kf > 0.1 s−1) tend not to need chaperone assistance, while very slow-folding client proteins (kf < 0.01 s−1) are not protected by GroEL for long enough to fold, and are vulnerable to degradation after they are released. The optimal folding rate for GroEL substrates is determined by the rate constant for ATP hydrolysis by GroEL, which determines how long substrates remain encapsulated. In FoldEco, the rate constant for this process is set to 0.1 s−1 (corresponding to a half-life of about 7 s), as generally found experimentally (Ranson et al., 1997; Rye et al., 1999). In principle, the optimal folding rate constant for GroELS substrates could be changed by changing the ATP hydrolysis rate constant. However, if this rate constant is too low, the GroELS system retains substrates for so long that it quickly becomes saturated; if this rate constant is too high, the GroELS system does not retain slow-folding client proteins long enough for them to fold appreciably and, as noted above, fast-folding substrates have no need of the GroELS system. The ATP hydrolysis rate constant of GroEL appears to have evolved to an optimum.
The other noteworthy feature of the biophysical profiles of the top GroELS substrates is their low misfolding propensities (Figure 3B). In fact, only 12% of the top GroELS substrates are rescued from aggregation compared with 25% of the top KJE substrates (Table S3). The GroELS system is apparently not optimized to correct misfolding. It is important to note, however, that while misfolding-prone proteins may not be among the top GroELS substrates, they nevertheless benefit substantially from the GroELS system. For example, of the 737 proteins whose native state concentration increases by > 10 µM upon adding the KJE system, 657 (89%) benefit similarly upon addition of the GroELS system. This observation suggests, and experiments have shown (Vorderwulbecke et al., 2004), that the GroELS system can complement the loss of the KJE system, at least partially.
Cooperation between the KJE and GroELS systems
To determine whether the biophysical profiles that benefit the most from the KJE or GroELS systems are different when one system is introduced in the presence of the other than when they are introduced individually, we repeated the simulation experiment described above with both systems present. The results obtained when one system was added in the presence of the other were similar to those obtained when each was added individually. The top 5% of the increases in the concentration of the native state upon adding the KJE system in the presence of the GroELS system ranged from +36 to +122 µM, and the native state concentration increased by more than 10 µM for 572 of the 4,000 client proteins. The top 5% of the increases in the concentration of the native state upon adding the GroELS system in the presence of the KJE system ranged from +101 to +156 µM, and the native state concentration increased by more than 10 µM for 2,013 of the 4,000 client proteins. The biophysical profiles of the top KJE and GroELS substrates were nearly identical in the presence (Figure S3B and S3C, respectively) and absence (Figure 3A and 3B, respectively) of the other system.
The observations above suggest that the activities of the KJE and GroELS systems are independent and complementary. The extent to which this idea is true is demonstrated by Figure 3C. As noted above, 657 of the 4,000 biophysical profiles studied had increases in native state concentration upon adding the KJE system or GroELS system > 10 µM. For this subset, we have plotted in Figure 3C the increase in native state concentration due to the simultaneous presence of the KJE and GroELS systems (the “combined effect”; y axis) against the sum of the increases due to the individual presence of either the KJE or the GroELS system (the “summed effects”; x axis). This plot can be split into three regions: region 1 contains the 415 cases for which the combined effects are less than the summed effects; region 2 contains the 161 cases for which the combined and summed effects are roughly equal; and region 3 contains the 81 cases for which the combined effects are greater than the summed effects.
The cases in region 2 are perhaps the easiest to understand: the effects of the KJE and GroELS systems are simply independent and additive. In contrast, the KJE and GroELS systems appear to interfere with each other to some extent for the cases in region 1. This is partly due to an artifact, as 247 of the cases in region 1 (60%) are there because the KJE or GroELS systems by themselves completely rescue the client protein (red open circles). In these cases, a second chaperone system will always be superfluous. For the remaining 168 cases in region 1, the KJE and GroELS systems are partially redundant (blue open circles).
For the cases in region 3, the KJE and GroELS systems are synergistic. The biophysical profiles of these cases (Figure 3D) show that they combine the features of the top KJE and GroELS substrates: they have high propensities to misfold (high Km) and slow folding rates (low kf). For such substrates, the ability of the KJE system to recover protein from the misfolded state apparently increases the input into the GroELS system, which then increases the flux of protein into the native state. The cases in regions 1 (Figure S3D) and 2 (Figure S3E) have lower misfolding propensities and faster folding rates, so that the KJE and GroELS systems are less able to amplify each other’s effects.
The Effect of Increasing the Synthesis Rate on the Biophysical Profiles of the Top KJE and GroELS Substrates
We probed the effect of a higher synthesis rate on the biophysical profiles of the top KJE and GroELS substrates by repeating the simulation experiments described above with a ten-fold higher synthesis rate (0.2 µM s−1). The top 5% of the increases in the concentration of the native state at 10,000 s upon adding the KJE system by itself or in the presence of the GroELS system are proportionately higher than they were at the lower synthesis rate, ranging from +643 to +1526 µM without the GroELS system and from +512 to +1,507 µM with the GroELS system (2,000 µM of protein was synthesized; Table S5). The biophysical profiles of the top KJE substrates at this synthesis rate (Figure 4A) are nearly identical with and without GroELS (Figure 4A and Figure S4A) and are both similar to those of the top KJE substrates at the lower synthesis rate, but with a new feature. The misfolding propensities of the top KJE substrates are still high, but now their aggregation propensities are high as well (Figure 4A). Thus, aggregation appears to be more important at this synthesis rate than at the lower synthesis rate. Indeed, all of the top KJE substrates are rescued from aggregation rather than degradation at this synthesis rate. Rescue from aggregation is not guaranteed by the presence of the KJE system, however. The aggregation rate constants for the top KJE substrates tend to be low (Figure 4A), suggesting that fast aggregation interferes with the KJE system.
Figure 4. Biophysical profiles for client proteins that benefit the most from the KJE and GroELS systems at a higher synthesis rate.
(A) As Figure 3A: top KJE substrates, but using a 10-fold higher synthesis rate (0.2 µM s−1) in the simulation. (B) As Figure 3B: top GroELS substrates, but using a 10-fold higher synthesis rate (0.2 µM s−1) in the simulation. (C) As Figure 3C, but using a 10-fold higher synthesis rate (0.2 µM s−1) in the simulation. Table S5 lists all of the 4,000 randomly generated biophysical profiles, and the effects of adding the KJE, GroELS, or both systems at a synthesis rate of 0.2 µM s−1. See also Figure S4.
At the higher synthesis rate, the top 5% of the increases in the concentration of the free native state upon adding the GroELS system ranged from +279 to +1050 µM without the KJE system and +135 to +1,119 with the KJE system (Table S5). Unlike the situation with the KJE system, the biophysical profiles of the top GroELS substrates in the absence (Figure 4B) or presence (Figure S4B) of the KJE system are different at this synthesis rate than at the lower synthesis rate. The folding rate constants are not as low as they were before for the top GroELS substrates. Instead, all of the parameters associated with misfolding and aggregation (km, Km, ka, and Ccrit) are high, indicating that the top beneficiaries of the GroELS system at higher synthesis rates are misfolding and aggregation prone rather than slow folding. Consistent with this observation, all of the top GroELS substrates are rescued primarily from aggregation.
These observations suggest that aggregation rather than degradation becomes the dominant problem in proteostasis as the synthesis rate increases, and the KJE and GroELS systems are both capable of rescuing proteins from misfolding and aggregation. This assertion is demonstrated by the concentration of the native state increasing by more than 100 µM for 700 of the 4,000 randomly generated client proteins upon adding the KJE system by itself (and for 554 client proteins when it is added in the presence of the GroELS system), and for 545 client proteins upon adding the GroELS system by itself (and for 288 client protein when it is added in the presence of the KJE system).
The ability of the KJE and GroELS systems to cooperate is illustrated in Figure 4C, which is a plot of the combined vs. summed effects of the KJE and GroELS systems for the 433 cases for which the KJE and GroELS systems each individually increase the concentration of the native state by >100 µM. We split this plot into three regions, as in Figure 3C. Also as in Figure 3C, many of the cases in region 1 (142 out of 273; red open circles) are there because they are almost completely rescued by either the KJE or the GroELS system by itself. Perhaps the most notable feature of Figure 4C is that region 3 is almost completely unpopulated. The effects of the KJE and GroELS systems are at best additive at this synthesis rate, probably because the similarity of the biophysical profiles of their top substrates leaves little room for them to complement the each other’s activity.
Testing Predictions from FoldEco
Here we examine the hypotheses suggested by the simulations in the preceding sections in the light of experimental data from the literature. The first is that aggregation can be suppressed by decreasing the synthesis rate or by increasing the concentration of Lon. Siller et al. recently examined the effect of the synthesis rate on the in vivo folding of firefly luciferase (Luc) (Siller et al., 2010). They found that slowing the rate of ribosomal translation decreased the extent of aggregation and increased the specific activity of Luc (see Extended Experimental Procedures for details). Siller et al. provide data for the relative specific activities of Luc after 15 min of expression at slow and fast translation rates (Siller et al., 2010) and data that can be used to estimate the extent of aggregation (see Figure S5A and Extended Experimental Procedures). We attempted to reproduce their results using FoldEco.
We constrained our simulations by using estimates of the folding (kf) and misfolding (km) rates for Luc folding based on in vitro studies of Luc refolding in the presence (Szabo et al., 1994) and absence (Herbst et al., 1998) of the KJE system (see Extended Experimental Procedures for details on this and other constraints; see also Figure S5B). The other folding parameters had little effect on the results of the simulations. For the initial chaperone concentrations, we used the chaperone concentrations at the top of the confidence interval reported in Table S1 for Luc produced at both slow and fast translation rates (40 µM trigger factor, 50 µM DnaK, 2 µM DnaJ, 20 µM GrpE, 5 µM GroEL tetradecamer, 10 µM GroES heptamer, 0.5 µM Lon hexamer, 0.5 µM ClpB hexamer), because heterologous expression of aggregation-prone proteins like Luc is known to induce the heat shock response (Hoffmann and Rinas, 2004). Moreover, we note that these chaperone concentrations are similar to some experimental measurements of chaperone concentrations in non-stressed E. coli (see Table S1).
The results of our simulations at these chaperone concentrations match the experimental results well. They reproduce the increase in aggregation as well as the decrease in specific activity (Figures 5A and 5B, respectively; see also Table S6 and Extended Experimental Procedures) observed experimentally upon increasing the translation rate (Siller et al., 2010). FoldEco also reproduces the trends in the data at lower chaperone concentrations, but the extent of aggregation is generally overestimated (Figure S5C and D; Table S7). Thus, FoldEco currently encompasses most of the aspects of in vivo proteostasis needed to understand the effect of protein synthesis rate on the partitioning of protein between aggregates and native, functional protein.
Figure 5. Tests of FoldEco predictions.
(A) Comparison of the experimental (gray bars) and predicted (blue bars) fraction of Luc that aggregates when expressed in bacteria at a slow (left) or fast (right) translation rate. (B) Comparison between experimentally determined (gray bar) and FoldEco predicted ratios of the specific activities (SA) of Luc when expressed in bacteria at a slow or fast translation rate. Specific activity normalized to the total mass of protein. The experimental data for (A) and (B) were taken from Siller et al. (Siller et al., 2010); see Extended Experimental Procedures for details.
Despite this success, one observation made by Siller et al. was not reproduced in these simulations: that the absolute activity (not just the specific activity) of Luc is higher in their expression system when translation is slower, despite less protein being produced per cell. Siller et al. attribute this effect to cotranslational folding at lower translation rates (Siller et al., 2010). The current version of FoldEco does not explicitly account for cotranslational folding, but we can mimic it by increasing the folding rate constant (kf) by a factor of about two in the simulations at the slower translation rate, but not the faster translation rate (Figure S5E). This adjustment enables us to reproduce their observation (see Extended Experimental Procedures), and suggests that cotranslational folding doubles the effective folding rate of Luc.
This exercise demonstrates one of the most important uses of models for complicated systems: they enable concrete tests of whether a given set of processes is sufficient to explain a given set of experimental results. When they are not, it is clear that at least one important process have been left out of the model. In this case, cotranslational folding is not necessary to explain the decrease in aggregation of Luc when it is synthesized more slowly, but the importance of cotranslational folding to the production of active Luc is nevertheless unambiguous. The current version of FoldEco can be adapted to account for cotranslational folding by adjusting kf, but future versions will include it explicitly.
The other hypotheses that are suggested by our results with FoldEco have to do with the characteristics of the substrates for the KJE and GroELS systems. For example, based on Figures 3A and 4A, we expect that DnaK substrates in E. coli should be aggregation prone. This prediction is consistent with the results from a recent study of DnaK interactors. Calloni et al. found that proteins that are highly enriched on DnaK (Calloni et al., 2012) tend to be less soluble than the average proteins of soluble cell lysate (where solubility measurements were taken from a proteome-wide measurement of E. coli protein solubilities (Niwa et al., 2009)). Based on Figures 3B and 4B, we expect that GroEL substrates in E. coli should be slow folding, aggregation prone, or both. The latter prediction is consistent with experimental results (Chapman et al., 2006; Fujiwara et al., 2010; Kerner et al., 2005). The former prediction must await proteome-wide determination of folding rate constants to be evaluated, but it is consistent with the finding that obligate GroEL substrates tend to have topologically complicated structures (Fujiwara et al., 2010; Kerner et al., 2005), since proteins with such structures generally fold slowly (Plaxco et al., 1998).
FoldEco as a Web Resource
To enable those interested to perform their own experiments with FoldEco, we have created a web version of FoldEco, the home page of which is http://foldeco.scripps.edu. This website has an overview of the model, descriptions of its components and subsystems, and an basic and advanced interactive pages into which parameters can be input and then sent to a server. The server solves FoldEco with the user-defined parameters and returns the output. There are several options for the output: fully graphical depictions of the simulation results (including static images, movies, and interactive animation); concentration vs. time plots for a selection of the species in the model; and downloadable tables with concentration vs. time data for some or all of the species in the model. Complete guides to the input parameters and the output data are provided on the website. We expect this publically available resource to prove useful as a means to both generate hypotheses and rationalize existing data about proteostasis.
Experimental Procedures
General
The differential equations that make up FoldEco were solved numerically using Mathematica® 8.0.1 (Wolfram Research) on a Dell Precision T7500 Workstation running Windows 7 Professional (64 bit) with a 3.33 GHz Intel® Xeon® W5590 Quadcore CPU and 24 GB of RAM. FoldEco is implemented on the web using Wolfram webMathematica™ 3.0 running on a VMware virtual server running CentOS installed on an Oracle Sun blade server. Details about setting up and solving FoldEco can be found in the Extended Experimental Procedures. A Mathematica® application file with the code for FoldEco and a Mathematica® notebook file demonstrating how to use FoldEco are provided as Supplemental Information.
Setting Up and Solving FoldEco
The differential equations that make up FoldEco can be derived from Figure 1 (or Figure S1). The net rate of change of each species’ concentration is equal to the sum of the rates of the processes that produce it, less the sum of the rates of the processes that consume it (where the rate of a process is its rate constant multiplied by the concentrations of the species that the process consumes). For example, free inactive ribosomes (R in Figure 1) are consumed when they become translationally active (forming Ra,i) or when they bind trigger factor (T). R is produced by dissociation of R:T, and by dissociation of Ra:Ui and Ra:Ui:T after translation is complete. The rate of change of the concentration of R is therefore
(1) |
where the square brackets indicate concentrations, d[R]/dt is the time derivative of [R], and s0,1, si,3, s0,2, si,11, and si,12 are rate constants (the first subscript of these rate constants is 0 if the process does not involve a client protein and i if it does; the second subscript indicates the reaction number in the synthesis system of FoldEco; see Figure S1A). FoldEco is solved using the numerical differential equation solver in Mathematica®, which adaptively chooses its integration methods and step sizes depending on the properties of the system of differential equations (e.g., smoothness and stiffness of the solutions).
Supplementary Material
Acknowledgments
We acknowledge with gratitude the contribution of Daniel W. Farrell (Stony Brook University) to the implementation of the FoldEco program. We thank Jeffery W. Kelly, R. Luke Wiseman, Amber N. Murray (The Scripps Research Institute) and Ken A. Dill (Stony Brook University) for helpful discussions. We thank F. U. Hartl and M. Hayer-Hartl for giving us early access to their DnaK interactome results. This work was funded by National Institutes of Health grants OD000945 and GM027616.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Supplemental Information
Supplemental information includes five figures, seven tables (two as Excel files), a Mathematica® application file with the code for FoldEco, a Mathematica® notebook file demonstrating how to use FoldEco, and Extended Experimental Procedures. This material can be found with this article online.
References
- Acebrón SP, Fernandez-Saiz V, Taneva SG, Moro F, Muga A. DnaJ recruits DnaK to protein aggregates. J. Biol. Chem. 2008;283:1381–1390. doi: 10.1074/jbc.M706189200. [DOI] [PubMed] [Google Scholar]
- Acebrón SP, Martin I, del Castillo U, Moro F, Muga A. DnaK-mediated association of ClpB to protein aggregates. A bichaperone network at the aggregate surface. FEBS Lett. 2009;583:2991–2996. doi: 10.1016/j.febslet.2009.08.020. [DOI] [PubMed] [Google Scholar]
- Apetri AC, Horwich AL. Chaperonin chamber accelerates protein folding through passive action of preventing aggregation. Proc. Natl. Acad. Sci. U. S. A. 2008;105:17351–17355. doi: 10.1073/pnas.0809794105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bremer H, Dennis PP. Modulation of chemical composition and other parameters of the cell by growth rate. In: Neidhardt FC, editor. Escherichia coli and Salmonella typhimurium: Cellular and molecular biology. Washington, D. C.: American Society for Microbiology; 1987. pp. 1527–1542. [Google Scholar]
- Calloni G, Chen T, Schermann SM, Change H-C, Genevaux P, Agostini F, Tartaglia GG, Hayer-Hartl M, Hartl FU. DnaK functions as a central hub in the E. coli chaperone network. Cell Reports. 2012 doi: 10.1016/j.celrep.2011.12.007. In press. [DOI] [PubMed] [Google Scholar]
- Chakraborty K, Chatila M, Sinha J, Shi Q, Poschner BC, Sikor M, Jiang G, Lamb DC, Hartl FU, Hayer-Hartl M. Chaperonin-catalyzed rescue of kinetically trapped states in protein folding. Cell. 2010;142:112–122. doi: 10.1016/j.cell.2010.05.027. [DOI] [PubMed] [Google Scholar]
- Chapman E, Farr GW, Usaite R, Furtak K, Fenton WA, Chaudhuri TK, Hondorp ER, Matthews RG, Wolf SG, Yates JR, et al. Global aggregation of newly translated proteins in an Escherichia coli strain deficient of the chaperonin GroEL. Proc Natl Acad Sci U S A. 2006;103:15800–15805. doi: 10.1073/pnas.0607534103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chiti F, Dobson CM. Protein misfolding, functional amyloid, and human disease. Annu. Rev. Biochem. 2006;75:333–366. doi: 10.1146/annurev.biochem.75.101304.123901. [DOI] [PubMed] [Google Scholar]
- Deuerling E, Bukau B. Chaperone-assisted folding of newly synthesized proteins in the cytosol. Crit. Rev. Biochem. Mol. Biol. 2004;39:261–277. doi: 10.1080/10409230490892496. [DOI] [PubMed] [Google Scholar]
- Diamant S, Ben-Zvi AP, Bukau B, Goloubinoff P. Size-dependent disaggregation of stable protein aggregates by the DnaK chaperone machinery. J. Biol. Chem. 2000;275:21107–21113. doi: 10.1074/jbc.M001293200. [DOI] [PubMed] [Google Scholar]
- Doyle SM, Hoskins JR, Wickner S. Collaboration between the ClpB AAA+ remodeling protein and the DnaK chaperone system. Proc. Natl. Acad. Sci. U. S. A. 2007;104:11138–11144. doi: 10.1073/pnas.0703980104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Erbse A, Schmidt R, Bornemann T, Schneider-Mergener J, Mogk A, Zahn R, Dougan DA, Bukau B. ClpS is an essential component of the N-end rule pathway in Escherichia coli. Nature. 2006;439:753–756. doi: 10.1038/nature04412. [DOI] [PubMed] [Google Scholar]
- Fujiwara K, Ishihama Y, Nakahigashi K, Soga T, Taguchi H. A systematic survey of in vivo obligate chaperonin-dependent substrates. EMBO J. 2010;29:1552–1564. doi: 10.1038/emboj.2010.52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gamer J, Multhaup G, Tomoyasu T, McCarty JS, Rudiger S, Schonfeld HJ, Schirra C, Bujard H, Bukau B. A cycle of binding and release of the DnaK, DnaJ and GrpE chaperones regulates activity of the Escherichia coli heat shock transcription factor σ32. EMBO J. 1996;15:607–617. [PMC free article] [PubMed] [Google Scholar]
- Genevaux P, Georgopoulos C, Kelley WL. The Hsp70 chaperone machines of Escherichia coli: a paradigm for the repartition of chaperone functions. Mol. Microbiol. 2007;66:840–857. doi: 10.1111/j.1365-2958.2007.05961.x. [DOI] [PubMed] [Google Scholar]
- Ghosh K, Dill K. Cellular proteomes have broad distributions of protein stability. Biophys. J. 2010;99:3996–4002. doi: 10.1016/j.bpj.2010.10.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghosh K, Dill KA. Computing protein stabilities from their chain lengths. Proc. Natl. Acad. Sci. U. S. A. 2009;106:10649–10654. doi: 10.1073/pnas.0903995106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gisler SM, Pierpaoli EV, Christen P. Catapult mechanism renders the chaperone action of Hsp70 unidirectional. J. Mol. Biol. 1998;279:833–840. doi: 10.1006/jmbi.1998.1815. [DOI] [PubMed] [Google Scholar]
- Gottesman S. Proteolysis in bacterial regulatory circuits. Annu. Rev. Cell Dev. Biol. 2003;19:565–587. doi: 10.1146/annurev.cellbio.19.110701.153228. [DOI] [PubMed] [Google Scholar]
- Hartl FU, Bracher A, Hayer-Hartl M. Molecular chaperones in protein folding and proteostasis. Nature. 2011;475:324–332. doi: 10.1038/nature10317. [DOI] [PubMed] [Google Scholar]
- Herbst R, Gast K, Seckler R. Folding of firefly (Photinus pyralis) luciferase: aggregation and reactivation of unfolding intermediates. Biochemistry. 1998;37:6586–6597. doi: 10.1021/bi972928i. [DOI] [PubMed] [Google Scholar]
- Hoffmann F, Rinas U. Stress induced by recombinant protein production in Escherichia coli. Adv Biochem Eng Biotechnol. 2004;89:73–92. doi: 10.1007/b93994. [DOI] [PubMed] [Google Scholar]
- Horwich AL, Farr GW, Fenton WA. GroEL-GroES-mediated protein folding. Chem. Rev. 2006;106:1917–1930. doi: 10.1021/cr040435v. [DOI] [PubMed] [Google Scholar]
- Hu B, Mayer MP, Tomita M. Modeling Hsp70-mediated protein folding. Biophys. J. 2006;91:496–507. doi: 10.1529/biophysj.106.083394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jewett AI, Shea JE. Do chaperonins boost protein yields by accelerating folding or preventing aggregation? Biophys. J. 2008;94:2987–2993. doi: 10.1529/biophysj.107.113209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kerner MJ, Naylor DJ, Ishihama Y, Maier T, Chang HC, Stines AP, Georgopoulos C, Frishman D, Hayer-Hartl M, Mann M, et al. Proteome-wide analysis of chaperonin-dependent protein folding in Escherichia coli. Cell. 2005;122:209–220. doi: 10.1016/j.cell.2005.05.028. [DOI] [PubMed] [Google Scholar]
- Lin Z, Madan D, Rye HS. GroEL stimulates protein folding through forced unfolding. Nat. Struct. Mol. Biol. 2008;15:303–311. doi: 10.1038/nsmb.1394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mayer MP, Bukau B. Hsp70 chaperones: cellular functions and molecular mechanism. Cell. Mol. Life Sci. 2005;62:670–684. doi: 10.1007/s00018-004-4464-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mayer MP, Schroder H, Rudiger S, Paal K, Laufen T, Bukau B. Multistep mechanism of substrate binding determines chaperone activity of Hsp70. Nat. Struct. Biol. 2000;7:586–593. doi: 10.1038/76819. [DOI] [PubMed] [Google Scholar]
- Mogk A, Deuerling E, Vorderwulbecke S, Vierling E, Bukau B. Small heat shock proteins, ClpB and the DnaK system form a functional triade in reversing protein aggregation. Mol. Microbiol. 2003;50:585–595. doi: 10.1046/j.1365-2958.2003.03710.x. [DOI] [PubMed] [Google Scholar]
- Niwa T, Ying BW, Saito K, Jin W, Takada S, Ueda T, Taguchi H. Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of Escherichia coli proteins. Proc Natl Acad Sci U S A. 2009;106:4201–4206. doi: 10.1073/pnas.0811922106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oosawa F, Asakura S. Thermodynamics of the polymerization of protein. London: Academic Press; 1975. [Google Scholar]
- Ouyang Z, Liang J. Predicting protein folding rates from geometric contact and amino acid sequence. Protein Sci. 2008;17:1256–1263. doi: 10.1110/ps.034660.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Plaxco KW, Simons KT, Baker D. Contact order, transition state placement and the refolding rates of single domain proteins. J. Mol. Biol. 1998;277:985–994. doi: 10.1006/jmbi.1998.1645. [DOI] [PubMed] [Google Scholar]
- Powers ET, Morimoto RI, Dillin A, Kelly JW, Balch WE. Biological and chemical approaches to diseases of proteostasis deficiency. Annu. Rev. Biochem. 2009;78:959–991. doi: 10.1146/annurev.biochem.052308.114844. [DOI] [PubMed] [Google Scholar]
- Ranson NA, Burston SG, Clarke AR. Binding, encapsulation and ejection: substrate dynamics during a chaperonin-assisted folding reaction. J. Mol. Biol. 1997;266:656–664. doi: 10.1006/jmbi.1996.0815. [DOI] [PubMed] [Google Scholar]
- Rye HS, Roseman AM, Chen S, Furtak K, Fenton WA, Saibil HR, Horwich AL. GroEL-GroES cycling: ATP and nonnative polypeptide direct alternation of folding-active rings. Cell. 1999;97:325–338. doi: 10.1016/s0092-8674(00)80742-4. [DOI] [PubMed] [Google Scholar]
- Siller E, DeZwaan DC, Anderson JF, Freeman BC, Barral JM. Slowing bacterial translation speed enhances eukaryotic protein folding efficiency. J. Mol. Biol. 2010;396:1310–1318. doi: 10.1016/j.jmb.2009.12.042. [DOI] [PubMed] [Google Scholar]
- Stoebel DM, Dean AM, Dykhuizen DE. The cost of expression of Escherichia coli lac operon proteins is in the process, not in the products. Genetics. 2008;178:1653–1660. doi: 10.1534/genetics.107.085399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Szabo A, Langer T, Schroder H, Flanagan J, Bukau B, Hartl FU. The ATP hydrolysis-dependent reaction cycle of the Escherichia coli Hsp70 system DnaK, DnaJ, and GrpE. Proc Natl Acad Sci U S A. 1994;91:10345–10349. doi: 10.1073/pnas.91.22.10345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tehver R, Thirumalai D. Kinetic model for the coupling between allosteric transitions in GroEL and substrate protein folding and aggregation. J. Mol. Biol. 2008;377:1279–1295. doi: 10.1016/j.jmb.2008.01.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tyagi NK, Fenton WA, Horwich AL. GroEL/GroES cycling: ATP binds to an open ring before substrate protein favoring protein binding and production of the native state. Proc. Natl. Acad. Sci. U. S. A. 2009;106:20264–20269. doi: 10.1073/pnas.0911556106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Usui K, Hulleman JD, Paulsson JF, Siegel SJ, Powers ET, Kelly JW. Site-specific modification of Alzheimer's peptides by cholesterol oxidation products enhances aggregation energetics and neurotoxicity. Proc. Natl. Acad. Sci. U. S. A. 2009;106:18563–18568. doi: 10.1073/pnas.0804758106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Varshavsky A. The N-end rule pathway and regulation by proteolysis. Protein Sci. 2011 doi: 10.1002/pro.666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vorderwulbecke S, Kramer G, Merz F, Kurz TA, Rauch T, Zachmann-Brand B, Bukau B, Deuerling E. Low temperature or GroEL/ES overproduction permits growth of Escherichia coli cells lacking trigger factor and DnaK. FEBS Lett. 2004;559:181–187. doi: 10.1016/S0014-5793(04)00052-3. [DOI] [PubMed] [Google Scholar]
- Weibezahn J, Tessarz P, Schlieker C, Zahn R, Maglica Z, Lee S, Zentgraf H, Weber-Ban EU, Dougan DA, Tsai FT, et al. Thermotolerance requires refolding of aggregated proteins by substrate translocation through the central pore of ClpB. Cell. 2004;119:653–665. doi: 10.1016/j.cell.2004.11.027. [DOI] [PubMed] [Google Scholar]
- Wiseman RL, Powers ET, Buxbaum JN, Kelly JW, Balch WE. An adaptable standard for protein export from the endoplasmic reticulum. Cell. 2007;131:809–821. doi: 10.1016/j.cell.2007.10.025. [DOI] [PubMed] [Google Scholar]
- Zhao K, Liu M, Burgess RR. The global transcriptional response of Escherichia coli to induced σ32 protein involves σ32 regulon activation followed by inactivation and degradation of σ32in vivo. J. Biol. Chem. 2005;280:17758–17768. doi: 10.1074/jbc.M500393200. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.