Abstract
Using exhaustive Monte Carlo simulations we study the kinetics and mechanism of fibril formation using lattice models as a function of temperature (T) and the number of chains (M). While these models are, at best, caricatures of peptides, we show that a number of generic features thought to govern fibril assembly are captured by the toy model. The monomer, which contains eight beads made from three letters (hydrophobic, polar, and charged), adopts a compact conformation in the native state. In both the single-layered protofilament (seen for M⩽10) and the two-layer fibril (M>10) structures, the monomers are arranged in an antiparallel fashion with the “strandlike” conformation that is perpendicular to the fibril axis. Partial unfolding of the folded monomer that populates an aggregation prone conformation (N*) is required for ordered assembly. The contacts in the N* conformation, which is one of the four structures in the first “excited” state of the monomer, are also present in the native conformation. The time scale for fibril formation is a minimum in the T-range when the conformation N* is substantially populated. The kinetics of fibril assembly occurs in three distinct stages. In each stage there is a cascade of events that transforms the monomers and oligomers to ordered structures. In the first “burst” stage, highly mobile oligomers of varying sizes form. The conversion to the N* conformation occurs within the oligomers during the second stage in which a vast number of interchain contacts are established. As time progresses, a dominant cluster emerges that contains a majority of the chains. In the final stage, the aggregation of N* particles serve as a template onto which smaller oligomers or monomers can dock and undergo conversion to fibril structures. The overall time for growth in the latter stages is well described by the Lifshitz–Slyazov growth kinetics for crystallization from supersaturated solutions. The detailed analysis shows that elements of the three popular models, namely, nucleation and growth, templated assembly, and nucleated conformational conversion are present at various stages of fibril assembly.
INTRODUCTION
The link between aggregation of proteins and a number of neurodegenerative diseases has spurred many experimental1, 2, 3, 4, 5, 6, 7, 8, 9, 10 and theoretical studies.11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23 Aggregation rates depend not only on protein sequence but also on the concentration of proteins and external conditions (temperature, pH, presence of crowding agents, etc.). The observation that many proteins that are unrelated by sequence and structure can aggregate and form fibrils10 with similar morphologies (albeit under different growth conditions) suggests that certain generic aspects of oligomerization and subsequent fibril growth can be gleaned from toy models. Toward this end, a number of lattice models11, 24, 25 have been introduced to probe the fibril formation mechanism. Here, following the important studies by Hall and co-workers,11 we use a three-dimensional lattice model that is, in part, inspired by all-atom simulations of oligomer formation of the peptide fragment Aβ16–22,16 to provide insights into mechanism of fibril formation.
Soluble (S) monomeric polypeptide chains can be either random coil-like (Aβ peptides or α-synuclein) or folded (transthyretin). Typically, fluctuations or denaturation stress can populate one of several aggregation-prone conformations (N*). Because of conformational variations in N*, fibrils with differing molecular structure can form starting from the same sequence. However, the growth mechanism starting from N* to the fibril state is not fully understood. Three mechanisms for fibril assembly have been proposed. In the nucleation-growth (NG) mechanism,26, 27 the first step is the oligomerization of a sufficient number of N* particles that form a critical nucleus, which is a free-energetically uphill process. Upon forming (with n>the size of the critical nucleus nc), S monomers can rapidly add to the oligomer resulting in their growth and eventual fibril assembly. The templated-assembly (TA) process28, 29, 30 suggests that preformed complex, with presumably n>nc, serves as a template onto which S or N* can dock and undergo the needed structural rearrangement to lock onto the template. Based on kinetic data on prion formation in yeasts, the nucleated conformational conversion (NCC) model1, 31 has been proposed. In the NCC model, it is envisioned that S forms mobile disordered oligomers. The monomers in the oligomer undergo S→N* conversion to form nuclei . The species can serve as a template and incorporate other (less structured) oligomers or monomers to rapidly form ordered fibrils. The important feature of NCC is that structural rearrangement S→N*→Nfib (Nfib is the monomer structure in the fibril) occurs within the molten oligomer. In many cases, the structures of N* and Nfib are similar.
In this paper, we study the mechanism of fibril assembly using a simple lattice model for which extensive simulations can be performed. The analysis reveals a complex scenario for protofilament and fibril assembly that has the elements of all three growth models. The dependence of fibril formation time τfib on the number of monomers reveals that late stages of growth have a lot in common with crystallization in supersaturated solutions. These findings arise from detailed Monte Carlo (MC) simulation studies using a toy lattice model in which each chain has N=8 beads of three types, namely, hydrophobic (H), polar (P), and charged (see Sec. 2). Our simulations show that the overall assembly of ordered protofilaments and fibrils occur in three distinct stages. The smallest time scale is associated with a fast “burst phase” during which highly mobile oligomers form. During this stage there is a distribution of oligomers of varying sizes. Because we are forced to simulate a finite number of chains, we cannot quantify the nature of the size distribution. The second stage is the transformation of the burst phase structures into a disordered but compact oligomer in which about half of the interpeptide contacts form. It is likely that the conformational transition from S→N* takes place during this stage, as envisioned in the NCC model. The longest time scale corresponds to the final stage of fibril formation. In this stage, the large clusters grow by incorporating the small clusters. The structural transitions here are best described by a dock-lock mechanism that requires the presence of a template. Thus, even in the toy model there are complex structural transitions that take place in each stage of assembly. It appears that elements of NG, TA, and NCC are operative at different stages of fibril formation.
METHODS
Model
Each chain consists of N connected beads that are confined to the vertices of a cube. The simulations are done using M identical chains with N=8. The sequence of a chain is +HHPPHH−, where + and − are charged beads. The assignment of chemical character and the nature of interactions between the beads should be viewed as a caricature of polypeptide chains and are not a realistic representation of amino acids. Despite such a drastic simplification, it has been shown that lattice models are useful in providing insights into protein folding mechanisms.32, 33, 34
The inter- and intrachain potentials include excluded volume and contact (nearest neighbor) interactions. Excluded volume is imposed by the condition that a lattice site can be occupied by only one bead. The energy of M chains is
(1) |
where rij is the distance between residues i and j, a is the lattice spacing, sm(i) indicates the type of residue i in the mth peptide, and δ(0)=1 and zero, otherwise. The first and second terms in Eq. 1 represent intrapeptide and interpeptide interactions, respectively.
The contact energies between the H beads eHH is −1 (in the units of kBT). The propensity of polar (including charged) residues to be “solvated” is mimicked by using ePα=−0.2, where α=P, + or −. “Salt-bridge” formation between oppositely charged beads is accounted for by a favorable contact energy e+−=−1.4. All other contact interactions are repulsive. The generic value for repulsion eαβ is 0.2. For a pair of like-charged beads, the repulsion is stronger, i.e., e++=e−−=(0.7). The chains were confined to the vertices of the three-dimensional hypercube. For example, when M=10 the length of the hypercube is 10a. Therefore, the volume fraction occupied by the peptides is 0.08, and the concentration corresponds to 250 mM. This is about three orders of magnitude denser than that used in typical experiments.
Simulation details
The simulations were performed by enclosing M chains in a box with periodic boundary conditions. We use MC algorithm to study the kinetics of amyloid formation. At the beginning of each MC cycle, a peptide is selected at random. Then one of the two types of MC moves, global or local, is randomly chosen. The acceptance probabilities of global and local moves are 0.1 and 0.9, respectively. Global moves correspond to either translation of a peptide by a in a randomly chosen direction or rotation by 90° around one of the randomly chosen coordinate axes. The direction of rotation as well as the type of global move are selected at random. A local move corresponds to tail rotation, corner flip, and crankshaft rotation.35 Given the condition that a local move is accepted with 0.9 probability, we used the same relative probabilities for selecting the particular types of local moves as described elsewhere.36 We measure time in units of MC steps (MCS). The combination of local and global moves constitutes one MCS.
For interacting chains, which leads to complex formation, care must be taken to ensure that move sets are efficient enough so that on the simulation time scale equilibrium is established.37 Our previous work25 on lattice model for aggregation showed that the relative rotation of chains, in a complex, is a slow process. Thus, we explicitly allowed for such moves and global translation. The acceptance rates were adjusted to ensure that chain disassociation occurs. Multiple association and disassociation of chains from transiently formed oligomers assures us (but does not prove) that for M=16 we have approached equilibrium at long times. In addition, the kinetic simulations lead to the ordered structure that is identical to the one obtained by an independent simulated annealing protocol.
Structural probes
The contacts in the aggregated state (oligomer or fibrils) are divided into two categories; intrapeptide and interpeptide. If two nonbonded beads (i.e., those that are not covalently linked) of a given chain are near neighbors, then they form an intrachain contact. An interpeptide contact in an ordered conformation is one that is (i) formed between beads belonging to different peptides, and (ii) the associated peptide bonds are in the ordered state. All interpeptide contacts in the fibril structure satisfy condition (ii), although this is not generally the case for an arbitrary oligomeric structure. The numbers of intrapeptide and interpeptide fibril contacts in an arbitrary conformation are denoted as Qm and Qf with Qm,0 and Qf,0 being their values in the fibril state. In what follows, quantities with the subscript 0 correspond to the fibril structure.
In order to probe the growth of the fibril, we obtained the distribution of the fibril clusters in a given oligomer conformation. A fibril cluster is computed by selecting a pair of fibril contacts and adding adjacent fibril contacts whose peptide bonds are parallel or antiparallel to the bonds associated with the original fibril contact pair. The growth of the fibril cluster continues until no more fibril contacts can be added to the cluster in any direction. A typical oligomer contains several fibril clusters of different sizes that are measured by the number of incorporated fibril contacts. The number of fibril contacts in the largest cluster is denoted by Qfc. In the fibril structure, a single fibril cluster consumes all of the residues and chains, and hence Qfc=Qf.
We have also computed the number of interpeptide contacts (of any type) Cout, which describes the formation of the aggregated state. Aggregation of chains is also monitored by computing the distribution of oligomers. An oligomer is defined as a group of aggregated chains. Two oligomers are distinct if none of the chains from one oligomer interacts with any chain from the other. A given multichain conformation may contain several oligomers, and their number N0 is useful to characterize the process of aggregation. In addition, the number of peptides in the largest oligomer Np is computed. As aggregation progresses, Np approaches M.
Kinetics of assembly
To follow the kinetics of aggregation, an initial distribution of M random peptide structures is generated and equilibrated at high temperature (T=3.0) for 105 MCS. The resulting distribution of chains is used as a starting point for initiating fibril assembly which begins by quenching the temperature to Ts (<3.0). Each MC trajectory starts with a unique distribution of chains. The total number of MC trajectories NMC for a given Ts varies from 100 to 400. The first instance, when the fraction of intrapeptide and interpeptide fibril contacts exceed 0.85, is associated with the first passage time τfib,i for fibril assembly for a trajectory i. The condition Q(=0.85), which is the fraction of intra- and interchain fibril contacts, was chosen empirically by analyzing numerous MC trajectories. The mean time of fibril assembly is computed by fitting the yield of the fibril structure Pf(t) in the pool of NMC independent MC trajectories.
Rapid nucleation of fibril structure was analyzed as follows. For each trajectory, we considered an interval of 106 MCS immediately preceding τfib,i and computed various quantities associated with fibril formation as described above. In addition, within the time interval, τfib,i−106<t<τfib,i, we considered the subset of fibril contacts in the largest fibril cluster Qfc, which satisfy two conditions,38, 39 namely, (i) that these fibril contacts are formed at the time of fibril assembly, τfib,i, and (ii) that apart from short lived disruptions they remain stable within the interval (t,τfib,i). The disruptions of fibril contacts must not exceed t=2000 MCS. The results do not depend on the specific value of t when it is varied by ±1000 MCS. The fibril contacts satisfying these two conditions are referred to as “nucleation” fibril contacts and their number is denoted as Qnfc.
RESULTS AND DISCUSSION
Monomeric and fibril structures
Monomer
The exact enumeration of all possible conformations of the monomer of eight beads shows that there are 18 energy levels. The three lowest levels in the spectrum are presented in Fig. 1. The monomeric native state is compact, and it has the lowest energy E=−3.8. It should be noted that the conformation of the chain in the fibril state is not compact, and it belongs to the first excited state (label N* in Fig. 1) which is fourfold degenerate. Fluctuations in the monomer conformations has to populate the structure with E=−3.4 for oligomerization to start. Such fluctuations, under condition when the native structure is stable, can occur spontaneously or through interchain interactions. Clearly, suppression of fluctuations at low temperatures would slow down the process of ologimerization. The toy model captures the well-accepted proposition that aggregation requires partial unfolding of the native conformation.40
Ensemble of peptides
When multiple chains are present in the unit cell, aggregation is readily observed, and in due course they form ordered structures (Fig. 2). Exact enumeration of all the conformations for multichain systems is not possible so the structure of the lowest energy has to be determined using simulations. We used the MC annealing protocol, which allows for the exhaustive conformational search, to find the lowest energy conformation. In the ordered protofilament (M=10) and fibril (M=16) structures, the chains adopt an antiparallel arrangement (Fig. 2).
The nature of ordering changes depending on M, and hence the concentration. For M⩽10 the chains are arranged in a single layer, while for M>10 the fibril state has a double-layer arrangement (Fig. 2). Just as noted, using all-atom molecular dynamics simulations,41 the organization of the chains in the fibril satisfies the principles of amyloid self-assembly (PASA) which states that the fibril structures are determined by maximizing the number of salt bridges and hydrophobic contacts.41 In accord with PASA, we found that the organization of the lowest energy structure demonstrates a remarkable order, leading to the maximization of favorable electrostatic and hydrophobic interactions (Fig. 2). All H (in green) beads located at the core of the fibril are sandwiched between exposed layers of P (in yellow) and charged beads (in blue and red). More importantly, all of the peptides adopt an in-registry antiparallel mutual orientation, which implies that for all bond vectors connecting nearest neighbor pairs of residues (i,N−i+1) and , where m and l are the peptide indices. The antiparallel arrangement is enforced by favorable electrostatic interactions. Figure 2 shows that the nearest neighbors of all negatively charged terminals (in red) are positively charged beads (in blue).
For M=10, there are in all 84 interpeptide fibril contacts and 30 intrapeptide contacts, and the entire protofilament structure in Fig. 2a comprises a single layer. This implies that a given interpeptide antiparallel in-registry arrangement of the chains is translated across the entire volume of the fibril in all directions. It is interesting that all intrapeptide contacts are also found in the native conformation of the monomer [lowest energy conformation in Fig. 1a] and the N* structure [Fig. 1a]. Due to different possible distributions of peptides within the volume of a fibril, the lowest energy fibril structure has nonzero entropy.
The fibril contains both interpeptide and intrapeptide interactions. The structure of the monomer in the ordered fibril coincides with one of the structures that is higher in energy than the native monomer conformation (conformation N* in Fig. 1). Because the fibrils are associated with aggregation of unfolded structures (here the first excited state in the spectrum of allowed monomer conformation), it is logical that other morphologies, which nucleate from different unfolded conformations, can form. By scanning the sequences for N=8, we could not produce fibrils starting from one of the high energy monomer conformations which highlights one of the limitations of the lattice model. This observation suggests that as long as the peptide sequence contains hydrophobic patches and oppositely charged residues distributed along the sequence, the fibril structure is likely to include a mixture of inter- and intrapeptide interactions. A combination of inter- and intrapeptide contacts maximizes the number of hydrophobic and salt bridges thus satisfying the PASA.
There are superficial similarities between structures in Fig. 2 and the model proposed for Aβ1–40, whose sequence is interspersed with charged and hydrophobic residues. The amyloidgenic Aβ peptide contains two hydrophobic regions (central hydrophobic cluster and the C-terminal), as well as charged residues. The proposed fibril model for Aβ1–40 is based on the assumption that an Aβ1–40 monomer contains a turn, which brings the two hydrophobic regions in proximity and facilitates the formation of a salt bridge.42
Time scales for monomer folding and fibril assembly
The short chain (N=8) allows us to compute the times τF for monomer folding as a function of temperature. The decay of the population of unfolded conformations is best described using a single exponential (data not shown), which is a characteristic of well-designed sequence. The folding time τF is well below 103 MCS (Fig. 3) over a wide temperature range. In contrast, the temperature-dependent time for fibril formation τfib is dramatically different (Fig. 3). There are two striking observations about τfib. First, τfib is about four to six orders of magnitude larger than τF. Clearly, the sizes of the monomer and the fibril can cause the vastly greater value of τfib compared to τF. The effect of the system size can be roughly rationalized using the approximate dependence of τF on N.43 It has been shown that .44 Assuming that τF0 does not change significantly and taking into account that the fibril in our model is ten times larger, then the monomer size alone would yield τfib∕τF∼103. In addition, the formation of fibril (or protofibrils) also requires collective fluctuation (e.g., formation of nucleus), which requires that several monomers access the N* structure in the first excited state of the isolated monomer (Fig. 1). There are barriers associated with such processes that also increase τfib. The relative values τfib∼(104–106)τF is not inconsistent with experimental observations. Typical values of τF for small proteins is about 1–100 ms. Thus, our simulations would suggest τfib∼102–104 s if τF∼10 ms.
The most striking aspect of Fig. 3 is the dramatic differences in the T-dependence of τF(T) and τfib(T). The temperature independence of τF(T) in the 0.3TF⩽T⩽1.3TF is typical of well-designed monomer sequences for which TF≈Tθ, where Tθ is the collapse transition temperature.45 In contrast, τfib(T) changes drastically as T varies. In the narrow temperature range (TF⩽T⩽1.4TF), τfib varies by almost two orders of magnitude. At the temperature T≈1.3TF (Fig. 3), when τfib is the smallest, the native structure is less stable than the unfolded ensemble. The structures of the partially unfolded conformations at T≈1.3TF shows that the probability of the “salt bridges” (intramolecular contact between + and − beads) being in contact exceeds 0.5. At T≈1.3TF there is a substantial probability of populating the aggregation-prone monomer [Fig. 1b], which acts as a seed for nucleation and growth. At T≈1.3TF, the fibrils form in the smallest time with 100% yield, whereas at T=TF the yield of the fibril drops to 0.42 during the simulations lasting 108 MCS.
The observation that partial unfolding of the native state is a necessary condition for oligomerization and fibril growth is consistent with experimental observations that many nonhomologous protein sequences assemble into amyloid fibrils under denaturing conditions.46 Although the formation of fibrils is apparently a generic feature of polypeptide sequences, our simulations suggest that for a given sequence, there may be only a narrow window of external conditions that favor rapid fibril assembly. Besides requiring that the native monomer partially unfolds for aggregation to begin, the denaturing conditions must also be relatively mild. Under these conditions, aggregation-prone structures with intramolecular native interactions that are moderately stable can be populated. In our model, the conformation that nucleates and grows (Fig. 1) is homogeneous, which results in a unique fibril structure. Denaturing conditions that favor its formation, with intact salt bridges results in the most rapid assembly (Fig. 1). In polypeptide chains, there may be a collection of conformations that can lead to fibrils. The differences in fibril morphology is probably linked to the variations in the initial conformations of the monomer.
Fibril assembly occurs in three major stages
Formation of protofilaments
To provide microscopic details of fibril assembly, we generated multiple MC trajectories for M=10 at Ts=0.65=1.3TF at which τfib is the smallest (Fig. 3). In all, 100 MC trajectories starting from random initial conditions were generated. The length of the MC trajectories (8×107 MCS) at Ts was sufficiently long to observe ordered structure formation in each trajectory. Figure 4 displays several quantities averaged over 100 trajectories and normalized to vary from 1 (at t=0) to 0 (the equilibrium value). The averaging over the ensemble of trajectories is indicated by angular brackets ⟨ ⟩. The time scales from exponential fits to these functions describe the kinetics of fibril formation. The analysis of the various time dependent quantities and the inspection of the structures sampled enroute to the final fibril gives an intuitive picture of assembly and growth.
Immediately after the temperature quench to Ts, the chains are randomly distributed in the unit cell. The numbers of intra- and interpeptide fibril contacts are negligible, and there are relatively few interchain interactions. The largest oligomer contains, on average, four chains (Np=4). Within a short time, the interchain interactions trigger the formation of oligomers, which represent the growth stage in the route to fibrils. Figure 4b shows that the average number of free chains ⟨Nfree⟩ (i.e., those which do not make interpeptide contacts) is less than 1 in ≈0.03×106 MCS or 0.01τfib. Almost concurrently, the number of peptides in the largest oligomer ⟨Np(t)⟩ exceeds 9. Thus, already in the initial stage, the chains interact and cooperatively form fluidlike oligomers. Indeed, ⟨Np(t)⟩ grows on the time scale of 0.06×106 MCS or 0.02τfib, and approaches its equilibrium value of 9.8. Therefore, virtually all of the chains are incorporated in a single burst phase leading to mobile oligomer formation.
The second stage in fibril assembly is associated with the formation of intra- and interpeptide interactions, which transforms the mobile oligomer formed in the first stage into a compact disordered oligomer. During this stage, structural rearrangement and conversion from S→N* take place, as shown by a number of quantities. The intrapeptide fibril contacts ⟨Qm(t)⟩ [data in blue in Fig. 4a] are formed on the time scale of 0.1τfib. On a similar time scale, the number of interpeptide contacts ⟨Cout(t)⟩ (data in green) approaches the equilibrium value of approximately 67. Interestingly, the number of distinct clusters ⟨Nfc(t)⟩ reaches a maximum during this stage of fibril assembly (data not shown). We surmise that the disordered oligomer contains as many as four distinct fibril clusters, the largest of which already comprises roughly 50% of the entire protofilament. Figure 4a further demonstrates that at t≈0.1τfib the distribution of the volume of fibril clusters extends from predominantly small clusters (Qfc⩽14) to larger ones (15⩽Qfc⩽28). The total number of fibril contacts is still relatively small in the disordered oligomer (⟨Qf(0.2τfib)⟩≈30=0.36Qfc,0). Therefore, disordered oligomers are characterized by a nascent single-layer protofilamentlike structure [Fig. 2a], which emerges in the oligomer volume as a distribution of disjoint fibril clusters of varying sizes.
The transformation of disordered oligomers to an ordered structure occurs during the third stage of fibril assembly. It follows from Fig. 4a that the time scale for the formation of interpeptide fibril contacts ⟨Qf(t)⟩ is 0.5×106 MCS or ≈0.2τfib (data in red). Importantly, on the same time scale, the dominant fibril cluster grows as shown by ⟨Qfc(t)⟩ (data in orange). This result indicates that the formation of the fibril structure occurs via the growth of the largest fibril cluster at the expense of small clusters. The winner-take-all scenario of fibril growth is further described below. The number of fibril clusters ⟨Nfc(t)⟩ decreases to less than 3 in the time interval of 0.2τfib<t<τfib. On the other hand, the maximum in the kinetic distribution of the fibril structure among the clusters shifts to the right, signaling the emergence of large clusters (43⩽Qfc⩽70). By assigning weight in proportion to the size of fibril clusters, we find that the dominant fibril cluster comprises of almost the entire fibril structure. In accord with this conclusion, we found that the number of fibril contacts (i.e., the degree of fibril structure) in the largest clusters is 43⩽Qfc⩽70 (results not shown). It is clear that at t>0.4τfib more than 80% of ordered structure is localized in a single large fibril cluster. Because on these time scales ⟨Nfc(t)⟩≈2, the remaining 10%–20% of fibril contacts are found in a much smaller satellite fibril cluster.
The formation of a dominant cluster containing the protofilament also follows from the calculations of thermodynamic quantities. The thermal averages of the number of fibril contacts ⟨Qf⟩ and the number of fibril contacts in the largest fibril cluster ⟨Qfc⟩ are 52 and 47, respectively. Thus, ⟨Qfc⟩=0.90 ⟨Qf⟩. After the dominant fibril cluster appears at t≈0.4τfib its further growth and consolidation continues until it reaches its equilibrium size (about 60% of all fibril contacts). This kinetic phase can be described by additional time scale with a small amplitude. Due to this additional fibril ordering, the final fibril assembly takes place only at τfib=3.3×106 MCS. Thus, long after the formation of the largest cluster, structural reorganization continues until the ordered stable fibril forms. The slow TA within the large cluster is reminiscent of the lock phase.
Mechanism of fibril assembly
In order to probe the mechanism of fibril formation [two-layer structure in Fig. 2b] at Ts=0.7, we generated 100 trajectories with each being 108 MCS. The mean time for fibril formation is τfib≈2×107 MCS. These long runs ensure that the fully ordered state is reached in each trajectory. Qualitatively, the fibril formation kinetics is the same as in the M=10 case, i.e., it follows three-stage kinetics. However, there are a few quantitative differences. In the protofilament formation case, the interpeptide contacts ⟨Cout(t)⟩, and intrapeptide fibril contacts ⟨Qm(t)⟩ [Fig. 4b] are formed on the same time scale. For M=16 [Fig. 4c] ⟨Cout(t)⟩ approaches the value of 0.5 earlier. A fit of ⟨Cout(t)⟩ using a sum of three-exponential functions gives τ1=0.15×106 MCS≈0.01τfib, τ2≈106 MCS≈0.05τfib and τ3≈11.2×106 MCS≈0.5τfib. Thus, τ1 is a characteristic time scale of the burst phase in which fluidlike clusters form. On this time scale, only a few interpeptide fibril contacts Qf (≈0.6% of total contacts) are formed and the largest oligomer contains, on average, only five peptides (Np=5). Using the three-exponential fit and data presented in Fig. 5a, one can show that the formation of the largest cluster occurs on a time scale of ≈0.02τfib. The number of peptides in this cluster approaches 15 [Fig. 4d], whereas the number of free peptide becomes 0. Almost simultaneously, the number of distinct fibril clusters ⟨Nfc(t)⟩ reaches a maximum (data not shown).
The second stage of fibril assembly, in which the burst phase oligomer is transformed into a compact disordered oligomer, takes place on the times scale τ2≈0.05τfib. Due to the larger value of M, this time is larger than for M=10. At this stage, 50% of the equilibrium values of the intra- (Qm) and interpeptide (Qf) fibril contacts are formed. Contrary to the M=10 case, fibril contacts in the largest cluster Qfc are formed earlier than the total Qf. This is probably due to the increasing role of the satellite clusters as the number of monomers increases. On long time scales, we have more than two and less than two such clusters for M=16 and 10, respectively. The “winner-take-all scenario” is also valid for the M=16 system because for t>0.2τfib the largest cluster contains ≈75% of the fibril contacts. These observations are made quantitative by using the dependence of τfib∼M (see below).
As seen from Fig. 5a, the three exponentials (f(t)=f0−f1 exp(−t∕τ1)−f2 exp(−t∕τ2)−f3 exp(−t∕τ3)) fit the data well (dashed line). Here, we have three different time scales τ1≈0.17×106, τ2≈1.24×106, and τ1≈12.18×106 MCS (the partitions of these phases are f1≈0.19, f2≈0.46, and f3≈0.1). Experiments on the fibril growth kinetics of Aβ-peptides,28 which is fit using a sum of two exponential functions, have been interpreted in terms of a template-assisted “dock-lock” mechanism. From the perspective of the present studies, we conclude that such a mechanism is probably valid during the second and third stages of fibril growth. The lock phase, during which in-registry arrangement of the chains takes place, clearly occurs only during the last part of stage three in the fibril growth process. The early stages of growth reveal a much more complex set of events in which the physical processes described in NG and NCC are manifested (see also the Sec. 4).
Dependence of fibril formation time on number of monomers: Lifshitz–Slyozov growth law
In order to obtain the dependence of τfib on the number of monomers, we fixed the monomer concentration and computed τfib for each system at Ts. The fibril formation time scales linearly with the number of monomer [Fig. 5b], τfib∼M but with different slopes for M⩽10 and M>10. This is probably related to the difference between protofilament and fibril formation [see Figs. 2a, 2b]. The linear dependence of τfib on M supports the template-assisted mechanism in which monomers are added one by one to preformed ordered structures (protofibrils or fibrils), provided that the number of these monomers exceeds the size of the critical nucleus. Thus, the linear dependence characterizes growth only during the late stages of ordered assembly. Our results agrees with the experimental findings of Kowalewski and Holtzman47 who studied the aggregation of Alzheimer’s β-amyloid peptides on hydrophilic mica and hydrophobic graphite surfaces, as well as with the results obtained by Collins et al.48 for the amyloidogenic yeast prion protein Sup35.
Interestingly, the dependence of τfib on M for such a complicated process as fibril assembly seems to follow the well-known Lifshitz–Slyzov (LS) law. Since M∼L3, where L is a typical size of the ologimer, we obtain τfib∼L1∕3 which is the LS law49 describing the growth of a cluster in a supersaturated solution. The finding in Fig. 5b further supports the “winner-take-over” scenario for oligomer growth because the LS law is based on the assumption that the largest cluster grows at the expense of smaller ones.
The evidence for the LS growth law is based on MC simulations involving multiple chains. Although the simplicity of the system permits us to perform exhaustive simulations that gives us confidence that we repeatedly reach the ordered state at long times (for both M=10 and M=16), the effect of the move sets used on the predicted LS law has to be assessed. This is particularly important because in a careful analysis Deeds et al.37 argued that move sets in systems that involve transient complex formation have to be carefully taken into account to ensure approach to equilibrium. The theoretical analysis of LS is based on the assumption that large oligomers devour smaller ones by diffusion which is mimicked here by MC simulations. Thus, LS theory shows L3≈Dt, where D is an effective multiparticle diffusion constant. If one was to vary the move sets in a MC simulations we suspect that only D would be renormalized with the LS growth law remaining intact.
CONCLUDING REMARKS
We have used a lattice model to elucidate the generic features of fibril assembly mechanisms in proteins. Using this toy model, many aspects of the transitions from the monomer to fully formed fibrils can be monitored. The examination of the kinetics of the assembly process reveals that several aspects of a complex set of transitions seen in the simple model is also qualitatively observed in experiments.
-
(1)
The ordered fibrils form as the number of chains becomes greater than a critical value. In our system, we find that for M=16 a stable two-layer fibril is formed, which is perhaps the minimum replicating unit in the infinite fibril. For smaller M [Fig. 2a], ordered protofilaments are the lowest energy conformation. It is likely that there are substantial internal rearrangements of the chains as the number of monomers increases so that stable fibrils can be populated. Although we did not carry out systematic calculations to infer the size of the critical nucleus, it appears both from the temperature dependence of protofilament formation as well as the ease of fibril production for M=16 that the size of the nucleus has to be less than 10.
-
(2)
The kinetics of fibril formation occurs broadly in three distinct stages. In the initial stage, the chains rapidly partition into clusters of varying sizes. Because of finite size limitations, we are unable to determine the precise distribution of cluster sizes. The chains within each cluster is mobile and fluidlike. There are, in all likelihood, substantial conformational fluctuations within each cluster. In the second stage, the chains in the clusters form a number of intra- and interchain contacts that lead to the disordered oligomers. During this stage, bigger clusters grow at the expense of smaller ones. In the process, protofibrils in which many peptides adopt the eventual conformation in the fibrils form. In the third postnucleation stage, the chains add to the largest (single) cluster. In this stage, which is captured in experiments, the addition of a monomer occurs by a lock-dock mechanism. Thus, a cascade of events, starting from conformational fluctuations in the monomer that populate the aggregation-prone conformation [Fig. 1a] through a series of interpeptide interaction-driven conformational changes, results in fibril assembly.
-
(3)
The growth kinetics depends on the depth of the quench ΔT=(Ti−Ts), where Ti is the initial temperature at which the chains are brought at equilibrium. When the depth of the quench is large, there appears to be a lag-time before the fibrils are populated. In this case, the ordered structures form in a highly cooperative manner. In contrast, when the growth process is initiated by equilibrating the monomers at the final growth temperature (ΔT=0), the fibril growth occurs in a continuous manner and is less cooperative (Fig. 6). Because the aggregation-prone structure is unique in the toy model, we do not observe variations in the morphology of the final fibril structure. This is surely an artifact of the lattice model.
-
(4)
The temperature dependence of for M=10 shows Arrhenius behavior with (see inset in Fig. 3). This is in qualitative agreement with experiments.50, 51 In addition, the collective rearrangements of several chains from the S to the N* structure that occurs within the oligomer become slower at low temperatures. These two factors contribute to the barrier that leads to a substantial increase in τfib as T is lowered.
-
(5)
The mechanism of assembly of fibrils even in this toy model is highly complex. While the overall growth kinetics can be summarized using a three-stage growth, the events that transpire in the distinct stages involve large structural transitions. In the initial burst phase loosely bound clusters form in which the chains are essentially “noninteracting.” In the second stage, stable clusters with considerable interparticle interactions form. There is a distribution of oligomers. Due to the finite size of the simulations, the nature of the distribution is unclear. It is within these oligomers, in which the chains are in a mixture of S-like and the aggregation prone N*-like states, that the conversion from S to N* takes place. These transitions result in the formation of large-enough ordered oligomers that can serve as templates for the conversion of additional monomers or oligomers to form mature fibrils. It is the last stage that is best described by the dock-lock mechanism.
-
(6)
Strikingly, the growth of mature fibrils in the third stage occurs by the LS mechanism in which the largest clusters grow at the expense of smaller ones. The proposed mechanism supports the physical picture that the S→N* transition occurs either in the oligomers (NCC model) or upon addition to preformed ordered template (dock-lock mechanism). Thus, we find that the elements of the three models (NG, TA, and NCC) are found in each assembly stage. This conclusion also supports a detailed study of fibril growth in an off-lattice model of polyalanine52 in which multiple routes to fibril formation were found, even in the final stages of the incorporation of ordered structures or disordered monomers. Finally, the proposed LS growth law strongly suggests that seeding with a preformed fibrils should lead to rapid growth because such large structures can incorporate disordered oligomers on time scales that vary linearly with peptide concentration.
ACKNOWLEDGMENTS
The contributions of D.K.K. were made when he was an Assistant Research Scientist at the University of Maryland. M.S.L. thanks the hospitality of IPST where this work was initiated. We are grateful to Dr. Sam Cho for comments and encouragement. J.E.S. and D.T. were supported by a grant from the National Institutes of Health (R01GM076688-05) and M.S.L.’s work was supported by the Ministry of Science and Information in Poland (202-204-234).
References
- Lomakin A., Chung D. S., Bedenek G. B., Kirschner D. A., and Teplow D. B., Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.93.3.1125 93, 1125 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rochet J. C. and Lansbury P. T., Curr. Opin. Struct. Biol. 10.1016/S0959-440X(99)00049-4 10, 60 (2000). [DOI] [PubMed] [Google Scholar]
- Wetzel R., Structure (London) 10.1016/S0969-2126(02)00809-2 10, 1031 (2002). [DOI] [PubMed] [Google Scholar]
- Selkoe D. J., Nature (London) 10.1038/nature02264 426, 900 (2003). [DOI] [PubMed] [Google Scholar]
- Dobson C. M., Science 10.1126/science.1093078 304, 1259 (2004). [DOI] [PubMed] [Google Scholar]
- Ross C. A. and Poirier M. A., Nat. Med. 10, S10 (2004). [DOI] [PubMed] [Google Scholar]
- Bossy-Wetzel E., Schwarzenbacher R., and Lipton S. A., Nat. Med. 10, S2 (2004). [DOI] [PubMed] [Google Scholar]
- Lee J. C., Gray H. B., and Winkler J. R., J. Am. Chem. Soc. 10.1021/ja0561901 127, 16388 (2005). [DOI] [PubMed] [Google Scholar]
- Nelson R. and Eisenberg D., Adv. Protein Chem. 10.1016/S0065-3233(06)73008-X 73, 235 (2006). [DOI] [PubMed] [Google Scholar]
- Chiti F. and Dobson C. M., Annu. Rev. Biochem. 10.1146/annurev.biochem.75.101304.123901 75, 333 (2006). [DOI] [PubMed] [Google Scholar]
- Gupta P., Hall C. K., and Voegler A. C., Protein Sci. 7, 2642 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ma B. Y. and Nussinov R., Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.212206899 99, 14126 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Massi F. and Straub J. E., Proteins: Struct., Funct., Genet. 42, 217 (2001). [DOI] [PubMed] [Google Scholar]
- Smith A. V. and Hall C. K., J. Mol. Biol. 10.1006/jmbi.2001.4845 312, 187 (2001). [DOI] [PubMed] [Google Scholar]
- Gsponer J., Haberthur U., and Caflisch A., Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.0835307100 100, 5154 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klimov D. K. and Thirumalai D., Structure (London) 10.1016/S0969-2126(03)00031-5 11, 295 (2003). [DOI] [PubMed] [Google Scholar]
- Thirumalai D., Klimov D. K., and Dima R. I., Curr. Opin. Struct. Biol. 10.1016/S0959-440X(03)00032-0 13, 146 (2003). [DOI] [PubMed] [Google Scholar]
- Favrin G., Irback A., and Mohanty S., Biophys. J. 10.1529/biophysj.104.046839 87, 3657 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wei G. H., Mousseau N., and Derreumaux P., Biophys. J. 10.1529/biophysj.104.047688 87, 3648 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buchete N. V., Tycko R., and Hummer G., J. Mol. Biol. 353, 804 (2005). [DOI] [PubMed] [Google Scholar]
- Takeda T. and Klimov D. K., J. Mol. Biol. 10.1016/j.jmb.2007.02.066 368, 1202 (2007). [DOI] [PubMed] [Google Scholar]
- Bellesia G. and Shea J. E., J. Chem. Phys. 10.1063/1.2739547 126, 245104 (2007). [DOI] [PubMed] [Google Scholar]
- Baumketner A. and Shea J. E., J. Mol. Biol. 10.1016/j.jmb.2006.11.015 366, 275 (2007). [DOI] [PubMed] [Google Scholar]
- Harrison P. M., Chan H. S., Prusiner S. B., and Cohen F. E., Protein Sci. 10.1110/ps.38701 10, 819 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dima R. I. and Thirumalai D., Protein Sci. 10.1110/ps.4220102 11, 1036 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Teplow D. B., Lazo N. D., Bitan G., Bernstein S., Wyttenbach T., Bowers M. T., Baumketner A., Shea J.-E., Urbanc B., Cruz L., Borreguero J., and Stanley H. E., Acc. Chem. Res. 10.1021/ar050063s 39, 635 (2006). [DOI] [PubMed] [Google Scholar]
- Jarrett J. T. and Lansbury P. T., Cell 10.1016/0092-8674(93)90635-4 73, 1055 (1993). [DOI] [PubMed] [Google Scholar]
- Esler W. P., Stimson E. R., Jennings J. M., Vinters H. V., Ghilardi J. R., Lee J. P., Mantyh P. W., and Maggio J. E., Biochemistry 10.1021/bi992933h 39, 6288 (2000). [DOI] [PubMed] [Google Scholar]
- Cannon M. J., Williams A. D., Wetzel R., and Myszka D. G., Anal. Biochem. 328, 67 (2004). [DOI] [PubMed] [Google Scholar]
- Nguyen P., Li M. S., Staub J. E., and Thirumalai D., Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.0607440104 104, 111 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Serio T. R., Cashikar A. G., Kowal A. S., Sawicki G. J., Moslehi J. J., Serpell L., Arnsdorf M. F., and Lindquist S. L., Science 10.1126/science.289.5483.1317 289, 1317 (2000). [DOI] [PubMed] [Google Scholar]
- Thirumalai D., Klimov D. K., and Dima R. I., Adv. Chem. Phys. 120, 35 (2002). [Google Scholar]
- Shakhnovich E., Farztdinov G., Gutin A. M., and Karplus M., Phys. Rev. Lett. 10.1103/PhysRevLett.67.1665 67, 1665 (1991). [DOI] [PubMed] [Google Scholar]
- Socci N. D. and Onuchic J. N., J. Chem. Phys. 10.1063/1.467775 101, 1519 (1994). [DOI] [Google Scholar]
- Hilhorst H. J. and Deutch J. M., J. Chem. Phys. 10.1063/1.431297 63, 5153 (1975). [DOI] [Google Scholar]
- Li M. S., Klimov D., and Thirumalai D., J. Phys. Chem. B 10.1021/jp025837q 106, 8302 (2002). [DOI] [Google Scholar]
- Deeds E. J., Ashenberg O., Gerardin J., and Shakhnovich E. I., Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.0702766104 104, 14952 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guo Z. and Thirumalai D., Folding Des. 2, 277 (1997). [DOI] [PubMed] [Google Scholar]
- Klimov D. K. and Thirumalai D., J. Mol. Biol. 287, 471 (1998). [DOI] [PubMed] [Google Scholar]
- Fink A. L., Folding Des. 10.1016/S1359-0278(98)00002-9 3, R9 (1998). [DOI] [PubMed] [Google Scholar]
- Tarus B., Straub J. E., and Thirumalai D., J. Am. Chem. Soc. 10.1021/ja064872y 128, 16159 (2006). [DOI] [PubMed] [Google Scholar]
- Tycko R., Curr. Opin. Struct. Biol. 13, 96 (2004). [DOI] [PubMed] [Google Scholar]
- Thirumalai D., J. Phys. I 10.1051/jp1:1995209 5, 1457 (1995). [DOI] [Google Scholar]
- Li M. S., Klimov D. K., and Thirumalai D., Polymer 10.1016/j.polymer.2003.10.06645, 573 (2004). [DOI] [Google Scholar]
- Klimov D. K. and Thirumalai D., J. Chem. Phys. 10.1063/1.477012 109, 4119 (1998). [DOI] [Google Scholar]
- Chiti F., Stefani M., Taddei N., Ramponi G., and Dobson C. M., Nature (London) 10.1038/nature01891 424, 805 (2003). [DOI] [PubMed] [Google Scholar]
- Kowalewski T. and Holtzman D. H., Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.96.7.3688 96, 3688 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Collins S. R., Douglass A., Vale R. D., and Weissman J. S., PLoS Biol. 2, 1582 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lifshitz I. M. and Slyozov V. V., J. Phys. Chem. Solids 10.1016/0022-3697(61)90054-3 19, 35 (1961). [DOI] [Google Scholar]
- Sabaté R., Gallardo M., and Estelrich J., Int. J. Biol. Macromol. 10.1016/j.ijbiomac.2004.11.001 35, 9 (2005). [DOI] [PubMed] [Google Scholar]
- Kusumoto Y., Lomakin A., Teplow D. B., and Benedek G. B., Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.95.21.12277 95, 12277 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nguyen H. D. and Hall C. K., J. Biol. Chem. 280, 9074 (2005). [DOI] [PubMed] [Google Scholar]