Skip to main content
PLOS One logoLink to PLOS One
. 2016 Jan 8;11(1):e0146096. doi: 10.1371/journal.pone.0146096

A Monte Carlo Study of the Early Steps of Functional Amyloid Formation

Pengfei Tian 1,2,¤, Kresten Lindorff-Larsen 2, Wouter Boomsma 2,*, Mogens Høgh Jensen 1,*, Daniel Erik Otzen 3,*
Editor: Jie Zheng4
PMCID: PMC4706413  PMID: 26745180

Abstract

In addition to their well-known roles in neurodegenerative diseases and amyloidoses, amyloid structures also assume important functional roles in the cell. Although functional amyloid shares many physiochemical properties with its pathogenic counterpart, it is evolutionarily optimized to avoid cytotoxicity. This makes it an interesting study case for aggregation phenomenon in general. One of the most well-known examples of a functional amyloid, E. coli curli, is an essential component in the formation of bacterial biofilm, and is primarily formed by aggregates of the protein CsgA. Previous studies have shown that the minor sequence variations observed in the five different subrepeats (R1-R5), which comprise the CsgA primary sequence, have a substantial influence on their individual aggregation propensities. Using a recently described diffusion-optimized enhanced sampling approach for Monte Carlo simulations, we here investigate the equilibrium properties of the monomeric and dimeric states of these subrepeats, to probe whether structural properties observed in these early stage oligomers are decisive for the characteristics of the resulting aggregate. We show that the dimerization propensities of these peptides have strong correlations with their propensity for amyloid formation, and provide structural insights into the inter- and intramolecular contacts that appear to be essential in this process.

Introduction

While protein amyloid has typically been associated with pathological conditions such as amyloidosis and neurodegenerative diseases including Parkinson’s, Alzheimer’s and Huntington’s disease [1, 2], it is increasingly clear that amyloid can also be a benign state with various important functional roles in their hosts [35]. Recent advances reveal that these functional amyloids share many physical and chemical properties with the pathological variants [68]. For instance, curli, natural amyloid fibers that are involved in biofilm formation by E.coli, can induce inflammatory responses of host tissue, similar to that known from the Alzheimer associated Aβ [9]. Moreover, a molecule that stimulates the oligomer assembly of α-synuclein has been demonstrated to have similar effect on the curli amyloid protein CsgA [10]. Despite their similarities, the properties of pathogenic and functional amyloid diverge in one important aspect: functional amyloid is regulated by the organism to avoid the cytotoxicity associated with the formation of their pathogenic counterparts [1113]. This renders functional amyloid highly important as model systems for understanding amyloid growth.

Curli is one of the best-understood functional amyloid system [14]. It is mainly formed by aggregation of the subunit protein, CsgA (Fig 1). CsgA is a soluble protein secreted through the outer membrane by protein CsgG, nucleated by the CsgB and assembled into amyloid fibrils on the surface of the cell [15, 16]. Recently, progress has been made on the characterization of the detailed amyloid protein structure by solid state NMR [6, 17], and amino acid proximity information extracted from residue-residue coevolution has revealed that CsgA forms a β-helical structure inside the amyloid fibrils [18]. However, the details of the folding pathway of CsgA and the assembly pathway remain unclear.

Fig 1. CsgA primary sequence including sub-repeat (R1 –R5) with the alignment of internally conserved Ser-X5-Gln-X-Gly-X-Gly-Asn-X-Ala-X3-Gln-X residues.

Fig 1

The amyloid core of CsgA contains five repeating units, R1-R5, each consisting of 19–23 amino acids (Fig 1). Each of these repeats is found to form a single turn in the final β-helical state [18]. A recent set of studies probed these subunits independently, demonstrating that the nucleation and polymerization of CsgA is encoded in the minor variations in amino acid composition of these different subunits [1921]. For example, deletion of R1 or R5 prevents CsgA amyloid formation, while R2 can be omitted without affecting the overall aggregation process. Also, peptides corresponding to R1 and R5 were shown to aggregate efficiently by themselves, while R2 lacked this property. Likewise, studies have examined the effect of the amino acid composition of the turn region separating the repeats, showing similarly strong dependencies [22]. These detailed experimental characterizations of the effects of minor sequence variations on global aggregation properties provide us with a unique opportunity to investigate the link between early oligomeric structure and the mature fibrillar amyloid state. One of the main difficulties in understanding the process of amyloid formation at a molecular level is the range of scales involved—both in time and size, a significant effort has been made towards the study of this problem using computer simulations recently [2327]. Initially, monomers are thought to associate into oligomers or protofibrils, which will eventually, under specific conditions, assemble into full fibrils. The initial dimerization is known to be critical for oligomerization and subsequent fibril growth [25, 28, 29], and it is also considered as a key primary event of the pathogenesis of for instance prion diseases [30], but it is still largely unknown to which extent the structural properties of the dimer are linked to that of the fibril [31]. The small size of the CsgA peptides, combined with the evolutionary optimization of the aggregation propensities, makes them attractive model systems to test the early events in protein aggregation in silico.

In the present study, we performed atomic-level computer simulations of the monomer-monomer interactions of the individual CsgA subrepeats which were exprimentally characterized in the studies described above, and investigate whether the effects we observe in their dimerization translate directly to the experimentally observed aggregation properties. Using a recently developed Monte Carlo (MC) enhanced sampling technique, we obtain a converged thermodynamic characterization of the free energy landscape of the interacting monomers, and use this to quantify the degree to which different inter- and intramolecular contacts are formed. Our study shows a remarkable correspondence between the propensities for dimer formation and the CsgA aggregation propensities previously reported, suggesting that formation of specific contacts already at the dimerization stage can dramatically influence the amyloid pathway.

Results

Characterization of monomer conformations

The subrepeats of the CsgA protein generally have high sequence similarity (Fig 1), but display variations that have been demonstrated to have a dramatic impact on their propensity to aggregate[1921]. This observation raises the question whether the minor variations in primary sequence encode critical change in secondary structure, which might be observed already at the monomer level. To investigate this question, we began by characterizing the conformational ensemble of the monomeric state of the individual R1-R5 repeats, by determining the free energy, F(α, β), calculated as a function of α-helix and β-strand content (Fig 2).

Fig 2. Free energy F(α,β) at 300K of the monomer of R1—R5((a)—(e)), calculated as a function of the α-helix content (Y-axis) and β-strand content (X-axis).

Fig 2

F(α,β) = -ln(P(x))), where P(x) represents the probability of state x.

We conducted our simulations using an all atom force field with an empirical implicit solvent model, PROFASI [32, 33]. This simulation framework has been successfully used to characterize thermodynamics of protein folding, as well as native structure ensembles with experimental data restraints [3437]. Recently, combining with Monte Carlo method, it has been used to study the dimerization and aggregation of Aβ42 [38, 39] and Aβ16–22 peptides [40], the misfolding of α-synuclein [41] and Copper, zinc superoxide dismutase 1 [42], the simulation results agree very well with the experimental studies on the kinetic and equilibrium properties. As an additional test of the forcefield, we conducted a series of simulations on the (AAQAA)3 15-mer peptide, a system which has previously been used to test molecular forcefields. These results show agreement with experiment as good as optimized all-atom MD potentials [43] (Figure A in S1 File) To explore a wide range of conformations more efficiently, in our current study, sampling was performed in a multicanonical (flat histogram) ensemble and subsequently reweighted to obtain Boltzmann statistics at 300K (see Methods). The secondary structure content was calculated based on Ramachandran angles for each residue using STRIDE [44].

Unlike what has previously been observed for monomeric Aβ peptides [25, 38, 45] and α-synuclein [41, 46, 47] which all have strong propensity to form β-strands, we find that the averaged ensembles have α-helix content < 10% and β-strand content < 20% (Fig 2 and Figure B in S1 File). The structural diversities of the monomers are described with more details by calculating the residue contact probability map of the whole ensemble at room temperature (Figure C in S1 File). Our results thus indicate that the R1–R5 monomer is not well structured in its free state, some β-hairpin like residue-residue contacts are formed with very low probabilities at 300K. One potential explanation could be that monomeric disorder is required to maintain solubility and avoid early aggregation of CsgA before it is secreted to the cell surface [16]. The generally low levels of secondary structure make it difficult to differentiate between properties of the individual subrepeats. Consequently, at the monomer level, we find little correlation between simulated secondary structure propensities and the experimentally observed aggregation propensities [19].

Dimer simulations

Peptides binding free energy

Given their disordered nature, the monomer simulations provide little information on the aggregated state of CsgA. We therefore proceed with a simulation analysis of dimer-formation for each of the subrepeats. It is known to be computationally challenging to converge dimer simulations. We initiated our studies using a standard parallel tempering (replica exchange) setup, but failed to obtain reliable free energy estimates (see Methods) and were therefore forced to abandon this approach. To overcome these problems we adopted a generalized ensemble technique that aims at optimizing the number of transitions from high to low energy (in this case between monomeric and dimeric species), using the energy-dependent diffusion profile (see Methods). Compared to the parallel tempering method, the sampling speed was improved by 30% to 460% (depending on the subrepeat) in terms of round trip time, thereby accumulating more statistics especially around the transition states (see Methods). The simulations of all the systems are fully converged as we can see from the error analysis of the thermodynamic properties using the Jackknife method in the Figure D in S1 File. In each simulation, we initialize with random extended structures in a cubic box with side lengths of 100Å, corresponding to a concentration of around 4.8 mg/ml.

To investigate the character of the transition between phases of the dimerized and separated chains, we constructed equilibrium free energy surfaces at 300K (Fig 3). Despite very high sequence identity, the five peptides display dramatically different dimerization tendencies. For R1, R4 and R5, the global free energy minimum at high β-strand content and short distance of the two chains indicates that dimeric β-hairpin states dominate the population. In contrast, the R2 peptides mainly stay in the unbounded disordered states, while R3 shows an intermediate binding affinity, marginally stronger than R2. Compared to the monomers, which are primarily disordered, the ensemble of dimer conformations contains significantly larger β-hairpin population, especially for R1, R3, R4 and R5. We thus see a remarkable stabilizing effect on the hairpin induced by the formation of dimer. The calculated standard binding free energy (ΔG) are -4.7 kcal/mol, -2.9 kcal/mol, -4.3 kcal/mol, -6.0 kcal/mol and -8.3 kcal/mol for R1-R5 respectively. ΔG is calculated as:

ΔG=kBTln(KaVref),

where Ka is the association constant, kB is the Boltzmann constant, T is the room temperature (300K), and Vref is the association constant and Vref is the reference volume in units consistent with the units of concentration in Ka (mol/L). Ka = [AB]/[A][B], where A represents peptide chain A and B represents chain B. Both A and B start out as monomers with an initial concentration of 1 per simulation box (100Å in length, i.e. 10−24 m3 or 10−27 L), corresponding to 1/600 mol/L.

Fig 3. Free energy of binding (-ln(P(x))) for R1 –R5 dimer at the 300K.

Fig 3

D (X-axis) is the distance of the center of the mass between the two-peptide chains and β content (Y-axis) is the percentage of β-strand within the structure. (a)–(e) correspond to R1 –R5, respectively.

Aggregation propensity

A central question is how strongly the observed dimer characteristics are linked to the properties of the fully formed aggregate state. Fortunately, aggregation of CsgA subrepeats has been extensively studied experimentally [1921], showing substantial differences between the repeats. We proceed with a more detailed comparison to these experimental data by calculating melting curves from the temperature dependent dimerization simulations described in the previous section (Fig 4A). We observe that the melting temperatures for R1, R4 and R5 (312K, 300K and 300K, respectively) are clearly higher than those for R2 and R3 (291K and 293K, respectively), pointing toward a considerable difference in stability between the two groups.

Fig 4. (a) Temperature dependent population of the dimer phase. (b) Dimerization propensity obtained by the simulation at 300K compared to the experimental measurement of the self-aggregation propensity at room temperature (i.e. around 300K)[19].

Fig 4

The experimental data takes the form of aggregation propensities for the repeats measured using thioflavin T fluorescence [19, 21]. The simulation results projected to the distance of the center of the mass between the peptide chains (D) show strongly bimodal distribution at the room temperature (Figure E in S1 File), the aggregated phase and the detached phase are separated by a clear free energy barrier at D = 15Å. Therefore, to allow for comparisons to our simulated data, we define the proteins to be in a dimeric state when the D is below 15Å. The percentage of dimer population observed in simulation is shown in Fig 4. After normalizing the populations obtained from both the simulation and the experimental data, we observe that for 4 out of 5 of the subrepeats: R1, R2, R3 and R5, there are very strong correlations between the dimerization propensity obtained by the simulation and the oligomerization propensity obtained by experiments (Fig 4). To further validate the robustness of our conclusion, we reproduced our analyses using a cutoff of D = 10Å, and show that the aggregation profile is consistent with the D = 15Å result (Figure F in S1 File). Furthermore, to rule out disordered states within the cutoff, we have added an additional analysis using β-strand content as a reaction coordinate, calculating the temperature dependent population with D < 15 Å and β-strand-content > 50% (Figure F in S1 File). The relative aggregation propensities of different systems (except for R4) were found to remain consistent with the experimental data at room temperature.

As a control simulation, we include a dimerization study of the 22 amino acids long N-terminal peptide from CsgA that is not involved in curli fibril formation (Nterm in Fig 1). The simulations reveal that this peptide does not substantially dimerize at any temperature within a broad temperature range. This is consistent with previous findings that this part of the sequence is not involved in the amyloid core of the curli structure [20, 21].

The observed correlation between dimerization propensities in simulation and the experimentally observed rate of aggregation for these subrepeats suggests that structural properties of the full fibril are to some extent encoded in the dimer structure. This can be seen as providing some support to previous studies on neurodegenerative-associated amyloid proteins, where dimerization has been reported as the primary trigger for oligomerization and further aggregation [29, 30, 48]. The R4 subrepeat, however, stands out with a significant discrepancy between simulation and experiment. This prompted us to further investigate the detailed structural conformations and contacts formed during the dimerization process for the different repeats.

Intra- and inter-chain contacts

For a more detailed description of the tertiary organization of the dimers, we calculated the probability of each possible intrachain and interchain contact based on the whole ensemble for each system. Contact maps were constructed based on residue pairs with CA-CA distance below 8Å, containing the percentage of time that this contact is formed during a simulation (Fig 5). The parallel and anti-parallel pattern in the contact map demonstrates that dimer states occur with different orientations. R2 is an exception, for which almost no binding contacts between the two chains are observed, which agrees with the low aggregation propensity observed in experiment [19]. To support the visual interpretation of the contact maps, we also isolate representative structures for each subrepeat as cluster centroids (using the Gromacs g_cluster program [49]) (Fig 6).

Fig 5. Residue contact probability map for contacts within and between the two chains at 300K.

Fig 5

Intrachain and interchain pair contact probabilities are shown below and above the main diagonal (dashed white line), respectively. The solid white shows the diagonal of a 19×19 matrix in each plot. (a)–(e) correspond to R1 –R5, respectively.

Fig 6. Representative dimer state of R1 (a), R3 (b), R4 (c) and R5 (d).

Fig 6

Residue index of 8 to 11 are colored in green and residue 19 to the end of the chain is colored in purple as Fig 1. For residues 2 to 7 and 12 to 17, hydrogen-bonded backbone oxygens and protons are illustrated in red and blue, respectively. And the Cβ atoms on the side chains are shown as well. The dashed line is the distance between CA atoms of residues 1 and 19. A more detailed version of these structures is shown in the Figure G in S1 File.

Different dimer conformations can be classified and labeled according to the inner two strands of the dimer (whether the β-strand sequence is closer to either the N-terminus or the C terminus, and whether the two inner strands are either parallel or antiparallel): NN-parallel, NC-parallel, CC-parallel, NN-antiparallel, NC-antiparallel, and CC-antiparallel. From Fig 5 we can see the most populated dimer state for R1 and R3 is CC-antiparallel (Fig 6A and 6B). Moreover, R1 can also form an NN-parallel dimer. R4 forms all types of different dimers, ordered in probability from high to low: NN-parallel, NN-antiparallel, and NC-antiparallel and CC-antiparallel. The dimers of R5 include NN-parallel, NN-antiparallel, and NC-antiparallel. In contrast to the dimer of Aβ which is stabilized mainly by the hydrophobic effect [39, 50], the primary force driving the dimer formation is the inter and intra chain hydrogen bonding. Presumably, the oligomer could potentially be stabilized further with the formation of a hydrophobic core between opposite β sheets, similar to the model of the CsgA monomer [16, 18] or other peptide fibrils [51], when more chains are involved.

The R1 to R5 subrepeats are known to behave differently in their ability to polymerize with CsgA seeds [21]. In contrast to R1, R3, and R5, which polymerize rapidly in contact with a CsgA seed, the R2 and R4 subrepeats have been reported to remain unpolymerized for 48 hours at room temperature [20]. For R2, this result is entirely consistent with our simulations, which demonstrate that this subrepeat does not form stable interchain hydrogen bonds (Figs 3 and 5). In contrast, R4 readily dimerizes in our simulations. However, R4 exhibits several particular characteristics compared to the other repeats. While R4 shares characteristics of the inter-chain contacts with R1, R3 and R5, there are marked differences in the intra-chain contacts: the hairpin turn of R4 is at residue position 10–13 while R1, R3, R5 are located at around residue 8–11 (Figs 5D and 6C). Interestingly, this latter position is identical to the location of the turns in the full-length CsgA amyloid state [16, 18]. The aggregation mode has previously been reported to be highly related to the arrangement of the sequence within the repeat [22], suggesting that the lack of CsgA-seeded R4 polymerization could be explained by this incompatibility in topology.

Conclusion

Accurate descriptions of the multiple length- and time scales involved in protein aggregation remains an important challenge in molecular simulation [5254]. In this study, we investigated whether characteristic properties observed in the early oligomerization steps are reflected in the final aggregate state for the different subrepeats in the functional amyloid protein CsgA. By combining a minimalistic and highly efficient force field with a state-of-the art Monte Carlo simulation methodology, we have successfully converged the free energy landscapes of both the monomer and dimer states. The monomers were found to be primarily disordered. However, the dimerization process induces more well-defined structure, leading to distinct topologies for the different subrepeats. In our simulations, hydrogen bonding seems to be the main stabilizing force of dimer formation, in line with previous findings from both experiments and simulations [5557]. We found the peptide repeats R1, R3, and R5, which have the highest dimerization propensities, to form similar hairpins with the X-G-X-G amino acid motif at the turn position. Finally, we calculated the dimerization propensity for each subrepeat and found remarkable agreement with both the experimentally measured propensities for self-polymerization [19] and for polymerization with CsgA fiber seeds [21]. While strong correlations were observed over the propensities of the R1, R2, R3 and R5 subrepeats, R4 displays substantial discrepancies. Experimentally, R4 was previously reported to stand out for its inability to polymerize even in the presence of CsgA seeds. The contact maps obtained from our simulations suggest that the reason for this incompatibility could be the shift in turn position compared to the other subrepeats. Based on the amyloid state of the CsgA monomer we determined in the previous study [18], the hydrogen bonding seems to be the main stabilizing force between the neighboring subrepeat. And the conserved pattern of hydrogen bonded residue “ladders” are contributed by the residues of index range 1–7 (the first β-strand) and 12 to the end of the subrepeat (the second β-strand) with the neighbouring subrepeat at the same residue positions. The residue index here is numbered on each subrepeat. For instance, the 14th residue ALA on each subrepeat form hydrogen bonds with the ALA14 from the neighboring subrepeat. Given the aggregated model we obtained from the simulation as shown in the Fig 6 and the Figure G in S1 File, for R1, R3 and R5, either the first or the second β-strand could “stack” onto the CsgA fibril with the conserved hydrogen bonding pattern. However, as to R4, the residue 12–14 is in the turn position instead of β–strand as the other subrepeats, it could potentially prevent the formation of hydrogen bonds with the same pattern.

A natural next step would be to extend our simulations to three chains, to evaluate whether the trimeric states follow the same pattern, and perhaps hint at the assembly into the full aggregated state. We have initiated work in this direction, but have so far been unsuccessful in obtaining converged estimates using an unbiased simulation approach. Even attempts at restricting two copies to a fully formed dimeric state to probe the association of the third failed to provide conclusive results. Incorporating bias terms from experimental data to drive the system between states of interest might be a fruitful path forward in this respect [58, 59]. Overall, our results indicate, however, that even an approach limited to two-chain simulations, when fully converged, can potentially provide valuable insights into the early stages of amyloidgenesis, by quantifying the formation of essential contacts that drive the aggregation process. In particular, such contact information is an important complement to the site-specific aggregation propensities that can be obtained using more standard sequence-based aggregation prediction algorithms.

Methods

All simulations were conducted using the PROFASI force field [32]. This implicit solvent force field has previously been used to characterize the folding and refolding thermodynamic properties of various proteins, and has succesfully been applied to aggregation studies [32]. The overall energy potential is composed of four terms:

E=Eloc+Eev+Ehb+Esc, (1)

where Eloc represents the local interactions between atoms connected by only a few covalent bonds, while Eev, Ehb and Esc take into account the non-local character. Eev is the energy term of exclude volume effect. Ehb is the hydrogen bond potential and Esc represents charge-charge interaction between sidechains. For further details, we refer to the original description of the model [32, 60].

The software framework PROFASI [33] is used to carry out the Monte Carlo (MC) simulation, using five different kinds of conformational updates: the pivot move, biased Gaussian steps (semi-local move) [61], side chain move, rotation move and translation move. Default settings are used for the weights of the updates. In PROFASI, by design, the bond lengths of covalent bonds and the bond angle between adjacent covalent bonds are kept fixed, leaving torsion angles in the backbone and side chains as the only degrees of freedom.

The monomer simulations were run using a standard Metropolis-Hastings Monte Carlo setup at 300K. For the dimer simulations we experienced severe convergence problems and experimented with various enhanced sampling algorithms in our efforts to obtain reliable free energy estimates. One method that turned out to be particularly well suited for this problem is a recently developed generalized ensemble technique based on flattening the energy-specific diffusion profile [62, 63]. Below, we briefly introduce the method and compare its performance to the commonly used parallel tempering (replica exchange) technique.

Diffusion-optimized ensembles

In generalized ensemble techniques, the standard Boltzmann expression is replaced with a weight function w(E). Various proposals for choices of efficient weight functions have been presented in the literature [6466]. One of the most common strategies is the multicanonical form:

w(E)=1/g(E), (2)

where g(E) is the density of state. Thus, the energy histogram converges to a flat distribution. However, a flat histogram is not the optimal choice if one wishes to minimize the first passage time (τ) between the minimum and maximum energies (Emin and Emax) [63]. If we assume that the dynamics can be described as one-dimensional diffusion along the reaction coordinate (in our case, along the energy E), then the mean first-passage time τ for “round trips” that cross the whole energy space is

τ=EminEmax1D(E)P(E)dE, (3)

where D(E) is the position-dependent diffusion coefficient along the reaction coordinate E and P(E) is the probability distribution of energy.

Eq 3 obtains its minimum when

Popt(E)=1D(E), (4)

with

D(E)=πΔtZC(E)2ZH(E)2, (5)

in which Δt is the sample interval, ZH(E) is the ordinary energy histogram and ZC(E) is the transition histogram. Specifically, given the simulation trajectory of Ei = E(iΔt) (i = 1,2 …), ZH(E) = NE where N is the total number of samples (Ei) that drop into the same bin of the width ΔE, ZC(E) is the number of transitions that cross a specific energy “cut”:

ZC(E)=1/2iΘ[(E(iΔt)E)(EE(iΔt+Δt))], (6)

where Θ is the Heaviside step function. In practice, the optimized weights are estimated through a feedback iteration,

lnwn(E)=lnwn1(E)lnZC(E), (7)

where wn−1 and wn are the old and new weights respectively. After this, the Boltzmann statistics can be recovered using reweighting techniques [67]. The details of calculating the free energy are introduced in the previous study [68]. All the monomer and dimer simulations we carried out were started from fully extended structures.

In our simulation of dimers, the initial runs of 1 × 108 MC sweeps (one sweep is equal to the number of degrees of freedom in the system) of each system are carried out to generate the optimized weight by the Eq 7. The diffusion-optimized distributions are illustrated in Fig 7.

Fig 7. Energy distribution of R1-R5 after the feedback (Eq 7).

Fig 7

Using the optimized weights for the simulation, we observe a dramatic speedup of sampling in terms of the number of tunneling events within the same simulation time (Fig 8, Table 1). Compared to the parallel tempering (PT) technique, the diffusion-optimized ensemble approach consistently gains speedup from 30% to 460% for different subrepeats. Both kinds of the simulations are carried on 16 parallel replicas within the same temperature range 279−367 K. In our setup, the PT method uses the default settings of the software package PROFASI [33], where the 16 temperatures are chosen according to a geometric distribution in the temperature range 279−367 K. We note that it has been shown that the efficiency of PT can be optimized considerably using diffusion considerations similar to those described above [69]. In our case, the diffusion optimized ensemble is carried out in a generalized ensemble framework, using the MUNINN [68, 70] software package.

Fig 8. (a) Representative simulation trajectories (of R1 dimer) of energy by the PT method and by (b) the diffusion-optimized ensemble method.

Fig 8

Table 1. Round-trip speed comparison.

Repeat Num of tunneling events (PT) Num of tunneling events (Diffusion Optimized)
R1 9 50
R2 19 65
R3 55 82
R4 14 54
R5 17 22

Supporting Information

S1 File. Comparison of secondary structure propensities between calculation and experiment.

We show the temperature dependent helical fraction of the (AAQAA)3 peptide obtained from previous experiment and simulations using PROFASI force field (Figure A). Free energy of R1 R5 monomer at the 300K projected to the reaction coordinates of the β-strand content (beta) and the α-helix content (alpha) (Figure B). Residue contact probability map for R1–R5 (a-e) monomer at 300K. The solid white shows the diagonal of a 19×19 matrix in each plot. The minimum residue distance is 5. Contact maps were constructed based on residue pairs with CA-CA distance below 8Å, containing the percentage of time that this contact is formed during a simulation (Figure C). Mean and error estimated for the temperature dependent population of the dimer phase (main text Fig 4). Since we have 8 independent simulations for each system, the mean and error here are estimated with the Jackknife method by excluding samples from one simulation at a time (Figure D). Free energy of binding for R1 R5 dimer at the 300K(main text Fig 3) projected to the reaction coordinate D (a) and β-strand content (b). The dashed lines in (a) correspond to D = 10 Å and 15 Å. The dashed line in (b) corresponds to beta (β-strand content) = 0.5 (Figure E). Temperature dependent population of the dimer phase with the D < 10Å(a) and Temperature dependent population of the dimer phase with D < 15Å and β-strand content > 50%(b) (Figure F). The detailed version of the dimer states of R1 (a), R3 (b), R4 (c) and R5 (d) in the main text Fig 6. Nitrogen donor and oxygen acceptor involved in the backbone hydrogen-bonding are illustrated in blue and red, respectively. The intra- and inter- chain hydrogen bonds are shown as yellow dashed line. All the sidechain atoms are not displayed here. The models are drawn with PyMOL (Figure G).

(DOCX)

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

This work was supported by the Lundbeck Foundation, Villum Foundation, Carlsberg Foundation and Novo Nordisk Foundation.

References

  • 1.Chiti F, Dobson CM. Protein misfolding, functional amyloid, and human disease. Annual Review of Biochemistry. 2006;75:333–66. [DOI] [PubMed] [Google Scholar]
  • 2.Ross CA, Poirier MA. Protein aggregation and neurodegenerative disease. Nature Medicine. 2004;10:S10–S7. [DOI] [PubMed] [Google Scholar]
  • 3.Otzen D, Nielsen PH. We find them here, we find them there: functional bacterial amyloid. Cellular and Molecular Life Sciences. 2008;65(6):910–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Fowler DM, Koulov AV, Balch WE, Kelly JW. Functional amyloid–from bacteria to humans. Trends in biochemical sciences. 2007;32(5):217–24. [DOI] [PubMed] [Google Scholar]
  • 5.Dueholm MS, Larsen P, Finster K, Stenvang MR, Christiansen G, Vad BS, et al. The Tubular Sheaths Encasing Methanosaeta thermophila Filaments Are Functional Amyloids. Journal of Biological Chemistry. 2015;290(33):20590–600. 10.1074/jbc.M115.654780 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Shewmaker F, McGlinchey RP, Wickner RB. Structural insights into functional and pathological amyloid. Journal of Biological Chemistry. 2011;286(19):16533–40. 10.1074/jbc.R111.227108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hufnagel DA, Tükel Ç, Chapman MR. Disease to dirt: the biology of microbial amyloids. PLoS pathogens. 2013;9(11):e1003740 10.1371/journal.ppat.1003740 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Trusova VM. Protein Fibrillar Nanopolymers: Molecular-Level Insights into Their Structural, Physical and Mechanical Properties. Biophysical Reviews and Letters. 2015;10(03):135–56. [Google Scholar]
  • 9.Tükel Ç, Wilson RP, Nishimori JH, Pezeshki M, Chromy BA, Bäumler AJ. Responses to amyloids of microbial and host origin are mediated through toll-like receptor 2. Cell host & microbe. 2009;6(1):45–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Horvath I, Weise CF, Andersson EK, Chorell E, Sellstedt M, Bengtsson C, et al. Mechanisms of protein oligomerization: inhibitor of functional amyloids templates α-synuclein fibrillation. Journal of the American Chemical Society. 2012;134(7):3439–44. 10.1021/ja209829m [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Evans ML, Chapman MR. Curli biogenesis: order out of disorder. Biochimica et Biophysica Acta (BBA)-Molecular Cell Research. 2014;1843(8):1551–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Landreh M, Rising A, Presto J, Jörnvall H, Johansson J. Specific chaperones and regulatory domains in control of amyloid formation. Journal of Biological Chemistry. 2015;290(44):26430–6. 10.1074/jbc.R115.653097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Evans ML, Chorell E, Taylor JD, Åden J, Götheson A, Li F, et al. The Bacterial Curli system possesses a potent and selective inhibitor of amyloid formation. Molecular cell. 2015;57(3):445–55. 10.1016/j.molcel.2014.12.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Olsén A, Jonsson A, Normark S. Fibronectin binding mediated by a novel class of surface organelles on Escherichia coll. Nature. 1989;(338):4. [DOI] [PubMed] [Google Scholar]
  • 15.Blanco LP, Evans ML, Smith DR, Badtke MP, Chapman MR. Diversity, biogenesis and function of microbial amyloids. Trends in microbiology. 2012;20(2):66–73. 10.1016/j.tim.2011.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Barnhart MM, Chapman MR. Curli biogenesis and function. Annual review of microbiology. 2006;60:131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Schubeis T, Yuan P, Ahmed M, Nagaraj M, van Rossum BJ, Ritter C. Untangling a Repetitive Amyloid Sequence: Correlating Biofilm‐Derived and Segmentally Labeled Curli Fimbriae by Solid‐State NMR Spectroscopy. Angewandte Chemie International Edition. 2015. [DOI] [PubMed] [Google Scholar]
  • 18.Tian P, Boomsma W, Wang Y, Otzen DE, Jensen MH, Lindorff-Larsen K. Structure of a Functional Amyloid Protein Subunit Computed Using Sequence Variation. Journal of the American Chemical Society. 2014;137(1):22–5. 10.1021/ja5093634 [DOI] [PubMed] [Google Scholar]
  • 19.Wang X, Smith DR, Jones JW, Chapman MR. In vitro polymerization of a functional Escherichia coli amyloid protein. Journal of Biological Chemistry. 2007;282(6):3713–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang X, Hammer ND, Chapman MR. The molecular basis of functional bacterial amyloid polymerization and nucleation. Journal of Biological Chemistry. 2008;283(31):21530–9. 10.1074/jbc.M800466200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wang X, Zhou Y, Ren J-J, Hammer ND, Chapman MR. Gatekeeper residues in the major curlin subunit modulate bacterial amyloid fiber biogenesis. Proceedings of the National Academy of Sciences. 2010;107(1):163–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Cherny I, Rockah L, Levy-Nissenbaum O, Gophna U, Ron EZ, Gazit E. The formation of Escherichia coli curli amyloid fibrils is mediated by prion-like peptide repeats. Journal of molecular biology. 2005;352(2):245–52. [DOI] [PubMed] [Google Scholar]
  • 23.Lipfert J, Franklin J, Wu F, Doniach S. Protein misfolding and amyloid formation for the peptide GNNQQNY from yeast prion protein Sup35: simulation by reaction path annealing. Journal of molecular biology. 2005;349(3):648–58. [DOI] [PubMed] [Google Scholar]
  • 24.Nguyen HD, Hall CK. Molecular dynamics simulations of spontaneous fibril formation by random-coil peptides. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(46):16180–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Urbanc B, Cruz L, Ding F, Sammond D, Khare S, Buldyrev S, et al. Molecular dynamics simulation of amyloid β dimer formation. Biophysical journal. 2004;87(4):2310–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Buchete N-V, Tycko R, Hummer G. Molecular dynamics simulations of Alzheimer's β-amyloid protofilaments. Journal of molecular biology. 2005;353(4):804–21. [DOI] [PubMed] [Google Scholar]
  • 27.Auer S, Dobson CM, Vendruscolo M. Characterization of the nucleation barriers for protein aggregation and amyloid formation. HFSP journal. 2007;1(2):137–46. 10.2976/1.2760023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Kim J, Lee H, Lee JH, Kwon D-y, Genovesio A, Fenistein D, et al. Dimerization, Oligomerization, and Aggregation of Human Amyotrophic Lateral Sclerosis Copper/Zinc Superoxide Dismutase 1 Protein Mutant Forms in Live Cells. Journal of Biological Chemistry. 2014;289(21):15094–103. 10.1074/jbc.M113.542613 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Roostaee A, Beaudoin S, Staskevicius A, Roucou X. Aggregation and neurotoxicity of recombinant alpha-synuclein aggregates initiated by dimerization. Molecular Neurodegeneration. 2013;8(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Roostaee A, Côté S, Roucou X. Aggregation and amyloid fibril formation induced by chemical dimerization of recombinant prion protein in physiological-like conditions. Journal of Biological Chemistry. 2009;284(45):30907–16. 10.1074/jbc.M109.057950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Chi EY, Krishnan S, Randolph TW, Carpenter JF. Physical stability of proteins in aqueous solution: mechanism and driving forces in nonnative protein aggregation. Pharmaceutical research. 2003;20(9):1325–36. [DOI] [PubMed] [Google Scholar]
  • 32.Irbäck A, Mitternacht S, Mohanty S. An effective all-atom potential for proteins. BMC Biophysics. 2009;2(1):2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Irbäck A, Mohanty S. PROFASI: a Monte Carlo simulation package for protein folding and aggregation. Journal of computational chemistry. 2006;27(13):1548–55. [DOI] [PubMed] [Google Scholar]
  • 34.Valentin JB, Andreetta C, Boomsma W, Bottaro S, Ferkinghoff‐Borg J, Frellsen J, et al. Formulation of probabilistic models of protein structure in atomic detail using the reference ratio method. Proteins: Structure, Function, and Bioinformatics. 2014;82(2):288–99. [DOI] [PubMed] [Google Scholar]
  • 35.Olsson S, Vögeli BR, Cavalli A, Boomsma W, Ferkinghoff-Borg J, Lindorff-Larsen K, et al. Probabilistic determination of native state ensembles of proteins. Journal of Chemical Theory and Computation. 2014;10(8):3484–91. 10.1021/ct5001236 [DOI] [PubMed] [Google Scholar]
  • 36.Boomsma W, Tian P, Frellsen J, Ferkinghoff-Borg J, Hamelryck T, Lindorff-Larsen K, et al. Equilibrium simulations of proteins using molecular fragment replacement and NMR chemical shifts. Proceedings of the National Academy of Sciences. 2014;111(38):13852–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Boomsma W, Frellsen J, Harder T, Bottaro S, Johansson KE, Tian P, et al. PHAISTOS: A framework for Markov Chain Monte Carlo simulation and inference of protein structure. Journal of computational chemistry. 2013;34(19):1697–705. 10.1002/jcc.23292 [DOI] [PubMed] [Google Scholar]
  • 38.Mitternacht S, Staneva I, Härd T, Irbäck A. Comparing the folding free‐energy landscapes of Aβ42 variants with different aggregation properties. Proteins: Structure, Function, and Bioinformatics. 2010;78(12):2600–8. [DOI] [PubMed] [Google Scholar]
  • 39.Mitternacht S, Staneva I, Härd T, Irbäck A. Monte Carlo study of the formation and conformational properties of dimers of Aβ42 variants. Journal of molecular biology. 2011;410(2):357–67. 10.1016/j.jmb.2011.05.014 [DOI] [PubMed] [Google Scholar]
  • 40.Irbäck A, Mitternacht S. Spontaneous β‐barrel formation: An all‐atom Monte Carlo study of Aβ16–22 oligomerization. Proteins: Structure, Function, and Bioinformatics. 2008;71(1):207–14. [DOI] [PubMed] [Google Scholar]
  • 41.Jónsson SÆ, Mohanty S, Irbäck A. Distinct phases of free α‐synuclein—A Monte Carlo study. Proteins: Structure, Function, and Bioinformatics. 2012;80(9):2169–77. [DOI] [PubMed] [Google Scholar]
  • 42.Bille A, Jónsson SÆ, Akke M, Irbäck A. Local unfolding and aggregation mechanisms of SOD1: a Monte Carlo exploration. The Journal of Physical Chemistry B. 2013;117(31):9194–202. 10.1021/jp404500b [DOI] [PubMed] [Google Scholar]
  • 43.Lindorff-Larsen K, Maragakis P, Piana S, Eastwood MP, Dror RO, Shaw DE. Systematic validation of protein force fields against experimental data. PloS one. 2012;7(2):e32131 10.1371/journal.pone.0032131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Frishman D, Argos P. Knowledge‐based protein secondary structure assignment. Proteins: Structure, Function, and Bioinformatics. 1995;23(4):566–79. [DOI] [PubMed] [Google Scholar]
  • 45.Han W, Wu Y-D. A strand-loop-strand structure is a possible intermediate in fibril elongation: Long time simulations of amyloid-β peptide (10–35). Journal of the American Chemical Society. 2005;127(44):15408–16. [DOI] [PubMed] [Google Scholar]
  • 46.Sandal M, Valle F, Tessari I, Mammi S, Bergantino E, Musiani F, et al. Conformational equilibria in monomeric α-synuclein at the single-molecule level. PLoS biology. 2008;6(1):e6 10.1371/journal.pbio.0060006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Kim HY, Heise H, Fernandez CO, Baldus M, Zweckstetter M. Correlation of Amyloid Fibril β‐Structure with the Unfolded State of α‐Synuclein. Chembiochem. 2007;8(14):1671–4. [DOI] [PubMed] [Google Scholar]
  • 48.Tiernan TM, Nur AO, Dainan Z, Aleksey L, Brian ON, Sara L, et al. Abeta dimers differ from monomers in structural propensity, aggregation paths and population of synaptotoxic assemblies. Biochemical Journal. 2014;461(3):413–26. 10.1042/BJ20140219 [DOI] [PubMed] [Google Scholar]
  • 49.Hess B, Kutzner C, Van Der Spoel D, Lindahl E. GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. Journal of chemical theory and computation. 2008;4(3):435–47. 10.1021/ct700301q [DOI] [PubMed] [Google Scholar]
  • 50.Watson D, Castaño E, Kokjohn TA, Kuo Y-M, Lyubchenko Y, Pinsky D, et al. Physicochemical characteristics of soluble oligomeric A β and their pathologic role in Alzheimer's disease. Neurological research. 2005;27(8):869–81. [DOI] [PubMed] [Google Scholar]
  • 51.Nagy-Smith K, Moore E, Schneider J, Tycko R. Molecular structure of monomorphic peptide fibrils within a kinetically trapped hydrogel network. Proceedings of the National Academy of Sciences. 2015;112(32):9816–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Wei G, Mousseau N, Derreumaux P. Computational simulations of the early steps of protein aggregation. Prion. 2007;1(1):3–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Buchete N-V. Unlocking the atomic-level details of amyloid fibril growth through advanced biomolecular simulations. Biophysical journal. 2012;103(7):1411–3. 10.1016/j.bpj.2012.08.052 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Morriss-Andrews A, Shea J-E. Simulations of protein aggregation: Insights from atomistic and coarse-grained models. The Journal of Physical Chemistry Letters. 2014;5(11):1899–908. 10.1021/jz5006847 [DOI] [PubMed] [Google Scholar]
  • 55.Auer S, Trovato A, Vendruscolo M. A condensation-ordering mechanism in nanoparticle-catalyzed peptide aggregation. PLoS Computational Biology. 2009;5(8):e1000458 10.1371/journal.pcbi.1000458 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Dobson CM. Protein folding and misfolding. Nature. 2003;426(6968):884–90. [DOI] [PubMed] [Google Scholar]
  • 57.Benzinger TL, Gregory DM, Burkoth TS, Miller-Auer H, Lynn DG, Botto RE, et al. Propagating structure of Alzheimer’s β-amyloid (10–35) is parallel β-sheet with residues in exact register. Proceedings of the National Academy of Sciences. 1998;95(23):13407–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Han W, Schulten K. Fibril Elongation by Aβ17–42: Kinetic Network Analysis of Hybrid-Resolution Molecular Dynamics Simulations. Journal of the American Chemical Society. 2014;136(35):12450–60. 10.1021/ja507002p [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Baftizadeh F, Pietrucci F, Biarnés X, Laio A. Nucleation process of a fibril precursor in the C-terminal segment of amyloid-β. Physical review letters. 2013;110(16):168103 [DOI] [PubMed] [Google Scholar]
  • 60.Irbäck A, Mohanty S. Folding thermodynamics of peptides. Biophysical journal. 2005;88(3):1560–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Favrin G, Irbäck A, Sjunnesson F. Monte Carlo update for chain molecules: biased Gaussian steps in torsional space. The Journal of Chemical Physics. 2001;114(18):8154–8. [Google Scholar]
  • 62.Jiang P, Yaşar F, Hansmann UH. Sampling of Protein Folding Transitions: Multicanonical Versus Replica Exchange Molecular Dynamics. Journal of chemical theory and computation. 2013;9(8):3816–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Trebst S, Huse DA, Troyer M. Optimizing the ensemble for equilibration in broad-histogram Monte Carlo simulations. Physical Review E. 2004;70(4):046701. [DOI] [PubMed] [Google Scholar]
  • 64.Berg BA, Neuhaus T. Multicanonical algorithms for first order phase transitions. Physics Letters B. 1991;267(2):249–53. [Google Scholar]
  • 65.Hesselbo B, Stinchcombe RB. Monte Carlo simulation and global optimization without parameters. arXiv preprint cond-mat/9501114. 1995. [DOI] [PubMed]
  • 66.Lee J. New Monte Carlo algorithm: entropic sampling. Physical Review Letters. 1993;71(2):211 [DOI] [PubMed] [Google Scholar]
  • 67.Ferrenberg AM, Swendsen RH. Optimized monte carlo data analysis. Physical Review Letters. 1989;63(12):1195 [DOI] [PubMed] [Google Scholar]
  • 68.Frellsen J. Probabilistic methods in macromolecular structure prediction.: Ph.D. thesis, University of Copenhagen; 2011.
  • 69.Nadler W, Meinke JH, Hansmann UH. Folding proteins by first-passage-times-optimized replica exchange. Physical Review E. 2008;78(6):061905. [DOI] [PubMed] [Google Scholar]
  • 70.Ferkinghoff-Borg J. Optimized Monte Carlo analysis for generalized ensembles. The European Physical Journal B-Condensed Matter and Complex Systems. 2002;29(3):481–4. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Comparison of secondary structure propensities between calculation and experiment.

We show the temperature dependent helical fraction of the (AAQAA)3 peptide obtained from previous experiment and simulations using PROFASI force field (Figure A). Free energy of R1 R5 monomer at the 300K projected to the reaction coordinates of the β-strand content (beta) and the α-helix content (alpha) (Figure B). Residue contact probability map for R1–R5 (a-e) monomer at 300K. The solid white shows the diagonal of a 19×19 matrix in each plot. The minimum residue distance is 5. Contact maps were constructed based on residue pairs with CA-CA distance below 8Å, containing the percentage of time that this contact is formed during a simulation (Figure C). Mean and error estimated for the temperature dependent population of the dimer phase (main text Fig 4). Since we have 8 independent simulations for each system, the mean and error here are estimated with the Jackknife method by excluding samples from one simulation at a time (Figure D). Free energy of binding for R1 R5 dimer at the 300K(main text Fig 3) projected to the reaction coordinate D (a) and β-strand content (b). The dashed lines in (a) correspond to D = 10 Å and 15 Å. The dashed line in (b) corresponds to beta (β-strand content) = 0.5 (Figure E). Temperature dependent population of the dimer phase with the D < 10Å(a) and Temperature dependent population of the dimer phase with D < 15Å and β-strand content > 50%(b) (Figure F). The detailed version of the dimer states of R1 (a), R3 (b), R4 (c) and R5 (d) in the main text Fig 6. Nitrogen donor and oxygen acceptor involved in the backbone hydrogen-bonding are illustrated in blue and red, respectively. The intra- and inter- chain hydrogen bonds are shown as yellow dashed line. All the sidechain atoms are not displayed here. The models are drawn with PyMOL (Figure G).

(DOCX)

Data Availability Statement

All relevant data are within the paper and its Supporting Information files.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES