Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Feb 16.
Published in final edited form as: J Phys Chem B. 2016 Oct 27;120(44):11395–11404. doi: 10.1021/acs.jpcb.6b08678

Refining Disordered Peptide Ensembles with Computational Amide I Spectroscopy: Application to Elastin-Like Peptides

Mike Reppert †,, Anish R Roy , Jeremy O B Tempkin , Aaron R Dinner , Andrei Tokmakoff †,*
PMCID: PMC5312701  NIHMSID: NIHMS849093  PMID: 27736076

Abstract

The characterization of intrinsically disordered protein (IDP) ensembles is complicated both by inherent heterogeneity and by the fact that many common experimental techniques function poorly when applied to IDPs. For this reason, the development of alternative structural tools for probing IDP ensembles has attracted considerable attention. Here we describe our recent work in developing experimental and computational tools for characterizing IDP ensembles using Amide I (backbone carbonyl stretch) vibrational spectroscopy. In this approach, the infrared (IR) absorption frequencies of isotope-labeled amide bonds probe their local electrostatic environments and structures. Empirical frequency maps allow us to use this spectroscopic data as a direct experimental test of atomistic structural models. We apply these methods to a family of short elastin-like peptides (ELPs), fragments of the elastin protein based around the Pro-Gly turn motif characteristic of the elastomeric segments of the full protein. Using a maximum entropy analysis of experimental spectra on the basis of predicted spectra from molecular dynamics (MD) ensembles, we find that peptides with Ala or Val sidechains preceding the Pro-Gly turn unit exhibit a stronger tendency toward extended structures than do Gly-Pro-Gly motifs, suggesting an important role for steric interactions in tuning the molecular properties of elastin.

Introduction

Over the last half-century, structural biology has played a pivotal role in our ability to understand and control biological systems at the molecular level. With the continued development of high-throughput structural methods like X-ray crystallography and nuclear magnetic resonance (NMR), the importance of these structural methods in biology and medicine is likely only to increase. Standard structural tools are often difficult to apply, however, when the function of the protein of interest depends explicitly on the absence of a unique native structure.13 A prototypical example of such an intrinsically disordered protein (IDP) is the mammalian structural protein elastin that lends elasticity to skin, lungs, and other connective tissues.4 Although the molecular mechanisms of the process are not well understood, the presence of structural heterogeneity—that is, the availability of a large number of energetically similar structures at many different total protein extension lengths—appears to play a central role in the elastic function of the system.4,5

The characterization of such a disordered conformational ensemble is difficult for two reasons. First, the problem is inherently complicated by the sheer number of distinct structures involved. Second, at a more technical level, many classic structure-determination methods are either inapplicable (e.g., X-ray crystallography) or more difficult to interpret (e.g., NMR) when applied to IDPs. 6,7

Among the tools available to us, NMR spectroscopy provides by far the most comprehensive method for IDP ensemble characterization. Even for NMR, however, the interpretation of measured signals is complicated by motional narrowing when the intrinsic measurement time scale of the NMR experiment exceeds the conformational exchange time between IDP structures. In (13C) NMR spectroscopy, frequency differences between structures typically occur over a range of tens of kHz, giving rise to a peak coalescence time (or “shutter speed”) on the order of a microsecond.7,8 For stably folded proteins, this microsecond cutoff presents no obstacle since large-scale structural rearrangements typically occur on a time scale of milliseconds or longer. The local conformational fluctuations so abundant in IDPs, however, occur on the much shorter scale of nanoseconds to microseconds, complicating the analysis of NMR data on disordered systems.911

As a result of these technical complications, interest has increased in the development of new structural tools with faster intrinsic measurement time scales. One such technique is ultrafast vibrational spectroscopy, with a particular focus on the Amide I (backbone carbonyl stretch) vibration of peptides and proteins.12,13 Experimentally, Amide I frequency differences Δν̄ are typically on the order of tens of cm−1, corresponding to a coalescence time of a few ps, an increase in time resolution of nearly 6 orders of magnitude relative to 13C NMR chemical shift measurements.8

Conversely, however, the correspondingly short vibrational lifetime (~1.3 ps for Amide I vibrations14) produces significant peak broadening in IR absorption spectra, with severe spectral congestion often complicating the interpretation of experimental data. Experimentally, this congestion can be partially eliminated through isotope-labeling strategies.1522 A 13C or 13C18O isotope label introduced into an amide bond decreases the corresponding Amide I mode frequency by ~43 or ~65 cm−1. This shift places the isotope-labeled absorption peak out of resonance with the main Amide I peak from the remainder of the peptide, allowing individual peptide groups to be monitored independently.16,23,24

This spectroscopic data translates into structural information thanks to the acute sensitivity of Amide I vibrational frequencies to their local electrostatic environment. These Stark-like frequency shifts provide a quantifiable metric for local structural factors such as solvent exposure and hydrogen bond (HB) count.15,2527 As a rule of thumb, each HB donated to an amide carbonyl group reduces its vibrational frequency by roughly 16 cm−1.

To facilitate a quantitative connection between experimental spectroscopy and protein structure, our group, along with many others, has been developing spectroscopic maps that translate structures from molecular dynamics (MD) simulations into predicted Amide I spectra.17,25,2832 By parameterizing electrostatic frequency shift coefficients directly against experimental data, Amide I vibrational frequencies may be predicted within an accuracy of a few cm−1 for both short dipeptide fragments25 and isotope labels embedded within larger proteins17,32. Although these models are expected to improve further, the predictability of Amide I spectra has reached a point where the method can be seriously evaluated as a tool to probe disordered peptide ensembles.

With these considerations in mind, we set out to apply Amide I spectroscopy to the structural analysis of a family of short (eight-residue) elastin-like peptides (ELPs), fragments of the disordered elastin protein introduced earlier. Although the full elastin protein contains a variety of different sequence motifs, it is believed that much of the protein’s bulk elasticity results from the reversible extensibility of the quasi-repeat motifs APGV and VPGV. These motifs occur approximately seven and thirteen times, respectively, in the ~786-residue human tropoelastin (precursor) sequence, along with numerous other variants of the XPGX motif.33,34 Indeed, low-complexity synthetic polypeptides of the form GVG(VPGVG)n have been found to reproduce many of the properties of wild-type elastin when n is on the order of 200.35 Early studies suggested that the VPGV motif might be a nucleation point for ordered Type II β-turn structures35, although subsequent study demonstrates that the long-range protein structure is disordered, regardless of conformational preferences at the local scale.36 Our interest in these systems is in exploring the molecular origins of elastic function by characterizing the conformational ensembles occupied by short ELP fragments based on the VPGV sequence.

In an earlier study, some of us applied Amide I spectroscopy to three short ELPs with sequences GVGXPGVG with X = G, A, or V.37 For brevity, we refer to these peptides here as GP, AP, and VP, respectively. In combination with spectroscopic modeling of structures from MD simulations using the most advanced spectral map available at that time, our experimental data for the VP peptide suggested a high population of a non-standard variant on the classic Type II β-turn in which two cross-turn HBs are formed between the X4 amide carbonyl and the V7 and G8 amide NH groups.37 Since then, improvements have been made in the reliability of Amide I spectroscopic simulations.17 Furthermore, we have questioned whether harvesting structures by sampling from relatively short MD simulations could bias the assignment of conformers. In particular, our earlier work did not quantitatively account for computational uncertainty, either simulation error or inadequate sampling.

Our objective in the present study is to assess and develop isotope-edited Amide I spectroscopy as a quantitative tool for interpreting and refining ELP conformational ensembles. In this analysis, we aim both to investigate the influence of peptide sequence on ELP structure and to test the degree to which MD force fields (FFs) are able to predict these trends. Experimentally, we use peptides with a 13C-labeled X4P5 Amide I group, allowing full separation from the Amide I main band. In addition, we study a fourth peptide GVGVPVVG (designated simply “VPV”), intentionally designed to be unable to form compact turn structures due to steric interference between the V4 and V6 residues. At the computational level, we compare calculations from two different spectroscopic models, the Jansen/Roy (JR) map30,38 adopted in our previous work and our new, empirically-parameterized single-point field (1F) map.25,32 These simulations make use of extensively sampled MD ensembles from four different force fields (FFs) on each peptide. In order to place our structural assignments on a quantitative footing, we develop a maximum-entropy (ME) method for ensemble refinement.

Materials and Methods

FMOC Synthesis

13C-labeled peptides NH2-GVGVPVVG-COOH, NH2-GVGVPGVG-COOH, NH2-GVGAPGVG-COOH, and NH2-GVGGPGVG-COOH were synthesized at a 0.02 mM scale with 9-fluorenylmethyloxycarbonyl (FMOC) solid phase peptide synthesis. Isotope-labeling was achieved by incorporating 99% enriched 1-13C Gly, Ala, and Val FMOC-protected amino acids (Cambridge Isotopes Laboratories) at the X4 position. Synthesized peptides were purified using reverse phase chromatography and characterized by mass spectrometry, with an estimated purity of the target product greater than 90% for all peptides. See Supporting Information for further details.

FTIR Spectroscopy

Peptides were dissolved in D2O/DCl at proton concentration of pD ≈ 1 and lyophilized overnight to remove residual hydrogen. Under these conditions, the peptides exist with protonated terminal groups ND3+ and COOD, leaving a net charge of +1. For spectroscopy, the samples were again dissolved in D2O/DCl at pD ≈ 1 and placed between two CaF2 windows with a 50 μm spacer. FTIR spectra were recorded at room temperature using a Bruker Tensor 27 FTIR spectrometer at a resolution of 2 cm−1. A background spectrum of D2O/DCl at pD ≈ 1 was taken beforehand and subtracted from the peptide spectra to remove solvent bands. In addition, a linear baseline correction was applied to provide a flat baseline between 1545 and 1800 cm−1. The OD of the main Amide I band was in the range of 0.05–0.15 for all peptides, while the OD of the isotopically labeled band was in the range of 0.01–0.04.

Isotope Label Absorption Profile

For ME calculations, we require the mean and variance of the IR line shape

μ=s(ω)ωdω (1)

and

σ2=s(ω)(ω-μexp)2dω (2)

where s(ω) is the absorption profile of the isotope-labeled XP amide unit, normalized to unit area. The tilde in the definition of σ̃2 highlights that the expectation is calculated with respect to the experimental mean frequency μexp for both experimental and simulated spectra. For experimental spectra σ̃2 is simply the variance. For simulated spectra, the non-standard definition is adopted for consistency with ME reweighting. See Supporting Information for further details. In order to evaluate these integrals accurately for the experimental data, we first performed a (single) Gaussian fit to the red edge of the main band region (1620 cm−1 to 1655 cm−1) of the raw experimental spectrum; this main-band fit was then subtracted from the raw data to isolate the contribution of the isotope-labeled absorption peak. To avoid bias due to baseline drift over the experimental frequency window, we filtered the experimental isotope-labeled spectrum by a ~60 cm−1 logistic function window before normalization. The width and steepness of this window were chosen to ensure a smooth decay of the spectrum to an accurately zeroed baseline outside of the region where the data provides meaningful signal. For consistency, we applied the same logistic function window to simulated spectra when evaluating Eqs. (1) and (2) so that any (weak) distortions induced by this window will be identical in simulated and experimental spectra. See Supporting Information for further details.

Molecular Dynamics Simulations

MD simulations were carried out in the GROMACS 4.6 simulation package39 using the AMBER99sb-ildn,40 GROMOS53a6,41 CHARMM27,42,43 and OPLS-AA44,45 FFs in explicit solvent boxes of ~4,000 water molecules (cubic box dimensions of ~5 nm). The choice of water model was tied to the corresponding FF: TIP4P-Ew46 for AMBER99sb-ildn, SPC47 for GROMOS53a6, SPC/E48 for OPLS-AA, and TIP3P49 for CHARMM27. Additional simulation details are provided in the Supporting Information. For spectroscopic calculations, we ran 400 ns of production data for each peptide/FF combination. In these trajectories, full structural coordinates were sampled at intervals of 20 fs to provide the finely-sampled structural trajectory required for spectroscopic simulations. In addition, we ran 1 μs production trajectories saving only peptide coordinates every 200 fs to better sample backbone conformations. For the CHARMM27 and OPLS-AA FFs, we also ran a third set of production runs of length 1 μs (CHARMM27) or 400 ns (OPLS-AA; 330 ns for the AP peptide). In total, we thus analyzed 1.4 μs of MD data for AMBER99sb-ildn and GROMOS53a6, 2.4 μs for CHARMM27, and ~1.8 μs for OPLS-AA.

Spectroscopic Simulations

Spectroscopic simulations were carried out using a mixed quantum/classical method described elsewhere.17 Briefly, beginning with our 400 ns MD structural trajectories, Amide I frequencies, couplings, and transition dipole moments were calculated frame-by-frame for each amide bond. The resulting excitonic Hamiltonian and dipole trajectory was converted to a simulated vibrational spectrum using either the dynamic wavefunction propagation method of Torii50,51 (long trajectories) or the time-averaging approximation (TAA) of Auer and Skinner52 (10 ps trajectories, unsuitable for wavefunction propagation). For TAA spectra, we use a cosine averaging window (decaying to zero by 725 fs) whose width was set by comparison with numerically exact wavefunction propagation. In both cases, the Amide I lifetime was set to 1.3 ps.

Two different literature electrostatic maps were used to generate spectroscopic parameters (site energies, couplings, and dipoles). The JR map was employed in our previous study.29,30,38 The 1F map is the newest generation of our single-point field models trained to a library of dipeptide fragments.25,32 It differs from our previous 1F map (Ref.25) in the use of modified Glycine charges and TIP3P water model charges (rather than SPC/E) as described in Ref.32 See Supporting Information for further details. Note that with the exception of the GROMOS53a6 calculations, CHARMM27 atomic charges were applied in all spectroscopic calculations (i.e. in post-processing after the MD run). This convention is based on our observation that CHARMM27 charges typically provide better spectroscopic predictions than other FF charges, even when applied to structures culled from other FFs.25,32 Furthermore, this choice allows all three force fields (AMBER99sb-ildn, CHARMM27, and OPLS-AA) to be analyzed using the same spectroscopic map, minimizing bias due to different map parameterizations. (The GROMOS53a6 force field uses united-atom representation for some groups and so cannot be assigned CHARMM27 charges.)

Results

Amide I Absorption Spectroscopy

Our objective in this work is to extract conformational preferences for the GP, AP, VP, and VPV peptides from experimental, isotope-edited Amide I spectra. The starting point for this study is the set of experimental absorption spectra presented in Figure 1. In these spectra, the main band near 1650 cm−1 corresponds to absorption from unlabeled peptide bonds, while the low-frequency peak between 1574 and 1592 cm−1 corresponds absorption from the 13C-labeled X4P5 amide bond. The observed peak frequencies for the X4P5 bond are summarized in Table 1 and increase in the order VPV < VP < AP < GP, with a fairly small shift between the VPV (1573 cm−1) and VP (1576 cm−1) peptides and a large jump in frequency between the AP (1581 cm−1) and GP (1592 cm−1) peptides.

Figure 1.

Figure 1

Amide I absorption spectra for the four 13C-labeled ELPs studied in this work.

Table 1.

Peak frequencies for experimental and simulated ELP isotope-labeled peptides, quoted in cm−1.

GP AP VP VPV
Experiment 1592 1581 1576 1574
1F 1586 1581 1580 1578
JR 1581 1581 1583 1583

Given the conclusion of our previous work that the VP peptide adopts a largely collapsed structure in solution,37 the similarity between the VP and VPV peptide spectra is surprising. As noted above, steric clash between the V4 and V6/V7 sidechains is expected to restrict the VPV peptide to a largely extended ensemble in solution. In this light, the similarity of the VP and VPV spectra suggests an extended conformation for the VP peptide. This conclusion is consistent with other findings in which the absorption bands of isotope-labeled sites shift toward lower frequencies with increased solvent exposure.19 In our previous study, however, we were unable (using OPLS-AA FF simulations and the JR spectroscopic map) to account for the experimental trend for the GP, AP, and VP peptides without assuming a substantial population of two-HB β-turn structures for the VP peptide.37 To investigate the source of this discrepancy, we turn to simulation data.

MD Simulations

To explore these contrasting assignments from an MD perspective, we next carried out 400 ns MD simulations for each peptide in the NVT ensemble using the OPLS-AA FF with explicit SPC/E solvent. As illustrated in Figure 2A, the resulting structural ensembles show a distribution of conformations that shift on average from collapsed structures for the GP peptide toward extended structures for the VP and VPV peptides. A more quantitative view is provided in Figure 2B, where population histograms are presented for each peptide as a function of the cross-turn hydrogen-bonding distance

Figure 2.

Figure 2

Conformational ensembles predicted by 400 ns MD simulations. A: Uniformly sampled structural ensembles aligned around the central proline residue. Structures are colored from red to white to blue with decreasing turn extension distance d. B: Population histograms for the cross-turn distance d of Eq.(3). C. Conformational populations for different hydrogen-bonding configurations to the X4 carbonyl. Indices m/n indicate m peptide-to-peptide HBs and n peptide-to-solvent HBs.

d=rox4-rNV7, (3)

where rOX4 and rNV7 are, respectively, the coordinates of the X4 oxygen atom and the V7 nitrogen (illustrated schematically in Figure 4 below). In a β-turn, these two atoms participate in a HB, as indicated by the histogram peak near d ≈ 3.5 Å for the GP peptide in Figure 2B. In the AP histogram, this 3.5 Å population occurs at nearly equal frequency with a fully-extended population near d ≈ 7 Å. The VP peptide, in contrast, shows a partial preference for the fully-extended 7 Å ensemble, while the VPV peptide adopts-fully extended conformations almost exclusively.

Figure 4.

Figure 4

Simulated isotope-label absorption spectra (white-filled spectra) as a function of turn distance d. Simulated spectra are calculated for 10 ps intervals from our 400 ns MD trajectory and are binned by the average turn distance d over the corresponding 10 ps interval. Structural bins are divided into 1 Å intervals with the darkest curve corresponding to collapsed structures with 2.5 Å < d < 3.5 Å, and the lightest corresponding to extended structures with 7.5 Å < d < 8.5 Å. All spectra are scaled to the same peak intensity. Experimental spectra (color-shaded curves) are shown in the background for reference. Top: Isotope-labeled spectra simulated using the 1F spectroscopic map. Bottom: Isotope-labeled spectra simulated using the JR spectroscopic map. Right: Graphical definition of the turn extension coordinate d used to define conformational bins.

These backbone conformational trends are mirrored in the local solvent exposure at the X4 carbonyl site, as illustrated in Figure 2C. Here pie charts for each peptide present the population of various HB configurations at the X4 site. The labels “m/n” indicate the number of HBs accepted by the X4 oxygen atom from peptide NH groups (m) and solvating water molecules (n). The extended VPV ensemble is composed almost entirely of conformations in which the V4 carbonyl forms one (0/1) or two (0/2) HBs to the surrounding solvent. In only ~1% of sampled conformations does the X4 carbonyl participate in intra-peptide HBs. In contrast, the VP, AP, and GP peptides show increasing populations of 1/0 and 2/0 conformations in which the X4 carbonyl accepts HBs from within the peptide itself, primarily from the V7 and G8 NH groups. Note, however, that the overall HB occupation number—the total number of HBs formed, on average, between the XP carbonyl group and either solvent or peptide—changes only quite modestly from peptide to peptide, increasing by ~10% from GP to VPV. As represented in the pie charts, HBs lost from solvent in the AP and GP peptides are largely replaced by those formed within the peptide itself. The trends observed thus represent a transition in the nature (i.e. peptide-solvent vs. peptide-peptide) rather than the number of XP carbonyl HBs.

Spectroscopic Simulations

Any quantitative comparison between experiment and simulation must rely on the spectroscopic maps described in the introduction. To test our MD conformational ensembles against experimental data, we next performed simulations using the 1F and JR spectroscopic maps. The results are presented in Figure 3. For both maps, the line shape of the main-band region (1600 – 1700 cm−1) is poorly captured by the simulation, likely a result of inaccuracies in our treatment of coupling interactions. As summarized in Table 1, however, the behavior in the isotope-label region (1550 – 1600 cm−1) shows clear differences between the two maps. Although by no means in quantitative agreement with the experimental spectra, the 1F map does capture the correct qualitative trend, a steady decrease in isotope-label peak frequency from 1586 cm−1 for the GP peptide to 1578 cm−1 for the VPV peptide. In contrast, simulations using the JR map predict a slight increase in frequency from GP to VPV.

Figure 3.

Figure 3

Experimental (shaded) and simulated (open) spectra for GP, AP, VP, and VPV peptides. The left-hand panel shows results using the 1F spectroscopic map; the right-hand panel shows results for the JR map. For reference, thin, gray, vertical lines mark 1575 cm−1 in each frame.

These trends are brought out more clearly by dividing the conformational ensemble into distinct structural bins based on the cross-turn distance d. To achieve this, we split up our 400 ns MD trajectory into 10 ps intervals In with n ranging from 1 to 40,000. On this time scale the turn distance d remains relatively static (fluctuations about the interval mean are approximately Gaussian with a standard deviation of ~0.3 Å). For each interval, we calculate an Amide I isotope-label spectrum using the TAA method. To emphasize the isotope-label absorption, in these calculations we set the oscillator strength of all sites other than the XP label to zero; in this way we treat all coupling interactions between sites correctly, while producing a simulated spectrum corresponding only to the absorption of the isotope-labeled unit. Finally, we bin each interval In based on average turn extension, combining their spectra to produce simulated spectra as a function of turn extension. For example, by averaging all spectra whose corresponding 10 ps interval shows an average turn distance between 2.5 Å and 3.5 Å, we obtain simulated spectra for tightly-collapsed, turn-like structures.

Figure 4 presents spectra binned by turn distance for each peptide using both the 1F and JR maps. Interestingly, the two maps display qualitatively different behavior as a function of turn extension. While the 1F map indicates a gradual blue shift of the absorption peak as turn extension decreases corresponding to the loss of the solvent-induced redshift, the JR map shows minimal variation with turn extension. Both maps show distinct behavior for the most tightly-collapsed structures (d < 3.5 Å). For this bin, the 1F map reveals two distinct subpopulations. The red-shifted peak corresponds to the sparsely populated 2/0 population identified earlier in which the NH groups of both V7 and G8 donate HBs to the X4 carbonyl (see Figure 2C). The blue-shifted peak corresponds to the 1/0 bin in which only the V7 NH group donates a HB. In the JR simulations, these bins are not distinguished clearly, with simulated spectra showing only a single broadened, red-shifted peak. Note that the 2/0 conformation is only accessible for extremely short turn distances; hence the absence of a red-shifted peak in bins with larger turn extensions. Likewise, the 0/1 and 0/2 bins fail to give rise to distinct spectral features due to strong solvent disorder; although spectra for 0/1 structures are, on average, blue-shifted from those for 0/2 structures, the spectra are sufficiently broadened by heterogeneity in the XP solvation shell that only a single broadened peak is observed in simulations using either spectroscopic map.

Taken together, these results highlight the critical role that the spectroscopic map plays in analyzing Amide I vibrational spectra. Indeed, the assignment in our previous study37 of a high population of 2/0 turn structures appears to be due largely to the use of the JR map. For our present application, we believe that the 1F map provides a more accurate description of the experimental data. This assessment is based both on the favorable performance of the 1F map in recent isotope-labeling studies of protein standards17 and on the substantially better agreement in the isotope-labeled region displayed in Figure 3 and Figure 4. From this point we focus exclusively on 1F simulations.

Maximum Entropy Analysis

Our primary objective in this analysis is to test and refine predicted MD ensembles by comparison against experimental data. Qualitatively, the dependence of the 1F simulated spectra on turn distance supports the idea that the GP peptide adopts a predominantly collapsed conformational ensemble, while the VP peptide is biased toward more extended structures. To put such statements on a quantitative footing, however, we need an objective method for incorporating experimental constraints into our simulated structural ensemble. For this purpose, we turn to the Maximum Entropy (ME) method, a probability-reweighting scheme advanced by E. T. Jaynes as a means of incorporating external constraints into a probabilistic system with minimal bias.53 In our context, we wish to reweight our MD conformational ensemble to produce a refined ensemble that is consistent with our experimental data. The ME method provides a rigorous means of identifying from all possible experimentally-consistent ensembles the one that deviates minimally from the initial MD ensemble. Given a set of initial (MD-based) estimates pno for the population of each structure n in a conformational ensemble, the ME principle states that the least-biased means of incorporating external constraints into the structural ensemble is to maximize the relative-entropy functional

η[ppo]=-npnlnpnpno (4)

subject to the normalization constraint Σn pn = 1 and any external constraints imposed by the user. In our analysis, the individual structures n that are to be reweighted according to Eq. (4) correspond to the “micro-ensemble” of structures sampled during the 10 ps intervals In rather than to the structure at any single frame in the MD trajectory. In the reweighting process, we constrain the ensemble to match the experiment in both mean frequency μ and variance σ̃2 as defined by Eqs. (1) and (2). The ME reweighting procedure could be carried out with reference only to the 400 ns trajectory we used for spectroscopic simulations. In this case, the intervals In would be assigned equal starting probabilities pno since they are sampled uniformly in time across the simulation. To ensure that our starting ensemble accurately reflects FF predictions, however, we chose to run addition simulations (1.8 μs in total) for each peptide, adjusting the starting probabilities to more accurately reflect the better-sampled structural ensemble. Explicitly, to each interval In, we assign a probability

pno=iInPfull(di)P400ns(di)α(di) (5)

where the sum extends over all structures i within the 10 ps interval In, the distance di is the cross-turn distance of structure i, and the probabilities P400ns (di) and Pfull (di) refer, respectively, to the ensemble frequency of structures with turn distance di in the 400 ns ensemble we used for spectroscopic simulations and in the longer (1.8 μs) reference ensemble. The scaling factor α(di) avoids numerical instability due to the finite sampling of P400ns (di) as described in detail in the Supporting Information. Briefly, α(di) = 1 for those structures whose turn distance di falls into the upper 99.5% of all structures sampled in the 400 ns trajectory; for the 0.5% of structures whose turn distance is sampled less frequently, α(di) is a cosine window interpolating smoothly to P400ns (di)/Pfull (di), so that the weighting factor for these sparsely-sampled structures is unchanged by reference to the longer simulation ensemble. Our final structural ensemble corresponds to the sum of all micro-ensembles from each of the 10 ps intervals In weighted by the probabilities pn. Further details on the numerical implementation of this procedure are provided in the Supporting Information.

The results of this ME refinement procedure are illustrated in Figure 5. On the whole, the changes induced by this ME refinement on our OPLS-AA ensemble are fairly mild. The primary trend is a uniform shift toward more pronounced structural trends, with the GP ensemble shifting toward more collapsed structures in the ME-refined ensemble and the VP ensemble shifting toward more extended structures. Only the VPV peptide shows no discernible shifts in overall turn extension, reflecting the close match between simulated and experimental spectra in the raw MD ensemble.

Figure 5.

Figure 5

ME-refinement results for the OPLS-AA FF. Left: Simulated spectra for each peptide before (dashed) and after (solid) ME refinement. Lightly shaded curves represent raw experimental absorption spectra; dark shaded curves are extracted spectra for the isotope-labeled site. Right: Raw (shaded) and ME-refined (solid black line) turn-distance histograms.

Force Field Dependence

To gain a sense for the reliability of these results, it is instructive to explore the sensitivity of these conclusions to both FF parameters and simulation error. To explore FF effects, we repeated our analysis using three additional FF/water model combinations: CHARMM27/TIP3P, AMBER99sb-ILDN/TIP4P-Ew, and GROMOS53a6/SPC, with total MD simulation times ranging from 1.4 μs (GROMOS53a6 and AMBER99sb-ildn) to 2.4 μs (CHARMM27). As a qualitative benchmark, we note that exchange times between extended and compact conformations typically occurred in our simulations on a time scale of a few tens of nanoseconds, so that a 1 μs trajectory can be expected to sample tens to hundreds of transitions. At a more quantitative level, we carried out a block-averaging analysis to test the adequacy of these trajectories for obtaining a reliable estimate of FF predictions. From this analysis, we estimate a standard error of approximately ±5% in the fraction of extended conformations in the starting MD ensemble (see Supporting Information for further details). To check the sensitivity of our ME analysis to spectroscopic errors, we repeated our analysis under the assumption of various systematic errors in Amide I frequency prediction. A reasonable range of errors for this purpose is ±4 cm−1, the 2σ confidence interval for the 1F map against the original dipeptide data set for which it was parameterized.32

The results of this analysis are summarized in Figure 6 and Table 2. Interestingly, while population predictions vary significantly from FF to FF in the raw MD data, the ME-refined ensembles show a consistent structural trend. The results for the CHARMM27 FF are particularly informative: the predictions of the ME-refined ensembles fairly closely resemble those of the remaining FFs, despite the fact that the raw MD data predicts essentially identical extended populations (45 – 47%) for all three XP ELPs. This increased uniformity in the ME-refined ensembles reflects the influence of the population restraints imposed by the experimental data on our MD ensembles. Significantly, only for the OPLS-AA FF do all raw MD values fall within the error bars of our ME analysis. The effects of the ME reweighting procedure are particularly pronounced for the GP peptide, with initial estimates of the extended population ranging from 44% for OPLS-AA to 63% for AMBER and ME-refined values ranging from 15% (GROMOS) to 33% (OPLS-AA). All four FFs thus appear to underestimate the propensity of this peptide to adopt collapsed turns in solution.

Figure 6.

Figure 6

ME analysis of the fraction of extended conformations in raw and ME-refined MD ensembles for the four force fields investigated. Extended conformations are defined by d > 5 Å. Raw MD fractions are represented by thick black lines. ME-refined values are represented by vertical bars. Error bars reflect uncertainty in spectroscopic predictions. The reported values correspond to the maximum and minimum populations obtained in repeated ME analyses for sys-tematic map errors of −4 cm 1, −3 cm 1, …, +4 cm 1.

Table 2.

ME-refined average turn distance d in Å. Raw force field estimates are in parentheses.

GP AP VP VPV
OPLS 4.6 (4.9) 5.2 (5.4) 5.8 (5.5) 6.6 (6.6)
CHARMM 4.1 (4.9) 4.8 (4.9) 5.1 (4.9) 6.1 (6.1)
AMBER 4.1 (5.4) 4.7 (5.7) 5.3 (5.9) 6.4 (6.4)
GROMOS 4.3 (5.3) 4.9 (5.5) 5.4 (5.6) 6.5 (6.5)

Discussion

The results presented above highlight both important strengths and limitations of isotope-edited Amide I spectroscopy as a tool for ensemble refinement. On the one hand, our experiments impose considerable restraints on the sampled conformational ensembles, drawing out a clear structural trend that would have been difficult to assign with confidence from the raw MD data alone. In this process, the ME method plays the critical role of providing a rigorous framework on which to base the ensemble refinement process.

On the other hand, our results also highlight that even modest spectroscopic simulation errors have a substantial impact on ensemble refinement. Taking the 2 (95%) confidence interval of our dipeptide 1F map as a reasonable measure of spectroscopic uncertainty, we expect our predicted frequencies to be within 4 cm−1 of the experimental result for ~95% of structures. Even this modest error, however, results in variations in the estimated population of extended states as high as ±15% (AMBER FF). At first, this high population error might seem surprising. After all, a frequency prediction error of ±4 cm−1 appears quite low in absolute terms (less than 1% error for a 1600 cm−1 vibration). The significance of this error is more apparent, however, when one considers the limited range over which Amide I frequencies are distributed. Absorption frequencies for isotope-labeled sites typically span a range of around 30 cm−1 in experimental data, from ~1570 cm−1 for strongly hydrogen-bonded species to just above ~1600 cm−1 for solvent-shielded sites. In comparison with this limited dynamic range, a frequency prediction error of ±4 cm−1 appears quite significant, more than a quarter of the total dynamic range. In this light, our observed population assignment errors appear quite reasonable.

In large part, this spectroscopic error should be removable as advances in simulation methods improve our ability to accurately predict Amide I vibrational frequencies. A question of more fundamental interest involves the sensitivity of the ensemble refinement results to errors in the starting ensemble. In short, we wish to know how strong are the constraints that can be imposed by Amide I spectroscopic data. For example, can Amide I spectroscopic constraints “rescue” a bad FF prediction to produce an accurate conformational ensemble even given a poorly sampled or strongly biased starting point?

The FF comparison of Figure 6 provides at least a partial answer. Even in the absence of spectroscopic error, differences between the raw FF ensembles are large enough that the fraction of extended populations predicted for the GP peptide span a range of nearly 20%, from 15% for the GROMOS FF to 33% for OPLS-AA. In this sense, Amide I spectroscopy appears to serve as a relatively weak constraint on peptide structure: a wide range of structural ensembles can be made to reasonably fit the experimental data, leaving only a limited capacity for a priori structure prediction. At a physical level, this lack of discrimination stems from the fact that isotope-edited Amide I spectroscopy is primarily sensitive to short-range electrostatics, and reports only indirectly on larger-scale coordinates such as overall turn extension. Because solvent fluctuations give rise to significant variations in local electrostatic parameters at any given turn extension—and hence significant variation in the instantaneous Amide I frequency—Amide I spectroscopic data will rarely be capable of absolutely excluding any given conformation from a predicted ensemble.

Looking forward, it bears noting that such a weak-constraint situation is well-suited to the Bayesian ensemble refinement approaches.5456 These methods provide a concrete framework both for incorporating multiple weak constraints (e.g. spectroscopic data from multiple labeling sites) into a single structural ensemble and for testing the robustness of the result against experimental and simulation error. For Amide I spectroscopy in particular, multi-site labeling strategies should be particularly powerful once coupling models achieve sufficient accuracy that spectroscopic features like main band line shapes, energy transfer times, and (in multidimensional spectra) cross-peak intensity can be implemented as structural constraints. Multi-site labeling experiments will be particularly critical in the analysis of larger peptide or protein systems where larger-scale structural parameters are of interest (e.g. inter-strand distances or secondary structure content). Ultimately, of course, Amide I spectroscopy will be most effective when used together with other structural tools. In combination with ensemble-averaged constraints from NMR or electron paramagnetic resonance (EPR), Amide I spectroscopy should provide a useful tool for distinguishing between different conformational ensembles that yield identical ensemble averages but different degrees of disorder around the average structure.

Finally, with regard to the molecular properties of elastin, our results provide direct, experimental evidence that the conformational ensembles of ELPs in aqueous solution follow a bimodal distribution between extended and collapsed states. In particular, our finding of a ~40% population of collapsed structures for the VP peptide is in excellent agreement with the solid-state NMR findings of Ohgo et al. and Yao and Hong who assigned populations of 40% and 33%, respectively, to collapsed β-turn-like structures in related VPG-based ELPs.57,58 Our results indicate that these low-temperature results are preserved in aqueous solution and that the AP peptide adopts a similar (but slightly more collapsed) ensemble.

At a mechanistic level, our results support the notion that the molecular properties of elastin are tuned to preserve local conformational disorder. In contrast to the preference of the GP and VPV peptides for collapsed and extended populations, respectively, the AP and VP peptides, which mimic the most common sequence motifs in native elastin, are more evenly split between collapsed and extended conformations. Recent global sequence analysis and mutational studies indicate that elastomeric function is, to a significant extent, a direct result of the periodic “kinking” of the protein chain conformation by PG units.5,59 The conformational restrictions imposed by the proline residue thus disrupt the formation of long-range secondary structural motifs (α-helices or β-sheets), while the flexibility of the adjacent glycine facilitates the thermal accessibility of a large number of distinct conformation states at a wide range of overall extension lengths. Our analysis adds to this discussion the observation that the identity of the residue preceding the PG turn unit appears to tune (likely via steric effects) the average extension of each monomeric unit and thus the persistence length of the bulk polymer. In light of the measured persistence length of only 3.5 Å for native tropoelastin60 (comparable to the turn length for a single XPGX unit), these observations are consistent with a molecular model for elastomeric function in which individual PG units switch independently between extended and turn-like structures, adding up collectively to yield bulk elasticity through an entropic chain mechanism.

In summary, we have explored here the utility of isotope-labeled Amide I spectroscopy as a quantitative tool for IDP conformational analysis. As a test case, we investigate the ensemble refinement of a family of short ELPs based on the elastin-like motifs APGVG and VPGVG. Experimental isotope-labeled data is compared with MD simulations and mixed quantum/classical spectroscopic simulations. We adopt a maximum entropy approach to provide an objective refinement of the conformational ensembles predicted by MD simulations against experimental data. The resulting conformational ensembles for the peptides tested reveal a distinct trend in overall peptide extension lengths in the order GP < AP< VP< VPV. The OPLS-AA FF is found to produce the best agreement with the experimentally-predicted trend, with populations of extended and compact structures falling within the error bars of our analysis. The GROMOS53a6 and AMBER99sb-ildn FFs capture the correct qualitative trend, although the assigned populations require substantial adjustments before reading agreement with experimental data. The raw CHARMM27 predictions differ most strongly from the refinement, assigning essentially identical populations for the GP, AP, and VP peptides. With regard to elastin structure, our findings suggest that steric effects imposed at the monomer level by the residue preceding proline in the characteristic elastin PG turn motif tune the persistence length of the bulk protein. In a broader sense, our results indicate that Amide I vibrational spectroscopy should serve as a valuable tool for ensemble refinement when used in conjunction with complementary methods such as NMR, EPR, and FRET spectroscopies.

Our results offer insight into the intertwined factors involving experimental data, spectroscopic models, and FFs that must be addressed in protein structural analysis. Experiment-based ensemble refinement requires, first of all, conformational sampling that is complete enough to visit all important energy basins. The populations predicted in those basins should be enough to bias against unphysically strained conformers. Maximum entropy or Bayesian refinement tools provide a critical framework incorporating experimental constraints into a refined ensemble. For Amide I, the accuracy of the spectroscopic model (particularly vibrational coupling) seems to be the leading factor to consider to improve confidence in these methods. As spectroscopic models improve, simulations can provide an active role not only in interpreting experimental data, but in guiding what experiments are performed. This feedback between simulation and experiment will be especially helpful in selecting isotope label sites to provide useful constraints for ensemble refinement. Already the current accuracy of our ensemble refinement indicates that these strategies for improving Amide I ensemble refinement will result in a useful tool for analysis of IDPs.

Supplementary Material

SI

Acknowledgments

We thank the National Science Foundation (NSF) (Grant No. CHE-1414486) and the National Institutes of Health (NIH) (Grant No. 1 R01 GM118774-01) for support of this research. We thank the NIH (grant No. 5 R01 GM109455-02) for funding for computational resources that were used in this research. M.R. and J. O. B. T. thank the NSF for Graduate Research Fellowships. This work was completed in part with resources provided by the University of Chicago Research Computing Center.

Abbreviations

IDP

intrinsically disordered protein

IR

infrared

ELP

elastin-like peptide

NMR

nuclear magnetic resonance

FF

force field

HB

hydrogen bond

ME

maximum entropy

EPR

electron paramagnetic resonance

Footnotes

Supporting Information Description

Online Supporting Information includes detailed descriptions of FMOC synthesis, spectral processing, MD simulations, spectroscopic maps, ME reweighting, and block averaging analysis.

References

  • 1.Lee R, Van Der Buljan M, Lang B, Weatheritt RJ, Daughdrill GW, Dunker AK, Fuxreiter M, Gough J, Gsponer J, Jones DT, et al. Classification of Intrinsically Disordered Regions and Proteins. Prog Biophys Mol Biol. 2015;114:6589–6631. doi: 10.1021/cr400525m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Tompa P. Intrinsically Disordered Proteins: A 10-Year Recap. Trends Biochem Sci. 2012;37:509–516. doi: 10.1016/j.tibs.2012.08.004. [DOI] [PubMed] [Google Scholar]
  • 3.Uversky VN, Dunker AK. Understanding Protein Non-Folding. Biochim Biophys Acta - Proteins Proteomics. 2010;1804:1231–1264. doi: 10.1016/j.bbapap.2010.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Green EM, Mansfield JC, Bell JS, Winlove CP. The Structure and Micromechanics of Elastic Tissue. Interface Focus. 2014;4:20130058. doi: 10.1098/rsfs.2013.0058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Rauscher S, Baud S, Miao M, Keeley FW, Pomès R. Proline and Glycine Control Protein Self-Organization into Elastomeric or Amyloid Fibrils. Structure. 2006;14:1667–1676. doi: 10.1016/j.str.2006.09.008. [DOI] [PubMed] [Google Scholar]
  • 6.Uversky VN. Biophysical Methods to Investigate Intrinsically Disordered Proteins: Avoiding an “Elephant and Blind Men” Situation. In: Felli IC, Pierattelli R, editors. Intrinsically Disordered Proteins Studied by NMR Spectroscopy. Vol. 870. Springer International Publishing; Switzerland: 2015. pp. 215–260. [DOI] [PubMed] [Google Scholar]
  • 7.Dyson HJ, Wright PE. Nuclear Magnetic Resonance Methods for Elucidation of Structure and Dynamics in Disordered States. Vol. 339. Elsevier Masson SAS; 2001. [DOI] [PubMed] [Google Scholar]
  • 8.Bryant RG. The NMR Time Scale. J Chem Educ. 1983;60:933–935. [Google Scholar]
  • 9.Eaton Wa, Munoz V, Hagen SJ, Jas GS, Lapidus LJ, Henry ER, Hofrichter J. Fast Kinetics and Mechanisms in Protein Folding. Annu Rev Biophys Biomol Struct. 2000;29:327–359. doi: 10.1146/annurev.biophys.29.1.327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Chung H, Piana-Agostinetti S, Shaw D, Eaton W. Structural Origin of Slow Diffusion in Protein Folding. Science. 2015;349:1504–1510. doi: 10.1126/science.aab1369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lindorff-Larsen K, Piana S, Dror RO, Shaw DE. How Fast-Folding Proteins Fold. Science. 2011;334:517–520. doi: 10.1126/science.1208351. [DOI] [PubMed] [Google Scholar]
  • 12.Baiz C, Reppert M, Tokmakoff A. An Introduction to Protein 2D IR Spectroscopy. In: Fayer MD, editor. Ultrafast Infrared Vibrational Spectroscopy. Taylor & Francis; Boca Raton: 2013. pp. 361–404. [Google Scholar]
  • 13.Barth A, Zscherp C. What Vibrations Tell Us about Proteins. Q Rev Biophys. 2002;35:369–430. doi: 10.1017/s0033583502003815. [DOI] [PubMed] [Google Scholar]
  • 14.Hamm P, Lim M, Hochstrasser RM. Structure of the Amide I Band of Peptides Measured by Femtosecond Nonlinear-Infrared Spectroscopy. J Phys Chem B. 1998;102:6123–6138. [Google Scholar]
  • 15.Decatur SM. Elucidation of Residue-Level Structure and Dynamics of Polypepitdes via Isotope-Edited Infrared Spectroscopy. Acc Chem Res. 2006;39:169–175. doi: 10.1021/ar050135f. [DOI] [PubMed] [Google Scholar]
  • 16.Torres J, Kukol a, Goodman JM, Arkin IT. Site-Specific Examination of Secondary Structure and Orientation Determination in Membrane Proteins: The Peptidic (13)C=(18)O Group as a Novel Infrared Probe. Biopolymers. 2001;59:396–401. doi: 10.1002/1097-0282(200111)59:6<396::AID-BIP1044>3.0.CO;2-Y. [DOI] [PubMed] [Google Scholar]
  • 17.Reppert M, Roy AR, Tokmakoff A. Isotope-Enriched Protein Standards for Computational Amide I Spectroscopy. J Chem Phys. 2015;142:125104/1–10. doi: 10.1063/1.4915271. [DOI] [PubMed] [Google Scholar]
  • 18.Brewer SH, Song B, Raleigh DP, Dyer RB. Residue Specific Resolution of Protein Folding Dynamics Using Isotope-Edited Infrared Temperature Jump Spectroscopy. Biochemistry. 2007;46:3279–3285. doi: 10.1021/bi602372y. [DOI] [PubMed] [Google Scholar]
  • 19.Smith AW, Lessing J, Ganim Z, Peng CS, Tokmakoff A, Roy S, Jansen TLC, Knoester J. Melting of a Beta-Hairpin Peptide Using Isotope-Edited 2D IR Spectroscopy and Simulations. J Phys Chem B. 2010;114:10913–10924. doi: 10.1021/jp104017h. [DOI] [PubMed] [Google Scholar]
  • 20.Remorino A, Hochstrasser RM. Three-Dimensional Structures by Two-Dimensional Vibrational Spectroscopy. Acc Chem Res. 2012;45:1896–1905. doi: 10.1021/ar3000025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Petty Sa, Decatur SM. Intersheet Rearrangement of Polypeptides during Nucleation of β-Sheet Aggregates. Proc Natl Acad Sci U S A. 2005;102:14272–14277. doi: 10.1073/pnas.0502804102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lin YS, Shorb JM, Mukherjee P, Zanni MT, Skinner JL. Empirical Amide I Vibrational Frequency Map: Application to 2D-IR Line Shapes for Isotope-Edited Membrane Peptide Bundles. J Phys Chem B. 2009;113:592–602. doi: 10.1021/jp807528q. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Haris PI, Robillard GT, van Dijk aa, Chapman D. Potential of 13C and 15N Labeling for Studying Protein-Protein Interactions Using Fourier Transform Infrared Spectroscopy. Biochemistry. 1992;31:6279–6284. doi: 10.1021/bi00142a016. [DOI] [PubMed] [Google Scholar]
  • 24.Fang C, Wang J, Charnley AK, Barber-Armstrong W, Smith AB, III, Decatur SM, Hochstrasser RM. Two-Dimensional Infrared Measurements of the Coupling between Amide Modes of an α-Helix. Chem Phys Lett. 2003;382:586–592. [Google Scholar]
  • 25.Reppert M, Tokmakoff A. Electrostatic Frequency Shifts in Amide I Vibrational Spectra: Direct Parameterization against Experiment. J Chem Phys. 2013;138:134116. doi: 10.1063/1.4798938. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Cho M. Correlation between Electronic and Molecular Structure Distortions and Vibrational Properties. I. Adiabatic Approximations. J Chem Phys. 2003;118:3480–3490. [Google Scholar]
  • 27.Schmidt JR, Corcelli SA, Skinner JL. Ultrafast Vibrational Spectroscopy of Water and Aqueous N-Methylacetamide: Comparison of Different Electronic Structure/molecular Dynamics Approaches. J Chem Phys. 2004;121:8887–8896. doi: 10.1063/1.1791632. [DOI] [PubMed] [Google Scholar]
  • 28.Ham S, Kim JH, Lee H, Cho M. Correlation between Electronic and Molecular Structure Distortions and Vibrational Properties. II. Amide I Modes of NMA–nD[sub 2]O Complexes. J Chem Phys. 2003;118:3491–3498. [Google Scholar]
  • 29.la Cour Jansen T, Dijkstra AG, Watson TM, Hirst JD, Knoester J. Modeling the Amide I Bands of Small Peptides. J Chem Phys. 2006;125:44312/1–9. doi: 10.1063/1.2218516. [DOI] [PubMed] [Google Scholar]
  • 30.la Cour Jansen T, Knoester J. A Transferable Electrostatic Map for Solvation Effects on Amide I Vibrations and Its Application to Linear and Two-Dimensional Spectroscopy. J Chem Phys. 2006;124:044502/1–11. doi: 10.1063/1.2148409. [DOI] [PubMed] [Google Scholar]
  • 31.Wang L, Middleton CT, Zanni MT, Skinner JL. Development and Validation of Transferable Amide I Vibrational Frequency Maps for Peptides. J Phys Chem B. 2011;115:3713–3724. doi: 10.1021/jp200745r. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Reppert M, Tokmakoff A. Communication: Quantitative Multi-Site Frequency Maps for Amide I Vibrational Spectroscopy. J Chem Phys. 2015;143:61102. doi: 10.1063/1.4928637. [DOI] [PubMed] [Google Scholar]
  • 33.Indik Z, Yeh H, Ornstein-Goldstein N, Sheppard P, Anderson N, Rosenbloom JC, Peltonen L, Rosenbloom J. Alternative Splicing of Human Elastin mRNA Indicated by Sequence Analysis of Cloned Genomic and Complementary DNA. Proc Natl Acad Sci U S A. 1987;84:5680–5684. doi: 10.1073/pnas.84.16.5680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Foster JA, Bruenger E, Gray WR, Sandberg LB. Isolation and Amino Acid Sequences of Tropoelastin Peptides. J Biol Chem. 1973;248:2876–2879. [PubMed] [Google Scholar]
  • 35.Urry DW. Protein Elasticity Based on Conformations of Sequential Polypeptides: The Biological Elastic Fiber. J Protein Chem. 1984;3:403–436. [Google Scholar]
  • 36.Tamburro AM, Bochicchio B, Pepe A. Dissection of Human Tropoelastin: Exon-By-Exon Chemical Synthesis and Related Conformational Studies. Biochemistry. 2003;42:13347–13362. doi: 10.1021/bi034837t. [DOI] [PubMed] [Google Scholar]
  • 37.Lessing J, Roy S, Reppert M, Baer M, Marx D, Jansen TLC, Knoester J, Tokmakoff A. Identifying Residual Structure in Intrinsically Disordered Systems: A 2D IR Spectroscopic Study of the GVGXPGVG Peptide. J Am Chem Soc. 2012;134:5032–5035. doi: 10.1021/ja2114135. [DOI] [PubMed] [Google Scholar]
  • 38.Roy S, Lessing J, Meisl G, Ganim Z, Tokmakoff A, Knoester J, Jansen TLC. Solvent and Conformation Dependence of Amide I Vibrations in Peptides and Proteins Containing Proline. J Chem Phys. 2011;135:234507/1–11. doi: 10.1063/1.3665417. [DOI] [PubMed] [Google Scholar]
  • 39.Hess B, Kutzner C, van der Spoel D, Lindahl E. GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. J Chem Theory Comput. 2008;4:435–447. doi: 10.1021/ct700301q. [DOI] [PubMed] [Google Scholar]
  • 40.Lindorff-Larsen K, Piana S, Palmo K, Maragakis P, Klepeis JL, Dror RO, Shaw DE. Improved Side-Chain Torsion Potentials for the Amber ff99SB Protein Force Field. Proteins Struct Funct Bioinforma. 2010;78:1950–1958. doi: 10.1002/prot.22711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Oostenbrink C, Villa A, Mark AE, van Gunsteren WF. A Biomolecular Force Field Based on the Free Enthalpy of Hydration and Solvation: The GROMOS Force-Field Parameter Sets 53A5 and 53A6. J Comput Chem. 2004;25:1656–1676. doi: 10.1002/jcc.20090. [DOI] [PubMed] [Google Scholar]
  • 42.Bjelkmar P, Larsson P, Cuendet MA, Hess B, Lindahl E. Implementation of the CHARMM Force Field in GROMACS: Analysis of Protein Stability Effects from Correction Maps, Virtual Interaction Sites, and Water Models. J Chem Theory Comput. 2010;6:459–466. doi: 10.1021/ct900549r. [DOI] [PubMed] [Google Scholar]
  • 43.Mackerell AD, Feig M, Brooks CL. Extending the Treatment of Backbone Energetics in Protein Force Fields: Limitations of Gas-Phase Quantum Mechanics in Reproducing Protein Conformational Distributions in Molecular Dynamics Simulation. J Comput Chem. 2004;25:1400–1415. doi: 10.1002/jcc.20065. [DOI] [PubMed] [Google Scholar]
  • 44.Kaminski GA, Friesner RA, Tirado-Rives J, Jorgensen WL. Evaluation and Reparametrization of the OPLS-AA Force Field for Proteins via Comparison with Accurate Quantum Chemical Calculations on Peptides. J Phys Chem B. 2001;105:6474–6487. [Google Scholar]
  • 45.Jorgensen WL, Maxwell DS, Tirado-Rives J. Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids. J Am Chem Soc. 1996;118:11225–11236. [Google Scholar]
  • 46.Horn HW, Swope WC, Pitera JW, Madura JD, Dick TJ, Hura GL, Head-Gordon T. Development of an Improved Four-Site Water Model for Biomolecular Simulations: TIP4P-Ew. J Chem Phys. 2004;120:9665–9678. doi: 10.1063/1.1683075. [DOI] [PubMed] [Google Scholar]
  • 47.Berendsen HJC, Postma JPM, van Gunsteren WF, Hermans J. Interaction Models for Water in Relation to Protein Hydration. In: Pullman B, editor. Intermolecular Forces. Reidel; Dordrecht: 1981. pp. 331–342. [Google Scholar]
  • 48.Berendsen HJC, Grigera JR, Straatsma TP. The Missing Term in Effective Pair Potentials. J Phys Chem. 1987;91:6269–6271. [Google Scholar]
  • 49.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of Simple Potential Functions for Simulating Liquid Water. J Chem Phys. 1983;79:926–935. [Google Scholar]
  • 50.Torii H. Effects of Intermolecular Vibrational Coupling and Liquid Dynamics on the Polarized Raman and Two-Dimensional Infrared Spectral Profiles of Liquid N,N-Dimethylformamide Analyzed with a Time-Domain Computational Method. J Phys Chem A. 2006;110:4822–4832. doi: 10.1021/jp060014c. [DOI] [PubMed] [Google Scholar]
  • 51.Jansen TLC, Knoester J. Nonadiabatic Effects in the Two-Dimensional Infrared Spectra of Peptides: Application to Alanine Dipeptide. J Phys Chem B. 2006;110:22910–22916. doi: 10.1021/jp064795t. [DOI] [PubMed] [Google Scholar]
  • 52.Auer BM, Skinner JL. Dynamical Effects in Line Shapes for Coupled Chromophores: Time-Averaging Approximation. J Chem Phys. 2007;127:104105/1–10. doi: 10.1063/1.2766943. [DOI] [PubMed] [Google Scholar]
  • 53.Jaynes ET. Information Theory and Statistical Mechanics. Physical Review. 1955:1–11. [Google Scholar]
  • 54.Hummer G, Köfinger J. Bayesian Ensemble Refinement by Replica Simulations and Reweighting. J Chem Phys. 2015;143:243150/1–14. doi: 10.1063/1.4937786. [DOI] [PubMed] [Google Scholar]
  • 55.Beauchamp KA, Pande VS, Das R. Bayesian Energy Landscape Tilting: Towards Concordant Models of Molecular Ensembles. Biophys J. 2014;106:1381–1390. doi: 10.1016/j.bpj.2014.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Rieping W, Habeck M, Nilges M. Inferential Structure Determination. Science. 2005;309:303–306. doi: 10.1126/science.1110428. [DOI] [PubMed] [Google Scholar]
  • 57.Ohgo K, Ashida J, Kumashiro KK, Asakura T. Structural Determination of an Elastin-Mimetic Model Peptide, (Val-Pro-Gly-Val-Gly)6, Studied by 13C CP/MAS NMR Chemical Shifts, Two-Dimensional off Magic Angle Spinning Spin-Diffusion NMR, Rotational Echo Double Resonance, and Statistical Distribution of. Macromolecules. 2005;38:6038–6047. [Google Scholar]
  • 58.Yao X, Hong M. Structure Distribution in an Elastin-Mimetic Peptide (VPGVG)3 Investigated by Solid-State NMR. J Am Chem Soc. 2004;126:4199–4210. doi: 10.1021/ja036686n. [DOI] [PubMed] [Google Scholar]
  • 59.Muiznieks LD, Keeley FW. Proline Periodicity Modulates the Self-Assembly Properties of Elastin-like Polypeptides. J Biol Chem. 2010;285:39779–39789. doi: 10.1074/jbc.M110.164467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Baldock C, Oberhauser AF, Ma L, Lammie D, Siegler V, Mithieux SM, Tu Y, Chow JYH, Suleman F, Malfois M, et al. Shape of Tropoelastin, the Highly Extensible Protein That Controls Human Tissue Elasticity. Proc Natl Acad Sci U S A. 2011;108:4322–4327. doi: 10.1073/pnas.1014280108. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SI

RESOURCES