Skip to main content
NIST Author Manuscripts logoLink to NIST Author Manuscripts
. Author manuscript; available in PMC: 2020 Dec 8.
Published in final edited form as: J Chem Phys. 2018 Jul 21;149(3):035101. doi: 10.1063/1.5036677

A Spin–1 Representation for Dual–Funnel Energy Landscapes

Justin E Elenewski 1,2, Kirill A Velizhanin 3, Michael Zwolak 1,a)
PMCID: PMC7723752  NIHMSID: NIHMS1510622  PMID: 30037251

Abstract

The interconversion between left– and right–handed helical folds of a polypeptide defines a dual–funneled free energy landscape. In this context, the funnel minima are connected through a continuum of unfolded conformations, evocative of the classical helix–coil transition. Physical intuition and recent conjectures suggest that this landscape can be mapped by assigning a left– or right–handed helical state to each residue. We explore this possibility using all–atom replica exchange molecular dynamics and an Ising–like model, demonstrating that the energy landscape architecture is at odds with a two–state picture. A three–state model – left, right, and unstructured – can account for most key intermediates during chiral interconversion. Competing folds and excited conformational states still impose limitations on the scope of this approach. However, the improvement is stark: Moving from a two-state to a three-state model decreases the fit error from 1.6 kBT to 0.3 kBT along the left-to-right interconversion pathway.

I. INTRODUCTION

While most biomolecular helices possess a right–handed orientation, a small minority are predisposed to assume a left–handed fold.1 This chiral propensity may be enhanced through the introduction of achiral amino acids, such as 2–aminoisobutyric acid (Aib) — noted for its ability to induce a prominent left–handed helical population in biomimetic peptides (Fig. 1).2 While useful as a probe of biological organization, this observation also constitutes a general rule: a helix–forming polymer will demonstrate a preferred axial chirality when it is constructed from chiral blocks. In contrast, polymers derived from achiral blocks exhibit a degeneracy of left–(L) and right–handed (R) helical folds, forming an effective two–state system.3 Deviations from this behavior must be associated with induced chirality, either from the solvent environment or from chiral structural elements flanking the polymer.

FIG. 1.

FIG. 1.

Interconversion between left– and right–handed helical conformations in an Aib10 peptide foldamer.

This tunable two–state character has been exploited in the design of foldamer–based nanodevices.4-6 By introducing structural features that extend beyond standard biological motifs, these materials can be engineered to include distinctive photoreactive,7 thermoresponsive,8,9 pore–forming,10 and ligand binding functionalities.11,12 Aib–containing decamers, in particular, form stable helices and have found use as actuators within biomimetic receptors6,13,14 and photoswitches.7 In this case, triggering a conformational shift in the N–terminal sensor domain — through either ligand binding or photoexcitation — presumably biases the free energy landscape toward the opposite helical chirality, analogous to solvent–driven transitions in polyproline peptides.15,16 This shift results in an LR interconversion, the subsequent structural reorganization of a covalently linked C–terminal reporter and the ultimate detection of an activation signal. While promising as biomimetic devices, these systems also afford a fundamental opportunity to explore the translation of local molecular interactions into folding and function, outside the confines of natural biological systems.17

The static and dynamic properties of a macromolecule are dictated by the architecture of its potential energy landscape.18-20 In the case of biomolecules, these landscapes are minimally frustrated, containing the smallest possible set of competing low–energy conformations, separated by high potential barriers. This principle often affords a sharply funneled profile, with the so–called native state lying at the global minimum and successive ‘excited’ states occupying higher–lying local minima. These minima collectively define the stable conformers of the molecule, while barriers in the landscape dictate the kinetic profile for conformational transitions.

Multifunctional biomolecules deviate from this paradigm, delivering a potential energy landscape that contains separate funnels for each functional conformer.21 This organizational principle extends to helical foldamers, where left– and right–handed orientations correspond to separate funnel minima in a dual–funnel energy landscape.22,23 While increasingly recognized in a biological context, the initial theoretical foundation for these systems was derived to describe dual–funnel, solid–solid transitions of small Lennard–Jones (LJ) clusters.24,25 These investigations delivered fundamental insight into the hallmarks of multifunnel landscapes, and illustrated how landscape features can mediate processes that range from kinetic trapping to phase changes in extremely finite systems. Explorations of cluster landscapes have likewise inspired, and serve as a benchmark for, ergodic sampling methods in molecular simulations,26-34 global optimization schemes,35 and path–sampling frameworks for rare–event dynamics.36-40 These computational advances have, in turn, facilitated more recent studies of biological and biomimetic systems, including those undergoing helix chirality inversions15,22,23,41 and containing bistable switching motifs.42-44

While certain biomolecules have been well–explored, a systematic, theory–guided approach to foldamer design is impeded by the complex nature of multifunnel energy landscapes. In principle, these may only be mapped using costly numerical simulations.4,45 Nonetheless, a suitable analytical model — parameterized using discrete calculations at the single–block level — might dramatically accelerate these efforts while retaining acceptable quantitative accuracy. Numerous computational techniques have been applied to Aib–containing helices in order to understand the form that such a model should take.46-53 Nonetheless, the majority of these are limited in their sampling of conformational dynamics during the LR transition. Motivated by studies of energy transport in the Aib9 peptide foldamer,54-58 more recent efforts have long–timescale molecular dynamics simulations, Markov state modeling, and principal component analysis (PCA) to these systems.59,60 The resulting data suggest that Aib9 has a conformational landscape where long timescale (ns to μs) dynamics are ultimately slaved to short timescale (ps) hydrogen bonding transitions through a hierarchy of timescales — a feature characteristic of systems with a multi–funnel architecture.61-63 A coarse–grained energy landscape was proposed, parameterized in terms of backbone dihedral angles (ϕ, ψ), with conformational substates determined by discrete, per–residue changes in helical chirality. In this manner, each residue is assigned to either a left (L) or right (R) conformational substate, though the authors did not make this statement quantitative.59,60 While more elaborate coarse–grained models can capture the dynamics of helix chirality inversion,23 it is unclear if this minimal description can do the same.

To explore this physically intuitive proposal, and assess the limits of coarse–graining, we have constructed an explicit representation of the L/R–model using an Ising–like Hamiltonian. Large–scale replica–exchange molecular dynamics (REMD) simulations and an unsupervised clustering approach were employed for parameterization, affording a granular picture of the potential energy landscape. Our observations indicate that a two–state representation is insufficient to describe the structural diversity inherent in the LR transition. A three–state, spin–1 model yields better performance by explicitly including unstructured regions. However, the model still exhibits systematic deviations from all–atom simulations. These inconsistencies arise from numerous factors, including the presence of ‘excited’ conformational substates and the existence of distinct, competing helical folds. Taken together, our observations provide a minimal bound on the complexity of analytical models that accurately describe simple foldamers such as AibN, where N is the number of repeats, and attest to the necessity of explicit simulation in characterizing these systems.

II. THEORY AND METHODS

A. Spin Models

We consider an unbranched foldamer containing N linearly–ordered blocks. In the simplest case, each of these blocks might be classified as either a left–handed (L) or right–handed (R) configuration according to its backbone dihedral angles (Appendix I). An energetic gain of J < 0 per site is expected when consecutive blocks share the same helical orientation (… RR … or … LL …) — promoting the formation of homochiral domains — and an energetic penalty of −J > 0 encountered at a domain wall between left– and right–helical regions (… LR … or … RL …). The simplest Hamiltonian that describes these interactions is a classical ‘ferromagnetic’ spin–1/2 Ising model,

EI=α=1N1J^(σα+1,σα) (1)

where the spin σα ∈ {L, R} residing on block α is assigned to either a right–handed (R) or left–handed (L) configuration. The spin-spin coupling matrix J^ is symmetric, and all elements are of the same magnitude: J=J^(R,R)=J^(L,L)=J^(L,R)=J^(R,L). The supplementary material (SM) shows the effect of asymmetric couplings and discusses our approach to fitting (SM Section IA, Figs. S1-S5). We will discuss the difficulties encountered with this simple approach.

The energy, Eq. (1), can be extended to a spin–1 scenario, where an unstructured random coil state U exists alongside the L and R configurations (σα ∈ {L, U, R}). We take the coil state to always refer to the unstructured polymer or region. For simplicity, we assume that extended coil stretches have no inter–site coupling J^(U,U)=0, nor do they make an energetic contribution when contacting helical regions J^(LR,U)=0 (SM Section IB, Fig. S6, S7) This construction differs from Zimm–Bragg64 and Lifson–Roig65 models for the helix–coil transition, where an additional statistical weight for nucleation is assigned to helix termini that flank unfolded regions. The inclusion of a nucleation penalty J^(LR,U)0, or a correction for flexible helix termini, has only a modest impact on our model (see the SM, Figs. S7-S8).

The persistence of extended coils will depend on a variety of factors, including the interaction of side chain and backbone atoms with the encapsulating solvent. This effect may be captured through an on–site solvation energy KS that is associated with unstructured peptide regions. We may also define a contribution K^(LUR) that reflects the tendency of a given conformation to reside in rotameric minima (not including cooperative factors, such as hydrogen bonding), leading to an on–site Hamiltonian term:

EO=α=1NK^(σα). (2)

In practice, it is convenient to set K^(L)=K^(R)=0 and introduce a single nonzero on–site parameter K^(U)=KSK0 that reflects the impact of solvation (and other factors that impact the “on site” energy of a block) on the extended coil state (see the SM, Figs. S9-S10). Under these considerations K^(U)<0 promotes and K^(U)>0 penalizes the formation of extended coil regions. For Aib peptides in chloroform, one would expect a K^(U)>0 as contacts between the polar backbone amides and the nonpolar solvent would be disfavored.

In a realistic foldamer, coiled regions will admit a multitude of conformations, while the comparatively rigid helical residues are likely to cluster around a single conformational state. This behavior may be accommodated by introducing the contribution of rotameric entropy to the free energy of the system

SR=kBiC(ni1)s, (3)

where ni is the length of the i–th coiled stretch, kB · s is a unit of conformational (rotameric) entropy, and kB is the Boltzmann constant. The summation runs over indices in the set of coiled regions, C, defined as a repeat of two or more consecutive U sites in a given peptide conformation. The high flexibility of helix termini, coupled with their ambiguous dihedral assignments, necessitates their exclusion when determining membership in C. The appearance of (ni − 1) in Eq. (3) and the definition of C both require an explanation. While a single U site results in a kinked helix – and a slight increase in entropy – broad conformational (rotameric) sampling only occurs when there is junction between two U sites (an upper bound for the kink entropy is given by Fig. S11b; however the minute conformational variability observed in all–atom MD is beyond the scope of our model). As a consequence of this, an expression scaling as ni results in an overestimation of the entropy for otherwise structured states, leading to the use of an (ni − 1) term (see the SM, Fig. S11).

These considerations collectively define the relevant contributions to the Gibbs free energy for a solvated, helical foldamer, containing both helical and unfolded segments:

GC=EO+EITSR+pV. (4)

As a matter of convention, the free energy of the j–th spin configuration ΔGC,j = GC,jGC,0 will be measured relative to the absolute free energy GC,0 of the lowest energy spin configuration(s) in a given ensemble. The Hamiltonian components of this model are exactly solvable, and the partition function may be evaluated using transfer matrix techniques. Equation (4) includes a contribution from volume, pV, where p (V) is the pressure (volume), that is necessary for completeness. We may alternatively drop this explicit dependence, effectively wrapping this parameter into other terms appearing in Eq. (4) by fitting simulation data. This will be addressed later.

B. All–Atom Simulations

Molecular dynamics simulations were performed using a ten residue stretch (Aib10) of 2–aminoisobutyric acid, with initial backbone dihedrals assigned from the Dunbrack rotamer library for a right–handed 310 helix.66 The model was embedded in a (5 nm)3 cubic cell, containing 922 chloroform molecules, and packed to the density of bulk solvent.67 C– and N–termini were methylated and acetylated, respectively, and simulation physics described using the CHARMM36 force field68 and the LAMMPS package.69 CHARMM cross–term map (CMAP) corrections were not employed.70 While this modification has been demonstrated to improve α–helix folding cooperativity,68,71 it dramatically overestimates α–helical character for model helices.72 Since Aib10 is characterized by nontrivial 310–helical content, this would have questionable transferability without major reparameterization.

Lennard–Jones and Coulomb interactions were computed using conventional CHARMM pair potentials, conjoined with an additional electrostatic damping term to maintain compatibility with particle–particle–particle mesh summation (force cutoff = 6.95 × 10−3 pN). Switching functions were employed to rescale coupling between atomic pairs separated by more than 1.0 nm, with the interatomic potential vanishing beyond 1.35 nm separation. A timestep of 1.0 fs was used for all calculations within a scheme73 that employs a velocity Verlet integrator, modified Nosé–Hoover thermostat and Martyna–Tobias–Klein barostat,74 alongside a Parrinello–Rahman representation75 for the strain energy, allowing us to reproduce the correct probability distribution for the isobaric–isothermal (NPT) ensemble. The damping period of the thermostat was set to 100 fs, while the damping period of the barostat was set to 1000 fs and coupled to a chain of eight members. Isotropic cell fluctuations were allowed in all directions, and initial velocities assigned according to Gaussian distributions for both linear and angular momenta at a given temperature.

The Aib10 model was subjected to an initial 22 ns NPT equilibration, providing a starting point for subsequent replica exchange simulations. REMD runs were performed using 48 replicas in an NPT scheme,76,77 with temperatures spanning between Ti = 230 K and Tf = 465 K. Exchanges were attempted every 500 fs, yielding an acceptance ratio of 32 % for all simulations. REMD simulations were equilibrated for a further 150 ns, followed by a 500 ns production run. Conformational clusters were determined using a running k–means scheme and a metric based on the root-mean-square deviation (RMSD) of heavy backbone atoms (0.26 nm, cutoff radius). This procedure is sufficient to converge the energies of ground–state clusters — presumed to be degenerate at all temperatures — to within a deviation of 0.07 kBT when sampling below 300 K. The persistence of this criterion for over 50 ns of sampling was taken as a hallmark for convergence, as force field deficiencies may prevent complete degeneracy from occurring on an accessible timescale during our simulations.59

III. RESULTS AND DISCUSSION

A. Aib10 Energy Landscape

The conformational dynamics of Aib–based polypeptides have been experimentally characterized in a spectrum of solvents. However, their behavior in chloroform has a particularly distinguished history. Under these conditions, it has been proposed that a glass–like dynamical transition occurs in Aib-Ala-(Aib)6 derivatives,54-57,78 as reflected through temperature–dependent energy transport, though the nature of this transition – and the changes in energy transport – remains contentious, even among the same group of authors.58,59,79,80 More concretely, a nonpolar environment mimics the artificial membranes in which many foldamer–based molecular sensors are intended to operate.7,14 These factors – combined with the absence of solvent hydrogen bonding or complex electrostatics – motivated us to adopt this model system.

An enhanced–sampling approach was employed to facilitate the exploration of conformational space, using all–atom REMD simulations to sample the NPT ensemble for an explicitly–solvated Aib10 peptide. This protocol generates approximately 106 conformers for each temperature, however, the construction of an energy landscape requires either projection onto a lower–dimensional subspace or the reductive classification of these peptide conformations into a smaller set of states. We adopted the latter approach — a granular quantification of minor intermediates along the transition pathway. While a full–dimensional approach might involve the use of disconnectivity graphs and related classification methods, a simple clustering scheme is sufficient for our purposes.24 This character may be obscured when using PCA–based methods. Classification was accomplished using a k–means clustering scheme, in which a given trajectory frame is assigned to a cluster only if the running intra–cluster RMSD is less than 0.26 nm from the cluster centroid. For a 500 ns REMD trajectory, this affords clusters containing ≈ 400 states at each temperature that is sampled between 230 K and 330 K. With this classification in hand, a relative free energy

ΔGjk=kBTlog[PkPj] (5)

may be calculated between states of populations Pj and Pk, respectively. For simplicity, we assume that free energies are measured with respect to the most populous cluster in each ensemble, allowing ΔGj to be indexed by a single parameter.

REMD simulations reveal an energy landscape that contains two distinct folding funnels, corresponding to left– and right–handed conformations of the peptide helix (Fig. 2). The native helical states in each funnel are nearly isoenergetic, however, a degree of asymmetry exists in distribution of clusters. This is likely a collective effect, resulting from clustering artifacts, initial conditions, sampling limitations, and a degree of bias induced by the terminal patches that maintain electrostatic neutrality. Interestingly, (a more sizable) asymmetry was observed in a previously reported map of the Aib landscape derived following long–timescale MD simulations and dimensional reduction of the data.59 On a separate note, the gap in helical content between left– and right–handed funnels is intrinsic to the function Hlx[{xi}] (see appendix), which accounts for oriented twists in the peptide backbone, reflecting an ‘intrinsic helical content’ associated with gyration of the peptide chain.

FIG. 2.

FIG. 2.

Free energy landscape of Aib10 at T = 300 K. Each data point represents one of N = 433 clusters, and the size of each point is scaled relative to the number of states contained within the respective cluster.

While the precise cluster assignments in each funnel are not identical, the overall distribution of these states remains quite similar. Both funnels are dominated by highly–populated clusters of helix–rich states at low energies, expanding into a large set of high–entropy states with a low helical content at higher energies. This high energy region also contains ‘excited’ helical states, in which an energetically unfavorable conformation is assumed while preserving the overall helical fold. An apparent crossing between funnels, where the fold becomes largely unstructured, occurs around T = 4 kBT consistent with prior PCA–based landscapes.59,60 This system is distinguished from other foldamers by the achiral nature of Aib, affording a highly symmetric and degenerate energy landscape. In contrast, helices that are derived from chiral blocks15,22,23 often demonstrate some degree of energetic asymmetry, while clusters such as LJ38 possess energetically asymmetric basins with markedly different topographies.25

The low energy clusters in this ensemble are primarily α–helical, transitioning to a combination of unfolded and 310–character with increasing energy (Fig 3). At first glance, this appears to contradict experiment, as crystallography,81-83 optical,84-86 and magnetic resonance spectroscopies suggest that short (≤ 7 residues) Aib peptides exist as 310–helices in weakly dielectric environments. This picture is nuanced, as both α– and 310–conformations – with a dominant α–helical population – have been observed in polar and moderately polar solvents (water, DMSO).48,87-89 Taken together, these data indicate that Aib conformations exhibit a high degree of environmental sensitivity, with 310–helical content predominating for short helices in nonpolar media and at higher temperatures.90 Since our ten–residue model is longer than the peptides employed in most experimental efforts, it is plausible that a greater degree of α–helical content may occur in this system, with the early stages of folding templated by a 310–helix.91 This behavior is consistent with MD simulations using a custom, AMBER–based force field for Aib52,53 and long–timescale trajectories of Aib9 dynamics using GROMOS96, which reveal an increasing proportion of α–helical character for longer chains.59 Nonetheless, the dominant helical fold, and any force–field dependence, will not markedly alter our conclusions. They key observation herein is the coexistence of two distinct helical populations with different physical characteristics.

FIG. 3.

FIG. 3.

Temperature–dependent helical content within the Aib10 landscape, demonstrating an excess of either α–helical (blue) or 310–helical (red) character. Clusters are ranked in order of decreasing population (i.e., increasing free energy).

The state distribution within the Aib10 landscape is readily analyzed using the density of conformational cluster states ρ(E) that are observed at each replica temperature

ρ(E)=j=1NCPje(EΔGj)22σ2 (6)

where NC is the number of k–means clusters calculated for a given replica, Pj is the population of the j–th cluster, ΔGj is the free energy of the j–th cluster, and σ2 accounts for the inter–cluster conformational variance (Fig. 4). These data exhibit a weak temperature dependence, characterized by a 5 % increase in the number of conformers lying below 4 kBT as the temperature is increased from 230 K to 330 K. Notably, these data do not demonstrate any obvious signatures of a transition between 250 K and 270 K, where prior experiments and simulations have suggested at the Aib might undergo a protein dynamical transition.54-56,58,59,79,80 A dynamical transition cannot be excluded without a more comprehensive analysis, as these inherently coarse–grained clusters may obscure subtle changes associated with this phenomenon. Furthermore, a dynamical transition is canonically associated with a redistribution of barrier heights in the energy landscape, which is not determined in this work. These effects are expected, at most, to modestly perturb the landscape minima.

FIG. 4.

FIG. 4.

Density of conformational clusters, populated according to the number of members in each cluster, for all–atom replica exchange simulations with base temperatures spanning between 220 K and 330 K. The density of states was calculated using σ2 = 0.01 as a smearing parameter.

B. Spin Representations of the Energy Landscape

The energy landscape of Aib10 – parameterized by the backbone dihedrals – reflects the secondary structure of conformational substates and bounds the complexity of coarse–grained models. The free energy surface corresponding to a single Aib residue, ΔG1 (ϕ, ψ), is foundational to this classification. When calculated from the REMD ensemble at T = 300 K, this surface contains four primary minima, including left–handed (ϕ ≈ −50°, ψ ≈ −55°) and right–handed (ϕ ≈ 50°, ψ ≈ 55°) helical regions alongside a pair of broad–shouldered basins (right–handed: ϕ ≈ −65°, ψ ≈ 55°; left–handed: ϕ ≈ 65°, ψ ≈ −55°) that define extended conformations (Fig. 5a, Fig. S9). REMD simulations reveal a distribution of minima and interconversion barriers (ranging between 4kBT to 5kBT) that resemble earlier simulations of Aib dynamics, suggesting that our computational approach captures the general condensed phase behavior of Aib.52,59

FIG. 5.

FIG. 5.

Quantification of the Aib10 energy landscape: (a) free energy landscape ΔG1(ϕ, ψ) assumed by Aib when incorporated into Aib10, derived from the REMD distribution of (ϕj, ψj) backbone dihedral angles; (b) free energy surface ΔG10(ϕ¯,ψ¯) for individual Aib10 conformers calculated from a distribution of the average backbone dihedrals (ϕ¯j,ψ¯j);(c) overlay of k–means centroids (white dots) with the ΔG10(ϕ¯,ψ¯) landscape; and (d) cross–correlation between the j–th cluster energy, ΔGj determined from REMD simulations, and the j–th centroid energy ΔGC,j calculated using the spin–1/2 Hamiltonian ΔGC=ΔEI with unit coupling (J^(R,R)=1.0kBT;J^(L,R)=1.0). The root mean square error (RMSE) between ΔGj and ΔGC,j is 2.8 kBT, however this is biased toward unstructured states; the RMSE for states lying along the helix–coil transition (Fig. 6) is 1.6 kBT. Energy landscapes were determined using 106 Aibi0 conformers, extracted from REMD simulations at T = 230 K. Boxed areas in (b,c) denote the preferred subregions of dihedral space for left– and right–handed helices.

In a similar manner, it is straightforward to derive a free energy surface ΔG10(ϕ¯,ψ¯) for entire Aib10 peptides in terms of the mean backbone dihedrals ϕ¯j=(N2)1α=2N1ϕj,α and ψ¯j=(N2)1α=2N1ψj,α, where the index α runs over residues within the j–th Aib10 configuration. Summation is restricted to interior residues as the highly–flexible terminal sites can only be assigned a single dihedral parameter. The resulting landscape is more complex than that of a single Aib residue, containing helical substates that are connected by a near–continuum of weakly structured intermediate configurations (Fig. 5b). This architecture is well mapped by k–means clusters, with centroids that are both localized in minima of ΔG10(ϕ¯,ψ¯) and diffusely distributed throughout the interstitial parameter space (Fig. 5c).

While this energy profile is consistent with earlier simulations, the preceding efforts identified fewer states within in the energy landscape.59 This deviation is likely associated with the PCA–based dimensional reduction employed by other authors. In this case, PCA component vectors were shown to convolve the ϕ and ψ backbone dihedrals with undetermined lower–weight parameters. The resulting landscape corresponds (approximately) to a configuration in which conformations from our k–means clusters are averaged according to their backbone dihedrals within overlapping neighborhoods (c.f. Fig. 5). This redistribution and averaging affords a smoothed map of conformational space, while impeding detection of nuanced details that are captured by our clustering. In a similar manner, the landscape given by ΔG10 contains numerous minima that are better differentiated by the geometric (ϕ, ψ) backbone dihedrals than the admixed parameters resulting from PCA. The marginalization of fine landscape features with certain order parameters is a well–known complication of dimensional reduction34,92-94 though any given pair of these parameters may be related through a well–defined scaling transformation.94

The Aib10 landscape contains numerous conformations (70.8 % of the ensemble) that lie outside the basins dominated by left– and right–handed helical character (Fig. 5; Fig. S9). At first glance, this would appear to preclude a two state model, even when restricted to core regions of the Aib helix. To test this assumption, the centroids at T = 230 K were given a binary classification by setting σα = R when ψα ≤ −ϕα and σα = L when ψα ≥ −ϕα, following an earlier proposal.59 Using this ensemble, the centroid energies calculated using the spin–1/2 Hamiltonian ΔGC,j=ΔEI exhibit extremely weak correlation (particularly for structured conformers ) with the energies ΔGj derived from the REMD landscape, indicating that unstructured configurations are critical to constructing a simplified model of Aib10 dynamics. This observation is underscored by theoretical investigations of other bistable helical foldamers, which note the importance of unstructured states to either primary or secondary pathways for helix chirality inversion.15,22,41

A more meaningful classification scheme becomes apparent when introducing an unfolded coil (U) configuration, leading to a three-state, spin–1 representation of the energy landscape. To implement this approach, every cluster centroid may be encoded as a spin configuration between residues i and i + 1 using the function hcx[{xi}, {xi+1}] (Eq. 11). These data may then be employed to compute the the energy ΔGC,j of the j–th centroid, and compared directly to with the corresponding cluster energy ΔGj from the all–atom replica ensemble.

The simplest three-state model for Aib10 employs only the spin–spin Hamiltonian term ΔGC=ΔEI to quantify interactions between structural motifs. This model demonstrates modest agreement with the REMD cluster distribution at low energies, however, a pronounced deviation is observed for high–entropy states in which random coil character is dominant (Fig. 6a). The inclusion of a correction ΔGC=ΔEI+ΔEO that disfavors solvent–exposed coils strengthens this correspondence for a number of clusters in the well–folded region (ΔGj, ΔGC,j ≤ 4kBT), particularly for states lying within the right–handed funnel (Fig. 6b). While this term accommodates the short helical segments that occur within the bulk of a helix, or the fraying terminal regions associated with the canonical helix–coil transition, there is a dramatic overestimation of solvation penalties for centroids with low helical content. The introduction of an entropic term ΔGC=ΔEI+ΔEOTΔSR strengthens this correspondence (Fig. 6c), and may be rationalized as a form of energy–entropy compensation that corrects for the loss of inter–helical hydrogen bonds and helix dipole reinforcement. Nonetheless, notable deviations persist for a series of states lying within the low–energy regime, as well as within the large high entropy cluster consisting largely of unstructured configurations. REMD simulations indicate that a pΔV term would shift the highest lying centroid energies by approximately 0.7 kBT (see the SM, Fig. S12). However, the on-site and entropic terms already capture part of this correction due to the fitting. We thus neglect this correction.

FIG. 6.

FIG. 6.

Cross–correlation between the j–th cluster energy, ΔGj determined from REMD simulations at T = 230 K, and the j–th centroid energy ΔGC,j calculated using spin–1 Hamiltonians. Configurations correspond to (a) ΔGC=ΔEI, (b) ΔGC=ΔEI+ΔEO and (c) ΔGC=ΔEI+ΔEOTΔS. Points are colored according to their helical content, spanning from right–handed helices (blue) to random coil (green) to left–handed helices (red) (parameters are determined by setting J^(L,L)=J^(R,R)=1.0kBT and J^(L,R)=1.0kBT (See SM, Section 1), and employing a systematic fitting procedure to find KU = 0.9 kBT, s = 0.9). Dashed lines denote a range within ±0.5 kBT of the diagonal. Fitting was performed by maximizing the number of clusters (Nfit = 26) for which ∣ΔGC,j − ΔGj∣ ≤ 0.5 kBT and simultaneously minimizing ∣ΔGC,j − ΔGj∣, affording an RMSE of 0.3 kBT for fitting points (overall RMSE = 2.4 kBT). This corresponds to 15 % of the spin–1/2 RMSE for clusters lying along the helix–coil transition (Fig. 5d). We note that the solvent and entropic terms are both physically tied to the three-state model, as they reflect behavior of the unstructured state. Therefore, they are not included in the spin–1/2 model of Fig. 5d.

It is natural to ask if these deviations are unique to a particular simulation temperature. To assess this possibility, an identical set of calculations was performed by transferring these parameters, and the full spin model (Eq. 4), to four different environmental conditions (Fig. 7). A similar pattern of deviations between ΔGC,j and ΔGj is observed at each temperature, with a comparable quality of fit, suggesting a surprising degree of transferability for this model. The consistency of this behavior is highly suggestive of a systematic deviation between the spin model and all–atom simulations.

FIG. 7.

FIG. 7.

Temperature dependence of cross–correlation between the j–th cluster energy ΔGj and the j–th centroid energy ΔGC,j calculated using the full spin–1 model ΔGC=ΔEI+ΔEOTΔS. Parameters are defined at each temperature so that J^(R,R)=1.0kBT,K^(U)=0.9kBT, and s = 0.9, following Fig. 6. Dashed lines denote the fitting range within ±0.5 kBT of the diagonal (overall RMSE230K = 2.4 kBT; fit RMSE230K = 0.3 kBT (26 states); overall RMSE270K = 2.6 kBT; fit RMSE270K = 0.3 kBT (19 states); overall RMSE300K = 2.5 kBT; fit RMSE300K = 0.3 kBT (25 states); overall RMSE330K = 2.4 kBT; fit RMSE230K = 0.3 kBT (19 states)).

To dissect the origin of this behavior, it is helpful to examine a series of centroids that exhibit tight agreement with replica exchange clusters. An ideal set is afforded by the right–handed helical funnel within the T = 230 K ensemble (Fig. 8a). Cursory analysis of these states reveals a classical helix–coil transition, proceeding from a well–formed α–helical native state to a series of higher energy conformers characterized by fraying of the helix termini and the ultimate unwinding of the segment. The robust fit for this series is consistent with the enhanced stability of α–helical populations within Aib foldamers at low temperatures. Furthermore, the majority of these centroids retain a degree of helicity, suggesting that all but the highest–energy configurations in this series correspond to folds in which a helix has already nucleated.

FIG. 8.

FIG. 8.

Structural elements of the Aib10 energy landscape: (a) selected centroids exhibiting tight cross–correlation between the full spin model (Eq. 4) and REMD simulations at T = 230 K, demonstrating a robust helix–coil transition; (b) cluster demonstrating both α– and 310–helical domains, shown in white (on top) and cyan (on bottom), respectively. Hydrogen bonding patterns are indicated by dotted lines and diffuse shading; (c) comparison between ‘ground’ and ‘excited’ state conformers sharing the same spin assignment; (d) anomalous cluster demonstrating an incommensurate hydrogen bonding pattern.

It is notable that low energy clusters — exhibiting a reasonable correspondence between the spin–1 model and replica exchange simulations — are dominated by structures with substantial α–helical character (Fig. 3). Higher energy clusters are dominated by a 310–fold when nontrivial helicity is present. Manual inspection of poorly fit clusters reveals that many contain at least a small 310–helical twist, either in isolation (high–entropy cluster) or abutting an α–helical segment through a small unstructured linker (states below the diagonal). Owing to their distinct physical characteristics, these 310–helical folds constitute a distinct helical population that coexists alongside the dominant α–helical distribution (Fig. 8b). A more physically accurate model might accommodate two distinct helical populations, forming a five–state representation. The prevalence of 310–helical configurations in short stretches underscores a role as nucleation sites for α–helices, consistent with their purported role in macromolecular structure formation and with traditional analytical models for the helix–coil transition.64,65

An additional family of deviations is associated with clusters that classify into the same spin encoding, yet differ conformationally within the REMD ensemble. These structures are generally related through a localized conformational distortion, which preserves the overall secondary structure yet places one conformer into a higher–energy configuration (Fig. 8c). These energetically ‘excited’ states were anticipated in earlier PCA analyses59 and fall outside the scope of simple spin–based models (one needs further fine graining, i.e., a higher dimensional spin, to account for them), attesting to the importance of all–atom simulations for characterizing macromolecular energy landscapes. Underscoring this point, several of these configurations are highly populated (ΔGj ranging between 2 kBT and 3 kBT) and are associated with alterations in the terminal domains of the helix.

One final anomaly deserves further discussion. The atypical cluster located at ΔGj ≈ 1.5 kBT and ΔGC,j ≈ 6 kBT is accompanied by several similarly folded counterparts in both left and right helical funnels, and deviates strongly from a helical fold while retaining a well–defined secondary structure (Fig. 8d). These folds contain hydrogen bonding patterns consistent with single turns of 310– and α–helical character. Nonetheless, consecutive backbone hydrogen bonds from the i–th residue to residue i + 3 (310–helix) and from the i–th residue to residue i + 4 (α–helix) lie out of registry and thus a long–distance helical structure fails to form. This behavior may be unique to the Aib10 model, which exceeds the length of prior experimental constructs and lacks the bulky flanking groups present in synthetic helices. It is unlikely that this unconventional fold plays a role in the experimentally observed LR interconversion due to constraints from the helix termini or the surrounding membrane environment. Nonetheless, simulations of the helix–coil transition in polyalanine indicate the presence of alternate folds that reside within shoulders of the folding funnel.95 It is unclear if these clusters represent similar behavior, or if they correspond to a unique transition pathway between left–handed and right–handed funnels. While the latter possibility is less probable, it may only be excluded by mapping a kinetic network for these energy landscapes.

IV. CONCLUSIONS

Taken together, the observations herein underscore the complexity of seemingly simple macromolecular energy landscapes. From an analytical perspective, a three–state, spin–1 model can accommodate the general structural motifs present within the conformational ensemble of Aib10 — corresponding to left–handed helices, right–handed helices, and unstructured coils. Nonetheless, the presence of competing α– and 310–helical subpopulations limits the scope of this approach. More complex five–state models can be constructed following Ising or Potts Hamiltonians, however, the presence of high–energy (‘excited’) and low–energy (‘ground’) conformational states with identical spin encodings suggests that increasing the dimensions of the model will be met with diminishing returns. If a high resolution picture is required for the entire state spectrum, Markov state models96 or transition path representations36-40 may afford a more efficacious — yet costly — approach.

Despite these limitations, the construction of simplified (and, in our case, analytical) models helps to reveal key features of the energy landscape. These considerations extend to techniques for reduction of dimensionality — while PCA analysis reveals major minima, an approach based around replica exchange and unsupervised clustering captures states that might otherwise be masked when the resolution of simulation data is low. The importance of excited states and competing folds, as revealed through the spin–1 model, attests to the importance of careful landscape quantification. A judicious choice of methods is essential, balanced by a tradeoff between computational cost and resolution required of the resulting coarse–grained representation.

On a more sobering note, it is widely known that force-field dependence of the free energy landscape – and secondary structure in particular – is a constant source of figurative, and sometimes literal, frustration97. For Aib in particular, we find that α–helical content dominates at low energy (and 310 at higher energy). The funnel crossover energetics are in agreement with other force fields, but the ratio of α to 310 content for those cases is not available. Furthermore, experimental data and other MD simulations are not directly transferrable to the longer, unmodified, Aib polypeptide that we examine. Nevertheless, the inclusion of unstructured regions vastly improves coarse graining, irrespective of the force field – in this manner we consider Aib10 to be a model system for other helical macromolecules.15,22,41

This gives a general lesson for coarse-graining, whether into discrete structural states or at the atomic scale: Full characterization of the energy landscape for protein fragments and polypeptides is possible. A comparison of structural features between all-atom, atomically coarsegrained, and discrete representations can therefore give a strong and quantitative assessment of what is physically occurring in these models – in our case, a 0.3 kBT deviation of coarse grained states between 0 kBT and 4 kBT (this is in addition to constraints that ensure a correspondence with thermodynamic parameters and other data). In doing so, one can to pinpoint deviations, determine symmetries and asymmetries and, all-in-all, see what these complex atomic models are really yielding.

Supplementary Material

SI

VI. ACKNOWLEDGEMENTS

J. E. E. acknowledges support under the Cooperative Research Agreement between the University of Maryland and the National Institute for Standards and Technology Center for Nanoscale Science and Technology, Award 70NANB14H209, through the University of Maryland. K. A. V. was supported by the U.S. Department of Energy through the LANL/LDRD Program. Computing resources were made available through the Los Alamos National Laboratory Institutional Computing Program, which is supported by the U.S. DOE National Nuclear Security Administration under contract no. DE-AC52-06NA25396 as well as the Maryland Advanced Research Computing Center (MARCC).

VII. APPENDICES

A. Definition of Helical Order Parameters

Helical content is quantified98-100 using a function Hlx[{xi}] that scores a given peptide configuration {xi} based on (i) conformity to the angle formed by consecutive α–carbons in an ideal helix and (ii) consistency with an ideal hydrogen bonding arrangement for either an α–helix or a 310–helix:

Hlx[{xi}]=12(N2)i=1N2Ang(xα,i,xα,i+1,xα,i+2)+12(Nm)i=1NmHb(xO,i,xN,i+m) (7)

where m = 3 for a 310–helix, m = 4 for an α–helix, θ0 = 90° reflects the angle formed by three consecutive α–carbons and Δθtol = 15° defines an acceptance tolerance for deviations from an ideal helix. In this case, xα,i, xO,i, and xN,i, denote, respectively, the α–carbon, amide oxygen, and amide nitrogen coordinates for the i–th residue. The angular deviation function Ang(xα,i, xα,i+1, xα,i+2) is defined as

Ang(xα,i,xα,i+1,xα,i+2)=1[θ(xα,i,xα,i+1,xα,i+2)θ0]2(Δθtol))21[θ(xα,i,xα,i+1,xα,i+2)θ0]4(Δθtol))4 (8)

where θ(xα,i, xα,i+1, xα,i+2) is the angle formed by consecutive α–carbons and the hydrogen bonding contribution is quantified through

Hb(xO,i,xN,i+R)=1[xO,ixN,i+Rd0]41[xO,ixN,i+Rd0]6. (9)

The orientation of a given peptide configuration {xi} is assigned using the function

Hcx[{xi}]=i=1N1hcx[{xi},{xi+1}], (10)

where the pairwise helical content is defined in terms of the (ϕ, ψ) dihedrals so that

hcx[{xi},{xi+1}]={1(100ϕ30;80ψ5)0otherwise1(100ϕ30;80ψ5)} (11)

reflects the net right–handed (positive) or left–handed helical content (negative), respectively.

Footnotes

V.

SUPPLEMENTARY MATERIAL

The supplementary material contains a general evaluation of spin–1/2 and spin–1 model parameters.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SI

RESOURCES