Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2024 Sep 27;123(21):3798–3811. doi: 10.1016/j.bpj.2024.09.028

An integrative characterization of proline cis and trans conformers in a disordered peptide

Alice J Pettitt 1,3,4, Vaibhav Kumar Shukla 1,4, Angelo Miguel Figueiredo 1, Lydia S Newton 1, Stephen McCarthy 2, Alethea B Tabor 2, Gabriella T Heller 1, Christian D Lorenz 3,, D Flemming Hansen 1,4,∗∗
PMCID: PMC11560310  PMID: 39340152

Abstract

Intrinsically disordered proteins (IDPs) often contain proline residues that undergo cis/trans isomerization. While molecular dynamics (MD) simulations have the potential to fully characterize the proline cis and trans subensembles, they are limited by the slow timescales of isomerization and force field inaccuracies. NMR spectroscopy can report on ensemble-averaged observables for both the cis-proline and trans-proline states, but a full atomistic characterization of these conformers is challenging. Given the importance of proline cis/trans isomerization for influencing the conformational sampling of disordered proteins, we employed a combination of all-atom MD simulations with enhanced sampling (metadynamics), NMR, and small-angle x-ray scattering (SAXS) to characterize the two subensembles of the ORF6 C-terminal region (ORF6CTR) from SARS-CoV-2 corresponding to the proline-57 (P57) cis and trans states. We performed MD simulations in three distinct force fields: AMBER03ws, AMBER99SB-disp, and CHARMM36m, which are all optimized for disordered proteins. Each simulation was run for an accumulated time of 180–220 μs until convergence was reached, as assessed by blocking analysis. A good agreement between the cis-P57 populations predicted from metadynamic simulations in AMBER03ws was observed with populations obtained from experimental NMR data. Moreover, we observed good agreement between the radius of gyration predicted from the metadynamic simulations in AMBER03ws and that measured using SAXS. Our findings suggest that both the cis-P57 and trans-P57 conformations of ORF6CTR are extremely dynamic and that interdisciplinary approaches combining both multiscale computations and experiments offer avenues to explore highly dynamic states that cannot be reliably characterized by either approach in isolation.

Significance

This study employs MD simulations (with metadynamics), NMR spectroscopy, and SAXS to elucidate the individual cis-proline and trans-proline conformations of ORF6CTR from SARS-CoV-2. The good agreement on proline cis/trans populations observed in experiments (NMR) and those calculated from simulations in the AMBER03ws force field (with SAXS reweighting) showcases the efficiency of this interdisciplinary approach, which can be used to characterize highly dynamic disordered protein states, even for very slow processes. Furthermore, our study emphasizes the importance of considering both computational and experimental methodologies to gain a more holistic understanding of highly dynamic proteins. The presented integrative approach sets a precedent for future studies aiming to explore complex and dynamic biological systems with slow transitions such as proline isomerizations.

Introduction

Intrinsically disordered proteins (IDPs) and disordered regions, which represent at least 30% of the human proteome (1), are particularly common in cancer-associated proteins, with up to 80% containing disordered regions (2), and in viruses, where their coverage ranges from 3 to 55% depending on the viral species (3). Unlike folded proteins, disordered proteins are highly dynamic, and they often exist as an ensemble of diverse heterogeneous conformations that lack a single three-dimensional (3D) structure. Compared with folded proteins, the primary sequences of disordered proteins have a nearly 2-fold increase of proline residues (4), which are well-known to reduce the formation of secondary structure in proteins (5). In particular, proline residues in disordered proteins have been shown to play key roles in regulating protein-protein interactions (6,7), posttranslational modifications (8), and liquid-liquid phase separation (9).

Most peptide bonds within proteins exist almost exclusively in the energetically favorable trans conformation. However, for proline residues, the free energy difference between the cis and trans isomers is lower due to the cyclic structure of this amino acid. Given the high energy barrier to rotation, approximately 84 kJ mol−1 (10), proline isomerization is generally a slow process (11), occurring at a rate of 10−3–10−2 s−1 at room temperature, depending on the adjacent residues (12,13). The cis-proline population typically ranges between 5 and 10% in disordered proteins (4), but this can vary substantially depending on the length and composition of the amino acid sequence (6,14,15). Consequently, multiple cis-proline conformations may be present within polyproline disordered protein ensembles. These ensembles sample a vast conformational space of very slowly exchanging conformers, further increasing their complexity (16).

Molecular dynamics (MD) simulations are often used to characterize the ensemble of disordered proteins as they can resolve individual conformations within an ensemble at atomic resolution, which is a challenge for many experimental techniques. Significant progress has been made over the last decade to optimize force fields for modeling disordered proteins (17,18,19), as well as advances in the integration of MD simulations and experimental data to improve their accuracy (20,21). Despite these advances, sampling the full configurational energy landscape of disordered protein ensembles in all-atom explicit solvent MD simulations is extremely computationally expensive. Proline cis/trans isomerization presents an additional challenge due to the slow timescales of this process (12,13), which are generally not accessible in brute-force MD simulations alone (22), even on today’s most powerful computers. However, when suitable collective variables (CVs) can be identified, metadynamics, an enhanced sampling approach, offers an effective method for sampling slow motions (23,24). Indeed, metadynamics has been used to encourage exploration of the full configurational space of disordered proteins (7,25) and proline cis/trans isomerization in simulations of dipeptides and folded systems (26,27). In the latter cases, the ζ angle (Cαi–1, Oi–1, Cδi, Cαi, where i = proline) was employed as one CV for the isomerization and pyramidalization of the amide nitrogen and the ψ angle (Ni, Cαi, C’i, Ni+1) was employed as an additional CV to control the amide orientation, which may affect the rate of transition between the cis-proline and trans-proline conformations. Both CVs are required to enhance proline cis/trans sampling as they compensate for each other.

NMR spectroscopy is a well-suited experimental technique to characterize ensemble-averaged properties of disordered proteins at atomic resolution under physiological conditions (pH, temperature, and salt concentrations) (28). Furthermore, NMR can uniquely characterize and quantify the populations of cis-proline and trans-proline conformations. The distinct chemical environments for the two proline isomers, coupled with their slow exchange, can result in the detection of two separate peaks for neighboring residues or the proline itself. NMR has therefore not only been used to characterize the overall ensemble of disordered proteins (29,30), but NMR has also been used extensively to characterize the structural propensities and dynamics of cis-proline conformations in disordered proteins (6,14,15,31).

Another experimental technique that can report on the ensembles of disordered proteins in solution is small-angle x-ray scattering (SAXS). This technique can provide coarse structural information relating to a protein’s size and shape. The capability to predict SAXS profiles from atomic coordinates makes it possible to compare conformational ensembles from MD simulations with experimental SAXS data (32). While SAXS measurements offer powerful global information, they report ensemble-averaged states and cannot generally distinguish between the cis-proline and trans-proline configurations. Complementary approaches, such as NMR, are essential for providing detailed experimental information at the local scale.

Here, we used an integrative approach anchored in all-atom explicit solvent metadynamic simulations to characterize the C-terminal region of open reading frame 6 (ORF6CTR) from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). This region of ORF6 is predicted to be disordered (Fig. S1, A and B) and binds to host proteins via an essential methionine residue at position 58 (M58), leading to suppression of the innate immune response (33,34,35). Moreover, this 21-residue peptide contains a single proline residue at position 57 (P57), which may influence its binding to host proteins as this residue is at a preceding position to M58 (33,34). We sampled the conformational space of ORF6CTR using three different force fields, each optimized for disordered proteins: AMBER03ws (a03ws) (17), AMBER99SB-disp (a99SB-disp) (18), and CHARMM36m (C36m) (19). We employed metadynamics to enhance sampling (23,24), using various local and global CVs, including those on the P57 ζ and ψ angles (26,27). To reweight and validate resulting conformational ensembles, we compared ensemble-averaged properties from each force field to NMR and SAXS data. Specifically, we employed NMR chemical shifts to report on the local properties and populations of the cis-P57 and trans-P57 states, NMR diffusion experiments to compare the global properties of both states, and NMR spin-relaxation experiments to probe dynamics. Moreover, SAXS data were used to select the most accurate force field for predicting the ORF6CTR global conformational ensemble. To further refine the conformational ensembles, we updated the statistical reweighting using a Bayesian/maximum entropy (BME) approach (21,36).

By integrating metadynamic simulations, SAXS, and NMR, we can characterize the highly dynamic cis-P57 and trans-P57 subensembles of ORF6CTR. We show that metadynamics with the P57 ζ and ψ angle CVs enhances sampling of P57 isomerization, and we observe convergence of these two CVs for the a03ws and C36m force fields. We find that a03ws most accurately predicts the cis-P57 and trans-P57 populations in the ORF6CTR. By employing SAXS BME reweighting (21,36) and two independent a03ws runs, we obtain cis-P57 populations in the a03ws force field that match those from NMR. NMR diffusion experiments suggest that the cis-P57 subensemble is slightly more compact than the trans-P57 subensemble, in qualitative agreement with the metadynamic simulation predictions. Furthermore, NMR spin-relaxation experiments and metadynamic simulations indicate that both the cis-P57 and the trans-P57 conformations of ORF6CTR are extremely dynamic. We anticipate that this interdisciplinary approach can be broadly applied to the many disordered proteins that undergo complex dynamics across varying timescales.

Materials and methods

ORF6CTR peptide synthesis

The ORF6CTR with sequence SKSLTENKYSQLDEEQPMEID was initially made by solid-phase peptide synthesis using a MultiSynTech Syro Peptide Synthesiser. Fmoc-Asp(OtBu)-NovaSyn TGT resin and standard Fmoc-amino acids, coupling, and deprotection conditions were used. All residues were double-coupled. The resin was cleaved using a cleavage mixture of TFA/TIPS/H2O (95:2.5:2.5) and the peptide was isolated by precipitation from diethyl ether, centrifugation, and lyophilization. A portion (4 mg) of the crude peptide was purified by semipreparative HPLC to give pure ORF6CTR (1.5 mg) (see Fig. S2, A and B in the supporting material).

N-Acetylated ORF6CTR peptide synthesis

The unlabeled N-acetylated ORF6CTR peptide (NAc-ORF6CTR) with sequence Ac-SKSLTENKYSQLDEEQPMEID was produced synthetically (>96.3% purity) by GenScript (GenScript Biotech UK, Oxford, UK).

Expression and purification of the isotopically labeled ORF6CTR peptide

The uniformly isotopically labeled ORF6CTR with sequence SKSLTENKYSQLDEEQPMEID was produced by recombinant protein expression with an N-terminal glutathione S-transferase (GST) tag followed by a tobacco etch virus protease cleavage site in the pGEX-6P-1 expression vector (GenScript Biotech UK, Oxford, UK). ORF6CTR was expressed in Escherichia coli BL21(DE3) strain. The cells were grown at 37°C in minimal M9 medium containing 1 g/L 15NH4Cl as the sole nitrogen source and 10 g/L glucose for the 15N-labeled ORF6CTR peptide. For the 13C,15N-labeled ORF6CTR peptide, 1 g/L 15NH4Cl was used as the sole nitrogen source and 3 g/L 13C-glucose as the sole carbon source. Cultures were grown at 37°C with vigorous shaking. Expression was induced at an OD600 of 0.5–0.6 by addition of 1 mM isopropyl β-D-thiogalactopyranoside and left shaking for 4 h at 37°C.

The cell pellet was collected by centrifugation and resuspended in lysis buffer containing 50 mM Tris (pH 8.0), 300 mM NaCl, 10 mM β-mercaptoethanol, and 5% glycerol. Prior to lysing the cells via sonication, 0.25% IGEPAL was added to the mixture along with small amounts of DNase, lysozyme, and one protease inhibitor tablet per 50 mL. Next, the cell lysate was added to Glutathione Agarose 4B resin (Protino) and binding was allowed to occur by gently rocking the mixture for 1 h at 4°C. The gravity columns were then washed with the lysis buffer to remove any unbound proteins. ORF6CTR was cleaved from the GST-tag via the addition of tobacco etch virus protease overnight, shaking at 22°C. The next day, the flowthrough containing the cleaved ORF6CTR was concentrated by ultracentrifugation through 1 kDa cutoff centricons (PALL, New York, NY) ultrafiltration membranes. For further purification, size-exclusion chromatography on a Superdex 75 column was carried out in NMR buffer containing 25 mM HEPES (pH 6.9), 150 mM NaCl at 5°C. Fractions containing ORF6CTR were pooled together and concentrated by 1 kDa cutoff centricons (PALL) ultrafiltration membranes. The yield of purified peptide was around 1.4–1.6 mg from 1 L of culture medium.

NMR spectroscopy

Unless specified otherwise, NMR spectra were collected on uniformly 15N-labeled or 13C,15N-labeled ORF6CTR peptide samples at concentrations of 300 μM and unlabeled NAc-ORF6CTR at concentrations of 400 μM. All peptide samples were prepared in 25 mM HEPES buffer (pH 6.9), 150 mM NaCl, containing 5% D2O, 1 mM sodium azide, and 1 mM EDTA. Before recording experiments, ORF6CTR samples were boiled in the NMR tube (sample volume ∼600 μL) at 100°C for ∼3 min to remove proteases. NMR data were recorded at 15°C unless stated otherwise. NMR data were acquired at three different static magnetic fields: 14.1 T (600 MHz) on a Bruker NEO spectrometer with a TXO cryoprobe, 18.8 T (800 MHz) on a Bruker Avance III HD spectrometer equipped with Z-gradient triple-resonance TCI cryoprobe, and 22.3 T (950 MHz) on a Bruker Avance NEO spectrometer equipped with a QCI-F cryoprobe.

Resonance assignments were obtained from a standard suite of double-resonance and triple-resonance experiments at a static magnetic field strength of 14.1 T. Backbone 15N relaxation rates, including R1, R, and {1H}-15N steady-state heteronuclear NOEs (hetNOEs), were measured at two static magnetic field strengths (14.1 and 18.8 T). The backbone 15N exchange-free relaxation rates (Rdd) were recorded using a previously described method (37) at a static magnetic field strength of 14.1 T. Diffusion ordered spectroscopy (DOSY) experiments were measured using a pseudo-3D 1H-15N heteronuclear single quantum coherence (HSQC)-type experiment at a static magnetic field strength of 22.3 T. Additional information regarding the NMR experiments can be found in the supporting material.

NMR data analysis

All NMR spectra were processed using NMRPipe (38), and analyzed with NMRFAM-SPARKY software (39) and FuDA (37). To analyze residual secondary structure for the cis-P57 and trans-P57 configurations, we used the secondary structure propensity score algorithm (40), with the ORF6CTR 1Hα, 13Cα, and 13Cβ chemical shifts as input. R1 and R rate-constants were calculated by fitting the intensity profiles to monoexponential decay functions with FuDA (37). The {1H}-15N hetNOEs were calculated as the ratios of the peak intensities in the saturated and reference subspectra. The transverse relaxation rate constants (R2) were calculated from the R1 and R relaxation rates (see Eqs. S1 and S2 in the supporting material). Peak intensities of the diffusion NMR data were obtained with FuDA (37) and diffusion coefficients were calculated (see Eq. S3 in the supporting material).

SAXS measurements

The NAc-ORF6CTR was dissolved in 25 mM HEPES (pH 6.9), 150 mM NaCl and centrifuged at 5600 × g for 10 min at 4°C to give a final concentration of 2 mg/mL (800 μM). The SAXS data were obtained on Instrument B21 at Diamond Light Source (Didcot, UK) (41). Measurements were recorded at 37°C. Data sets of 26 frames with a frame exposure time of 1 s each were acquired. ScÅtter IV (42) was used for buffer subtraction and data reduction, in which the 26 frames were averaged. The Guinier peak analysis function in ScÅtter IV was used to calculate the radius of gyration (Rg). Furthermore, we rescaled the error bars for the SAXS intensities by a factor estimated through the Bayesian indirect Fourier transform using the BioXTAS Raw software (43).

MD simulations

All-atom metadynamic simulations were performed using GROMACS 2021.2 (44) patched with the open-source, community-developed PLUMED library version 2.7.1 (45). Simulations were setup using three different force fields and their corresponding water models: a03ws force field (17) with the TIP4P/2005 water model (46), a99SB-disp (18) with the TIP4P-D water model (47), and C36m (19) with the CHARMM-modified TIP3P water model (48). ORF6CTR was N-acetylated for the simulations. The initial starting structure for NAc-ORF6CTR was prepared as a linear peptide using PyMOL (49). Topology and coordinate files were then generated for the peptide using the heavy hydrogen flag for hydrogen mass repartitioning (50). The system was then solvated in a rhombic dodecahedron box with an initial volume of 775 nm3. Ions were added to neutralize the charge of the system and to maintain a concentration of 150 mM NaCl to match experiments. The solvated system was subjected to an energy minimization simulation using the steepest descent method with a target maximum force of 2,000 kJ mol−1 nm−1. The extended structure was collapsed by running a high-temperature simulation at 600 K for 20 ns in the canonical (NVT) ensemble with a 2 fs time step. To ensure a sufficiently large box size, the Rg was calculated for each frame in the 600 K NVT trajectory. From the 95th percentile of the Rg distribution, a collapsed peptide starting structure was randomly selected. This was repeated for each force field.

The collapsed peptide structure was then resolvated and neutralized at a concentration of 150 mM NaCl in a new rhombic dodecahedron box with volumes between 290 and 380 nm3 and 9,600–12,200 explicit water molecules, depending on the force field. Each force field system was subjected to the energy minimization simulation by steepest descent with a target maximum force of 2,000 kJ mol−1 nm−1. All subsequent simulations employed either a 4 or 5 fs time step to enhance computational efficiency. A 20 ns simulation at 600 K in the NVT ensemble was performed to obtain 128 diverse starting structures. These 128 starting structures were taken at even intervals after 5 ns of equilibration time. Thermalization was implemented for 1,250 ps at 310 K in the canonical (NVT) ensemble using the Bussi-Donadio-Parrinello thermostat (51). Then the density of each system was equilibrated for 50 ns at 310 K in the isothermal-isobaric (NPT) ensemble using the Parrinello-Rahman barostat (52).

Production runs were executed in the NPT ensemble with a target pressure of 1 bar and temperature of 310 K, employing the Parrinello-Rahman barostat (52), and using a 5 fs time step (50). LINCS constraints on all bonds were used (53). A Verlet list cutoff scheme was used for the nonbonded interactions. The van der Waals and Coulomb interactions were cut off at 1.2 nm for all a03ws and a99SB-disp simulations and at 0.95 nm for C36m. Long-range electrostatic effects were treated with the particle-mesh Ewald method (54). Metadynamics was performed with the parallel-bias, well-tempered, and multiple-walkers protocols using a Gaussian deposition stride set to 2.5 ps, an initial height of 1.2 kJ mol−1, and a bias factor of 30 for 128 replicas (23,24,25). Nine CVs were selected to enhance conformational sampling (see Eqs. S4–S8; Table S1). Metadynamic simulations were run for an accumulated time of 192 μs (a03ws run 1 and a03ws run 2), 181 μs (a99SB-disp), and 222 μs (C36m) until convergence was reached as assessed by blocking analysis for every CV (see Figs. S3, AC and S4; Eq. S9) (32,55).

Structural ensemble analysis

Statistical weights were calculated for each force field at the end of the simulation (see Eq. S10). Analysis was conducted using the open-source package MDTraj (56). Chemical shift predictions were back-calculated from the simulations at each time step using CamShift (57). SAXS intensity curves were calculated for each structure using Pepsi-SAXS (58) and we followed a previously reported method to perform the fitting (36). We used the BME software to update the metadynamics weights to better match experimental SAXS data (21).

Results

Sampling slowly exchanging proline cis/trans conformers with metadynamic simulations

To characterize the structural ensemble of ORF6CTR, all-atom explicit solvent MD simulations were set up with enhanced sampling provided by metadynamics (23,24). Given the sensitivity of simulations of disordered proteins to the force field used (59), we employed three diverse parameter sets (force fields) optimized for disordered proteins, capable of sampling both folded and disordered regions, or primarily extended conformations. These included the force fields a03ws (17), a99SB-disp (18), and C36m (19). The ORF6CTR peptide was N-acetylated (NAc-ORF6CTR) in the simulations to ensure a neutral charge at the N-terminus and to better match the peptide in its native context.

To obtain a good sampling of the highly heterogeneous free energy surface (FES) adopted by the expected disordered peptide (Fig. S1, A and B), we selected a variety of metadynamics CVs to explore both local and global properties of NAc-ORF6CTR. These CVs included total α-helical content, total β-sheet content, the radius of gyration (Rg), the number of salt bridges, the distance between the Cα of the first and last residue (end-to-end distance), the correlation between consecutive ψ dihedral angles (dihedral correlation), and the number of contacts between hydrophobic residues (25). To enhance proline cis/trans isomerization (Fig. 1 A), we also implemented two CVs on the P57 improper ζ dihedral angle and the P57 ψ dihedral angle (Fig. 1 B) (see the supporting material for more information on the CVs) (26,27). Each CV for each force field was assessed for convergence by blocking analysis (see Fig. S3, AC; Eq. S9) (32,55). We observed that the FES standard-error plateaus at a constant value for the CVs listed above in all three force fields, except for the P57 ζ improper dihedral angle CV in the a99SB-disp force field.

Figure 1.

Figure 1

All three force fields predict NAc-ORF6CTR is disordered but only the a03ws and C36m force fields predict that NAc-ORF6CTR undergoes proline cis/trans isomerization. (A) cis-proline (pink) and trans-proline (green) conformations with the positions of Cγ and Cβ indicated. (B) Definition of the ζ (gray) and ψ (purple) dihedral angles used as CVs to enhance proline cis/trans isomerization. (C) Rg probability distributions were calculated using kernel density estimates to compare the a03ws run 1 (orange), a99SB-disp (red), and C36m (blue) ensembles. Ensemble-averaged Rg are shown for each system. The associated error represents the standard deviation between the first and second halves of the analyzed trajectories. The experimental SAXS data and error (standard deviation from the Guinier analysis) are shown in gray. (D) Probability distributions for the ω angle about the Q56-P57 peptide bond in the a03ws run 1 (orange), a99SB-disp (red), and C36m (blue) ensembles. The thickness of the gray distribution represents the uncertainty. The cis-P57 populations are displayed for each system and the error represents the standard deviation between the first and second halves of the analyzed trajectories.

For the a03ws and C36m force fields, convergence was observed for the P57 ζ improper dihedral angle (Fig. S3, A and C). This, combined with convergence of the ψ dihedral angle, suggests that P57 cis/trans isomerization was converged despite the very slow timescale generally observed for proline isomerization, typically 10–103 s at room temperature (12,13). Sampling of the cis-P57 isomer was not observed in the simulation with the a99SB-disp force field (Fig. S3 B). This is potentially due to the exclusion of glycine and proline residues from torsion optimizations, along with the enhanced Lennard-Jones pair override (18). New and improved force fields, such as DES-Amber (60), have further advanced upon this and may be better suited for proteins with both ordered and disordered regions or for investigating a disordered protein in complex with a folded protein.

To assess the sampling of secondary structures, we calculated ensemble-averaged Cα minimum distance contact maps for all three force fields, which showed low probabilities of α-helix and β-sheet content (Fig. S5, A–C), and thus a lack of secondary structure. This result is consistent with predictions from s2D (Fig. S1 A), a protein secondary structure propensity predictor trained on solution-based NMR data (61), and the AlphaFold2 pLDDT score (62), in which low confidence scores have been found to correlate with disordered regions (Fig. S1 B) (63).

Assessing the conformational ensembles using experimental data

Histograms of the Rg, calculated from the ensembles obtained with metadynamic simulations, show that NAc-ORF6CTR adopts a continuum of states in all three force fields, ranging from collapsed to extended, suggesting that NAc-ORF6CTR exists as an ensemble of disordered conformations (Fig. 1 C), in agreement with the lack of stable secondary structure (Fig. S5, A–C). It is well established that various force fields for disordered proteins can predict dramatically different features for the same disordered protein sequence, particularly in terms of ensemble-averaged properties such as Rg values (59). The three force fields used herein predict slightly different Rg values (a03ws run 1: 1.194 ± 0.002 nm, a99SB-disp: 1.131 ± 0.005 nm, and C36m: 1.18179 ± 0.00002 nm; Fig. 1 C). To assess which of the simulations, if any, was the most in agreement with experimental measurements, we performed SAXS experiments on NAc-ORF6CTR and determined an ensemble-averaged Rg of 1.27 ± 0.02 nm. Notably, a03ws and C36m performed only marginally better than the coarse-grained force field CALVADOS, which predicts an Rg of 1.180 ± 0.007 nm (64,65,66). Furthermore, the coarse-grained disordered region ensemble predictor ALBATROSS (67) predicts an Rg of 1.29 nm, which is closer in agreement with the experimental Rg than any of the all-atom models tested here. Both coarse-grained force field predictors specifically used the Rg parameter for optimization (66,67).

In contrast to SAXS data, which report on global properties, NMR chemical shifts report on local dihedral angles, ring current shifts, and electrostatics (68), and are regularly used to improve and assess the accuracy of simulation ensembles because they are sensitive to secondary structure (21,25). To this end, we recorded and assigned 2D 1H–1H total correlation spectroscopy (TOCSY) and 1H-13C HSQC spectra of unlabeled NAc-ORF6CTR to obtain 13Cα, 13Cβ, 1Hα, and 1HN chemical shifts. Using CamShift (57), we back-calculated chemical shifts for all nonproline residues in our metadynamics ensembles for all three force fields. All experimentally determined chemical shifts were within the CamShift error (Fig. S6, A–C). To probe secondary structure in the Nac-ORF6CTR we recorded a 2D 1H–1H TOCSY temperature titration at eight temperature values between 5 and 37°C. The linear temperature coefficients of the 1HN chemical shifts are indicative of the hydrogen bonding state of individual amides (69). For example, at higher temperatures hydrogen bonds are weakened, which causes the relative upfield shifting of the 1HN chemical shift. All residues in the Nac-ORF6CTR have temperature coefficients more negative than −4.5 ppb K−1 (Fig. S7), suggesting that none of the 1HN are involved in a long-lived intramolecular hydrogen bond. This indicates that there is no substantial secondary structure in Nac-ORF6CTR, in agreement with the metadynamic simulations.

Solution-state NMR confirms proline cis/trans isomerization

Like force-field-dependent variations predicted in the Rg distributions, the cis-P57 population predictions also vary dramatically between different force fields (Fig. 1 D). Our analysis suggests cis-P57 populations of 18 ± 1, 0, and 1.6 ± 0.3% using the a03ws run 1, a99SB-disp, and C36m force fields, respectively. To assess the accuracy of these cis-P57 populations, we recombinantly expressed and purified uniformly 15N-labeled and 13C, 15N-labeled ORF6CTR for NMR experiments (see materials and methods). The production of isotopically labeled ORF6CTR in E. coli required a GST-tag, making it extremely difficult to N-acetylate the system. While N-acetylation has been shown to decrease the cis-proline population in small tetrapeptide systems, the effect of N-acetylation on cis-proline sampling is reduced upon increasing the length of the peptide (14). As the N-acetylation in ORF6CTR would occur at residue S41, we anticipate that potential cis/trans isomerization at residue P57, separated by 16 residues from the acetylation site, is unlikely to have its chemical environment influenced by the acetyl group (see below).

2D 1H-15N HSQC spectra show sharp resonances with a limited chemical shift dispersion in the 1HN dimension, suggesting that these resonances arise from disordered residues (Fig. 2 A). This finding agrees with spectra of the unlabeled, N-acetylated peptide (Fig. S8), and with the secondary structure predictions (Fig. S1, A and B). The chemical shifts in the ORF6CTR are very similar to those obtained from the NAc-ORF6CTR (see Table S2; Eq. S11). The a03ws and C36m simulations discussed above predict that the ORF6CTR conformational ensemble undergoes proline cis/trans isomerization about the Q56-P57 peptide bond. Indeed, in the 2D 1H-15N HSQC spectrum, two distinct signals with different signal intensities are observed for residues in proximity to P57 (Fig. 2 A).

Figure 2.

Figure 2

NMR verifies the disordered ORF6CTR undergoes proline cis/trans isomerization. (A) 2D 1H-15N HSQC spectrum of 300 μM 15N-labeled ORF6CTR measured at 15°C, pH 6.9, and 14.1 T. Residues colored in green originate from the trans-P57 conformational ensemble, while residues colored in pink and labeled with an asterisk () report on the cis-P57 conformation. The black peak is unassigned. (B) 2D 1H-13C strip taken from the 3D CC(CO)NH spectrum of the 300 μM 13C,15N-labeled ORF6CTR at the 15N chemical shift of M58. (C) The ORF6CTR13Cα, 13Cβ, and 1Hα chemical shifts were used to calculate the secondary structure propensity scores for both the cis-P57 and trans-P57 subensembles. A positive value indicates α-helical propensity, a negative value indicates a β-strand propensity, whereas a value near zero indicates random coil. (D) The cis-P57 populations at 15°C (solid fill) and 37°C (dashed fill) are shown for each well-resolved cis-P57 and trans-P57 peak in the 2D 1H-15N HSQC spectrum. The integrated peak volume was used to calculate the mean cis-P57 population and standard deviation across residues E55, Q56, and E59. (E) The van ’t Hoff analysis of P57 cis/trans isomerization determined from the integrated peak volume. The mean natural logarithm of the equilibrium constant for proline cis/trans isomerization about the Q56-P57 peptide bond as a function of temperature was calculated using residues E55, Q56, and E59. Error bars represent the standard deviation across the three residues used in the analysis. The van ’t Hoff linear fit yielded a cis-P57 population of 10 ± 2% at 37°C.

To verify that the additional set of minor peaks corresponds to residues in the cis-P57 conformational ensemble, we analyzed the 13C side-chain chemical shifts of P57 using a 3D CC(CO)NH experiment (Fig. 2 B)(70). The difference in chemical shifts between 13Cβ and 13Cγ is highly diagnostic of a cis-proline (∼9.5 ppm) or a trans-proline (∼4.5 ppm) peptide bond conformation (31,71). Similarly, peaks from the cis-P57 ensemble were detected for residues E55, Q56, M58, and E59. Most cis-P57 peaks are relatively close to their corresponding trans-P57 peak. For Q56, the residue preceding P57, the change in the 15N chemical shift between the cis and trans states is ∼3 ppm. A large change in chemical shift between cis-proline and trans-proline configurations is not uncommon for peaks of residues preceding the proline undergoing isomerization (71). Thus, NMR provides evidence of a cis-P57 conformation, while also enabling the characterization of the properties of both the proline cis and trans conformational subensembles. Initially, to characterize the secondary structure content of the cis-P57 and trans-P57 subensembles, we calculated the secondary structure propensity scores for the two configurations (40). These scores indicate that both conformations are disordered (Fig. 2 C), although there is a slight increase in β-strand propensity near the C-terminus for the cis-P57 subensemble.

In the slow-exchange regime, NMR peak intensities are proportional to the concentration of the species giving rise to the observed peaks (72). The population of the cis-P57 and trans-P57 conformations can thus be determined, assuming slow exchange and similar dynamics for the two conformations. The NMR experiments described above were recorded at 15°C to reduce signal loss due to backbone amide proton exchange with the solvent. The cis-P57 and trans-P57 peak intensities at 15°C were calculated for residues E55, Q56, M58, and E59 using the integrated peak volume or the peak height (Figs. 2 D and S9 A) (31). To enhance the cis-P57 population, study the system at a biologically relevant temperature, and match our metadynamic simulations, the cis-P57 and trans-P57 integrated peak volumes and peak heights were also analyzed at 37°C. Residue M58 was excluded from the analysis at 37°C since the two proline isomer peaks overlap at higher temperatures. The integrated peak volume analysis yielded a cis-P57 population of 5.4 ± 0.9% at 15°C and 8.8 ± 2.5% at 37°C (Fig. 2 D), which agrees most closely with the free energy difference between the cis-P57 and trans-P57 conformations in the a03ws run 1 metadynamic simulation.

To investigate the enthalpic and entropic factors influencing the formation of the cis-P57 state, we measured the equilibrium constant K = [cis]/[trans] for residues E55, Q56, and E59 at 10 temperature values between 2.5 and 40°C. From the van ’t Hoff analysis, using integrated peak volumes, we observed that the cis-P57 conformation is enthalpically disfavored (ΔΗ = 16 ± 2 kJ mol−1) but entropically favored (ΔS = 34 ± 7 J K−1 mol−1) over the trans-P57 conformation (Fig. 2 E). Similar results were obtained using peak heights, where ΔΗ = 14 ± 1 kJ mol−1 and ΔS = 25 ± 3 J K−1 mol−1 (Fig. S9 B). The cis-P57 population is consistent with results reported in the literature for other disordered protein systems (31) and is a reasonable population given that the Q56 residue, preceding P57 in ORF6CTR, is neither favorable nor unfavorable for proline isomerization (71).

Refining the metadynamics ensembles using Bayesian/maximum entropy

Given that the a03ws force field (a03ws run 1) produced an ensemble that agreed best with the experimental cis-P57 populations and SAXS data, we repeated the a03ws simulation (a03ws run 2) to assess reproducibility. To do this, and to avoid positive bias for the cis-P57 configuration, we randomly selected 128 frames only from the trans-P57 configuration within the initial a03ws run 1 ensemble. We then repeated the metadynamic simulation until convergence was reached at an accumulated simulation time of 192 μs (Fig. S4). The a03ws run 2 conformational ensemble was again consistent with the NMR data (Fig. S10), gave a cis-P57 population of 13 ± 1% (Fig. S11 A), and agreed with the characterizations from a03ws run 1 (Figs. 1 C, S5 A, and S11, B and C). The robust sampling of the cis-P57 conformation, even when using starting structures with 100% trans-P57, and the consistency with both SAXS and chemical shift data from two separate metadynamic simulations, indicate that the a03ws force field produces an NAc-ORF6CTR ensemble that is both robust and in agreement with the experimental data for this system.

To further refine the conformational ensemble with minimal perturbation, we used BME reweighting with the SAXS data for the two independent a03ws ensembles and the C36m ensemble (21,36). While all three ensembles were already in quite good agreement with the SAXS data (Figs. 3 A and S12 A), we were interested in exploring whether reweighting using SAXS data could improve the cis-P57 population predictions. As expected, reweighting resulted in a more extended ensemble-averaged Rg for the a03ws run 1 (1.292 ± 0.001 nm) and the a03ws run 2 (1.295 ± 0.005 nm) ensembles (Fig. 3 B), indicating that the reweighting was successful. Reweighting also increased the ensemble-averaged Rg of the C36m ensemble (1.299 ± 0.003 nm) and shifted the C36m Rg distribution to match the a03ws distributions (Fig. S12 BD).

Figure 3.

Figure 3

The a03ws force field most accurately predicts the disordered NAc-ORF6CTR ensemble and P57 cis/trans populations after SAXS BME reweighting. (A) The calculated SAXS intensities from the a03ws run 1 metadynamic simulation (orange) and the SAXS BME reweighted simulation (brown) compared with the experimental SAXS intensities (gray), and the error associated with each intensity (gray). (B) Rg probability distributions were calculated using kernel density estimates to compare the SAXS BME reweighted a03ws run 1 ensemble (brown) and the SAXS BME reweighted a03ws run 2 ensemble (teal). The associated error represents the standard deviation between the first and second halves of the analyzed trajectories. The experimental SAXS data and error (standard deviation from the Guinier analysis) are shown in gray. (C) Probability distributions for the ω angle about the Q56-P57 peptide bond in the SAXS BME reweighted a03ws run 1 ensemble (brown) and the SAXS BME reweighted a03ws run 2 ensemble (teal). The two reweighted independent a03ws ensembles were used to calculate the mean cis-P57 population and standard deviation. Additional plots for the SAXS BME reweighted a03ws run 2 and C36m ensembles can be found in the supporting material.

Across all three systems, we observed that SAXS BME reweighting led to a reduction in the cis-P57 populations (Figs. 3 C and S12 E). Although reweighting improved the consistency between predicted cis-P57 populations in both a03ws ensembles and the NMR data, it did not improve the prediction in the C36m ensemble. This discrepancy underscores the challenge of using BME reweighting to improve accuracy when experimental data are minimally informative on the desired observable, as in this case, where SAXS does not distinguish the cis-proline and trans-proline conformers. Using experimental data that are informative on both isomeric states could potentially improve the accuracy of the cis-proline population predictions. The reweighted ensembles were also compared with the chemical shift data and no significant changes were observed (Fig. S13, AC). Again, this indicates that the chemical shifts alone are not sufficient for reweighting the ensembles, as changes to the global conformation have limited influence on this parameter. Taking the two SAXS BME reweighted a03ws ensembles we calculated a mean cis-P57 population of 13 ± 2%, which is in good agreement with the population derived from NMR (∼10 ± 2%).

ORF6CTR is highly dynamic in both proline configurations

Using the reweighted metadynamics ensembles, we next sought to elucidate the structural differences between the cis-P57 and trans-P57 states. We observed that both conformations have similar Rg distributions (Fig. 4 A), with the cis-P57 state exhibiting a slightly smaller mean conformation (1.2510 ± 0.0004 nm) compared with the trans-P57 state (1.299 ± 0.003 nm). We then calculated ensemble-averaged Cα minimum distance contact maps for all 21 residues (Fig. 4 B) and the secondary structure populations for both P57 states (Fig. 4 C). Both analyses suggest an overall lack of stable secondary structure for the cis-P57 and the trans-P57 subensembles. Within the N-terminal region of ORF6CTR, the secondary structure propensity scores calculated from NMR chemical shifts and the secondary structure populations from the metadynamic simulations indicate the sampling of very transient α-helical structures (Figs. 2 C, 4 C, S14 A–C, and S15 A–C). Whereas the chemical shifts near the P57 residue indicate a slight increase in β-strand (Fig. 2 C). These near-miniscule changes in secondary structure were not predicted by the metadynamic simulations ( Figs. 4 C, S14 C, and S15 C).

Figure 4.

Figure 4

NAc-ORF6CTRcis-P57 and trans-P57 subensembles predicted by the a03ws run 1 SAXS BME reweighted metadynamic simulation have very similar profiles. (A) Rg probability distributions were calculated using kernel density estimates to compare the cis-P57 (pink) and trans-P57 (green) conformational subensembles. The associated error represents the standard deviation between the first and second halves of the analyzed trajectory. The experimental SAXS data and error (standard deviation from the Guinier analysis) are shown in gray. (B) Ensemble-averaged Cα minimum distance contact maps for the cis-P57 (pink) and trans-P57 (green) conformations. Color intensity represents the probability of a contact between residues, with white indicating no contact, and increasing color intensity suggesting higher contact probabilities. (C) Secondary structure populations for all residues in the cis-P57 (pink) and the trans-P57 (green) conformations based on SAXS BME statistical weights. Coil populations are represented by solid lines and α-helical populations by dashed lines. β-strand represents less than 0.5% of the population for each residue so was not included. Error bars (black) represent the standard deviation between the first and second halves of the analyzed trajectory. (D) Consistency of the 1HN and 15N cis-P57 and trans-P57 ORF6CTR experimental chemical shifts with the predicted chemical shifts from the reweighted NAc-ORF6CTR a03ws run 1 ensemble. The error in CamShift (silver) is shown. The standard deviation for predicted chemical shifts between the first and second halves of the analyzed trajectory were also plotted but are too small to see. See the supporting material for the a03ws run 2 and C36m analysis.

We also wondered whether the metadynamic simulations could accurately predict both the cis-P57 and trans-P57 chemical shifts using CamShift (57). Thus, we compared 1HN and 15N chemical shifts for residues E55, Q56, M58, and E59 with those predicted by CamShift (Figs. 4 D, S14 D, and S15 D). The errors in CamShift are generally greater than the chemical shift differences between cis-P57 and trans-P57 states, except for the backbone nitrogen for residue Q56, which precedes P57. For this experimental observable, CamShift accurately predicts the value for the trans-P57 state, but inaccurately predicts that of the cis-P57 state (Figs. 4 D, S14 D, and S15 D). As more data for proline isomerization of disordered proteins becomes readily available, we anticipate that force field parameters for this important state will improve, as well as the prediction of chemical shifts for cis-proline states.

To qualitatively compare the global features of the cis-P57 and trans-P57 ORF6CTR subensembles predicted by the metadynamic simulations to experiments, we recorded 1H-15N-DOSY-HSQC experiments at a magnetic field strength of 22.3 T. The P57 cis/trans isomerization exchange rate is slow compared with the diffusion timescales (Δ = 200 ms), and slow compared with the longitudinal relaxation (kexR1), making it possible to measure distinct diffusion rates for the cis-P57 and trans-P57 subensembles. We determined the average diffusion coefficients for the cis-P57 and trans-P57 conformations using data from residues Q56, M58, and E59. The diffusion measurements for the 15N-labeled ORF6CTR at pH 6.9 and 15°C revealed a diffusion coefficient of (2.52 ± 0.09) × 10−10 m2 s−1 for the cis-P57 conformation and (2.34 ± 0.05) × 10−10 m2 s−1 for the trans-P57 conformation. The observed faster diffusion coefficient for the cis-P57 subensemble suggests a slightly more compact conformation compared with the trans-P57, with statistical significance of 98% (determined via a one-tailed Welch’s t-test, df = 4, t = 3.179, p = 0.042). This finding is in qualitative agreement with our SAXS BME reweighted metadynamic simulations, which predict that the ORF6CTR cis-P57 subensemble is slightly more compact than the trans-P57 subensemble in the two independent a03ws simulations and the C36m simulation (Figs. 4 A, S14 A, and S15 A). However, small discrepancies are observed when comparing the relative differences in the predicted ensemble-averaged Rg values between the cis-P57 and trans-P57 subensembles with their corresponding NMR diffusion coefficients, highlighting the accuracy limitations of all three metadynamic simulations.

To characterize the dynamics of the ORF6CTR experimentally, we recorded the standard set of backbone heteronuclear 1H-15N relaxation experiments at 15°C and at two different static magnetic field strengths (14.1 and 18.8 T) (73). 15N R2 rates were determined from the corresponding 15N R rates, to minimize the effect of off-resonance effects and potential microsecond exchange dynamics (see Eqs. S1 and S2) (74). The 15N R1 and R2 relaxation rates report on protein motions occurring at timescales faster than the effective rotational correlation time, which is usually on the nanosecond timescale for disordered proteins. The rates obtained for the ORF6CTR cis-P57 and trans-P57 subensembles are very similar, and both show the expected bell shape for a random coil disordered state (Figs. 5, A and B and S16, A and B) (31,75).

Figure 5.

Figure 5

ORF6CTR is very dynamic in both the trans-P57 and cis-P57 conformations. (A) 15N longitudinal relaxation rates (R1), (B) 15N transverse relaxation rates (R2), (C) {1H}-15N steady-state hetNOEs, and (D) J(0) values of 300 μM 15N-labeled ORF6CTR at 15°C and pH 6.9. The backbone motions are shown for all peaks corresponding to the trans-P57 conformation (green), and only for the well-resolved peaks with a measurable intensity corresponding to the cis-P57 (pink) conformation. Error bars represent the fitting of the NMR data in (A) and (C). The error bars for (B) were calculated by error propagation. The error bars for (D) represent the standard deviations for each residue, which were calculated using Monte Carlo uncertainty propagation. All experiments were recorded at a static magnetic field strength of 14.1 T.

The {1H}-15N hetNOEs measured for ORF6CTR at a static magnetic field strengths of 14.1 T were mostly near 0 in the central region and decreased to −1 at the N- and C-termini, which is indicative of picosecond dynamics associated with a disordered protein (Fig. 5 C) (73). At higher static magnetic field strengths (18.8 T), the values of the steady-state hetNOEs increase relatively, suggesting the absence of slow motions in ORF6CTR (Fig. S16 C). We also evaluated the spectral density function, J(ω), at zero frequency, J(0), on a per-residue basis by reduced spectral density mapping (Fig. 5 D) (37). This analysis indicated that both P57 subensembles are highly dynamic on the nanosecond timescale and there are no substantial changes in secondary structure propensity between the two ORF6CTR P57 states.

Transverse 15N relaxation rates (R2) also report on the potential chemical exchange contribution (Rex) due to dynamics occurring on the microsecond to millisecond timescale. One way of distinguishing between the contributions to the 15N R2 rates arising from picosecond to nanosecond dynamics and those associated with slower microsecond to millisecond dynamics is through the use of the 15N exchange-free relaxation rate (Rdd) (37). Thus, we determined the Rdd rates at a static magnetic field strength of 14.1 T (Fig. S17 A). We did not detect any exchange occurring on the microsecond to millisecond timescale (Fig. S17, B and C), suggesting minimal long-range contacts and/or stabilization of secondary structures. This is consistent with the lack of secondary structure observed for both P57 ORF6CTR subensembles from the secondary structure propensity scores (Fig. 2 C).

Discussion

Significant progress has been made over the last decade to accurately characterize the conformational ensembles of disordered proteins with MD simulations and experimental techniques. The extensive amounts of data generated by MD simulations, alongside the capabilities of deep-learning, have now made it possible to predict some overall properties of disordered proteins from just the amino acid sequence alone (66,67). Using deep-learning techniques, global features such as the ensemble-averaged Rg can be determined within a couple of minutes. This approach achieves an accuracy that matches experimental data and surpasses the accuracy of all-atom explicit solvent MD simulations accumulated over hundreds of microseconds. While these coarse-grained deep-learning approaches have the potential to accurately characterize many global features of disordered proteins, atomic resolution is currently required to characterize local features, such as proline cis and trans configurations. Proline cis/trans isomerization, which in certain cases can dramatically alter the conformational ensemble of disordered proteins and thus regulate their function (6,7,15,31), is challenging to characterize with simulations given force field inaccuracies and the long simulation times required to reach convergence for this slow motion. Although NMR generally reports only on ensemble-averaged properties, it provides key insights into the local features of disordered proteins, specifically proline cis/trans isomerization, as these slow conversions result in separate NMR signals reporting on the individual proline conformations (14,31).

Using a combination of all-atom explicit solvent metadynamic simulations, and experimental NMR and SAXS, we have accurately characterized both the cis-P57 and trans-P57 conformational subensembles of the 21-residue ORF6CTR from SARS-CoV-2. This includes an accurate description of many of the features of both cis-P57 and trans-P57 subensembles, including secondary structure content, Rg, and P57 cis/trans populations. Applying the ζ and ψ CVs for P57 in ORF6CTR shows that it is possible to achieve convergence for a process that generally occurs on a timescale of seconds to minutes (12,13) and in a system that is highly computationally expensive to simulate given its heterogeneous FES. While all three force fields used here produced ensembles with Rg, chemical shifts predictions, and secondary structure content that agreed with experimental data, only the a03ws force field generated an ensemble with a cis-P57 population that agreed with those calculated from NMR. The a03ws force field appears to agree best with the experimental data in the case of the highly disordered peptide presented above; however, further considerations will be required if simulating other systems with distinct features. Our integrative approach will be particularly relevant for investigating the interactions of disordered protein ensembles with ligands, where proline isomerization can be influenced by binding. In the context of ORF6CTR, the proximity of residue P57 to the essential M58 residue (33,34,35) suggests that proline isomerization could play a regulatory role in binding to host proteins. Although the potential influence of proline isomerization has previously been discussed for the isomeric configuration of P57 in the bound-state crystal structure (34), the effects of proline isomerization on binding free energy and potential long-range dynamics remain open questions. We anticipate that the ongoing accumulation of data will further enhance the accuracy of force fields to characterize the cis-proline state, and agreement between experimental ensemble-averaged cis-proline conformations and MD resolved cis-proline conformations will only continue to improve.

Data and code availability

Code that supports the findings of this study is available from GitHub at https://github.com/hansenlab-ucl/orf6-ctr_cis_trans_conformers and data files are available from Zenodo at https://doi.org/10.5281/zenodo.13748215. NMR chemical shifts have been deposited in the Biological Magnetic Resonance Data Bank (BMRB) (www.bmrb.wisc.edu) under the following accession codes: 52459 for the ORF6CTR cis-P57 and trans-P57 configurations, and 52460 for the unlabeled NAc-ORF6CTR. Additional data related to this paper may be requested from the authors.

Acknowledgments

The authors acknowledge Carla Molteni, Kresten Lindorff-Larsen, Francesco Pesce, Charlie Buchanan, Carlo Camilloni, and Thomas Löhr for useful discussions. We acknowledge Greg Towers and Morten Andreas Govasli Larsen for providing the GenScript NAc-ORF6CTR peptide. This work used the ARCHER2 UK National Supercomputing Service (https://www.archer2.ac.uk) under the ARCHER2 Pioneer Projects proposal (project name e692). We acknowledge PRACE for awarding us access under the special allocation COVID-19 in the PRACE Call 21, on TGCC Joliot-Curie (project name COVID-1986). We acknowledge Diamond Light Source for time on beamline BL21 under proposal 32676, and we acknowledge Katsuaki Inoue and Nathan Cowieson for their assistance with the sample preparation, beamline experiments, and analysis. A.J.P. was supported by a Biotechnology and Biological Sciences Research Council (BBSRC) UK Research and Innovation (UKRI) funded studentship, BB/T008709/1, with the London Interdisciplinary Biosciences Consortium Doctoral Training Partnership. L.S.N. was supported by the UCL-Birkbeck Medical Research Council (MRC) DTP (MR/N013867/1). The Wellcome Trust is thanked for the award of a PhD studentship to S.M. (109073/Z/15/Z). G.T.H. was supported by Schmidt Science Fellows, Rosaland Franklin Research Fellowship from Newnham College, Cambridge, and a BBSRC Discovery Fellowship (BB/X009955/1). D.F.H was supported by the Engineering and Physical Sciences Research Council (EPSRC) grant (EP/X036782/1). The BBSRC (BB/R000255/1), Wellcome Trust (ref. 101569/z/13/z), and the EPSRC are acknowledged for supporting the NMR facility at University College London. Access to ultra-high field NMR spectrometers was supported by the Francis Crick Institute through provision of access to the MRC Biomedical NMR Centre. The Francis Crick Institute receives its core funding from Cancer Research UK (FC001029), the UK Medical Research Council (FC001029), and the Wellcome Trust (FC001029). For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising. This research is supported by the UKRI and EPSRC.

Author contributions

A.J.P., G.T.H., and C.D.L. performed and analyzed the metadynamic simulations. A.J.P., V.K.S., and S.M. produced the samples. A.J.P., A.M.F., L.S.N., V.K.S., and D.F.H. performed and analyzed NMR data. A.J.P. analyzed the SAXS data. A.B.T., G.T.H., C.D.L., and D.F.H. designed and supervised the research. All authors discussed the results and wrote the paper.

Declaration of interests

The authors declare no competing interests.

Editor: Jianhan Chan.

Footnotes

Supporting material can be found online at https://doi.org/10.1016/j.bpj.2024.09.028.

Contributor Information

Christian D. Lorenz, Email: chris.lorenz@kcl.ac.uk.

D. Flemming Hansen, Email: d.hansen@ucl.ac.uk.

Supporting citations

References (76,77,78,79,80,81,82,83,84,85,86,87,88) appear in the supporting material.

Supporting material

Document S1. Figures S1–S17, Tables S1 and S2, and Equations S1–S11
mmc1.pdf (2.9MB, pdf)
Document S2. Article plus supporting material
mmc2.pdf (5.3MB, pdf)

References

  • 1.Heller G.T., Sormanni P., Vendruscolo M. Targeting disordered proteins with small molecules using entropy. Trends Biochem. Sci. 2015;40:491–496. doi: 10.1016/j.tibs.2015.07.004. [DOI] [PubMed] [Google Scholar]
  • 2.Iakoucheva L.M., Brown C.J., et al. Dunker A.K. Intrinsic Disorder in Cell-signaling and Cancer-associated Proteins. J. Mol. Biol. 2002;323:573–584. doi: 10.1016/s0022-2836(02)00969-5. [DOI] [PubMed] [Google Scholar]
  • 3.Peng Z., Yan J., et al. Kurgan L. Exceptionally abundant exceptions: Comprehensive characterization of intrinsic disorder in all domains of life. Cell. Mol. Life Sci. 2015;72:137–151. doi: 10.1007/s00018-014-1661-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Theillet F.-X., Kalmar L., et al. Uversky V.N. The alphabet of intrinsic disorder: I. Act like a Pro: On the abundance and roles of proline residues in intrinsically disordered proteins. Intrinsically Disord. Proteins. 2013;1 doi: 10.4161/idp.24360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Levitt M. Conformational preferences of amino acids in globular proteins. Biochemistry. 1978;17:4277–4285. doi: 10.1021/bi00613a026. [DOI] [PubMed] [Google Scholar]
  • 6.Theisen F.F., Prestel A., et al. Skriver K. Molecular switching in transcription through splicing and proline-isomerization regulates stress responses in plants. Nat. Commun. 2024;15:592. doi: 10.1038/s41467-024-44859-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zosel F., Mercadante D., et al. Schuler B. A proline switch explains kinetic heterogeneity in a coupled folding and binding reaction. Nat. Commun. 2018;9:3332. doi: 10.1038/s41467-018-05725-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Martin E.W., Holehouse A.S., et al. Mittag T. Sequence determinants of the conformational properties of an intrinsically disordered protein prior to and upon multisite phosphorylation. J. Am. Chem. Soc. 2016;138:15323–15335. doi: 10.1021/jacs.6b10272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Zhang X., Vigers M., et al. Kosik K.S. The proline-rich domain promotes Tau liquid-liquid phase separation in cells. J. Cell Biol. 2020;219 doi: 10.1083/jcb.202006054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cheng H.N., Bovey F.A. Cis-Trans equilibrium and kinetic studies of acetyl-L-proline and glycyl-L-proline. Biopolymers. 1977;16:1465–1472. doi: 10.1002/bip.1977.360160707. [DOI] [PubMed] [Google Scholar]
  • 11.Camilloni C., Sahakyan A.B., et al. Vendruscolo M. Cyclophilin a catalyzes proline isomerization by an electrostatic handle mechanism. Proc. Natl. Acad. Sci. USA. 2014;111:10203–10208. doi: 10.1073/pnas.1404220111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Grathwohl C., Wüthrich K. NMR studies of the rates of proline cis-trans isomerization in oligopeptides. Biopolymers. 1981;20:2623–2633. [Google Scholar]
  • 13.Reimer U., Scherer G., et al. Fischer G. Side-chain effects on peptidyl-prolyl cis/trans isomerisation. J. Mol. Biol. 1998;279:449–460. doi: 10.1006/jmbi.1998.1770. [DOI] [PubMed] [Google Scholar]
  • 14.Alderson T.R., Lee J.H., et al. Bax A. Propensity for cis-Proline Formation in Unfolded Proteins. Chembiochem. 2018;19:37–42. doi: 10.1002/cbic.201700548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Mateos B., Conrad-Billroth C., et al. Pierattelli R. The Ambivalent Role of Proline Residues in an Intrinsically Disordered Protein: From Disorder Promoters to Compaction Facilitators. J. Mol. Biol. 2020;432:3093–3111. doi: 10.1016/j.jmb.2019.11.015. [DOI] [PubMed] [Google Scholar]
  • 16.Alcantara J., Stix R., et al. Ball K.A. An Unbound Proline-Rich Signaling Peptide Frequently Samples Cis Conformations in Gaussian Accelerated Molecular Dynamics Simulations. Front. Mol. Biosci. 2021;8 doi: 10.3389/fmolb.2021.734169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Best R.B., Zheng W., Mittal J. Balanced Protein-Water Interactions Improve Properties of Disordered Proteins and Non-Specific Protein Association. J. Chem. Theor. Comput. 2014;10:5113–5124. doi: 10.1021/ct500569b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Robustelli P., Piana S., Shaw D.E. Developing a molecular dynamics force field for both folded and disordered protein states. Proc. Natl. Acad. Sci. USA. 2018;115:E4758–E4766. doi: 10.1073/pnas.1800690115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Huang J., Rauscher S., et al. MacKerell A.D. CHARMM36m: an improved force field for folded and intrinsically disordered proteins. Nat. Methods. 2017;14:71–73. doi: 10.1038/nmeth.4067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bonomi M., Heller G.T., et al. Vendruscolo M. Principles of protein structural ensemble determination. Curr. Opin. Struct. Biol. 2017;42:106–116. doi: 10.1016/j.sbi.2016.12.004. [DOI] [PubMed] [Google Scholar]
  • 21.Bottaro S., Bengtsen T., Lindorff-Larsen K. Integrating Molecular Simulation and Experimental Data: A Bayesian/Maximum Entropy Reweighting Approach. Methods Mol. Biol. 2020;2112:219–240. doi: 10.1007/978-1-0716-0270-6_15. [DOI] [PubMed] [Google Scholar]
  • 22.Piana S., Lindorff-Larsen K., Shaw D.E. Atomic-level description of ubiquitin folding. Proc. Natl. Acad. Sci. USA. 2013;110:5915–5920. doi: 10.1073/pnas.1218321110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Barducci A., Bonomi M., Parrinello M. Metadynamics. WIREs Comput. Mol. Sci. 2011;1:826–843. [Google Scholar]
  • 24.Pfaendtner J., Bonomi M. Efficient Sampling of High-Dimensional Free-Energy Landscapes with Parallel Bias Metadynamics. J. Chem. Theor. Comput. 2015;11:5062–5067. doi: 10.1021/acs.jctc.5b00846. [DOI] [PubMed] [Google Scholar]
  • 25.Heller G.T., Aprile F.A., et al. Vendruscolo M. Small-molecule sequestration of amyloid-β as a drug discovery strategy for Alzheimer’s disease. Sci. Adv. 2020;6 doi: 10.1126/sciadv.abb5924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Melis C., Bussi G., et al. Molteni C. Trans-cis switching mechanisms in proline analogues and their relevance for the gating of the 5-HT3 receptor. J. Phys. Chem. B. 2009;113:12148–12153. doi: 10.1021/jp9046962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Maschio M.C., Fregoni J., et al. Corni S. Proline isomerization effects in the amyloidogenic protein β 2-microglobulin. Phys. Chem. Chem. Phys. 2021;23:356–367. doi: 10.1039/d0cp04780e. [DOI] [PubMed] [Google Scholar]
  • 28.Heller G.T., Yu L., Hansen D.F. NMR Spectroscopy for Probing Functional Dynamics at Biological Interfaces. The Royal Society of Chemistry. 2022. Chapter 14: Characterising Intrinsically Disordered Proteins Using NMR Spectroscopy and MD Simulations; pp. 383–410. [Google Scholar]
  • 29.Jensen M.R., Zweckstetter M., et al. Blackledge M. Exploring free-energy landscapes of intrinsically disordered proteins at atomic resolution using NMR spectroscopy. Chem. Rev. 2014;114:6632–6660. doi: 10.1021/cr400688u. [DOI] [PubMed] [Google Scholar]
  • 30.Felli I.C., Pierattelli R. Novel methods based on 13C detection to study intrinsically disordered proteins. J. Magn. Reson. 2014;241:115–125. doi: 10.1016/j.jmr.2013.10.020. [DOI] [PubMed] [Google Scholar]
  • 31.Alderson T.R., Benesch J.L.P., Baldwin A.J. Proline isomerization in the C-terminal region of HSP27. Cell Stress Chaperones. 2017;22:639–651. doi: 10.1007/s12192-017-0791-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Pesce F., Newcombe E.A., et al. Lindorff-Larsen K. Assessment of models for calculating the hydrodynamic radius of intrinsically disordered proteins. Biophys. J. 2023;122:310–321. doi: 10.1016/j.bpj.2022.12.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Li T., Wen Y., et al. Ji X. Molecular Mechanism of SARS-CoVs Orf6 Targeting the Rae1–Nup98 Complex to Compete With mRNA Nuclear Export. Front. Mol. Biosci. 2021;8 doi: 10.3389/fmolb.2021.813248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Gao X., Tian H., et al. Cui S. Structural basis for Sarbecovirus ORF6 mediated blockage of nucleocytoplasmic transport. Nat. Commun. 2022;13:4782. doi: 10.1038/s41467-022-32489-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Miorin L., Kehrer T., et al. García-Sastre A. SARS-CoV-2 Orf6 hijacks Nup98 to block STAT nuclear import and antagonize interferon signaling. Proc. Natl. Acad. Sci. USA. 2020;117:28344–28354. doi: 10.1073/pnas.2016650117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ahmed M.C., Skaanning L.K., et al. Lindorff-Larsen K. Refinement of α-Synuclein Ensembles Against SAXS Data: Comparison of Force Fields and Methods. Front. Mol. Biosci. 2021;8 doi: 10.3389/fmolb.2021.654333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hansen D.F., Yang D., et al. Kay L.E. An exchange-free measure of 15N transverse relaxation: an NMR spectroscopy application to the study of a folding intermediate with pervasive chemical exchange. J. Am. Chem. Soc. 2007;129:11468–11479. doi: 10.1021/ja072717t. [DOI] [PubMed] [Google Scholar]
  • 38.Delaglio F., Grzesiek S., et al. Bax A. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 1995;6:277–293. doi: 10.1007/BF00197809. [DOI] [PubMed] [Google Scholar]
  • 39.Lee W., Tonelli M., Markley J.L. NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy. Bioinformatics. 2015;31:1325–1327. doi: 10.1093/bioinformatics/btu830. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Marsh J.A., Singh V.K., et al. Forman-Kay J.D. Sensitivity of secondary structure propensities to sequence differences between α- and γ-synuclein: Implications for fibrillation. Protein Sci. 2006;15:2795–2804. doi: 10.1110/ps.062465306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Cowieson N.P., Edwards-Gayle C.J.C., et al. Rambo R.P. Beamline B21: high-throughput small-angle X-ray scattering at Diamond Light Source. J. Synchrotron Radiat. 2020;27:1438–1446. doi: 10.1107/S1600577520009960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Tully M.D., Tarbouriech N., et al. Hutin S. Analysis of SEC-SAXS data via EFA deconvolution and Scatter. J. Vis. Exp. 2021;167 doi: 10.3791/61578. [DOI] [PubMed] [Google Scholar]
  • 43.Hopkins J.B., Gillilan R.E., Skou S. BioXTAS RAW: Improvements to a free open-source program for small-angle X-ray scattering data reduction and analysis. J. Appl. Crystallogr. 2017;50:1545–1553. doi: 10.1107/S1600576717011438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Abraham M.J., Murtola T., et al. Lindahl E. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1–2:19–25. [Google Scholar]
  • 45.Bonomi M., Bussi G., et al. White A. Promoting transparency and reproducibility in enhanced molecular simulations. Nat. Methods. 2019;16:670–673. doi: 10.1038/s41592-019-0506-8. [DOI] [PubMed] [Google Scholar]
  • 46.Abascal J.L.F., Vega C. A general purpose model for the condensed phases of water: TIP4P/2005. J. Chem. Phys. 2005;123 doi: 10.1063/1.2121687. [DOI] [PubMed] [Google Scholar]
  • 47.Piana S., Donchev A.G., et al. Shaw D.E. Water dispersion interactions strongly influence simulated structural properties of disordered protein states. J. Phys. Chem. B. 2015;119:5113–5123. doi: 10.1021/jp508971m. [DOI] [PubMed] [Google Scholar]
  • 48.Jorgensen W.L., Chandrasekhar J., et al. Klein M.L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
  • 49.Schrödinger L., DeLano W. PyMOL. http://www.pymol.org/pymol
  • 50.Hopkins C.W., Le Grand S., et al. Roitberg A.E. Long-Time-Step Molecular Dynamics through Hydrogen Mass Repartitioning. J. Chem. Theor. Comput. 2015;11:1864–1874. doi: 10.1021/ct5010406. [DOI] [PubMed] [Google Scholar]
  • 51.Bussi G., Donadio D., Parrinello M. Canonical sampling through velocity rescaling. J. Chem. Phys. 2007;126 doi: 10.1063/1.2408420. [DOI] [PubMed] [Google Scholar]
  • 52.Parrinello M., Rahman A. Polymorphic transitions in single crystals: A new molecular dynamics method. J. Appl. Phys. 1981;52:7182–7190. [Google Scholar]
  • 53.Hess B., Bekker H., et al. Fraaije J.G.E.M. LINCS: A Linear Constraint Solver for Molecular Simulations. J. Comput. Chem. 1997;18:1463–1472. [Google Scholar]
  • 54.Essmann U., Perera L., et al. Pedersen L.G. A smooth particle mesh Ewald method. J. Chem. Phys. 1995;103:8577–8593. [Google Scholar]
  • 55.Bussi G., Tribello G.A. Analyzing and Biasing Simulations with PLUMED. Methods Mol. Biol. 2019;2022:529–578. doi: 10.1007/978-1-4939-9608-7_21. [DOI] [PubMed] [Google Scholar]
  • 56.McGibbon R.T., Beauchamp K.A., et al. Pande V.S. MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories. Biophys. J. 2015;109:1528–1532. doi: 10.1016/j.bpj.2015.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kohlhoff K.J., Robustelli P., et al. Vendruscolo M. Fast and accurate predictions of protein NMR chemical shifts from interatomic distances. J. Am. Chem. Soc. 2009;131:13894–13895. doi: 10.1021/ja903772t. [DOI] [PubMed] [Google Scholar]
  • 58.Grudinin S., Garkavenko M., Kazennov A. Pepsi-SAXS: an adaptive method for rapid and accurate computation of small-angle X-ray scattering profiles. Acta Crystallogr. D Struct. Biol. 2017;73:449–464. doi: 10.1107/S2059798317005745. [DOI] [PubMed] [Google Scholar]
  • 59.Rauscher S., Gapsys V., et al. Grubmüller H. Structural Ensembles of Intrinsically Disordered Proteins Depend Strongly on Force Field: A Comparison to Experiment. J. Chem. Theor. Comput. 2015;11:5513–5524. doi: 10.1021/acs.jctc.5b00736. [DOI] [PubMed] [Google Scholar]
  • 60.Piana S., Robustelli P., et al. Shaw D.E. Development of a Force Field for the Simulation of Single-Chain Proteins and Protein-Protein Complexes. J. Chem. Theor. Comput. 2020;16:2494–2507. doi: 10.1021/acs.jctc.9b00251. [DOI] [PubMed] [Google Scholar]
  • 61.Sormanni P., Camilloni C., et al. Vendruscolo M. The s2D Method: Simultaneous Sequence-Based Prediction of the Statistical Populations of Ordered and Disordered Regions in Proteins. J. Mol. Biol. 2015;427:982–996. doi: 10.1016/j.jmb.2014.12.007. [DOI] [PubMed] [Google Scholar]
  • 62.Jumper J., Evans R., et al. Hassabis D. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Zhao B., Ghadermarzi S., Kurgan L. Comparative evaluation of AlphaFold2 and disorder predictors for prediction of intrinsic disorder, disorder content and fully disordered proteins. Comput. Struct. Biotechnol. J. 2023;21:3248–3258. doi: 10.1016/j.csbj.2023.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Eastman P., Swails J., et al. Pande V.S. OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLoS Comput. Biol. 2017;13 doi: 10.1371/journal.pcbi.1005659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Tesei G., Lindorff-Larsen K. Improved predictions of phase behaviour of intrinsically disordered proteins by tuning the interaction range. Open Res. Eur. 2022;2:94. doi: 10.12688/openreseurope.14967.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Tesei G., Trolle A.I., et al. Lindorff-Larsen K. Conformational ensembles of the human intrinsically disordered proteome. Nature. 2024;626:897–904. doi: 10.1038/s41586-023-07004-5. [DOI] [PubMed] [Google Scholar]
  • 67.Lotthammer J.M., Ginell G.M., et al. Holehouse A.S. Direct prediction of intrinsically disordered protein conformational properties from sequences. Nat. Methods. 2024;21:465–476. doi: 10.1038/s41592-023-02159-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Oldfield E. Chemical shifts and three-dimensional protein structures. J. Biomol. NMR. 1995;5:217–225. doi: 10.1007/BF00211749. [DOI] [PubMed] [Google Scholar]
  • 69.Baxter N.J., Williamson M.P. Temperature dependence of 1H chemical shifts in proteins. J. Biomol. NMR. 1997;9:359–369. doi: 10.1023/a:1018334207887. [DOI] [PubMed] [Google Scholar]
  • 70.Grzesiek S., Anglister J., Bax A. Correlation of Backbone Amide and Aliphatic Side-Chain Resonances in 13C/15N-Enriched Proteins by Isotropic Mixing of 13C Magnetization. J. Magn. Reson. B. 1993;101:114–119. [Google Scholar]
  • 71.Shen Y., Bax A. Prediction of Xaa-Pro peptide bond conformation from sequence and chemical shifts. J. Biomol. NMR. 2010;46:199–204. doi: 10.1007/s10858-009-9395-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Hansen D.F., Led J.J. Implications of using approximate Bloch-McConnell equations in NMR analyses of chemically exchanging systems: application to the electron self-exchange of plastocyanin. J. Magn. Reson. 2003;163:215–227. doi: 10.1016/s1090-7807(03)00062-4. [DOI] [PubMed] [Google Scholar]
  • 73.Kay L.E., Torchia D.A., Bax A. Backbone Dynamics of Proteins As Studied by 15N Inverse Detected Heteronuclear NMR Spectroscopy: Application to Staphylococcal Nuclease. Biochemistry. 1989;28:8972–8979. doi: 10.1021/bi00449a003. [DOI] [PubMed] [Google Scholar]
  • 74.Korzhnev D.M., Skrynnikov N.R., et al. Kay L.E. An NMR experiment for the accurate measurement of heteronuclear spin-lock relaxation rates. J. Am. Chem. Soc. 2002;124:10743–10753. doi: 10.1021/ja0204776. [DOI] [PubMed] [Google Scholar]
  • 75.Abyzov A., Salvi N., et al. Blackledge M. Identification of Dynamic Modes in an Intrinsically Disordered Protein Using Temperature-Dependent NMR Relaxation. J. Am. Chem. Soc. 2016;138:6240–6251. doi: 10.1021/jacs.6b02424. [DOI] [PubMed] [Google Scholar]
  • 76.Kay L., E, Keifer P., Saarinen T. Pure Absorption Gradient Enhanced Heteronuclear Single Quantum Correlation Spectroscopy with Improved Sensitivity. J. Am. Chem. Soc. 1992;114:10663–10665. [Google Scholar]
  • 77.Bermel W., Bertini I., et al. Vasos P.R. Complete Assignment of Heteronuclear Protein Resonances by Protonless NMR Spectroscopy. Angew. Chem., Int. Ed. Engl. 2005;44:3089–3092. doi: 10.1002/anie.200461794. [DOI] [PubMed] [Google Scholar]
  • 78.Kay L.E., Ikura M., et al. Bax A. Three-dimensional triple-resonance NMR spectroscopy of isotopically enriched proteins. J. Magn. Reson. 1990;89:496–514. doi: 10.1016/j.jmr.2011.09.004. [DOI] [PubMed] [Google Scholar]
  • 79.Clubb R.T., Thanabal V., Wagner G. A constant-time three-dimensional triple-resonance pulse scheme to correlate intraresidue 1HN, 15N, and 13C′ chemical shifts in 15N-13C-labelled proteins. J. Magn. Reson. 1992;97:213–217. [Google Scholar]
  • 80.Wittekind M., Mueller L. HNCACB, a High-Sensitivity 3D NMR Experiment to Correlate Amide-Proton and Nitrogen Resonances with the Alpha- and Beta-Carbon Resonances in Proteins. J. Magn. Reson. B. 1993;101:201–205. [Google Scholar]
  • 81.Frenkiel T., Bauer C., et al. Feeney J. HMQC-NOESY-HMQC, a three-dimensional NMR experiment which allows detection of nuclear overhauser effects between protons with overlapping signals. J. Magn. Reson. 1990;90:420–425. [Google Scholar]
  • 82.Marion D., Driscoll P.C., et al. Clore G.M. Overcoming the overlap problem in the assignment of 1H NMR spectra of larger proteins by use of three-dimensional heteronuclear 1H-15N Hartmann-Hahn-multiple quantum coherence and nuclear Overhauser-multiple quantum coherence spectroscopy: application to interleukin 1 beta. Biochemistry. 1989;28:6150–6156. doi: 10.1021/bi00441a004. [DOI] [PubMed] [Google Scholar]
  • 83.Farrow N.A., Muhandiram R., et al. Kay L.E. Backbone dynamics of a free and phosphopeptide-complexed Src homology 2 domain studied by 15N NMR relaxation. Biochemistry. 1994;33:5984–6003. doi: 10.1021/bi00185a040. [DOI] [PubMed] [Google Scholar]
  • 84.Hansen D.F., Kay L.E. Improved magnetization alignment schemes for spin-lock relaxation experiments. J. Biomol. NMR. 2007;37:245–255. doi: 10.1007/s10858-006-9126-6. [DOI] [PubMed] [Google Scholar]
  • 85.Ferrage F., Cowburn D., Ghose R. Accurate sampling of high-frequency motions in proteins by steady-state 15N-{ 1H} nuclear overhauser effect measurements in the presence of cross-correlated relaxation. J. Am. Chem. Soc. 2009;131:6048–6049. doi: 10.1021/ja809526q. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Wu D.H., Chen A., Johnson C.S. An Improved Diffusion-Ordered Spectroscopy Experiment Incorporating Bipolar-Gradient Pulses. J. Magn. Reson. 1995;115:260–264. [Google Scholar]
  • 87.Stejskal E.O., Tanner J.E. Spin Diffusion Measurements: Spin Echoes in the Presence of a Time - Dependent Field Gradient. J. Chem. Phys. 1965;42:288–292. [Google Scholar]
  • 88.Pietrucci F., Laio A. A Collective Variable for the Efficient Exploration of Protein Beta-Sheet Structures: Application to SH3 and GB1. J. Chem. Theor. Comput. 2009;5:2197–2201. doi: 10.1021/ct900202f. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S17, Tables S1 and S2, and Equations S1–S11
mmc1.pdf (2.9MB, pdf)
Document S2. Article plus supporting material
mmc2.pdf (5.3MB, pdf)

Data Availability Statement

Code that supports the findings of this study is available from GitHub at https://github.com/hansenlab-ucl/orf6-ctr_cis_trans_conformers and data files are available from Zenodo at https://doi.org/10.5281/zenodo.13748215. NMR chemical shifts have been deposited in the Biological Magnetic Resonance Data Bank (BMRB) (www.bmrb.wisc.edu) under the following accession codes: 52459 for the ORF6CTR cis-P57 and trans-P57 configurations, and 52460 for the unlabeled NAc-ORF6CTR. Additional data related to this paper may be requested from the authors.


Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES