Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 May 2.
Published in final edited form as: J Phys Chem B. 2013 Apr 11;117(17):4740–4754. doi: 10.1021/jp311723a

Structural Refinement from Restrained-Ensemble Simulations Based on EPR/DEER Data: Application to T4 Lysozyme

Shahidul M Islam 1, Richard A Stein 2, Hassane Mchaourab 2, Benoît Roux 1,*
PMCID: PMC3684008  NIHMSID: NIHMS467321  PMID: 23510103

Abstract

DEER (Double Electron Electron Resonance) is a powerful pulsed ESR (electron spin resonance) technique allowing the determination of distance histograms between pairs of nitroxide spin-labels linked to a protein in a native-like solution environment. However, exploiting the huge amount of information provided by ESR/DEER histograms to refine structural models is extremely challenging. In this study, a restrained ensemble (RE) molecular dynamics (MD) simulation methodology is developed to address this issue. In RE simulation, the spin-spin distance distribution histograms calculated from a multiple-copy MD simulation are enforced, via a global ensemble-based energy restraint, to match those obtained from ESR/DEER experiments. The RE simulation is applied to 51 ESR/DEER distance histogram data from spin-labels inserted at 37 different positions in T4 lysozyme (T4L). The rotamer population distribution along the five dihedral angles connecting the nitroxide ring to the protein backbone is determined and shown to be consistent with available information from X-ray crystallography. For the purpose of structural refinement, the concept of a simplified nitroxide dummy spin-label is designed and parameterized on the basis of these all-atom RE simulations with explicit solvent. It is demonstrated that RE simulations with the dummy nitroxide spin-labels imposing the ESR/DEER experimental distance distribution data are able to systematically correct and refine a series of distorted T4L structures, while simple harmonic distance restraints are unsuccessful. This computationally efficient approach allows experimental restraints from DEER experiments to be incorporated into RE simulations for efficient structural refinement.

Keywords: Structural refinement, spin-labeled T4 Lysozyme, ESR/DEER, rotamer population, molecular dynamics (MD), restrained ensemble (RE)

INTRODUCTION

Structural information is critical to understand the function of proteins. While X-ray crystallography is the best technique to obtain high-resolution structural information, it is also necessary to examine the protein in its native environment, free from the constraints imposed by the crystal lattice, to achieve a complete picture of its function. Two important spectroscopic approaches occupy a central role in these efforts, nuclear magnetic resonance (NMR) and electron spin resonance (ESR). One advantage of NMR is that it can report directly on the protein conformation through the nuclei in the system. In contrast, ESR requires the introduction of spectroscopic probes into the system via site-directed spin-labeling (SDSL) techniques, with the possible drawback that these could introduce unwanted perturbations. On the other hand, NMR investigations are limited by the size of the protein system, whereas the strong signal from the spin-label unpaired electron confers a sensitivity advantage to ESR even in the case of extremely large macromolecular complexes. Of particular interest, DEER (Double Electron-Electron Resonance) is a powerful ESR technique that makes it possible to measure the distance histogram between a pair of spin-labels inserted in a macromolecule.1-3 While the experimental distance histograms between pairs of spin-labels contain a lot of information, designing a structural refinement strategy able to exploit the huge amount of data available from ESR/DEER is confronted with a number of challenges.

Any probe-based spectroscopic technique requires one to translate experimental measurements into valid information about the protein structure. In the case of ESR spectroscopy, such an operation is complicated by the internal dynamics of the spin-labels, rendering the extraction of detailed structural and dynamic information of the protein difficult. The most commonly used nitroxide spin-label used in ESR is MTSSL (1-oxyl-2,2,5,5-tetramethylpyrroline-3-methyl-methanethiosulfonate), which is typically linked to a cysteine residue in the protein through a disulphide bond (Figure 1A). The MTSSL moiety has five dihedral bonds, resulting in a highly flexible side-chain once it is linked to a protein. A reliable characterization of the dynamical properties of the spin-label is essential to fully exploit the available structural information from ESR/DEER spectra. The well-characterized soluble protein, T4 lysozyme (T4L, Figure 1B), has provided an important model system in the characterization of the conformational properties of MTSSL. Structural and spectroscopic analyses revealed general properties regarding the spin-label motion in protein capture in the “χ45 model” for solvent exposed helix surface (SEHS) sites in T4L.4 According to this model, internal motion of the spin-label side chain is essentially restricted to rotations about the dihedrals χ4 and χ5 and the remaining dihedrals are effectively “locked” on the ESR time scale. The χ3 disulphide torsion is opposed by a large energy barrier,5 while the χ1 and χ2 torsions are hindered by the formation of a hydrogen bond between the sulfur atom of the spin-label and the backbone amide6 and Cα.7 Such sulfur-backbone contacts are indeed observed in a number of X-ray crystal structures of T4L with spin-labels8-15 in support of the χ45 model. Despite these efforts, there remains a paucity of data on the population of accessible rotamers of the MTSSL spin-label.

Figure 1.

Figure 1

(A) Side-chain resulting from linking MTSSL (1-oxyl-2,2,5,5-tetramethylpyrroline-3-methyl-methanethiosulfonate) to a cysteine through a disulfide bond. (B) Cartoon representation of T4 lysozyme with the 37 spin-labeled sites (colored in blue).

Computations can provide a “virtual route” to link the atomic structures to the experimental ESR observations. In principle, quantum mechanical ab initio methods offer the most accurate approaches to characterize the energy and conformations of the spin-labels.16-18 However, these methods are too generally onerous to treat large protein systems and account for thermal fluctuations. Molecular dynamics (MD) simulations based on molecular mechanical force fields offer a realistic alternative strategy to calculate ESR observables.19,20 For example, MD simulations were able to quantitatively reproduce multi-frequency spectra of spin-labels attached to T4 lysozyme with remarkable accuracy.21 Furthermore, the distance distributions from ESR/DEER experiments have been used to ascertain the validity and accuracy of MD simulations and structural models.2,3,22-27 Computational modeling methods, such as the Multiscale Modeling of Macromolecular systems (MMM) software package of Yevhen Polyhach and Gunnar Jeschke 28,29, and the PRONOX algorithm of Hatmal et al,30 have been developed to determine the inter-label distances distributions based on the analysis of spin-label rotamers. Attempts were also made to build structural models based on elastic network model and MD simulations that satisfies ESR/DEER distance restraints. 23 In spite of these efforts, there remains a need to develop a robust and effective computational method for making best use of ESR/DEER data in structural refinement.

Translating the ESR/DEER distance histograms into constraints that can be utilized in structural refinement is not straightforward. All the previous computational simulation studies22-26 and modeling methods28,30 use the ESR/DEER distance histogram in post-analysis, to assess the correctness of models that were generated independently from the experimental data. In other words, none of the existing methods “drive” the structural model toward a 3D conformation that satisfies the ESR/DEER data. The standard approach to incorporate experimental knowledge of any average property, 〈q(x)〉=Q, consists in simulating the system in the presence of an artificial energy restraint, k(q(x)-Q)2, that biases the configuration x of the system toward the desired outcome.31 For example, this is how NOE (nuclear Overhauser effect) distances are imposed in NMR refinement. This approach, however, is not applicable in the case of ESR/DEER data; the measurements report distance histograms and trying to ascribe a single average distance between any pair of atoms (e.g., backbone Cα) is not a valid approximation. A more sophisticated strategy is based on the notion of “restrained-ensemble” (RE) molecular dynamics (MD) simulations introduced by Vendruscolo and co-workers,32-36 and exploited by Im and co-workers to determine the structure of membrane-bound peptides on the basis of solid state NMR observables.37-40 Such restrained-ensemble MD simulation scheme consists in carrying out parallel MD simulations of N replicas of the basic system in the presence of a biasing potential that enforces the ensemble-average of a given property toward its known experimental value. Recently, the formal equivalence between the results from restrained-ensemble MD simulation scheme and the maximum entropy method for biasing thermodynamics ensembles41 was established.42 This analysis led to the development of a novel method to incorporate ESR/DEER distance histogram data into multiple-copy restrained-ensemble MD simulations.43 In the RE simulations, distributions of the spin-spin distances are constrained by the spin pair distance histograms obtained from ESR/DEER experiments. While the methodology based on RE simulations is designed to enforce a given set of real-space distance histograms for the purpose of structural refinement, in practice, such distance histograms are obtained by post-processing the measured time-dependent ESR/DEER signal.29 The construction of distance histograms from DEER measurements is a challenging inverse problem that necessitates the utilization of specialized regulation procedures. While our main focus is to enable structural refinement based on experimental data, it is important to keep in mind that the distance histograms can be sensitive to the treatment of the inverse problem.

In the present study, detailed RE simulations based on ESR/DEER data for 51 spin-label pairs inserted at 37 positions in T4L are used to provide a rich source of detailed information about the rotameric states of the MTSSL spin-labels. Using this unprecedented new information from RE simulations and ESR/DEER data, a simplified nitroxide dummy spin-label is then parameterized for the purpose of structural refinement. Finally, the usefulness of the method is illustrated by showing that RE simulations with the dummy nitroxide spin-labels imposing the ESR/DEER experimental distance distribution data are able to systematically correct a series of distorted T4L structures.

METHODS

a) ESR experiments

Nitroxide spin-labels were inserted at 37 different positions in T4 lysozyme (T4L), for a total of 51 different pairs for DEER experiments. The position of the spin-labels in the T4L structure and their relationship to secondary structure elements is summarized in Table 1. All double mutants of T4L used in this study are shown in Supplemental Table S1. Cysteine mutants generated for this study were introduced into the cysteine-free T4L DNA by site-directed mutagenesis.3 Protein expression, purification, and labeling with MTSSL were carried out as previously described.1,3,6 DEER measurements were preformed on a Bruker 580 pulsed ESR spectrometer operating at either X- (9.6 GHz) or Q- (33.4 GHz) band using a standard four pulse protocol.44 Data was collected with the samples at 83 K with 23% (v/v) glycerol as cryoprotectant. Analysis of the DEER data to determine the distance distributions was carried in DeerAnalysis2010 or DeerAnalysis2011.45 The data was fit with Tikhonov regularization and L-curve determination of the optimal regularization parameter.46 The distance distributions were subsequently converted to 1 Å bins and normalized for use in the RE simulations using a script implemented in MATLAB.

Table 1.

Spin-label positions in T4 Lysozyme, number of spin-spin distance distributions and spin-label rotameric states obtained from RE simulation.

Secondary structures Number of spin-labels Number of ESR/DEER histograms Labeled residues χ12rotamer
χ3 =+90°(p) χ3 =-90° (m)
Turn (59) 1 1 59 tp -
H1 (60-80) 9 25 60 - mm
61 - mm
62 mm mm
64 mt -
65* - mm
72* mm, mt -
75 mt -
76* - mm, tm
79 mt -
H2 (82-90) 6 8 82* - mm
83 - mm
85 - mm
86 mt -
89 - mm
90 - tt
H3 (93-106) 2 6 93 - mm
94 mt -
H4 (108-113) 3 6 108 - mt, mm
109 - mm
112 mt -
H5 (115-123) 5 13 115* mt -
116 mt -
119* mt -
122 mt -
123 pp1 pt1, pp1
H6 (126-134) 5 15 127 - mm
128 mt -
131* - mm
132 mt mm
134 mt -
Turn (135) 1 2 135 - mm
H7 (137-141) 1 1 140 - mm
H8 (143-155) 3 13 151* mt mm
154 mt -
155 - mm
Turn (159) 1 1 159 mt -
1

these rotameric states were populated by less than 30%

*

A crystallographic X-ray structure is available for a spin-label at this position (Table 5)

b) Computational details

Molecular dynamic simulations of spin-labeled T4L were carried out with the CHARMM47 and NAMD48 programs, using the all-atom CHARMM27 protein force field49 with the CMAP corrections.50 The force field parameters for the nitroxide spin-label developed by Sezer et al.19, which were found to provide very accurate multi-frequency ESR spectrum in previous studies,19-21 were employed for the description of the nitroxide spin-labels. The crystal structure of native T4L (2LZM)51 was used to construct the initial geometry of the T4L system with its spin-labels. Since 37 sites in total were experimentally labelled, it is not possible to introduce all of the nitroxide spin-labels simultaneously into a single protein without extensive steric clashes. To avoid this problem, four copies of the protein (A, B, C, D) were included in the simulation cell (Figure 2) and the 37 spin-labels were distributed among the four copies to ensure that they do not interact with each other. Each copy comprises 9, 9, 9 and 10 spin-labels, respectively. The labeled residues are: [59, 64, 76, 83, 89, 108, 128, 135, 140, 154] in A, [60, 65, 85, 90, 115, 123, 127, 131, 151] in B, [62, 72, 79, 86, 93, 112, 119, 132, 155] in C, and [61, 75, 82, 94, 109, 116, 122, 134, 159] in D. All spin-labels are separated by at least a distance of 10-15Å. To ensure that the four proteins do not interact with each other, the center of mass of each adjacent T4L were separated by 80Å. The whole system thus constructed has a total of 11349 atoms, and is solvated by 41547 TIP3P water molecules within a 160 × 60 × 130 Å rectangular box. The salt concentration was maintained at 0.15 mM/mol by adding 44 potassium and 117 chloride ions. For all the simulations, a weak positional harmonic restraint with a force constant of 1 (kcal/mol)/Å2 was applied to the protein backbone to avoid any large displacement. Three different simulation methodologies were utilized in this study: conventional molecular dynamics (MD), locally enhanced sampling (LES)52-54 and restrained-ensemble (RE) simulations.43

Figure 2.

Figure 2

The simulation set up consisting of 37 spin-labeled side chains distributed in four T4L systems. The distance between the center of mass of each T4L are 80Å from each other; thereby, they are far enough to cause any interaction. Similarly, the spin-label side chains were also kept at least four residues apart with a distance of about 10-15Å from each other; therefore, the interaction between the spin-labels are also minimum. The copies of the spin-labels were generated with CHARMM using the command REPLICA.

Conventional MD Simulation Protocol

Prior to LES and RE simulations, the simulation box was minimized, equilibrated and a 140 ns MD simulation was performed with NAMD48. Both equilibration and production simulations were performed under NPT conditions where the temperature was kept at 300 K and the pressure at 1 atm to remain consistent with experimental conditions. The simulations were performed with Langevin55 thermostat to control the temperature of the simulation box. A collision frequency, γ, of 5.0 ps-1 was used for the Langevin thermostat. A dielectric constant of 1.0 was used during the simulation. Bonds involving hydrogen atoms in water were constrained to their equilibrium values using the SHAKE56 algorithm. Periodic boundary conditions (PBC) were imposed and a cutoff of 10Å was used for nonbonded interactions with a switch distance of 8Å and pairlist distance of 12Å. Long-range electrostatic behaviour was controlled with the particle mesh Ewald (PME) method.57,58 First, the conjugate gradient energy minimization was performed for 1000 steps to eliminate initial bad contacts between various atoms. This was then followed by a short equilibration run of 5ns. Then the production MD simulation was performed for 140ns from which the ensemble averaged properties of the spin-labels were calculated. Even with such a long trajectory, it is possible that these ensemble average properties are weighted to the starting rotamers due to the lower probability of the interconversion of the disulfide bond.5

Locally Enhanced Sampling Simulation Protocol

The Locally Enhanced Sampling (LES) method,52-54,59 aims at increasing the sampling efficacy of conformational space of selected groups of atoms relative to conventional MD simulations. LES consists in introducing N non-interacting copies (replicas) of selected groups of atoms, which then interact with the rest of the system with forces that are scaled down by a factor of 1/N. According to an approximate mean-field argument, the statistical distribution of the configurations resulting from LES is expected to be representative of the single-copy conventional MD. To try improving the sampling efficacy of the nitroxide spin-labels conformations using the LES method, all 37 labeled residues were replicated 25 times including their backbone atoms. The multiple copies were generated using the psfgen module of vmd (Visual Molecular Dynamics)60 visualization program. The temperature of the replicated atoms was scaled by a factor of 1/N and the masses of the enhanced atoms were also reduced by the same factor to restore the normal velocity distribution. The temperature of the rest of the molecule was kept at 300 K. The last frame from the 140 ns MD simulation was used to start the LES simulation; therefore the system is already well equilibrated. The simulation was performed with NAMD48 keeping the backbone atoms of the T4L restrained to their X-ray structure by applying positional restraints with a harmonic force constant of 1 (kcal/mol)/Å2. The production LES simulation was performed for 7ns after an initial 1000 steps of minimization and 1ns of equilibration. All the other simulation parameters were kept the same to those used in the conventional MD simulation.

Restrained Ensemble Simulation Protocol

The restrained ensemble (RE) method was recently developed to exploit the information from experimental distance histograms obtained from ESR/DEER experiments.43 The RE ensemble as implemented here builds on the framework provided by the multiple-copy LES simulation method. As in LES, an ensemble of N non-interacting replicas was created for each spin-label side chain of the basis system (Figure 2). The multiple replicas yield a total of N2 distances for each pair of spin-labels. Based on these distances, an energy restraint was implemented in CHARMM47 so that at every step of the simulation the histogram obtained from the N2 spin-spin distances would be forced to match the experimental distance histogram obtained from the DEER experiment. The possibility of restraining the simulations from the time-dependent DEER signal rather than the post-processed distance histograms was discussed previously.43 To avoid discontinuities between the bins, a smooth Gaussian of width σ is used in the construction of the histograms. Let ij(n) be the ensemble-averaged “instantaneous” histogram produced by the multiple replica and Hij(n) be the target experimental histogram. We introduce the energy restraint on all the spin-label i-j pairs, in the system,

URE=12K{pairij}{binn}(hij(n)Hij(n))2 (1)

where K is a large harmonic force constant used to enforce the restraint. The ensemble-average histogram ij(n) is calculated as,

hij(n)=1N2s=1Ns=1N12πσ2exp[(nΔrrisrjs)2σ2] (2)

where risrjs is the distance between the spin-label i and spin-label j. Both the experimental and the calculated histograms are recorded with the same bin size Δr of 1 Å, and are normalized to 1 (the distributions ij(n) and Hij(n) have dimension of inverse length, and the units of the force constant K is (kcal/mol)/Å2. For details about the implementation of the RE method, see Ref. 43.

All the spin-labels were replicated 25 times using the command REPLICA in CHARMM. Interactions between the replicas and the rest of the system were scaled by 1/25 using the BLOCK command; interactions within each replica were not scaled. A harmonic force constant K of 10000 (kcal/mol)/Å2 was used to impose the energy restraint to match the calculated distributions ij(n) with those of the experiment Hij(n), and the natural spread σ of the Gaussian was set to 1.7Å. A total of 51 spin-spin distance histograms obtained from ESR/DEER experiments are used simultaneously in the RE simulations of the system. Large displacements of the protein backbone atoms were prevented by applying positional restraints with a harmonic force constant of 1 (kcal/mol)/Å2 relative to the X-ray structure. At each time-step, the distance histogram between any selected pair of spin-labels was calculated from Cartesian coordinates of the four protein translated to a unique reference frame. For example, there is an experimental DEER histogram for spin-labels at positions 62 and 109 even though they are located in protein C and D, respectively. To calculate the instantaneous ensemble-average histogram of the pair 62,109(n), the C and D protein systems are translated to a unique reference frame and then the distance between the spin-label at position 62 and the spin-label at position 109 is calculated.

To reduce the size of the simulated system, only the water molecules within 35Å from the center of mass of each T4L system were kept, for a total of 19479. A spherical half-harmonic containing restraint with a force constant of 0.5 (kcal/mol)/Å2 was used to keep the waters near the proteins. Both equilibration and production simulations were performed under NVT conditions where the temperature for both replica spin-label atoms and normal atoms were kept at 300 K. The rest of the simulation parameters were kept the same to those used in the conventional single-copy MD and LES simulations. To increase the efficiency, five realization of the RE ensemble were simulated simultaneously. They were initiated from the frames at 60, 80, 100, 120 and 140 ns of the single-copy MD simulation after energy minimization and equilibration for 1 ns; 8ns RE simulation was performed for each of the five systems, providing a total of 40ns RE simulation.

Data Analysis Protocol

Spin pair distances and dihedral angles were collected every 10 ps from 140 ns conventional single-copy MD simulation. Thus a total of 14000 data points were obtained for each spin pair distances and dihedral angles. These data were used to calculate the spin pair distance distributions and the rotamer population distributions along five dihedral angles of a particular spin-label. A total of 518000 data points for every dihedral angle were obtained from 37 spin-labels. This enormous amount of data was used to calculate the global rotameric states of the spin-label side chain. For both LES and RE simulations, a total of 625 spin pair distances were obtained from a single simulation step since each spin-label is replicated to 25 copies. The spin pair distances were collected every 10 ps from the last 1ns of the trajectory providing a total of 625000 distances from the simulations, which were then used to calculate the distance distributions. Data for the dihedral angles were collected every 10 ps from the 7ns LES and the 40 ns RE simulations. Since for every spin-label there are 25 copies, a total of 17500 data points were obtained for every spin-label yielding a total of 647500 data points from LES simulation. Following the same procedure, the 40 ns RE simulation provided a total of 987500 data points for every spin-label and a total of 3695375 data points from all spin-labels. These data were used to calculate the dihedral angle distributions. The potential of mean force (PMF) or the change in free energy along the dihedral angles was also calculated by integrating the rotamer population distributions.

RESULTS AND DISCUSSION

As expected, the 51 spin-label distance histograms obtained from the multiple-copy restrained-ensemble (RE) simulation are very similar to those obtained from ESR/DEER data. It is, however, worth emphasizing that satisfying all the distance histograms simultaneously did not generate any unacceptable distortions of the spin-labels, which are attached to protein backbone restrained near the X-ray structure. This is strongly suggestive that the set of 51 distance histograms generated from the post-processing of the time-dependent DEER signal is broadly consistent with the X-ray structure of T4L and the chemically-acceptable range of conformational flexibility of the MTSSL spin-label. In contrast, the distance histograms obtained from conventional MD and the locally enhanced sampling (LES) differ considerably from experiment (see Figure S6 in the supporting information for all 51 distance distributions). It is possible to rapidly quantify differences in the 51 distance distributions obtain from MD, LES and RE simulation from the total mean-square deviation relative to the DEER data,

χ2={pairij}{binn}(hij(n)Hij(n))2 (3)

As shown in Figure 3, the values of χ2 for most of the spin-spin distance histograms are large for MD and LES, while they are very small for RE. In most cases, χ2 from LES is slightly larger than from MD, which is perhaps explained by the approximate nature of the multiple-copy mean-field argument underlying the method. Because the protein backbone was restrained to remain near the X-ray structure, these differences can be directly traced back to the conformations of the flexible spin-labels themselves. Clearly, the unbiased sampling from MD and LES produces spin-label conformations that are in fairly substantial disagreement with experimental data. The discrepancy may, in part, reflect inaccuracies in the molecular mechanical force field used in the simulations. However, it is also important to recall that the samples for ESR/DEER experiments are fast-frozen, and the conformational probability of the flexible spin-labels would reflect the proper Boltzmann distribution at room temperature only if the freezing process was infinitely fast (quenching). On the other hand, if the freezing is slow on the timescale of some molecular motions, the conformational distribution will partly drift toward lower temperature population distributions during the process (annealing). Ultimately, it is likely that some slow degrees of freedom are truly quenched while fast degrees of freedom such as side-chain rotamers are partly annealed to local energy conformations.61 The implication is that the information from the distance histograms imposed via the RE simulations significantly restricts the conformation of the spin-labels, which provides a unique opportunity to determine the conformational propensity of MTSSL nitroxide spin-labels in multiple positions of a soluble protein under the conditions of ESR/DEER experiments.

Figure 3.

Figure 3

Difference in spin-spin distance histograms between experiment and conventional MD (red line), locally enhanced sampling (blue line) and restrained ensemble (green line) simulations.

a) Analysis of the T4L simulations and rotameric distributions of MTSSL

The conformation of the MTSSL nitroxide spin-label is completely characterized by the five dihedral angles, χ1, χ2, χ3, χ4 and χ5 along the flexible side-chain Cα-Cβ-Sγ-Sδ-Cη-Cζ (Figure 1A). The distribution about the χ1, χ2, χ3, χ4 and χ5 of all the spin-labels obtained from MD, LES, and RE simulations is shown in Figure 4. For the sake of simplicity, it will be useful to adopt the following convention in the following discussion. The dihedral angles χ1, χ2, and χ4 can adopt 3-fold conformations, +60° (or gauche+), 180°(or trans), and –60° (or gauche-), which will be denoted by p, t and m, respectively. According to the RE simulations, the m rotamer for χ1 is the most populated, while the rotamers from the conventional MD and LES simulations are more broadly distributed. Similarly, the m rotamer for χ2 is also the most populated according to the RE simulations, with again a broader distribution from the conventional MD and LES simulations. The distributions of the dihedral angle χ3 of the slowly converting Sγ-Sδ disulfide bond are similar for all three simulations. Essentially, χ3 can adopt only two stable positions, around +90° and -90° (=270°), which will be denoted as p and m, respectively. The last dihedral angle, χ5, between the chain and the ring is broadly distributed, with two positions slightly more populated around +90° and -90°, which will be denoted as p and m, respectively. The p and m states were visited for 22 spin-labels in the RE simulation, but only for 11 spin-labels in the conventional MD simulation (Figure S7). Lastly, the rotamer distributions for χ4 and χ5 obtained from all simulation methods are very similar (Figure 4), suggesting that considerable fluctuations around these two dihedrals are consistent with the ESR/DEER data.

Figure 4.

Figure 4

Rotamer population distribution of the dihedral angle χ1, χ2, χ3, χ4 and χ5 of the MTSSL spin-label obtained from the conventional MD (red line), locally enhanced sampling (blue line) and restraint ensemble (green line) simulations. All the data from all the spin-labels attached to T4L were combined together to produce the distributions.

The joint 2D distributions along χ1 and χ2 for prescribed values of χ3 obtained from the MD, LES and RE simulations are shown in Figure 5. The populations of the χ12 rotameric states are given in Table 2. According to all simulation methods, it is clear that the χ12 rotamer distribution is strongly affected by the χ3 dihedral angle of the Sγ-Sδ bond. This is consistent with previous studies that have highlighted the correlations between the various dihedrals.19,28 Only two predominant χ123 joint rotameric states are observed with the RE simulation, mtp (33%) and mmm (32%), whereas several are observed in the LES and MD simulations. This shows that sampling of the conformational space while restraining the spin-label pair via the ESR/DEER distance histograms leads to a considerable reduction in the observed rotameric states. The 2D PMF calculated from the distributions reveal that the free energy barrier between the mt and mm states is only about 1-2 kcal/mol (Figure S8) suggesting the possibility of rapid inter-conversion rates for these dihedral angles. The 2D rotamer population distribution along χ4 and χ5 shows that they are only weakly correlated. Regardless of the simulation methods, the distribution along χ4 and χ5 are very similar and fairly independent of the value of χ3 (Figure S9). The populations of the χ45 rotameric states are given in Table 3. The results are consistent with the notion that the χ4 and χ5 dihedral are very mobile, an underlying hypothesis of the so-called χ45 model 4,7, which assumes that the internal motion of the spin-label is largely limited to rotations about the last two dihedrals. Nonetheless, as observed previously21, the χ4 and χ5 dihedral display some intrinsic rotameric preferences. Overall, the LES method appears to produce distributions that cover somewhat less rotameric space than the conventional MD simulation. This might indicate that there are differences in the amount of sampling. Alternatively, it is possible that the results are adversely affected by the mean-field approximation that underlies the LES method.52,59 Nevertheless, the differences are not sufficiently large to cause concerns, which is reassuring given that the same mean-field approximation is also used in the multiple-copy RE ensemble implementation based on ESR/DEER distance histograms.43

Figure 5.

Figure 5

Rotamer population distribution along χ1 and χ2 of spin-label attached to T4 Lysozyme at χ3 ≈ 90° (left panel) and 270° (right panel) obtained from conventional MD (upper panel), locally enhanced sampling (middle panel) and restraint ensemble ((lower panel) simulations. The units of the angles χ1 and χ2 are in degrees. Population densities are indicated with contour lines with densities increasing from blue to red color.

Table 2.

Rotamer population (%) along χ1 and χ2 of the spin-label attached to T4 Lysozyme at χ3 ≈ 90° and 270° obtained from conventional MD (MD), locally enhanced sampling (LES) and restraint ensemble (RE) simulations.

χ 3 χ12 MD LES RE χ 3 χ12 MD LES RE
pp 1.10 0.00 1.51 pp 0.03 0.00 0.66
tp 13.33 10.00 4.17 tp 5.04 12.95 3.89
mp 1.20 0.09 0.90 mp 2.58 2.66 0.16
pt 0.72 0.00 1.18 pt 1.25 0.02 4.23
p (+90°) tt 11.31 8.35 1.10 m (270°) tt 7.71 5.80 4.16
mt 12.20 22.09 33.00 mt 8.44 7.20 6.13
pm 0.0 0.00 0.01 pm 0.02 0.00 0.34
tm 0.12 0.04 0.19 tm 6.74 5.27 1.10
mm 9.25 10.89 5.63 mm 18.96 14.66 31.64
Total 49.23 51.46 47.48 Total 50.77 48.56 52.31

Table 3.

Rotamer population (%) along χ4 and χ5 of the spin-label attached to T4 Lysozyme at χ3 ≈ 90° and 270° obtained from conventional MD (MD), locally enhanced sampling (LES) and restraint ensemble (RE) simulations.

χ 3 χ45 MD LES RE χ 3 χ45 MD LES RE
pp 1.62 1.11 4.13 pp 1.33 0.27 5.59
tp 16.44 15.65 12.65 tp 15.26 8.92 14.63
mp 8.06 10.09 6.21 mp 5.89 2.46 5.31
pt 0.62 0.47 0.07 pt 0.09 0.07 0.11
p (+90°) tt 0.61 0.68 0.25 m (270°) tt 0.72 0.36 0.30
mt 0.20 0.18 0.09 mt 0.33 0.11 0.10
pm 6.10 8.26 3.79 pm 7.71 5.45 8.20
tm 12.71 9.71 14.16 tm 17.78 30.20 13.48
mm 2.85 5.30 6.31 mm 1.68 0.70 4.60
Total 49.21 51.45 47.67 Total 50.79 48.54 52.33

It is of interest to examine how the rotamer distribution at specific positions departs from the average trend. The results are summarized in Table 4. Due to its particular importance, it is helpful to first focus on the dihedral around the Sγ-Sδ bond, χ3. Isomerization of the Sγ-Sδ bond is opposed by a large energy barrier5 and is a relatively slow process on the timescale of ESR.21 Furthermore, the m:p equilibrium ratio was previously shown to be a critical determinant of the line shape of CW-ESR spectra.21 As observed in Table 4, χ3 shows a preference for either +90° (p) or -90° (m) in most cases. Spin-labels at positions 59, 64, 72, 75, 79, 86, 112, 115, 116, 122, 128, 134, 154 and 159 mainly prefer +90° (p) with population more than 70%, while spin-labels at positions 60, 61, 65, 76, 83, 85, 89, 90, 93, 108, 109, 123, 131, 135, 140 and 155 mainly prefer-90° (m).Only 7 spin-labels (positions62, 82, 94, 119, 127, 132 and 151) show significant occupancy of both rotamers. When χ3 is +90° (p), the preferred χ12 rotameric state is typically mt(Table 1 and Figure 6). This is observed at 16 positions along the α-helices, a number of them being near the C-terminus (79, 94, 112, 122, 132, 134, 154). When χ3 is -90° (m), the preferred χ12 rotameric state is typically mm. This is observed at 18 positions located along α-helices, a number of them being near the N-terminus (60, 61, 82, 83, 93, 108, 109, 127, 151). Few spin-labels (72, 108, 132, 151) also display both the mm and mt rotamers. The marked preference of χ12 for the mm and mt rotamers is consistent with the energy surface of the spin-label,19 suggesting that the surrounding residues do not lead to large effects. Returning to Table 1, it is clear that spin-labels inserted within α-helics largely prefer the χ123 rotameric states mmm or mtp on average. The spin-labels that display a wider variability are either near the ends of an helix, in a loop, or buried. For example, spin-labels at positions 59, 90, and 123 are found to be in the unusual χ123 rotameric state tpp ttm, and ppp, respectively. Position 59 is located in a loop while position 90 is buried within the solvent inaccessible protein core, and position 123 is the last residue of helix H5. Finally, regarding the χ45 rotameric state, the majority are either tp or tm. Only two positions are in the rotameric states mm(position 109) or tp (position 115) with a population greater than 50%. The predominance of these tp and tm rotamers is consistent with the initial parameterization of the spin-label.19 This suggests that these rotamers are predominant unless there are steric clashes with neighboring residues.

Table 4.

Total rotamer population (%) of χ1 and χ2 at χ3 ≈ 90° and 270° of the spin-label attached to the T4 Lysozyme obtained from restraint ensemble (RE) simulations.

Residue p (+90°) m (270°) Residue p (+90°) m (270°) Residue p (+90°) m (270°)
59 92.78 7.22 86 89.23 10.77 127 39.42 60.58
60 17.86 82.14 89 1.20 98.80 128 82.71 17.29
61 4.74 95.26 90 8.65 91.35 131 7.18 92.82
62 41.57 58.43 93 4.22 95.78 132 59.82 40.18
64 87.14 12.86 94 68.26 31.74 134 84.17 15.83
65 18.79 81.21 108 6.75 93.25 135 5.73 94.27
72 81.82 18.18 109 7.48 92.52 140 25.25 74.75
75 99.63 0.37 112 98.30 1.70 151 49.47 50.53
76 7.16 92.84 115 100.00 0.00 154 73.75 26.25
79 84.13 15.87 116 96.52 3.48 155 6.53 93.47
82 41.28 58.72 119 63.04 36.96 159 93.39 6.61
83 4.47 95.53 122 78.40 21.6
85 2.98 97.02 123 29.98 70.02

Figure 6.

Figure 6

Population of χ12 rotamers when χ3 is in the p state around +90° (left panel) or when χ3 is in the m state around -90° (right panel) for the 37 individual spin-labels attached to T4L obtained from RE simulations.

b) Comparison with available rotamer data from X-ray crystallography

Comparison with the available information from X-ray crystallographic structures of MTSSL spin-labels inserted into T4L9-11 offers a powerful route to examine the validity of the rotamers populations extracted from the RE simulations. However, several X-ray structures, the electron density along the flexible side-chain Cα-Cβ-Sγ-Sδ-Cη-Cζ is missing beyond the Sδ atom, leaving the position and the orientation of the remaining chain and nitroxide ring undefined. In the small number of X-ray structures in which the Cη is resolved, it is observed that the dihedral angle χ3 of the Sγ-Sδ bond adopts either values of +90° or -90°. Transitions between these two well-identified states are opposed by a large energy barrier and occur infrequently according to computational analysis.21,49 For this reason, comparison with X-ray data is mainly limited to χ12, corresponding to the region of the side-chain nearest to the backbone Cα-Cβ-Sγ-Sδ. The result of a comparison for spin-labels at nine positions (65, 72, 75, 76, 82, 115, 119, 131, 151) is given in Table 5.

Table 5.

Comparison of χ1 and χ2 rotamers of the spin-labels obtained from X-ray structures and RE simulations.

Residue PDB ID Exptl. Rotamer* RE simulations*
65 3K2R tp a,b,c mm (83%)
72 (mm, tp)d mm (64%), mt (32%)
75 - mt a mt (93%)
76 3K2R tm c tm; mm (30% each)
82 1ZYT mm a,e mm, mt (30% each)
115 2IGC, 2OU8 mm a,f mt (84%), mm (15%)
2OU8 tp a,f
119 3L2X mm f mm, mt (35% each)
131 2CUU(1), 3G3V(1) mm a,e mm, mt (30% each)
(mm, tp, mt, tt)d tt (25%), tp (10%)
2CUU(2), 3G3V(2) tp d,g
151 3G3X mm e mt (45%), mm (40%)
a

Fleissner et al.13

b

Langen et al.15

c

Toledo et al.14

d

Sezer et at.21

e

Fleissner et al.11,12

f

Guo et al.10

g

Guo et al.9

The χ12 rotamers extracted from the RE simulations are broadly consistent with the information from the X-ray crystal structures for seven positions: a mm rotamer is observed at 72, 82, 115, 119, 131, and 151, a mt rotamer is observed at 75, and a tm rotamer is observed at 76. Analysis of the RE simulations suggests that the free energy difference between mm and mt is small. In the case of position 131, analysis of multifrequency ESR spectra using MD simulations concluded the rotamers mm, mt, tp, and tt were predominant.21 All four rotamers are observed in the RE simulations, and the first one is observed in the X-ray structure. In the case of position 72, there is no X-ray structure, but there is indirect information from analysis of multifrequency ESR spectra using MD simulations indicating that the mm and tp rotamers are the most stable,21 which is consistent with the results from RE simulations. The clearest disagreement concerns spin-label at position 65, as the rotamer is tp in the X-ray structure, whereas it is mm in the RE simulations. However, this position is problematic because the spin-label is involved in crystal contacts.15

For the dihedral angle χ1, the most frequently observed rotamer in X-ray structures is m, although the t rotamer is also observed at a few positions. The m rotamer is also predicted to be the most stable energetically according to computational studies.18,19 It has been argued that the m and t rotameric states of χ1 might be stabilized by an interaction between the sulfur Sδ atom of the spin-label with the backbone nitrogen N and oxygen O atoms,18 though further analysis indicates that such interactions is very weak.19 For the dihedral angle χ2, the predominant rotamer observed in X-ray structures is m, which is also frequently observed in the RE simulations but absent from the conventional MD or the LES simulations. Thus, there is closer agreement between the conformations from X-ray structures and those extracted from ESR/DEER experiments using RE simulations than with those obtained from unbiased MD or LES simulations. It is likely that these differences reflect the fact that the rate at which the experimental samples are frozen is slow on the molecular timescale, thereby leading to a partial annealing of some side-chains rotameric conformations to local minimum energy conformations. It is noteworthy that the nine sites (65, 72, 75, 76, 82, 115, 119, 131, 151), which are in the helix surface sites in T4L, either prefer the mm or mt rotamer for χ12.

c) Simplified representation of the spin-label for structural refinement

An important outcome of the present analysis of RE simulations is a detailed characterization of the rotamer population for MTSSL spin-labels bound to T4L based on ESR/DEER data. This information is critically important for the interpretation of ESR experiments, particularly given the general paucity of data about the conformation of spin-label. Nevertheless, these results were obtained by applying an extensive computational framework with RE simulations that may seem somewhat cumbersome if one's sole purpose is to utilize ESR/DEER data for structural refinement. This framework is not without some drawbacks. For instance, because the replicas of the spin-labels are represented with all atomic details, fairly long RE simulations may be required to allow for the conformational transitions of the five dihedrals χ1 to χ5 in order to adequately sample all the accessible rotameric states. Furthermore, introducing multiple copies of the protein was necessary in the present application to avoid steric clashes between neighboring spin-labels.

Although all these technical difficulties can be surmounted, it is tempting to try to find ways to further simplify the computational framework without compromising accuracy. A key observation guiding our efforts to design a simplified method is that structural refinement based on ESR/DEER data with RE simulations ought to be concerned only with translating the information from histograms of inter-atomic distances between the multiple copies of the oxygen of the nitroxide spin-labels into backbone constraints. In the context of structural refinement, the detailed population of accessible rotameric states represents an excess of information that is used only indirectly. In fact, the detailed molecular representation of the spin-labels is needed because it serves to accurately position the oxygen atom relative to the protein backbone. Therefore, one potential route towards a simplified framework is to take a shortcut and discard the rest of the spin-label and retain only the nitroxide oxygen as a dummy ON particle linked to the backbone. Such a simplified representation avoids the burdensome task of accounting for a large ensemble of Boltzmann-weighted spin-label rotamers. Yet, it is perfectly adequate for the purpose of translating the information from ESR/DEER data into the protein structure—as long as one has the ability to determine the position of the ON atom accurately relative to the backbone without modeling the molecular structure of the spin-label itself. The position of this dummy ON particle relative to the N, Cα, and Cβ of the backbone can be parameterized via a few simple energy terms (bond, angle dihedral) to best-reproduce the statistical distribution of the spin-label side chains deduced from the all-atom RE simulations analysis (Figure 7A). As shown in Figure 7B, the configurations extracted from the RE simulations reveal that the oxygen atom of the nitroxide spin-label is distributed within a half-sphere around the Cα atom with respect to the N, Cα, and Cβ atoms of the labeled residue. Interestingly, the distribution is almost uniform and does not seem to be strongly correlated with any given rotameric states.

Figure 7.

Figure 7

(A) Cartoon representation of the spin-label side chain, R1, is replaced with a single dummy nitroxide atom, which is attached to the Cα atom of the protein backbone. Three valuables, Cα – ON distance, Cβ – Cα – ON angle and ON-Cβ – Cα – N torsion are used to parameterize the dummy atom; (B) the dynamics of nitroxide oxygen of spin-labels around the Cα atom obtain from RE simulation; (C) Comparison of distribution and potential of mean force of the distance, angle, dihedral angle and nonbond interactions obtained from MD simulations with R1 and dummy nitroxide atoms.

The parameterization of the dummy nitroxide atom was carried out by determining the force constant for the Cα-ON bond (r), the Cβ–Cα–ON angle (θ) and the N–Cα–Cβ–ON dihedral angel (ϕ) (Figure 7A). The spatial distribution of the nitroxide oxygen atom is mapped in terms of the Cα-ON bond (r), the Cβ–Cα–ON angle (θ) and the N–Cα–Cβ–ON dihedral angel (ϕ), under the assumption that the values of three variables are uncorrelated (Figure S12). The distributions over r, θ and ϕ extracted from the RE simulation are shown in Figure 7C (red lines). The probability of finding the ON is maximum at a distance of about 8.0 Å from the Cα and the width of the distribution is about 5 Å. The distribution for θ is quite broad going from 0° to 110° and there appears to be two shallow humps at about 25° and 85°. There is one predominant rotameric state for the dihedral angle ϕ positioned around 180-280°, peaking around 240°. The probability distribution functions are then modeled on the basis of 3 simple energy terms, Vr = kr(rr0)2, Vθ = kθ(θθ0)2 and Vϕ = kϕ (1+cos(ϕ0)), where kr, kθ, and kϕ are the force constants for the distance, the angle and dihedral potentials and n represents the dihedral multiplicity. In addition, a Lennard-Jones 6-12 potential was used to account for the excluded-volume interactions between the ON particle and the rest of the protein. The radial distribution (R) of all nonbond interactions involving ON atom of the spin-label was calculated in a shell of 5Å radius and the distribution was converted to the corresponding Boltzmann energy. The result, shown in Figure 7C, indicates that the probability of finding an ON atom is almost zero between 0Å and 2.5Å from any protein atom, which is also evident from the large energy at this region, while the change in energy between 4Å and 5Å is very small (<0.5 kcal/mol). Optimized Lennard-Jones parameters are Rmin = 4Å and Emin = -0.05 kcal/mol. Non-bonded interactions between the ON dummy atoms and the water molecules are switched off by using the keyword NBFIX in the parameter file, which can be read by both the NAMD48 and CHARMM47 program package. The optimal force constants for kr, kθ, kϕ are 0.5 (kcal/mol)/Å2, 1.0 (kcal/mol)/rad2 and 1.9 (kcal/mol)/rad2, respectively. A multiplicity of n=1 is used for the N–Cα–Cβ–ON dihedral angle.

Having parameterized the ON dummy atom representation, we performed 5 ns MD simulation of T4L labeled with ON dummy atom at 37 sites. The calculated ON-ON distance distributions are slightly broader than the experimental ESR/DEER histograms, although the results are somewhat improved compared with the conventional MD simulations carried out with the detailed spin-label models with explicit solvent (Figures S6 and S13). The average distances obtained from MD simulation and experimental spin-spin distance distributions were compared using a linear regression analysis. The correlation between averaged measured and simulated distances is 0.91. These results were compared with those obtained from the Multiscale Modeling of Macromolecular systems (MMM) software package developed by Yevhen Polyhach and Gunnar Jeschke to analyze ESR/DEER data 28. With MMM, the modeling is carried out on the basis of a pre-calculated library of ~200 rotamers which are then inserted at individual sites on a fixed T4L protein backbone. Configurations with clashes are discarded and a Boltzmann-weighted probability distribution is produced for each inserted spin-label, and finally an average pair distance is calculated for each pair of spin-labels. The correlation coefficient of the average distances obtained from MMM with ESR/DEER data is 0.85, which is slightly smaller to the correlation coefficient of 0.91 obtained from MD simulations with the simplified dummy ON spin-label described above.

d) Illustrative structural refinement with artificially distorted conformations of T4L

The simplified dummy ON spin-label model, together with the RE simulation method, can be used to refine protein structures. As a first illustrative test, the T4L structure was refined with the RE simulation from its X-ray structure. Dummy ON atoms were attached to the 37 sites of the spin-labels in T4L and were replicated 25 times for the RE simulations. A 2ns RE simulation was performed starting from the X-ray crystal structure equilibrated from MD (in vacuum and with explicit solvent). The root mean square deviation (rmsd) over the trajectory calculated with respect to the original X-ray structure shows that the rmsd flattens out after 400ps and remains almost constant to a value of about 1.4Å (see Figure 8A). The inter-spin distance distributions are very similar to those obtained from experiment and with those obtained from RE simulation performed with the whole spin-labels (Figure S18). As a second test, the ability of the method to correct artificially distorted structures of T4L was examined. Figure 8 shows some of the distorted structures that were refined with the RE simulations. A total of 8 distorted structures were created by altering the dihedral angles in the turns or loops that are connecting pairs of helices, starting from the C-terminal helix of T4L (Figure 8). Some of the distorted models have a backbone atom rmsd of more than 15Å relative to the X-ray structure. In the refinement, an initial stage of MD simulation was first carried out using a flat-bottom harmonic distance restraints as commonly used in NMR refinement based on NOE relaxation data. The 51 average distances obtained from ESR/DEER data were used. After several ns of simulation, the rmsd of the backbone atoms of all distorted T4L with respect to the X-ray structure flattens out. The distorted structures of T4L improved for all the structures. However, the rmsd for seven distorted structures remained large, in the range of 5-17Å, from the X-ray structure. Only one distorted model reached a value of 1.7Å rmsd (Figure 8). To see if the models could be further improved, RE simulations were then performed starting from the final configuration obtained from the previous simulations. The rmsd of the backbone atoms of most of the models improved considerably, demonstrating that the RE method with the simplified dummy ON spin-label is able to further refine these structures (Figure 8). However, the backbone rmsd for some distorted models remained larger than 6Å (Figures S19 and S20). This mainly shows the limitation of a sampling algorithm that is based on straight MD simulations, even in the presence of the RE biasing. There are, however, alternative and more powerful computational strategies that have the ability to refine such highly distorted protein structures, and improve the overall rmsd down to ~4Å.27,62 Therefore, as a final illustrative test, we sought to examine the ability of the RE method to refine 10 structures that are only moderately distorted to ~4Å rmsd (Figure 8B). In comparison, MD simulations were also performed with a NOE-like flat-bottom harmonic distance restraints. The average backbone atom rmsd, obtained from the last frame of the RE simulations, were found to be about 1.8Å with respect to the X-ray structure. In contrast, many of the models refined with the NOE-like distance restraint remained at 3.5Å rmsd (3 models refined to 2.0-3.5 Å rmsd). These results demonstrate the usefulness of RE simulation method with the simplified representation of the spin-label as a dummy ON particle for structural refinement.

Figure 8.

Figure 8

Cartoon representation of X-ray and distorted structures of T4L and the respective root mean square deviation of the distorted backbone atoms with respect to the X-ray structure obtained from NOE and RE simulations colored in black and blue, respectively. A. The x-ray structure is denoted by T4L and the distorted structures are denoted by 1, 2 and 3 depending on the number of distorted helices. The distorted and undistorted residues of the T4L structures are colored in yellow and purple, respective. B. Ten distorted T4L structures, with a backbone atom rmsd of about 4Å from the X-ray crystal structure, are refined using the NOE and RE simulations colored in black and blue, respectively. Each distorted structure is represented with a particular color scheme. Three black traces from NOE (black) going down to rmsd of 3.1Å, 2.8Å and 2.2Å, respectively, are not shown for the sake of clarity.

CONCLUSION

An extensive characterization of the rotamer population of the spin-labels attached to T4 lysozyme (T4L) has been carried out using a novel multiple-copy restrained-ensemble (RE) simulation method to incorporate ESR/DEER distance histogram data. In RE simulations, a global energy restraint forces the simulation spin pair distances collected from multiple copies of the system to match with the experimental distance histograms obtained from ESR/DEER. Here, 51 distance distributions for 37 nitroxide spin-labeled T4L were constrained to the experimental distributions. In contrast, results obtained from single-copy MD and unrestrained LES simulations deviate significantly from the experiments and therefore the conformational preferences of the spin-labels could be very different with these simulations when compared to experiment. In most cases, the rotamers along χ1 and χ2 at χ3 ≈ 90° and 270° are predominantly mt and mm, respectively. On the other hand, multiple rotameric states are found for χ4 and χ5 at χ3 ≈ 90° and 270°. Overall, these findings are in good agreement with the available information from X-ray crystallographic structures,9-15,21 ESR lineshape analysis6,21,63 and ab initio studies.16,18 Few spin-label side chains prefer other rotameric states or multiple rotamers depending on their positions in T4L. Based on the rotameric preferences, we classified the spin-labels broadly into two groups: (1) sites in the helices will most likely prefer either mm or mt or both mm and mt rotamers and (2) sites on to the solvent-exposed loop structures, or sites completely buried in the solvent inaccessible hydrophobic cores will prefer variable rotameric states.

To develop an efficient strategy aimed at protein structural refinement based on ESR/DEER distance histograms, we designed and parameterized a simplified dummy nitroxide atom “ON” reproducing the statistical distribution of the spin-label side chains deduced from the all-atom RE simulations analysis. The 51 average spin pair distances calculated from these dummy ON spin-label are found in excellent agreement with those obtained from experiment, with a correlation coefficient of 0.91. This result demonstrates that the simplified ON dummy representation is an effective approach to accurately calculate distances between spin-labels at two sites on a protein. Artificially distorted structures of T4L were successfully refined with the final structures deviating from the X-ray structures by an rms deviation of only a few angstroms. The extent of distortion of the initial structure will dictate over the refinement process. Therefore, it is important to obtain an X-ray or NMR derived initial structure that is close to the target structure. In most cases, short several nanosecond RE simulations were sufficient to refine the distorted T4L structures. Current efforts with the RE simulation method are aimed at determining the various conformational states of ion channels and membrane transporters using ESR/DEER data.

Supplementary Material

1_si_001

ACKNOWLEDGMENT

The work was carried out in the context of the Membrane Protein Structural Dynamics Consortium funded by grant U54-GM087519 from the National Institute of Health (NIH). We are grateful to XSEDE for computer time. We also thank Dr. Hanane Koteiche for assistance with generation of the double mutants of T4L.

Footnotes

Supporting Information: T4L double mutants, simulation setup, distance distributions, time dependence for χ3 dihedral angle, rotamer population distribution along χ4 and χ5, 2D-PMFs along χ1χ2 and χ4χ5, correlation of the Cβ–Cα–ON angle and the N–Cα–Cβ–ON dihedral angle, force field parameter and topology for dummy nitroxide atom, correlation of inter label experimental and MD average distances, correlation of inter label experimental and MMM average distances, structures used in refinement. This material is available free of charge via the internet at http://pubs.acs.org.

References

  • 1.Borbat PP, Mchaourab HS, Freed JH. Electron Spin Resonance in Studies of Membranes and Proteins. J. Am. Chem. Soc. 2002;124:5304–5314. doi: 10.1021/ja020040y. [DOI] [PubMed] [Google Scholar]
  • 2.Bhatnagar J, Freed JH, Crane BR. Rigid Body Refinement of Protein Complexes with Long-Range Distance Restraints from Pulsed Dipolar ESR. Method Enzymol. 2007;423:117–133. doi: 10.1016/S0076-6879(07)23004-6. [DOI] [PubMed] [Google Scholar]
  • 3.Kazmier K, Alexander NS, Meiler J, Mchaourab HS. Algorithm for Selection of Optimized EPR Distance Restraints for De Novo Protein Structure Determination. J. Struct. Biol. 2011;173:549–557. doi: 10.1016/j.jsb.2010.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Columbus L, Kalai T, Jeko J, Hideg K, Hubbell WL. Molecular Motion of Spin Labeled Side Chains in Alpha-Helices: Analysis by Variation of Side Chain Structure. Biochemistry. 2001;40:3828–3846. doi: 10.1021/bi002645h. [DOI] [PubMed] [Google Scholar]
  • 5.Jiao D, Barfield M, Combariza JE, Hruby VJ. Ab Initio Molecular Orbital Studies of the Rotational Barriers and the S-33 and C-13 Chemical Shieldings for Dimethyl Disulfide. J. Am. Chem. Soc. 1992;114:3639–3643. [Google Scholar]
  • 6.Mchaourab HS, Lietzow MA, Hideg K, Hubbell WL. Motion of Spin-Labeled Side Chains in T4 Lysozyme, Correlation with Protein Structure and Dynamics. Biochemistry. 1996;35:7692–7704. doi: 10.1021/bi960482k. [DOI] [PubMed] [Google Scholar]
  • 7.Columbus L, Hubbell WL. A New Spin on Protein Dynamics Trends. Biochem. Sci. 2002;27:288–295. doi: 10.1016/s0968-0004(02)02095-9. [DOI] [PubMed] [Google Scholar]
  • 8.Barnes JP, Liang ZC, Mchaourab HS, Freed JH, Hubbell WL. A Multifrequency Electron Spin Resonance Study of T4 Lysozyme Dynamics. Biophys. J. 1999;76:3298–3306. doi: 10.1016/S0006-3495(99)77482-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Guo ZF, Cascio D, Hideg K, Hubbell WL. Structural Determinants of Nitroxide Motion in Spin-Labeled Proteins: Solvent-Exposed Sites in Helix B of T4 Lysozyme. Protein Sci. 2008;17:228–239. doi: 10.1110/ps.073174008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Guo ZF, Cascio D, Hideg K, Kalai T, Hubbell WL. Structural Determinants of Nitroxide Motion in Spin-Labeled Proteins: Tertiary Contact and Solvent-Inaccessible Sites in Helix G of T4 Lysozyme. Protein Sci. 2007;16:1069–1086. doi: 10.1110/ps.062739107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Fleissner MR, Bridges MD, Brooks EK, Cascio D, Kalai T, Hideg K, Hubbell WL. Structure and Dynamics of a Conformationally Constrained Nitroxide Side Chain and Applications in EPR Apectroscopy. Proc. Natl. Acad. Sci. USA. 2011;108:16241–16246. doi: 10.1073/pnas.1111420108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Fleissner MR, Cascio D, Hubbell WL. Structural Origin of Weakly Ordered Nitroxide Motion in Spin-Labeled Proteins. Protein Sci. 2009;18:893–908. doi: 10.1002/pro.96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Fleissner MR. Ph.D. Thesis. University of California; Los Angeles: 2007. [Google Scholar]
  • 14.Toledo Warshaviak D, Cascio D, Khramtsov VV, Hubbell WL. Crystal Structure of Spin Labeled T4 Lysozyme Mutant K65V1/R76V1. PDB Data Bank.
  • 15.Langen R, Oh KJ, Cascio D, Hubbell WL. Crystal Structures of Spin Labeled T4 Lysozyme Mutants: Implications for the Interpretation of EPR Spectra in Terms of Structure. Biochemistry. 2000;39:8396–8406. doi: 10.1021/bi000604f. [DOI] [PubMed] [Google Scholar]
  • 16.Tombolato F, Ferrarini A, Freed JH. Dynamics of the Nitroxide Side Chain in Spin-Labeled Proteins. J. Phys. Chem. B. 2006;110:26248–26259. doi: 10.1021/jp0629487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Tombolato F, Ferrarini A, Freed JH. Modeling the Effects of Structure and Dynamics of the Nitroxide Side Chain on the ESR Spectra of Spin-Labeled Proteins. J. Phys. Chem. B. 2006;110:26260–26271. doi: 10.1021/jp062949z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Warshaviak DT, Serbulea L, Houk KN, Hubbell WL. Conformational Analysis of a Nitroxide Side Chain in an Alpha-Helix with Density Functional Theory. J. Phys. Chem. B. 2011;115:397–405. doi: 10.1021/jp108871m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sezer D, Freed JH, Roux B. Parametrization, Molecular Dynamics Simulation, and Calculation of Electron Spin Resonance Spectra of a Nitroxide Spin Label on a Polyalanine Alpha-Helix. J. Phys. Chem. B. 2008;112:5755–5767. doi: 10.1021/jp711375x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sezer D, Freed JH, Roux B. Using Markov Models to Simulate Electron Spin Resonance Spectra from Molecular Dynamics Trajectories. J. Phys. Chem. B. 2008;112:11014–11027. doi: 10.1021/jp801608v. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Sezer D, Freed JH, Roux B. Multifrequency Electron Spin Resonance Spectra of a Spin-Labeled Protein Calculated from Molecular Dynamics Simulations. J. Am. Chem. Soc. 2009;131:2597–2605. doi: 10.1021/ja8073819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Polyhach Y, Godt A, Bauer C, Jeschke G. Spin Pair Geometry Revealed by High-Field DEER in the Presence of Conformational Distributions. J. Magn. Reson. 2007;185:118–129. doi: 10.1016/j.jmr.2006.11.012. [DOI] [PubMed] [Google Scholar]
  • 23.Tikhonova IG, Best RB, Engel S, Gershengorn MC, Hummer G, Costanzi S. Atomistic Insights into Rhodopsin Activation from a Dynamic Model. J. Am. Chem. Soc. 2008;130:10141–10149. doi: 10.1021/ja0765520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Ding F, Layten M, Simmerling C. Solution Structure of HIV-1 Protease Flaps Probed by Comparison of Molecular Dynamics Simulation Ensembles and EPR Experiments. J. Am. Chem. Soc. 2008;130:7184–7185. doi: 10.1021/ja800893d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Sale K, Song LK, Liu YS, Perozo E, Fajer P. Explicit Treatment of Spin Labels in Modeling of Distance Constraints from Dipolar EPR and DEER. J. Am. Chem. Soc. 2005;127:9334–9335. doi: 10.1021/ja051652w. [DOI] [PubMed] [Google Scholar]
  • 26.Boura E, Rozycki B, Herrick DZ, Chung HS, Vecer J, Eaton WA, Cafiso DS, Hummer G, Hurley JH. Solution Structure of the ESCRT-I Complex by Small-Angle X-ray Scattering, EPR, and FRET Spectroscopy. Proc. Natl. Acad. Sci. USA. 2011;108:9437–9442. doi: 10.1073/pnas.1101763108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Hirst SJ, Alexander N, Mchaourab HS, Meiler J. RosettaEPR: An Integrated Tool for Protein Structure Determination from Sparse EPR Data. J. Struct. Biol. 2011;173:506–514. doi: 10.1016/j.jsb.2010.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Polyhach Y, Bordignon E, Jeschke G. Rotamer Libraries of Spin Labelled Cysteines for Protein Studies. Phys. Chem. Chem. Phys. 2011;13:2356–2366. doi: 10.1039/c0cp01865a. [DOI] [PubMed] [Google Scholar]
  • 29.Jeschke G. DEER Distance Measurements on Proteins. Annu Rev Phys Chem. 2012;63:419–446. doi: 10.1146/annurev-physchem-032511-143716. [DOI] [PubMed] [Google Scholar]
  • 30.Hatmal MM, Li Y, Hegde BG, Hegde PB, Jao CC, Langen R, Haworth IS. Computer Modeling of Nitroxide Spin Labels on Proteins. Biopolymers. 2012;97:35–44. doi: 10.1002/bip.21699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Brunger AT, Adams PD, Clore GM, DeLano WL, Gros P, Grosse-Kunstleve RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, Simonson T, Warren GL. Crystallography & NMR system: A New Software Suite for Macromolecular Structure Determination. Acta Crystallogr., Sect. D: Biol. Crystallogr. 1998;54:905–921. doi: 10.1107/s0907444998003254. [DOI] [PubMed] [Google Scholar]
  • 32.Dedmon MM, Lindorff-Larsen K, Christodoulou J, Vendruscolo M, Dobson CM. Mapping Long-Range Interactions in Alpha-Synuclein Using Spin-Label NMR and Ensemble Molecular Dynamics Simulations. J. Am. Chem. Soc. 2005;127:476–477. doi: 10.1021/ja044834j. [DOI] [PubMed] [Google Scholar]
  • 33.Lindorff-Larsen K, Best RB, DePristo MA, Dobson CM, Vendruscolo M. Simultaneous Determination of Protein Structure and Dynamics. Nature. 2005;433:128–132. doi: 10.1038/nature03199. [DOI] [PubMed] [Google Scholar]
  • 34.Lindorff-Larsen K, Best RB, Vendruscolo M. Interpreting Dynamically-Averaged Scalar Couplings in Proteins. J. Biomol. NMR. 2005;32:273–280. doi: 10.1007/s10858-005-8873-0. [DOI] [PubMed] [Google Scholar]
  • 35.Best RB, Lindorff-Larsen K, DePristo MA, Vendruscolo M. Relation Between Native Ensembles and Experimental Structures of Proteins. Proc. Natl. Acad. Sci. USA. 2006;103:10901–10906. doi: 10.1073/pnas.0511156103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Lindorff-Larsen K, Rogen P, Paci E, Vendruscolo M, Dobson CM. Protein Folding and the Organization of the Protein Topology Universe. Trends. Biochem. Sci. 2005;30:13–19. doi: 10.1016/j.tibs.2004.11.008. [DOI] [PubMed] [Google Scholar]
  • 37.Lee J, Chen JH, Brooks CL, Im WP. Application of Solid-State NMR Restraint Potentials in Membrane Protein Modeling. J. Magn. Reson. 2008;193:68–76. doi: 10.1016/j.jmr.2008.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Jo S, Im W. Transmembrane Helix Orientation and Dynamics: Insights from Ensemble Dynamics with Solid-State NMR Observables. Biophys. J. 2011;100:2913–2921. doi: 10.1016/j.bpj.2011.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kim T, Jo S, Im W. Solid-State NMR Ensemble Dynamics as a Mediator Between Experiment and Simulation. Biophys. J. 2011;100:2922–2928. doi: 10.1016/j.bpj.2011.02.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Im W, Jo S, Kim T. An Ensemble Dynamics Approach to Decipher Solid-State NMR Observables of Membrane Proteins. Biochim. Biophys. Acta. 2012;1818:252–262. doi: 10.1016/j.bbamem.2011.07.048. [DOI] [PubMed] [Google Scholar]
  • 41.Jaynes ET. Information Theory and Statistical Mechanics. Phys. Rev. 1957;106:620–630. [Google Scholar]
  • 42.Roux B, Weare J. On the Statistical Equivalence of Restrained-Ensemble Simulations With the Maximum Entropy Method. J. Chem. Phys. 2013 doi: 10.1063/1.4792208. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Roux B, Islam SM. Restrained-Ensemble Molecular Dynamics Simulations Based on Distance Histograms from Double Electron-Electron Resonance Spectroscopy. J. Phys. Chem. B. 2013 doi: 10.1021/jp3110369. submitted. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Jeschke G. Distance Measurements in the Nanometer Range by Pulse EPR. Chemphyschem. 2002;3:927–932. doi: 10.1002/1439-7641(20021115)3:11<927::AID-CPHC927>3.0.CO;2-Q. [DOI] [PubMed] [Google Scholar]
  • 45.Jeschke G, Chechik V, Ionita P, Godt A, Zimmermann H, Banham J, Timmel CR, Hilger D, Jung H. DeerAnalysis2006 - A Comprehensive Software Package for Analyzing Pulsed ELDOR Data. Appl. Magn. Reson. 2006;30:473–499. [Google Scholar]
  • 46.Chiang YW, Borbat PP, Freed JH. The Determination of Pair Distance Distributions by Pulsed ESR using Tikhonov Regularization. J. Magn. Reson. 2005;172:279–295. doi: 10.1016/j.jmr.2004.10.012. [DOI] [PubMed] [Google Scholar]
  • 47.Brooks BR, Brooks CL, Mackerell AD, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, et al. CHARMM: The Biomolecular Simulation Program. J. Comput. Chem. 2009;30:1545–1614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel RD, Kale L, Schulten K. Scalable Molecular Dynamics with NAMD. J. Comput. Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.MacKerell AD, Bashford D, Bellott M, Dunbrack RL, Evanseck JD, Field MJ, Fischer S, Gao J, Guo H, Ha S, Joseph-McCarthy D, Kuchnir L, Kuczera K, Lau FTK, Mattos C, Michnick S, Ngo T, Nguyen DT, Prodhom B, Reiher WE, Roux B, Schlenkrich M, Smith JC, Stote R, Straub J, Watanabe M, Wiorkiewicz-Kuczera J, Yin D, Karplus M. All-atom Empirical Potential for Molecular Modeling and Dynamics Studies of Proteins. J. Phys. Chem. B. 1998;102:3586–1616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
  • 50.Mackerell AD, Feig M, Brooks CL. Extending the Treatment of Backbone Energetics in Protein Force Fields: Limitations of Gas-Phase Quantum Mechanics in Reproducing Protein Conformational Distributions in Molecular Dynamics Simulations. J. Comput. Chem. 2004;25:1400–1415. doi: 10.1002/jcc.20065. [DOI] [PubMed] [Google Scholar]
  • 51.Weaver LH, Matthews BW. Structure of Bacteriophage-T4 Lysozyme Refined at 1.7 a Resolution. J. Mol. Biol. 1987;193:189–199. doi: 10.1016/0022-2836(87)90636-x. [DOI] [PubMed] [Google Scholar]
  • 52.Roitberg A, Elber R. Modeling Side-Chains in Peptides and Proteins - Application of the Locally Enhanced Sampling and the Simulated Annealing Methods to Find Minimum Energy Conformations. J. Chem. Phys. 1991;95:9277–9287. [Google Scholar]
  • 53.Simmerling C, Lee MR, Ortiz AR, Kolinski A, Skolnick J, Kollman PA. Combining MONSSTER and LES/PME to Predict Protein Structure from Amino Acid Sequence: Application to the Small Protein CMTI-1. J. Am. Chem. Soc. 2000;122:8392–8402. [Google Scholar]
  • 54.Simmerling C, Fox T, Kollman PA. Use of Locally Enhanced Sampling in Free Energy Calculations: Testing and Application to the Alpha ->Beta Anomerization of Glucose. J. Am. Chem. Soc. 1998;120:5771–5782. [Google Scholar]
  • 55.Adelman SA, Doll JD. Generalized Langevin Equation Approach for Atom-Solid-Surface Scattering - General Formulation for Classical Scattering Off Harmonic Solids. J. Chem. Phys. 1976;64:2375–2388. [Google Scholar]
  • 56.Ryckaert JP, Ciccotti G, Berendsen HJC. Numerical-Integration of Cartesian Equations of Motion of a System with Constraints - Molecular-Dynamics of N-Alkanes. J. Comput. Phys. 1977;23:327–341. [Google Scholar]
  • 57.Darden T, York D, Pedersen L. Particle Mesh Ewald - An N.Log(N) Method for Ewald Sums in Large Systems. J. Chem. Phys. 1993;98:10089–10092. [Google Scholar]
  • 58.Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG. A Smooth Particle Mesh Ewald Method. J. Chem. Phys. 1995;103:8577–8593. [Google Scholar]
  • 59.Elber R, Karplus M. Enhanced Sampling in Molecular-Dynamics - Use of the Time-Dependent Hartree Approximation for a Simulation of Carbon-Monoxide Diffusion through Myoglobin. J. Am. Chem. Soc. 1990;112:9161–9175. [Google Scholar]
  • 60.Humphrey W, Dalke A, Schulten K. VMD: Visual Molecular Dynamics. J. Mol. Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
  • 61.Georgieva ER, Roy AS, Grigoryants VM, Borbat PP, Earle KA, Scholes CP, Freed JH. Effect of Freezing Conditions on Distances and Their Distributions Derived from Double Electron Electron Resonance (DEER): A Study of Doubly-Spin-Labeled T4 Lysozyme. J. Magn. Reson. 2012;216:69–77. doi: 10.1016/j.jmr.2012.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Brunger AT. Version 1.2 of the Crystallography and NMR System. Nat. Protoc. 2007;2:2728–2733. doi: 10.1038/nprot.2007.406. [DOI] [PubMed] [Google Scholar]
  • 63.Mchaourab HS, Kalai T, Hideg K, Hubbell WL. Motion of Spin-Labeled Side Chains in T4 Lysozyme: Effect of Side Chain Structure. Biochemistry. 1999;38:2947–2955. doi: 10.1021/bi9826310. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1_si_001

RESOURCES