Abstract
Disulfide bridges in proteins are formed by the oxidation of pairs of cysteine residues. These cross-links play a critical role in stabilizing the 3D-structure of small disulfide rich polypeptides such as hormones and venom toxins. The arrangement of the multiple disulfide bonds directs the peptide fold into distinct structural motifs that have evolved for resistance against biochemical and physical insults. These structural scaffolds have, therefore, proven to be very attractive in bioengineering efforts to develop novel biologics with applications in health and agriculture. Structural characterization of small disulfide rich peptides (DRPs) presents unique challenges when using commonly applied biophysical methods. NMR is the most commonly used method for studying such molecules, where the relatively small size of these molecules results in highly precise structural ensembles defined by a large number of distance and dihedral angle restraints per amino acid. However, in NMR the sulfur atoms that are involved in three of the five dihedral angles in a disulfide bond cannot be readily measured. Given the central role of disulfide bonds in the structure of these molecules, it is unclear what the inherent resolution of such NMR structures is when using traditional NMR methods. Here, we use an extensive set of long-range residual dipolar couplings (RDCs) to assess the resolution of the NMR structure of a disulfide-rich peptide. We find that structures based primarily on NOEs, yield ensembles that are equivalent to a crystallographic resolution of 2-3 Å in resolution, and that incorporation of RDCs reduces this to ~1-1.5 Å resolution. At this resolution the sidechain of ordered amino acids can be defined accurately, allowing the geometry of the cysteine bridges to be better defined, and allowing for disulfide-bond connectivities to be determined with high confidence. The observed improvements in resolution when using RDCs is remarkable considering the small size of these peptides.
Keywords: disulfide-rich peptides, nuclear magnetic resonance, peptides, NMR, residual dipolar couplings, RDCs
Introduction
Disulfide bridges are naturally occurring cross-links formed between the side chains of two cysteine residues and are one of the most important post-translational modifications in proteins. The significance of disulfide bonds in proteins can be appreciated by their prevalence accounting for ~18% of all known protein structures (9,709 of the 55,032 proteins deposited in the PDB contain at least one disulfide bond – excluding structures with >90% identity, PDB accessed 2019-10-15).
Disulfide-rich peptides and proteins are commonly secreted, and include biopharmaceutical targets such as hormones and antibodies (Lewis and Garcia, 2003; Mamathambika and Bardwell, 2008; Gongora-Benitez et al., 2014). In these molecules, the disulfide bonds serve to stabilize the protein fold in the extracellular environment. This property is perhaps most dramatically demonstrated in venom peptides, where disulfide-rich peptide toxins are not only excreted but further injected into a foreign host where they exert their function, often with devastating consequences (Undheim et al., 2015).
The potency and portability of disulfide-rich peptides have attracted much attention from the growing biotechnology sector as a potential source of leads for development of therapeutics or agricultural agents (Gongora-Benitez et al., 2014). As research efforts in this field intensify there is an interest in defining the high-resolution structure of these proteins to interpret structure-activity relationship studies and ultimately for rational peptide engineering (Brust et al., 2013).
Structural analysis of small disulfide-rich peptides, however, presents unique challenges in commonly applied high-resolution biophysical characterization by NMR spectroscopy and X-ray crystallography. In X-ray studies the crystal packing forces can have a significant effect on the structure of these molecules (de Araujo et al., 2014), where the peptide fold is often more reliant on the disulfide bonds than an extensive hydrophobic core (Undheim et al., 2016). The dynamic nature of peptides, often including extended loops, can further complicate the crystallization process itself. An approach to overcome these problems is the use of racemic crystallization methods (Zawadzke and Berg, 1993), which have gained popularity with the reduced cost of production of the D-form of peptides (Yeates and Kent, 2012). Indeed, where sufficient quantities of the D-form of the peptide can be synthesized and folded readily, this provides an attractive approach to structural characterization of peptides.
In NMR, the sulfur atoms that are involved in three of the five dihedral angles in a disulfide bond cannot be readily measured (Figure 1) (Mobli and King, 2010). This can be particularly detrimental to structure determination of these molecules, as the disulfide bonds may act as hinges, connecting distal parts of the peptide. Large inaccuracies in the definition of the geometry of disulfide bonds can therefore lead to reorientation of the relative position of different segments of the peptide. Efforts to replace sulfur with the NMR amenable 77Se isotope of selenium offer a solution (Mobli et al., 2009), and whilst very useful in defining disulfide connectivities, it remains unclear how the selenium itself affects the overall 3D structure.
Despite the above noted disadvantages, NMR remains the preferred method for structural characterization of peptides, with 73% (2524/3436) of all structures of peptides (proteins smaller than 6 kDa) solved by NMR spectroscopy. For disulfide containing peptides this statistic increases to 93% (800/853 – PDB accessed 2019-09, including only representative structures at >90% sequence identity). In principle, the small size of peptides makes them ideally suited to structural characterization by NMR spectroscopy, where the relatively small number of atoms provides largely unequivocal assignment of all atoms and their interatomic interactions. This is particularly true where uniform isotope labeling can be applied, allowing for use of multidimensional heteronuclear NMR methods (Ikura et al., 1990). Indeed, in general, a large number (>12) of restraints (distance and dihedral angle) per residue can be generated in these structures leading to a very high precision in the structure calculation step – RMSD of 0.1-0.2 Å often reported over ordered regions of the backbone (Klint et al., 2015; Undheim et al., 2015). However, the achieved precision can be deceptive as it reflects convergence of the numerical optimization and does not necessarily correlate with the accuracy of the structural models generated. Indeed, recent work investigating the structure of a series of disulfide containing proteins by NMR and X-ray crystallography found structural discrepancies (0.5–0.8 Å along the Cα atoms) (Alex et al., 2019), with the NMR structures having higher calculated disulfide bond energies (Schmidt et al., 2006).
Residual dipolar couplings (RDC) provide an excellent independent measure of structural accuracy of NMR models and can themselves be used to improve the resolution of NMR structures (Tjandra and Bax, 1997; Bax and Grishaev, 2005). Here, we seek to apply RDCs in assessment of the accuracy of disulfide-rich peptide structures generated by NMR spectroscopy and also investigate if these structures can be further refined by the inclusion of RDCs in the refinement step. We perform an in-depth structural analysis of a disulfide-rich peptide (Ta1a) previously reported with a precision below 1 Å using standard heteronuclear NMR methods (Undheim et al., 2015). Our results show that the accuracy of this structure is consistent with an X-ray structure of ~2.5 Å resolution. RDC refinement improves this to the equivalent of a 1-1.5 Å X-ray structure resolution. At this resolution there is a significantly better definition of sidechain orientations, and critically, improved definition of the cysteine bridges and their connectivities.
Materials and Methods
Ta1a Production
A pLicC vector harboring a codon optimized Ta1a gene (GeneArt, Life technologies) was transformed into BL21 (DE3) E. coli cells. A single colony was used to inoculate a culture and grown over night in 100 ml Luria-Bertani (LB) media containing 100 μg/ml ampicillin and the culture was grown at 37°C at 200 rpm until the optical density at 600 nm (OD600) reached 0.8. 5% inoculum was used from the starter culture to inoculate 1 L of LB medium containing 100 μg/ml of ampicillin.
The culture was induced at an OD600 of 0.8, with IPTG (isopropyl-β-D-thiogalactopyranoside) at a final concentration of 500 μM, and then further grown for another 14 h at 18°C. The bacterial cells were harvested by centrifugation at 6,000 rpm for 20 min at 4°C, and then resuspended in 10 ml of lysis buffer (40 mM Tris, 300 mM NaCl, 10 mM imidazole pH 8.0). The cells, kept on ice, were then lysed using sonication. Subsequently, the supernatant was collected after centrifugation at 17,000 rpm for 45 min at 4°C and filtered through a 0.45 μm filter.
The cell lysate was applied to a buffer-equilibrated, 5 ml His-Trap column (GE Healthcare) using a peristaltic pump at a flow rate of 3 ml/min. The column was then washed with 30 column volumes of wash buffer (40 mM Tris, 300 mM NaCl, 40 mM imidazole pH 8.0). The protein was eluted with 40 mM Tris, 300 mM NaCl at pH 8.0 with 250 mM imidazole. The eluted protein was concentrated and buffer exchanged using 15 ml centrifugal filters (Millipore) with a 10 kDa cut-off membrane, using a Tris buffer (40 mM Tris, 300 mM NaCl, pH 8.0) to remove imidazole.
Ta1a was separated from the (His)6-MBP fusion by Tobacco Etch Virus (TEV) protease. The cleavage was performed by adding TEV protease (1 mg/ml) to the protein solution [at a UV absorption at 280 nm (A280) ratio of 1:20] in a redox buffer (2.5 mM GSH and 0.25 mM GSSG) and incubated at 25°C overnight. The reaction mixture was loaded onto a 5 ml His-Trap column (GE Healthcare) and the flow-through containing Ta1a was collected.
The Ta1a sample was acidified with 0.05% Trifluoroaceticacid (TFA) and filtered through a 0.45 μm filter, and loaded onto a semi-preparative column (C3-Zorbax resin, Agilent) at a flowrate of 3 ml/min with a linear gradient of 5-80% acetonitrile (0.043% TFA) in water (0.05% TFA) over 50 min using an Agilent HPLC system. Elution was monitored by UV absorption at 214 and 280 nm. The fraction containing the pure peptide was lyophilized and stored at −20°C.
Uniformly enriched protein was produced by growing the transformed E. coli cells in minimal media supplemented with 4.0 g/L 13C6-glucose and 1.0 g/L 15NH4Cl as the sole carbon and nitrogen sources, respectively (Marley et al., 2001).
Preparation of Liquid Crystalline Solutions
A Pf1-phage aligned sample was obtained by mixing the stock solution of 50 mg/ml Pf1 phage (http://www.asla-biotech.com) with the protein solution and gently pipetting the final mixture a few times. PEG solution (Ruckert and Otting, 2000) was prepared by mixing the pentaethylene glycol monododecyl ether (C12E5; Sigma Aldrich), with hexanol at a molar ratio [PEG]:[hexanol] of 3:2. All the anisotropic data in aligned media were recorded at 25°C.
NMR Measurements
Details of all NMR experiments are provided in Tables S1, S2. First, a 3D CT-HNCA (Grzesiek and Bax, 1992) spectrum was recorded to confirm the assignments of Ta1a in 20 mM phosphate buffer pH 6.2 against the published values (BMRBID: 16667). All subsequent experiments were performed in the same buffer using a peptide concentration of ~500 μM (unless otherwise stated).
For RDC measurements data were acquired under isotropic conditions as well as when using the two different alignment media. Where values deviate from the details in Table S2, these are provided here. 2D IPAP-HSQC (Ottiger et al., 1998) spectra were obtained by acquiring two datasets in an interleaved manner for measurement of 1JNH splittings. 3D CT-HN(CO)CA (Bax et al., 2001) spectra without Hα decoupling were recorded for measurement of 1JCαHα. 3D HNCO (Bax et al., 2001) spectra without Cα decoupling were acquired for measurement of 1 splittings. In this case, for the isotropic sample, the data was acquired using non-uniform sampling mode and the data reconstructed using the sparse multidimensional iterative lineshape-enhanced method (Ying et al., 2017). 3D CT-HN(COCA)CB (Li et al., 2015) spectra without Hβ decoupling were recorded for measurement of sums of 1JCβHβ2 and 1JCβHβ3 splittings.
For χ1 measurements a 3D HA[HB,HN](CACO)NH (Lohr et al., 1999) spectrum was acquired to obtain 3JHαHβ couplings. A 3D HNHB (Archer et al., 1991) spectrum was recorded for the measurement of 3JNHβ coupling constants.
A 3D 13C-edited NOESY (Davis et al., 1992) spectrum as well as a 3D 15N-edited NOESY (Kay et al., 1992) spectrum, each using a mixing time of 150 ms, were recorded at 900.1 MHz 1H frequency for stereospecific assignment of Hβ2 and Hβ3 protons from Hα-Hβ2, Hα-Hβ3, HN-Hβ2, and HN-Hβ3 cross-peak intensities. Steady-state 1H-15N heteronuclear NOEs were recorded at 900.1 MHz 1H frequency as a qualitative probe for large amplitude backbone dynamics.
The NMRPipe software system (Delaglio et al., 1995) was used for processing the 3D CT-HNCA, 2D IPAP-HSQC, 3D HNCO, 3D HN(CO)CA, 3D CT-HN(COCA)CB and 3D HA[HB,HN](CACO)NH spectra. The 3D HNHB, 3D 13C edited NOESY, 1H-15N heteronuclear NOE and 3D 15N edited NOESY data were processed using the Rowland NMR toolkit (Hoch and Stern, 1996). The CCPNMR (Vranken et al., 2005) and Sparky (Goddard and Kneller, 2008) programs were used for analysis. Peak positions and intensities were determined using parabolic interpolation in all three dimensions of local peak maxima. Resonance assignments of Ta1a were made using the acquired spectra in agreement with previously published data (Undheim et al., 2015).
Refinement of the Ta1a Structure
The structure of Ta1a was refined starting from the coordinates of the PDB deposition 2KSL (Undheim et al., 2015), against the N-HN, Cα-Hα, C′-Cα, N-C′ and ΣCβHβ RDCs (in both alignment media where available) using the program XPLOR-NIH (Schwieters et al., 2003), which uses a simulated annealing protocol. The RDC refinements are here performed using a standard Cartesian molecular dynamics simulated annealing refinement protocol, starting from the coordinates of 2KSL structure, and with all structural restraints used in the CYANA calculations (Table S7). The protocol included 200,000 steps of 1 fs each, with the temperature linearly ramped down from 1000 to 25 K, followed by a Powell energy minimization. Empirical force fields included quadratic bond, angle, and improper terms with force constants of 5,000 kcal Å−2 mol, 500 kcal rad−2 mol−1 and 500 kcal rad−2 mol, respectively, as well as a quartic repulsive-only non-bonded potential with a force constant of 4 kcal Å−2 mol−1. In addition, backbone/backbone hydrogen bonding geometries were restrained via a potential of mean force (HBDB term in XPLOR-NIH). Varied magnitude alignment tensors were used for the RDCs of each alignment condition during the structural calculations. Force constants for different types of RDCs in two different alignment media were obtained from a combination of force constants (0.20, 0.15, 0.20, 0.20, 0.15 kcal Hz−2 mol−1for 1DNH, 1DCaHa, 1, 1 and 1DCbHb, respectively, from Pf1 phage medium; 0.20, 0.20, 0.10, 0.20 kcal Hz−2 mol−1 for 1DNH, 1DCaHa, 1 and 1, respectively, from PEG liquid crystals medium, and with all the values being normalized7 to the 1DNH couplings) that yielded the best cross validation performance according to a grid searching procedure. The 1DNH RDC force constant multipliers (and thereby the multipliers for the other types of RDCs) were ramped up with a constant multiplicative factor throughout the protocol from 0.05 to 5.0; i.e., at 25 K, the 1DNH force constant was ramped up to 1 kcal Hz−2 mol−1. A total of 50 structures was generated, and the twenty lowest energy structures were retained and then deposited in the PDB (entry 6URP). All figures of protein structures were prepared using PyMol (Schrodinger, 2015).
Results
Protein Expression and Purification
We transformed a plasmid containing the gene encoding a Ta1a-fusion protein into E. coli (BL21(DE3) strain) cells for expression. This gene also includes a periplasmic localization sequence followed by a (His)6 tagged maltose binding protein (MBP) – both N-terminal to the peptide sequence. The fusion also includes a TEV-protease cleavage site between the peptide and the fusion partner. Using this construct, we purified the fusion protein using IMAC chromatography followed by cleavage of the peptide from the fusion partner by TEV protease. The TEV protease and the released fusion partner were removed by an additional round of IMAC chromatography. We further purified the peptide using reverse-phase HPLC (Figure S1). The final yield of Ta1a was ~0.6 mg/L.
Optimization of Alignment Media Concentrations for RDC Measurements
We used two liquid crystalline media that align the protein differently relative to the magnetic field: a suspension of the negatively charged filamentous phage Pf1 (Hansen et al., 1998) and a polyethylene glycol (PEG) based liquid crystal (Ruckert and Otting, 2000). The prepared liquid crystalline medium will not always align in the magnetic field, if it does, the protein will also align. There is also a probability the protein will interact with the alignment media resulting in a higher degree of alignment than desired. Hence the strength of alignment in the particular liquid crystalline medium needs to be assessed. To determine the level of alignment of the liquid crystals themselves and how the peptide aligns with the Pf1 medium, we acquired a series of 2H spectra and 1H spectra of Ta1a, while reducing the Pf1-phage concentration from a starting value of 20 mg/ml. We found that the highest concentration of Pf1 phage at which the 1H spectrum shows good agreement with its isotropic counterpart (by visual comparison) is 5.8 mg/ml of Pf1 phage (Figures S2, S3). Similarly, we optimized the PEG bicelles concentration by measuring RDC data in either 5 or 8% w/v of PEG (Figure S4). Both concentrations yield good spectral data, with good agreement of backbone amide residual dipolar couplings (1DNH) when compared to the back-calculated values from the published Ta1a structure (2KSL). Based on the magnitudes of 1H-15N couplings, we chose the higher PEG concentration as it resulted in RDCs having a favorable magnitude in the 15-20 Hz range.
We used the optimized alignment conditions to acquire NMR data for subsequent structural refinement. The NMR data included a number of two-dimensional (2D) and three-dimensional (3D) experiments (Table S1), for extraction of J-splittings used to derive RDCs and dihedral angles. The majority of experiments were acquired under isotropic and two different anisotropic conditions, resulting in a total of 16 datasets.
J-Coupling Measurements and Analysis of χ1 Angles and Rotameric Distribution
To improve the resolution of the peptide sidechains, we analyzed the χ1 rotamer positions and distributions using a combination of J-couplings, NOE intensities and RDCs.
First, we assigned prochiral β-methylene protons using a combination of 3JHαHβ and qualitative 3JN−Hβ couplings (Bax et al., 1994). These coupling constants have a characteristic pattern for each of the three energetically preferred staggered rotamer positions (χ1 = 60°, 180° or −60°) (West and Smith, 1998). Following this procedure, we were able to stereospecifically assign 16 β-methylenes in Ta1a (Table S3). For 15 residues we found evidence of motional averaging, with 3JHαHβ couplings in the range of 5.0-9.0 Hz and/or qualitative 3JNHβ couplings classified as “medium-medium” pairs. In these cases, the χ1 angle was classified as “average.” We were unable to determine the χ1 angle of six residues due to overlap of their Hβ2 and Hβ3 resonances.
For valine, isoleucine and threonine residues (each with a β-methine proton), the side-chain is assumed to adopt a single staggered conformation when the measured 3JHαHβ couplings is greater than ~10 Hz (~9 Hz for Thr because of the effects of high electronegativity of the oxygen substituent) or less than ~5 Hz (Smith et al., 1991; Li et al., 2015). Three out of the 6 residues (Table S4) in Ta1a fit into this category with supporting data from the qualitative 3JN−Hβ measurements. For cases where the 3JHαHβ couplings does not fit into this category, the side-chain may either occupy a non-staggered conformation or be a rapid average of multiple conformations. Solvent exposed residues Ile-6 and Thr-12 exhibit this behavior, likely as a consequence of conformational averaging.
χ1 rotamer positions can also be derived from characteristic intra-residue 1H-1H NOE intensities. We were, therefore, able to examine the consistency of the above determined χ1 angles with experimentally measured NOE intensities (Table S5). Overall, we found good agreement between the two datasets, however, for Glu-36, Phe-38 and Asp-41, there is an apparent inconsistency. For these residues the HN-Hβ2 and HN-Hβ3 NOE intensities suggest χ1 rotamer averaging whereas the analysis of the J-couplings is consistent with a single staggered rotamer position. Given the apparent uncertainty, the χ1 angles determined for these residues were excluded from the subsequent structural refinement step.
Finally, we can also determine the χ1 rotameric states by comparison of appropriately scaled pairs of RDCs (Chou and Bax, 2001). This is done by assuming that the Cβ–Hβ bonds are at staggered conformations, and parallel to the Cα–Hα and Cα–C′ bonds. Under these conditions the sum of DCβHβ2 and DCβHβ3 is related to normalized sum of either:
[1] (Cα–Hα)i and (Cα–C′)i, or
[2] (Cα–Hα)i and (Cα–C′)i−1 of the preceding residue, or
[3] (Cα–C′)i and (Cα–C′)i−1 of the preceding residue.
Close agreement with [1], [2] or [3] indicates a χ1 value of 180°, 60°, or −60°, respectively (Table S6 shows examples of this analysis). For example, in Phe–16, Cys–26, Tyr–43, and His–47 the DCβHβ2+DCβHβ3 is closest to the sum of and DCαHα, indicating a χ1 angle of 180°. For Asn-40 the sum of DCβHβ2 and DCβHβ3 is closest to the sum of intraresidue DCαHα and of the preceding residue, indicating a χ1 angle of 60°. This analysis does not clearly identify residues that exhibit χ1 = −60°, because in these cases the intra-residue (Cα–N)i vector is not exactly parallel to the preceding residue's (Cα–C′) bond and therefore additional data is required to unambiguously identify the χ1 as being either −60° or 180°. Nevertheless, this approach is useful for extracting χ1 rotamer information for a significant fraction of residues in a protein, and further complements the conventional rotamer analysis using J-couplings and NOE data.
Structural Refinement of the Ta1a Structure
We wanted to compare the previously published structure of Ta1a with that of the refined structure using the RDCs and dihedral angles determined here. The previously reported structure (Undheim et al., 2015) of Ta1a was calculated using NOE-based distance restraints supplemented with backbone dihedral-angle restraints derived from TALOS chemical shift analysis and hydrogen bonding restraints (Cornilescu et al., 1999). This data was further supplemented with stereospecific assignments and χ1 angles obtained from initial structure calculations.
Compared to the published structure we have made a number of changes in the CYANA structure calculation protocol. First, to identify disordered residues, we acquired a heteronuclear NOE dataset, where 15N-1H NOEs are used as reporters of fast dynamics along the peptide backbone. Overall, the structure was found to be highly ordered except for residues E2, I6, and K51 (Figure S5). RDC and dihedral angle restraints involving these residues were thus removed in all subsequent structure calculation steps.
Next, we replaced the computationally derived stereospecific assignments and χ1 angles with those we have determined experimentally in this study. In our initial CYANA calculations, using this new data, we found that inclusion of the χ1 angle of N41 led to Ramachandran violations. We note that the sidechain of this residue also shows evidence of hydrogen bonding in the D2O exchange experiment, although we were here not able to unambiguously determine the hydrogen bonding partner of this residue. Both the χ1 and sidechain hydrogen bond constraints involving this residue were excluded from subsequent analyses. All other hydrogen bonds based on D2O exchange were included as previously assigned. No further violations were observed in subsequent CYANA calculations using these updated parameters.
The RDC refinements are here performed using NIH-XPLOR, thus the CYANA constraints were translated to the appropriate format for this software. In NIH-XPLOR, all constraints are weighted equally (1.0), and initial structure calculations revealed that some of the experimentally determined dihedral angles were being violated in some structures. Thus, to reflect the higher confidence of the experimentally derived dihedral angle constraints, we used a higher weighting for these (3.0 vs. 1.0), which resolved the observed violations without introducing any additional ones. These constraints formed the basis of all subsequent structure calculations (with and without inclusion of RDCs).
Measurement of Backbone and Side-Chain RDCs
RDCs can be used to both assess the resolution of an existing structure and to improve the resolution of a structure during the refinement step. To assess the resolution of the published Ta1a structure we fitted the backbone RDCs to the existing structure by order matrix analysis using singular value decomposition (SVD) (Losonczi et al., 1999). This method returns the predicted RDCs and the parameters of the alignment tensor determined by the fitting procedure. To quantify the agreement between the structure and the measured dipolar couplings, the quality factor Q is used as proposed by Cornilescu et al. (1998):
(1) |
where Dcalc and Dobs are calculated and observed dipolar couplings in the above equation (Equation 1). This factor offers a straightforward and unambiguous way to evaluate the structural quality.
The Q factor obtained for the published Ta1a structure was compared with the RDC data from the Pf1 phage and PEG aligned samples, where Q factors of 0.39 and 0.58 were found, respectively (Figure 2). In literature reports, Q factors of ~0.4 are commonly found for structures with a resolution equivalent to an X-ray crystallographic resolution of 2–3 Å resolution (Chen and Tjandra, 2011).
To refine the structure of Ta1a, we used all of the RDC data generated from the experiments listed in Table S1. This included a total of 355 backbone RDCs and 37 sidechain RDCs. The sidechain RDCs were derived from the sum of 1Hβ-13Cβ RDCs in the 3D HN(COCA)CB (Li et al., 2015) spectrum recorded without 1H decoupling in the 13Cβ dimension in the Pf1 phage aligned protein sample. In this experiment we were able to define side-chain RDCs for 37 out of the 49 non-Gly residues. Inclusion of these RDC restraints resulted in a computed structural ensemble having a Q factor of 0.08 and 0.21 in the Pf1 phage and PEG aligned samples, respectively (Figure 2 and Table S7).
Since the RDC data used to derive the structure is also used to calculate the Q factor, there is a clear potential for overfitting, and an alternative approach is required to assess the resolution of our RDC refined structure. This can be achieved by omitting some of the RDCs, and to use these excluded RDCs to cross-validate the refined structure. This procedure allows for an unbiased “Qfree-value” to be determined. Here, we omitted 10% of all RDCs in each of the eight experiments (N-HN, Cα-C′, Cα-Hαand C′-N in two alignment media) involving backbone atoms. The refinement was repeated using this reduced dataset, and the back-calculated values of the omitted RDCs were used to calculate the Qfree value. The procedure was repeated 10 times (ensuring each RDC was left out across the 10 runs) and the average Qfree value was 0.12 ± 0.03 for the Pf1-aligned sample and 0.24 ± 0.02 for the PEG-aligned data. A Qfree value in the low 20% range (0.20) roughly translates to structures consistent with an X-ray crystallographic resolution of 1–1.5 Å resolution.
Discussion
We have used residual dipolar couplings (RDCs) to assess the resolution of a disulfide-rich peptide structure. This structure had previously been solved using a large number of NMR restraints derived from high-quality heteronuclear NMR data (Undheim et al., 2015). Although the structure of this relatively small peptide had been defined with high precision, the quality of the structure is similar to what is typically achieved using similar methods for larger proteins (~2−3 Å). We subsequently used the measured RDCs to see if we could improve the resolution of this structure further. Our results show that inclusion of RDCs dramatically improves the attainable resolution.
Resolution of Peptide Structures
NMR remains the preferred method for solving peptide structures. Analysis of the protein databank (2019/10) reveals that three quarters of PDB structures of peptides smaller than 6 kDa have been solved by NMR spectroscopy. The fast-molecular tumbling of these molecules results in sharp NMR lines and the relatively low number resonances further reduces the complexity of the spectral data.
The favorable NMR conditions experienced by peptides in solution, results in data with low levels of ambiguity and in principle in a better-defined structure. It is, therefore, not unusual to find peptide structures that are defined by more than 10 experimental restraints per amino acid, yielding structural ensembles computed with a precision of 0.1–0.5 Å root-mean-squared difference (RMSD) over structured regions, along the peptide backbone (de Araujo et al., 2014; Klint et al., 2015; Undheim et al., 2015).
It remains unclear, however, if the high precision of peptide structural ensembles reflects the accuracy of these structures. The high precision is a direct consequence of the level of ambiguity in the data, which has been argued to be a good corollary with the accuracy of an NMR structure (Tikole et al., 2013; Buchner and Guntert, 2015). Thus, whilst it would appear reasonable to assume that the higher precision achieved for small peptides makes these more accurately defined, we note that there is no consensus on how NMR accuracy should be defined (Rosato et al., 2013).
In the case of disulfide rich peptides, there is a unique challenge arising from NMR blind-spots near the sulfur atoms. This arises as NMR signals from sulfur atoms cannot be readily measured in macromolecules. In the case of methionines this is not a significant concern as inaccuracies in defining the local environment about the sulfur atom only results in lower accuracy of side-chain dihedral angles near the periphery of the amino acid. In disulfide bonds, however, the quiescent sulfur atoms obscure three of the five dihedral angles that connect backbone atoms of often distal segments of the peptide.
The non-uniform distribution of structural restraints in disulfide-rich peptides has the potential of providing deceptively favorable ensemble statistics—in particular for helical peptides. In general NMR structures are defined overwhelmingly by short range NOE interactions within an amino acid or between neighboring amino acids. Orientation of segments of secondary structure are, however, often organized either through backbone-to-backbone hydrogen bonds (in β-sheets) or in proton-rich regions in the hydrophobic core of the protein. Thus, helices or loop regions that are connected by disulfide bonds in peptides rely critically on well-defined disulfide bonds.
In this study we revisited the high-precision structure of Ta1a, a largely helical disulfide-rich peptide. The peptide contains a disulfide bond connecting two helices as well as two disulfide bonds connecting a loop region with a helix. The peptide structure was solved using a large number of NOE and dihedral angle restraints generated by heteronuclear NMR measurements using an isotope-labeled sample. The peptide displays excellent NMR properties and consequently the structural ensemble can be computed with very high precision.
We used RDCs to assess the accuracy of the Ta1a structure and found it to be substantially lower than the reported precision. The Ta1a structure was originally reported with a precision of ~0.4 Å along the structured regions of the backbone. In this study we have refined the original structure using additional J-coupling and 3D NOE-derived restraints and an additional molecular dynamics refinement step in Xplor-NIH (Schwieters et al., 2018). The precision is still very high, albeit slightly lower than the reported structure (0.6 Å along the structured regions of the backbone). We then assessed the quality of the structure using an extensive RDC dataset acquired in two different alignment media (392 RDCs in total with 355 backbone RDCs). The agreement of the structures with the measured RDCs reveals that the two structures have similar quality factors, with the structure refined here agreeing slightly better than the published structure with the RDC data measured in the PEG-hexanol liquid crystals (see Figures 2, 3). The quality factors themselves are consistent with structures that have an equivalent crystallographic resolution of ~2.5 Å (Bax, 2003). Indeed, when we align the two structures along their structured regions there is a ~1 Å RMS difference in atomic coordinates (over residues 3–5 and 7–50).
We next used the RDC data to refine the peptide structure, which resulted in a structure that fits the RDC data very well (Figure 2). It is important to validate these results by omitting some of the measured RDCs to see how these fit the calculated structure using the remaining RDCs. Given the large number of RDCs generated here we excluded 10% of the backbone RDCs in each dataset (~35 RDCs randomly omitted from ~350) and performed the structure calculation using the remaining constraints. We repeated this procedure ten times, each time randomly omitting a different set of 10%. The quality factor generated using the omitted RDCs shows that the structure is of high quality consistent with a high-resolution crystal structure (1-1.5 Å) (Bax, 2003). We also aligned the ten generated structures with each other and to the structure generated using all RDCs. In each case the RMSD between these structures was <0.2 Å (all atoms in structured regions: residues 3-5 and 7-50).
Having the three structures (published, refined here with and without RDCs – Figure 3) we investigated what the likely source of discrepancy between them was. Given that there was a ~1 Å difference in structural alignment between the two structures that were generated without RDCs, when we align each of these with that solved using the RDCs. We found that both of these were ~1 Å different to the RDC refined structure as well (2KSL = 1.2 Å, Xplor-NIH refinement without RDCs = 1.0 Å). When we compared the alignment of each individual helix from either the published structure or that refined here without RDCs we find these to align very well with the RDC refined structure along the backbone (RMSD ~ 0.2 Å), indicating that the helices are locally accurately defined in all structures. The difference is, therefore, likely to be in the alignment of the helices with respect to each other. To test this, we represented each helix with a vector and calculated the angle formed between the central Helix-2 vector and the remaining three vectors (Figure 4). We find that Helix-3 is particularly displaced with respect to Helix-2. Further we find that when RDCs are not used in the refinement there is a much larger spread of inter-helix vector angles between different members of the same ensemble. This results in a larger standard deviation of the average angle within each structural ensemble (see Figure 4). The difference in average angle between Helix-2 and Helix-3 may appear to be small, but a 6° displacement of two connected 10 Å vectors is equivalent to about a 1 Å rotation at the tip of one the vectors. Thus, the vectorial displacement of the helical elements in a structure may be a better indicator of the resolution of a helical peptide than the ensemble RMSD.
Defining Disulfide Geometries in Peptide Structures
Conformations of disulfide bridges are classified based on the five side-chain dihedral angles as shown in Figure 1. Different methods have been proposed to classify disulfide conformers (Srinivasan et al., 1990; Harrison and Sternberg, 1996; Hutchinson and Thornton, 1996; Schmidt et al., 2006; Ozhogina and Bominaar, 2009). Here, we used the method proposed by Schmidt et al. (2006). There are three basic disulfide types based on the combination of signs of the χ2, χ3, and χ2' angles and they are designated spirals, hooks or staples. The classification depends on the sign and order of the angles, for instance all positive or all negative angles are designated as spirals. Disulfide bonds are further classified as right handed (RH) or left handed (LH) depending on whether the sign of the χ3 angle is positive or negative, respectively. Schmidt et al. included the χ1 and χ1′ angles to further refine the classification (Schmidt et al., 2006). This has expanded the number of types from 6 to 20 different types.
Using the above classification system, we analyzed the geometry of the three disulfide bonds in our Ta1a structures. In the NOE-derived Ta1a structure, the Cys7–Cys37 disulfide bridge exhibits 3 different conformers (–RH-hook; –RH-staple; –LH-spiral), the Cys23–Cys33 disulfide bridge predominantly adopts a –LH-spiral conformation with 9 structural models also adopting the +/−LH-spiral and for the Cys26–Cys46 disulfide bridge all the structural models in the ensemble adopt the +/−RH spiral conformation (Table S8)—note that the sign refers to the sign of the χ1 and χ1' angles. After refining the NOE derived structure with RDC restraints, the ensembles of all 20 structural models uniquely adopt –LH-hook, –LH-spiral, and –LH-hook for the disulfides Cys7–Cys37, Cys23–Cys33, and Cys26–Cys46, respectively (see Figure 5). We also compared our findings with those obtained using predictions from the DISH software (Armstrong et al., 2018). This software uses a trained neural network to predict the rotameric state of χ1 and χ2 dihedral angles (assuming idealized geometries) in disulfide bonds from input chemical shift values. The software produced reliable angles (>90% probability) for χ2 of residues 23 (180°), 33 (−60°) and 37 (180°). Compared to our RDC refined structure, the algorithm correctly predicted the rotameric state of residues 33 and 37, while residue 23 deviates from our results. The lack of reliable predictions for the other χ angles and the observed discrepancy may reflect structural heterogeneity as discussed further below.
Although we are able to classify the geometry of our disulfide bonds qualitatively, we note that there are some notable deviations from idealized geometries. Energetically the χ1 and χ2 angles in disulfide bonds have minima between −30° and −90° (gauche−), 30° and 90° (gauche+) and between 150° and −150° (trans). The χ3 angle has minima between −60° and −120° (left) as well as between 60° and 120° (right). The disulfide between Cys23–Cys33 fits into these limits whereas the disulfides between Cys7–Cys37 and Cys26–Cys46 do not satisfy the defined limits of the χ1/χ2 and χ2 angles, respectively (Figure 5). The χ1 rotamer analysis from J-couplings and NOE data further supports that Cys7 and Cys37 exhibit rotameric averaging. As χ1 is not locked in a staggered rotamer position this will affects the degree of freedom of the χ2 angle thereby exceeding the defined limits. While Cys7–Cys37 shows averaging at the level of the χ1 angle, Cys26–Cys46 shows a well-defined χ1 dihedral angle, but we find non-ideal χ2 angles. Further investigation of this disulfide bond revealed that some of the higher energy structures generated during structure calculations had a slightly different configuration of this bond (Figure 6). In the two alternative structures, we find a flip of the handedness of the disulfide bridge. What is particularly interesting is that while the χ2 and χ3 angles vary in these structures the χ1 angles remain largely the same (close to idealized staggered positions). Furthermore, the relative orientation of the C-H and C-C bond vectors remain largely the same, suggesting that the RDC restraints in this case would not be able to easily resolve this problem. This observed heterogeneity highlights the challenge in defining χ2 and χ3 angles by NMR spectroscopy—and suggests that beyond defining the χ1 angle we are largely reliant on the internal forcefield of molecular dynamics programs to define these angles. The observation of a number of dihedral angles at non-ideal staggered conformations in the disulfide bonds of our structures suggests that the internal forcefields for disulfide bonds can be better parameterized for structural characterization using NMR restraints. This is particularly problematic in CYANA where no torsion angle parameters exist for χ2 and χ3 angles, and disulfide bonds are introduced through a set of distance restraints across the disulfide bridge.
RDCs to Define Disulfide Connectivities in Peptide Structures
Determination of disulfide-bond connectivities in DRPs remains a significant area of research without a clear and unique solution (Mobli and King, 2010; Poppe et al., 2012; Lakbub et al., 2018). An interesting approach is to measure precise distances across the disulfide bond using selective deuteration (Takeda et al., 2012). The method provides excellent accuracy of the distance between hydrogen atoms across the disulfide bond and may be used to infer disulfide-bond connectivitites (in the absence of chemical shift overlap). Similarly, disulfide proxies may be used in the form of 77Se enriched seleno-cystines, allowing for unequivocal determination of diselenide connectivities (Mobli et al., 2009). The methods, however, require highly specialized labeling strategies, placing them beyond routine use.
The question then remains what impact RDCs may have on resolving disulfide-bond connectivities. The above analysis of the geometry of disulfide bonds shows that although the position of the Cβ atoms may be resolved using RDCs, it is unlikely that the RDC data will resolve the position of the sulfur atoms uniquely in solution. Further, our analysis of the quality of our structures, shows that inclusion of RDCs results in an improvement in resolution from ~2.5 Å to < 1.5 Å when RDCs are included as restraints. Based on this information, we downloaded all structures in the protein databank (PDB) that contain a disulfide bond, have a crystallographic resolution of < 1.5 Å and have a molecular weight <50 kDa (2019-09-28). We further excluded highly homologous structures (only including one representative structure when sequences have >90% identity). This resulted in a dataset of ~900 structures. We then queried Cβ–Cβ distances between atoms in a disulfide bond (within the same chain) and also extracted Cβ–Cβ distances for atoms that are not in a disulfide bond (regardless of chain).
Analysis of the PDB database showed that Cβ-Cβ distances between residues in a disulfide bond (intra) overlap with those not in a disulfide bond (inter). The intra-disulfide bonds (2447 bonds in our data set) have Cβ–Cβ distance shorter than 5 Å (average of 3.8 A ± 0.18)—note that one highly strained outlier was removed (1SO7.pdb). Further, our data contains approximately 200 Cβ–Cβ distance shorter than 5 Å between cysteine residues not in a disulfide bonds (inter). This would suggest that finding a solution using NMR data may be difficult based on the Cβ positions alone.
However, manual inspection of the 20 structures with the shortest Cβ–Cβ distances of non-connected cysteines shows that such a connection would result in significant violations of other disulfide bonds. There are two particular violations that can be observed, the first is that accommodating the shorter inter-disulfide bond connection results in at least one other disulfide bond having a Cβ–Cβ distance ≥ 5 Å. The second observation is that in all cases reviewed we find that correctly paired cysteines yield the shortest average Cβ–Cβ distances overall. It would, therefore, seem reasonable to determine disulfide bond connectivities from such data by minimizing the Cβ–Cβ distances between connected cysteine pairs.
Practically, this approach can be implemented by repeating the structure calculation step for each possible disulfide isoform and choosing the solution that provides the shortest overall Cβ-Cβ distances. Historically, this approach has been applied where the structural constraints are used to optimize an appropriate function (Jordan et al., 2009). However, when only NOE data are used this approach may yield ambiguous results which has in the past led to incorrect conclusions [see discussions elsewhere (Mobli and King, 2010; Poppe et al., 2012)]. Our analysis suggests that including RDCs in such a data-driven approach provides much higher confidence in determining disulfide bond connectivities and is unlikely to lead to incorrect solutions.
Conclusion
Structural characterization of disulfide-rich peptides is chiefly conducted using NMR spectroscopy. Although, these molecules have excellent properties for solution studies, the presence of multiple disulfide bonds poses a significant challenge in attainable resolution.
Analysis of the structure of a largely helical disulfide-rich peptide (Ta1a), using RDCs, shows that although the structure had been determined at very high precision, the overall resolution of the structure was consistent with an X-ray crystallographic resolution of ~2.5 Å. Including RDCs as restraints improves this resolution to < 1.5 Å resolution.
We find that despite inclusion of RDC restraints non-ideal geometries of cysteine bridges are found where evidence of rotamer averaging is present. We further find that χ2 and χ3 angles may display heterogeneity that cannot be resolved by RDCs alone.
Finally, we note that at the resolution achieved here, Cβ–Cβ distance measurements are sufficient to determine disulfide-bond connectivities with high confidence.
Data Availability Statement
The datasets generated for this study can be found in the Protein Data Bank 6URP.
Author Contributions
MM conceived and directed the project. VR prepared all of the samples and conducted the experiments with input and guidance from all authors. VR, YS, and MM analyzed the data. VR and MM prepared the figures and tables and wrote the manuscript with input from all authors.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors thank Dr. Johan Rosengren and Mr. Colton Payne for assistance with execution of DISH. Finally, we are indebted to Dr. Ad Bax for valuable input to the design of the RDC experiments and their interpretation.
Footnotes
Funding. This project was supported by the Australian Research Council (ARC grants: DP140101098, DP190101177, and FT110100925), The University of Queensland (UQ Fellowship to MM, and travel award to VR) and the National Health and Medical Research Council (NHMRC APP1162597). VR was supported by an International Postgraduate Award.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fchem.2019.00889/full#supplementary-material
References
- Alex J. M., Rennie M. L., Engilberge S., Lehoczki G., Dorottya H., Fizil A., et al. (2019). Calixarene-mediated assembly of a small antifungal protein. IUCrJ 6, 238–247. 10.1107/S2052252519000411 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Archer S. J., Ikura M., Torchia D. A., Bax A. (1991). An alternative 3d-Nmr technique for Correlating backbone N-15 with side-chain H-beta-resonances in larger proteins. J. Magn. Reson. 95, 636–641. 10.1016/0022-2364(91)90182-S [DOI] [Google Scholar]
- Armstrong D. A., Kaas Q., Rosengren K. J. (2018). Prediction of disulfide dihedral angles using chemical shifts. Chem. Sci. 9, 6548–6556. 10.1039/C8SC01423J [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bax A. (2003). Weak alignment offers new NMR opportunities to study protein structure and dynamics. Protein Sci. 12, 1–16. 10.1110/ps.0233303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bax A., Grishaev A. (2005). Weak alignment NMR: a hawk-eyed view of biomolecular structure. Curr. Opin. Struct. Biol. 15, 563–570. 10.1016/j.sbi.2005.08.006 [DOI] [PubMed] [Google Scholar]
- Bax A., Kontaxis G., Tjandra N. (2001). Dipolar couplings in macromolecular structure determination. Methods Enzymol. 339, 127–174. 10.1016/S0076-6879(01)39313-8 [DOI] [PubMed] [Google Scholar]
- Bax A., Vuister G. W., Grzesiek S., Delaglio F., Wang A. C., Tschudin R., et al. (1994). Measurement of homo- and heteronuclear J couplings from quantitative J correlation. Methods Enzymol. 239, 79–105. 10.1016/S0076-6879(94)39004-5 [DOI] [PubMed] [Google Scholar]
- Brust A., Wang C. I., Daly N. L., Kennerley J., Sadeghi M., Christie M. J., et al. (2013). Vicinal disulfide constrained cyclic peptidomimetics: a turn mimetic scaffold targeting the norepinephrine transporter. Angew. Chem. Int. Ed Engl. 52, 12020–12023. 10.1002/anie.201304660 [DOI] [PubMed] [Google Scholar]
- Buchner L., Guntert P. (2015). Increased reliability of nuclear magnetic resonance protein structures by consensus structure bundles. Structure 23, 425–434. 10.1016/j.str.2014.11.014 [DOI] [PubMed] [Google Scholar]
- Chen K., Tjandra N. (2011). “The use of residual dipolar coupling in studying proteins by NMR,” in NMR of Proteins and Small Biomolecules, ed G. Zhu (Berlin: Springer; ), 47–67. 10.1007/128_2011_215 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chou J. J., Bax A. (2001). Protein side-chain rotamers from dipolar couplings in a liquid crystalline phase. J. Am. Chem. Soc. 123, 3844–3845. 10.1021/ja015660y [DOI] [PubMed] [Google Scholar]
- Cornilescu G., Delaglio F., Bax A. (1999). Protein backbone angle restraints from searching a database for chemical shift and sequence homology. J. Biomol. NMR 13, 289–302. 10.1023/A:1008392405740 [DOI] [PubMed] [Google Scholar]
- Cornilescu G., Marquardt J. L., Ottiger M., Bax A. (1998). Validation of protein structure from anisotropic carbonyl chemical shifts in a dilute liquid crystalline phase. J. Am. Chem. Soc. 120, 6836–6837. 10.1021/ja9812610 [DOI] [Google Scholar]
- Davis A. L., Keeler J., Laue E. D., Moskau D. (1992). Experiments for recording pure-absorption heteronuclear correlation spectra using pulsed field gradients. J. Magn. Reson. 98, 207–216. 10.1016/0022-2364(92)90126-R [DOI] [Google Scholar]
- de Araujo A. D., Mobli M., Castro J., Harrington A. M., Vetter I., Dekan Z., et al. (2014). Selenoether oxytocin analogues have analgesic properties in a mouse model of chronic abdominal pain. Nat. Commun. 5:3165. 10.1038/ncomms4165 [DOI] [PubMed] [Google Scholar]
- Delaglio F., Grzesiek S., Vuister G. W., Zhu G., Pfeifer J., Bax A. (1995). NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6, 277–293. 10.1007/BF00197809 [DOI] [PubMed] [Google Scholar]
- Goddard T. D., Kneller D. G. (2008). SPARKY 3. San Francisco, CA: University of California. [Google Scholar]
- Gongora-Benitez M., Tulla-Puche J., Albericio F. (2014). Multifaceted roles of disulfide bonds. Peptides as therapeutics. Chem. Rev. 114, 901–926. 10.1021/cr400031z [DOI] [PubMed] [Google Scholar]
- Grzesiek S., Bax A. (1992). Improved 3d triple-resonance nmr techniques applied to a 31-Kda Protein. J. Magn. Reson. 96, 432–440. 10.1016/0022-2364(92)90099-S [DOI] [Google Scholar]
- Hansen M. R., Mueller L., Pardi A. (1998). Tunable alignment of macromolecules by filamentous phage yields dipolar coupling interactions. Nat. Struct. Biol. 5, 1065–1074. 10.1038/4176 [DOI] [PubMed] [Google Scholar]
- Harrison P. M., Sternberg M. J. (1996). The disulphide beta-cross: from cystine geometry and clustering to classification of small disulphide-rich protein folds. J. Mol. Biol. 264, 603–623. 10.1006/jmbi.1996.0664 [DOI] [PubMed] [Google Scholar]
- Hoch J. C., Stern A. S. (1996). NMR Data Processing. New York, NY: Wiley-Liss. [Google Scholar]
- Hutchinson E. G., Thornton J. M. (1996). PROMOTIF-a program to identify and analyze structural motifs in proteins. Protein Sci. 5, 212–220. 10.1002/pro.5560050204 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikura M., Kay L. E., Bax A. (1990). A novel approach for sequential assignment of 1H, 13C, and 15N spectra of proteins: heteronuclear triple-resonance three-dimensional NMR spectroscopy. Application to calmodulin. Biochemistry 29, 4659–4667. 10.1021/bi00471a022 [DOI] [PubMed] [Google Scholar]
- Jordan J. B., Poppe L., Haniu M., Arvedson T., Syed R., Li V., et al. (2009). Hepcidin revisited, disulfide connectivity, dynamics, and structure. J. Biol. Chem. 284, 24155–24167. 10.1074/jbc.M109.017764 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kay L. E., Keifer P., Saarinen T. (1992). Pure absorption gradient enhanced heteronuclear single quantum correlation spectroscopy with improved sensitivity. J. Am. Chem. Soc. 114, 10663–10665. 10.1021/ja00052a088 [DOI] [Google Scholar]
- Klint J. K., Chin Y. K., Mobli M. (2015). Rational engineering defines a molecular switch that is essential for activity of spider-venom peptides against the analgesics target NaV1.7. Mol. Pharmacol. 88, 1002–1010. 10.1124/mol.115.100784 [DOI] [PubMed] [Google Scholar]
- Lakbub J. C., Shipman J. T., Desaire H. (2018). Recent mass spectrometry-based techniques and considerations for disulfide bond characterization in proteins. Anal. Bioanal. Chem. 410, 2467–2484. 10.1007/s00216-017-0772-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewis R. J., Garcia M. L. (2003). Therapeutic potential of venom peptides. Nat. Rev. Drug Discov. 2, 790–802. 10.1038/nrd1197 [DOI] [PubMed] [Google Scholar]
- Li F., Grishaev A., Ying J., Bax A. (2015). Side chain conformational distributions of a small protein derived from model-free analysis of a large set of residual dipolar couplings. J. Am. Chem. Soc. 137, 14798–14811. 10.1021/jacs.5b10072 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lohr F., Schmidt J. M., Ruterjans H. (1999). Simultaneous measurement of (3)J(HN,H alpha) and (3)J(H alpha,H beta) coupling constants in C-13,N-15-labeled proteins. J. Am. Chem. Soc. 121, 11821–11826. 10.1021/ja991356h [DOI] [Google Scholar]
- Losonczi J. A., Andrec M., Fischer M. W., Prestegard J. H. (1999). Order matrix analysis of residual dipolar couplings using singular value decomposition. J. Magn. Reson. 138, 334–342. 10.1006/jmre.1999.1754 [DOI] [PubMed] [Google Scholar]
- Mamathambika B. S., Bardwell J. C. (2008). Disulfide-linked protein folding pathways. Annu. Rev. Cell Dev. Biol. 24, 211–235. 10.1146/annurev.cellbio.24.110707.175333 [DOI] [PubMed] [Google Scholar]
- Marley J., Lu M., Bracken C. (2001). A method for efficient isotopic labeling of recombinant proteins. J. Biomol. NMR 20, 71–75. 10.1023/A:1011254402785 [DOI] [PubMed] [Google Scholar]
- Mobli M., De Araujo A. D., Lambert L. K., Pierens G. K., Windley M. J., Nicholson G. M., et al. (2009). Direct visualization of disulfide bonds through diselenide proxies using77Se NMR spectroscopy. Angew. Chem. Int. Ed Eng. 48, 9312–9314. 10.1002/anie.200905206 [DOI] [PubMed] [Google Scholar]
- Mobli M., King G. F. (2010). NMR methods for determining disulfide-bond connectivities. Toxicon 56, 849–854. 10.1016/j.toxicon.2010.06.018 [DOI] [PubMed] [Google Scholar]
- Ottiger M., Delaglio F., Bax A. (1998). Measurement of J and dipolar couplings from simplified two-dimensional NMR spectra. J. Magn. Reson. 131, 373–378. 10.1006/jmre.1998.1361 [DOI] [PubMed] [Google Scholar]
- Ozhogina O. A., Bominaar E. L. (2009). Characterization of the kringle fold and identification of a ubiquitous new class of disulfide rotamers. J. Struct. Biol. 168, 223–233. 10.1016/j.jsb.2009.06.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poppe L., Hui J. O., Ligutti J., Murray J. K., Schnier P. D. (2012). PADLOC: a powerful tool to assign disulfide bond connectivities in peptides and proteins by NMR spectroscopy. Anal. Chem. 84, 262–266. 10.1021/ac203078x [DOI] [PubMed] [Google Scholar]
- Rosato A., Tejero R., Montelione G. T. (2013). Quality assessment of protein NMR structures. Curr. Opin. Struct. Biol. 23, 715–724. 10.1016/j.sbi.2013.08.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruckert M., Otting G. (2000). Alignment of biological macromolecules in novel nonionic liquid crystalline media for NMR experiments. J. Am. Chem. Soc. 122, 7793–7797. 10.1021/ja001068h [DOI] [Google Scholar]
- Schmidt B., Ho L., Hogg P. J. (2006). Allosteric disulfide bonds. Biochemistry 45, 7429–7433. 10.1021/bi0603064 [DOI] [PubMed] [Google Scholar]
- Schrodinger L. L. C. (2015). “The PyMOL Molecular Graphics System, Version 1.7” (New York, NY: Shrodinger LLC; ). [Google Scholar]
- Schwieters C. D., Bermejo G. A., Clore G. M. (2018). Xplor-NIH for molecular structure determination from NMR and other data sources. Protein Sci. 27, 26–40. 10.1002/pro.3248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwieters C. D., Kuszewski J. J., Tjandra N., Clore G. M. (2003). The Xplor-NIH NMR molecular structure determination package. J. Magn. Reson. 160, 65–73. 10.1016/S1090-7807(02)00014-9 [DOI] [PubMed] [Google Scholar]
- Smith L. J., Sutcliffe M. J., Redfield C., Dobson C. M. (1991). Analysis of. vphi. and. chi. 1 torsion angles for hen lysozyme in solution from proton NMR spin-spin coupling constants. Biochemistry 30, 986–996. 10.1021/bi00218a015 [DOI] [PubMed] [Google Scholar]
- Srinivasan N., Sowdhamini R., Ramakrishnan C., Balaram P. (1990). Conformations of disulfide bridges in proteins. Int. J. Pept. Protein Res. 36, 147–155. 10.1111/j.1399-3011.1990.tb00958.x [DOI] [PubMed] [Google Scholar]
- Takeda M., Terauchi T., Kainosho M. (2012). Conformational analysis by quantitative NOE measurements of the beta-proton pairs across individual disulfide bonds in proteins. J. Biomol. NMR 52, 127–139. 10.1007/s10858-011-9587-0 [DOI] [PubMed] [Google Scholar]
- Tikole S., Jaravine V., Orekhov V. Y., Guntert P. (2013). Effects of NMR spectral resolution on protein structure calculation. PLoS ONE 8:e68567. 10.1371/journal.pone.0068567 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tjandra N., Bax A. (1997). Measurement of dipolar contributions to 1JCH splittings from magnetic-field dependence of J modulation in two-dimensional NMR spectra. J. Magn. Reson. 124, 512–515. 10.1006/jmre.1996.1088 [DOI] [PubMed] [Google Scholar]
- Undheim E. A., Grimm L. L., Low C. F., Morgenstern D., Herzig V., Zobel-Thropp P., et al. (2015). Weaponization of a hormone: convergent recruitment of hyperglycemic hormone into the venom of arthropod predators. Structure 23, 1283–1292. 10.1016/j.str.2015.05.003 [DOI] [PubMed] [Google Scholar]
- Undheim E. A. B., Mobli M., King G. F. (2016). Toxin structures as evolutionary tools: using conserved 3D folds to study the evolutionary trajectory of rapidly evolving peptides. BioEssays 38, 539–548. 10.1002/bies.201500165 [DOI] [PubMed] [Google Scholar]
- Vranken W. F., Boucher W., Stevens T. J., Fogh R. H., Pajon A., Llinas M., et al. (2005). The CCPN data model for NMR spectroscopy: development of a software pipeline. Proteins 59, 687–696. 10.1002/prot.20449 [DOI] [PubMed] [Google Scholar]
- West N. J., Smith L. J. (1998). Side-chains in native and random coil protein conformations. Analysis of NMR coupling constants and chi1 torsion angle preferences. J. Mol. Biol. 280, 867–877. 10.1006/jmbi.1998.1911 [DOI] [PubMed] [Google Scholar]
- Yeates T. O., Kent S. B. (2012). Racemic protein crystallography. Annu. Rev. Biophys. 41, 41–61. 10.1146/annurev-biophys-050511-102333 [DOI] [PubMed] [Google Scholar]
- Ying J., Delaglio F., Torchia D. A., Bax A. (2017). Sparse multidimensional iterative lineshape-enhanced (SMILE) reconstruction of both non-uniformly sampled and conventional NMR data. J. Biomol. NMR 68, 101–118. 10.1007/s10858-016-0072-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zawadzke L. E., Berg J. M. (1993). The structure of a centrosymmetric protein crystal. Proteins 16, 301–305. 10.1002/prot.340160308 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated for this study can be found in the Protein Data Bank 6URP.