The nuclear Overhauser effect (NOE) has been the workhorse of structural studies of macromolecules in solution by NMR spectroscopy. The NOE is a measure of the rate of magnetization transfer between nuclei, induced by modulation of the magnetic dipole–dipole coupling between nuclear spins by overall tumbling of the molecule in solution (1). The efficiency of magnetization transfer via this mechanism is low; consequently, NOEs are typically only observed for protons within ≈5 Å, and lengthy experiments are required in order to detect the weak effect. NOE magnitudes provide a “spectroscopic ruler” through the approximate r−6 dependence on the distance between proton pairs. Although it takes surprisingly few such distance estimates to determine the overall fold of a protein (2)—provided the estimates are distributed evenly throughout the protein—even a complete set of NOEs yields far fewer constraints on the structure than are required to determine all 3N − 6 degrees of freedom for an N-atom protein. The fold is even more highly underdetermined when protein flexibility is taken into account. By complementing NMR measurements with prior knowledge of protein structure, usually in the form of a potential energy function that describes the physical plausibility of a model structure, NMR has emerged as a valuable complement to x-ray crystallography for structure determination. The Protein Data Bank (PDB) (3) now contains >7,000 NMR protein structures, and >1,000 new structures are anticipated to be added in 2008. One way to further hasten the rate of protein structure determination by NMR is to develop alternatives that avoid the need to measure and assign NOEs. In this issue of PNAS, Shen et al. (4) describe a viable alternative to NOE-based structure determination for small proteins.
Nevertheless, reliance on NOEs for determination of biomolecular structure presents real challenges. A NOE must be assigned to two specific nuclei before it can be used to constrain a structural model, and assignment becomes more challenging as the size of the molecule increases. Even a modest-sized protein can yield many thousands of detectable NOEs. NOEs are frequently missing from flexible regions, and NOEs that can be measured are usually highly correlated with one another. The information content of all NOEs is not the same: NOEs between residues that are distant in the primary sequence (called “long-range NOEs”) are more important for constraining the overall fold than are NOEs between neighboring residues (“short-range NOEs”) or intraresidue NOEs that provide no additional structural information (“redundant NOEs”). The latter two types of NOEs are far more prevalent. The confidence level in an assignment is greater when a NOE is correlated with other assigned NOEs, but distinguishing a unique long-range NOE from an erroneous assignment can be challenging. Indeed, the small number of demonstrably incorrect NMR structures reported can nearly all be attributed to a handful of erroneous NOE assignments.
NOEs are by no means the only NMR-observable parameters that are sensitive to structure. Vicinal couplings, residual dipolar couplings, and chemical shifts are all sensitive to structure and are all routinely used to supplement NOE-derived distance restraints in protein structure determination (5). Chemical shifts are the easiest to measure and are the first parameters to be determined in assigning resonances to specific nuclei. Chemical shift is a measure of the Larmour precession frequency of a magnetic spin under the influence of the local magnetic field (6). The field experienced by a nucleus is perturbed from the applied field strength of the instrument by the distribution of electrons near the nucleus, which can either diminish or augment the applied field. As the name implies, chemical shift is sensitive to the number and nature of substituents covalently connected to the nucleus, and to their conformation. The correlation between 1H, 13C, and 15N chemical shifts of the polypeptide backbone and backbone dihedral angles has long been viewed as a potential alternative to NOEs for determining solution structure, but the imprecision of the correlation has proved limiting. Nevertheless, parallel progress in improving the correlation between chemical shifts and structure (both quantum mechanical and empirical) and de novo methods for protein structure prediction led Wishart and Case (7) to anticipate in 2001 that a combination of chemical shifts and structure prediction methods would become a viable method for determining the solution structure of small proteins.
The article by Shen et al. (4) describes a confluence of precisely the two approaches foreseen by Wishart and Case (7). Baker and colleagues (8) are the developers of ROSETTA, a program that predicts protein structure from sequence data alone by using conformational ensembles for peptide fragments selected from the PDB on the basis of sequence homology, combined with an empirical cost function that quantifies the physical plausibility of conformations, to assemble the fragments into the intact protein. ROSETTA has consistently out-performed competing approaches in an annual competition [Critical Assessment of Techniques for Protein Structure Prediction (CASP); www.predictioncenter.org] designed to test the state of the art in protein structure prediction. Bax and colleagues (9) developed TALOS (torsion angle likelihood obtained from shift and sequence similarity), a widely used program for predicting torsion angles for peptide fragments on the basis of chemical shift measurements, again based on structures in the PDB and a corresponding database of chemical shifts, derived from the Biomagnetic Resonance Data Bank (10). To determine small protein structures from chemical shifts alone, Shen et al. employ an expanded TALOS-like database of chemical shifts to select ensembles of peptide fragment conformations, which are assembled into a final structure by using ROSETTA.
The results of this approach—applied to 16 proteins ranging up to 129 residues—are remarkable. The root-mean-square deviation (rmsd) for atoms making up the peptide backbone averages only 1.4 Å from the corresponding target structures, as determined by more-conventional NOE-based approaches. To explore the question of unintentional bias, the authors also performed “blind” structure computations by using chemical shifts for nine proteins whose structures were not yet deposited in the PDB, in collaboration with two structural genomics consortia. The rmsds from the target structures for these blind computations averaged 1.0 Å.
A very similar approach was recently reported by Cavalli et al. (11), also using a database of measured chemical shifts and known protein structures and using ROSETTA to assemble the final structure from ensembles of peptide fragment conformations. Of the five proteins in common to the two studies, only three used the same PDB entry as the target structure. For these three, the results by Shen et al. (4) average 1.1 Å from the target structures (backbone rmsd), whereas the results by Cavalli et al. differ by 1.5 Å. The significantly improved accuracy obtained by Shen et al. seems to be attributable to their use of 1HN and 13C′ chemical shifts, in addition to the 15N, 13Cα, 1Hα, and 13Cβ shifts used by Cavalli et al.—a telling difference suggesting that the addition of more chemical shifts (for example 1Hβ) may allow for future improvements. Different prior structure determinations of the same protein yielded backbone rmsds of 0.6 Å for ubiquitin (PDB entries 1D3Z vs. 1UBQ) and 0.8 Å for calbindin (1ICB vs. 4ICB), illustrating that the precision of the structures determined by chemical shift approaches that of traditional methods.
The impetus for using chemical shifts instead of NOEs for structural studies extends beyond the desire to avoid the time-consuming experiments needed to detect weak NOEs or the desire to reduce the number of assignments. The local nature of chemical shift structural restraints means that the effects of misassignment are similarly local (Fig. 1). In contrast, misassignment of even a single long-range NOE can have a profoundly negative impact on the accuracy of the computed structure. Indeed, all of the demonstrably incorrect NMR protein structures resulted from misassignment of a mere handful of NOEs. Conversely, because chemical shift structural restraints are by their very nature local, they are well suited for defining the structure of smaller proteins where all effects are local, while being less sufficient for larger structures. This may account for the increase in rmsd vs. number of protein residues observed by both Shen et al. (4) and Cavalli et al. (11). For these larger systems, supplementing the chemical shift data with a few longer-range constraints (NOEs, residual dipolar couplings, or spin labels) may prove effective in improving the accuracy of the determination. An interesting exercise would be to apply the method of Shen et al. to compute the structures of the incorrect NMR protein structures reported previously. We predict that, in many instances, the correct fold will be obtained despite the presence of misassignments. Another potential benefit of chemical shifts is that they can be measured for disordered or flexible regions of a protein that typically fail to yield detectable NOEs. Berjanskii and Wishart (12) have recently shown that backbone chemical shifts reflect local flexibility, as well as structure.
Fig. 1.
Structural information derived from chemical shifts is more local than information derived from NOEs. (Left) NOEs observed between the same pair of residues, depicted as arrows between the Hα atom of one residue and the Hβ atom (green) and methyl group (yellow) of a “distant” residue, are highly correlated and long-range in their impact: each provides approximately the same structural information, highly constraining the values of all 10 intervening dihedral angles (magenta). (Right) In contrast, chemical shift measurements for the atoms depicted in green provide more local structural constraints for the adjoining dihedral angles. The figure was prepared by using VMD (15).
The relative independence of structural information derived from chemical shifts reflects the different contributions to the chemical shifts experienced by different nuclei along the peptide backbone (7). In contrast, the high degree of correlation among NOEs (Fig. 1) hinders the use of powerful resampling statistical methods (13) for error analysis or for detecting incorrect assignments. With chemical shift-based structure determination, these tools, which have proven so powerful for x-ray crystallography, could be returned to the armamentarium of NMR spectroscopists.
The useful protein size limit of chemical shifts for structure determination appears to be the result of cumulative error in the correlation between chemical shifts and structure. If this is so, then improvements in accuracy should lead to a concomitant increase in the size of the protein for which the method works. Possible improvements are straightforward in principle. More accurate and extensive correlation between chemical shift and local conformation can be anticipated, either through additional empirical data or from recent improvements in the quantum mechanical computation of chemical shifts (14). In the approach of Shen et al. (4), chemical shifts were used to select families of peptide conformations but not to refine the final structure. Adding chemical shift terms to the cost function used for final assembly and refinement should help extend the applicability of the method. Although clearly there is more work to be done, it can be said that 2008 is the year in which an alternative to NOE-based determination of protein structure in solution became truly viable for small proteins.
Footnotes
The authors declare no conflict of interest.
See companion article on page 4685.
References
- 1.Noggle JH, Shirmer RE. The Nuclear Overhauser Effect: Chemical Applications. New York: Academic; 1971. [Google Scholar]
- 2.Connolly PJ, Stern AS, Hoch JC. J Am Chem Soc. 1994;116:2675–2676. [Google Scholar]
- 3.Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Shen Y, Lange O, Delaglio F, Rossi P, Aramini JM, Liu G, Eletsky A, Wu Y, Singarapu KK, Lemak A, et al. Proc Natl Acad Sci USA. 2008;105:4685–4690. doi: 10.1073/pnas.0800256105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Clore GM, Gronenborn AM. Proc Natl Acad Sci USA. 1998;95:5891–5898. doi: 10.1073/pnas.95.11.5891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Abragam A. Principles of Nuclear Magnetism. Oxford: Oxford Univ Press; 1961. [Google Scholar]
- 7.Wishart DS, Case DA. Methods Enzymol. 2001;338:3–34. doi: 10.1016/s0076-6879(02)38214-4. [DOI] [PubMed] [Google Scholar]
- 8.Simons KT, Bonneau R, Ruczinski I, Baker D. Proteins Suppl. 1999;3:171–176. doi: 10.1002/(sici)1097-0134(1999)37:3+<171::aid-prot21>3.3.co;2-q. [DOI] [PubMed] [Google Scholar]
- 9.Cornilescu G, Delaglio F, Bax A. J Biomol NMR. 1999;13:289–302. doi: 10.1023/a:1008392405740. [DOI] [PubMed] [Google Scholar]
- 10.Ulrich EL, Akutsu H, Doreleijers JF, Harano Y, Ioannidis YE, Lin J, Livny M, Mading S, Maziuk D, Miller Z, et al. Nucleic Acids Res. 2008;36:D402–D408. doi: 10.1093/nar/gkm957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cavalli A, Salvatella X, Dobson CM, Vendruscolo M. Proc Natl Acad Sci USA. 2007;104:9615–9620. doi: 10.1073/pnas.0610313104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Berjanskii MV, Wishart DS. J Biomol NMR. 2008;40:31–48. doi: 10.1007/s10858-007-9208-0. [DOI] [PubMed] [Google Scholar]
- 13.Brünger A. Nature. 1992;355:472–475. doi: 10.1038/355472a0. [DOI] [PubMed] [Google Scholar]
- 14.Moon S, Case DA. J Comput Chem. 2006;27:825–836. doi: 10.1002/jcc.20388. [DOI] [PubMed] [Google Scholar]
- 15.Humphrey W, Dalke A, Schulten K. J Mol Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]

