Abstract
One of the stiffest challenges in structural studies of proteins using NMR is the assignment of sidechain resonances. Typically, a panel of lengthy 3D experiments are acquired in order to establish connectivities and resolve ambiguities due to overlap. We demonstrate that these experiments can be replaced by a single 4D experiment that is time-efficient, yields excellent resolution, and captures unique carbon-proton connectivity information. The approach is made practical by the use of non-uniform sampling in the three indirect time dimensions and maximum entropy reconstruction of the corresponding 3D frequency spectrum. This 4D method will facilitate automated resonance assignment procedures and it should be particularly beneficial for increasing throughput in NMR-based structural genomics initiatives.
Introduction
The development of non-uniform sampling (NUS) approaches to data collection in multidimensional NMR experiments represents the most fundamental change in the way that NMR data are collected since the development of Fourier transform (FT) NMR by Ernst and Anderson some 50 years ago.[1] Uniform sampling that is required by the discrete Fourier transform (DFT) prevents full realization of the benefits from costly ultra-high-field magnets because it is impractical to sample long evolution times, required for high resolution, in the indirect dimensions of multidimensional experiments. The use of non-uniform sampling is essential for making high-resolution 3D and 4D NMR experiments practical, especially for marginally soluble or transiently stable systems.
High-resolution multidimensional spectra are of particular interest in studies of complex biomolecules that are isotopically enriched with 13C and 15N.[2] These experiments are used both for resonance assignment of all 1H, 13C and 15N atoms in the molecule and to extract interproton distance restraints from dipolar interactions. For proteins, this generally involves sequential assignment of backbone amide resonances followed by assignment of sidechain nuclei and, finally, extraction of distance and dihedral-angle restraints.
One of the most challenging steps in this process is the assignment of sidechain resonances. This process commonly involves the acquisition of two complementary 3D experiments, the H(CC)(CO)NH-TOCSY and (H)CC(CO)NH-TOCSY, which produce proton and carbon correlations respectively.[3] These experiments reveal correlations between the amide 1H-15N pair of residue i and the aliphatic sidechain atoms of the previous residue, i-1. Thus, each spectrum contains roughly one resonance per aliphatic sidechain nucleus. These experiments, however, do not provide information about carbon-hydrogen connectivities, often leading to ambiguous assignments. This problem can be ameliorated using a 4D version of this experiment[4], but conventional uniform sampling of this experiment will require a prohibitively long acquisition time. In principle, it should be possible to reduce the experiment time through NUS.
A variety of methods have recently been introduced for processing NUS data with more than three dimensions.[5; 6; 7; 8; 9; 10] These methods are generally restricted to situations of high signal-to-noise ratio (S/N). The relatively low sensitivity of HCC(CO)NH-TOCSY type experiments (compared to HNCO, CBCA(CO)NH etc.) makes them poor candidates for these reconstruction methods. An alternative approach that works well in situations of low S/N is maximum entropy reconstruction (MaxEnt).[11; 12] However, an implementation capable of reconstructing three indirect dimensions simultaneously (i.e., 4D experiments) has not been available (note that very recently the FM approach was applied to a 4D-NOESY dataset[13], this method attempts to recover signal amplitudes accurately but suffers from very long processing times). Here, we use a new implementation of a previously-described MaxEnt algorithm[14; 15], applicable to 3D reconstruction in the indirect dimensions of a new 4D HCC(CO)NH-TOCSY experiments acquired using NUS (see Fig. 1).
Projections of a related 5D experiment[16] have recently been used to extract sidechain resonance frequencies. Such projections, however, have been shown to produce significant artifacts if the fully-dimensional spectrum is reconstructed.[17] Thus without further processing or analysis (e.g., APSY[16], PRODECOMP[18], etc.), the data are not sufficient to accurately determine the full spectrum. Similarly, non-uniform FT (also referred to as sparse multidimensional Fourier transform) processing of this 5D experiment has been reported, using a 1.5 mM sample of ubiquitin.[19] The advantages of MaxEnt reconstruction over non-uniform FT, particularly in situations of low S/N, have been extensively demonstrated.[12; 20]
We present here the first NUS 4D HCC(CO)NH-TOCSY spectrum, reconstructed using 3D MaxEnt, which we compare with corresponding conventional 3D spectra.
Results
The maximum entropy algorithm used here has been described in detail elsewhere[14; 15], and the formulae for the entropy and constraint readily generalize to multidimensional data. However, because MaxEnt reconstruction is nonlinear, MaxEnt must be the last time-to-frequency transformation applied. Thus, if MaxEnt is to be used in t1, then the t2 dimension must be transformed to the frequency domain first. If MaxEnt is to be used for both t1 and t2, then it must be applied to both dimensions simultaneously, rather than one at a time. The IDFT (inverse discrete Fourier transform) operation used and the corresponding sums can be applied along as many dimensions as desired, with concomitant scaling of processing and memory requirements.
The experimental data consist of the 3D versions (uniformly sampled) and a 4D-NUS version of the HCC(CO)NH-TOCSY experiment, acquired using a 300 µM sample of a 4.5-kDa spider toxin known as PcTx1 (Fig. 2). The high-resolution 3D experiments were acquired in 1.5 days each to achieve resolution similar to the 4D experiment. The 3D data were processed using linear prediction extrapolation, apodization, zero-filling and discrete Fourier transform (DFT) in the indirect dimensions. The 4D experiment was acquired using 2000 hypercomplex samples distributed within a cube of 60×64×64 points in the 15N, 13C, and 1H dimensions, respectively (Fig. 2); less than 1% of the complete (uniformly sampled) cube was sampled and the total acquisition time was 1.5 days.
The non-uniform sampling schedule was generated using an algorithm not yet applied to NMR data. In this approach, sample times are selected randomly without replacement so that the same sample point is not acquired more than once. The sample points are distributed using an exponentially decaying sampling density matched to the decay of the signal envelope; samples thus occur more densely at short times where the S/N is higher, for non-constant time (CT) dimensions. The algorithm differs from those previously described in that it is based on a one-pass approach described by Eframidis and Spirakis[21], who demonstrated that this is equivalent to weighted random selection without replacement. The algorithm is applicable to any dimensionality, but currently developed for up to three NUS dimensions in the program Sched3d. The algorithm calculates for each time point t(i) a weighting factor w(i), using the decay envelope of the signal, viz:
where lwk and swk are the linewidth and sweep width in the kth dimension, respectively, dim is the number of dimensions and the equation may be further altered to include, for example, J modulation. The algorithm then calculates a rank for each point r(i) according to:
where x(i) is a random number between 0 and 1 generated for each point. A subset of M points having the highest values of r(i) are then selected from N total points. This method is fast and it avoids sampling the same point multiple times.
All data were processed using the Rowland NMR Toolkit (http://rnmrtk.uchc.edu). The 4D experiment was processed using the new 3D MaxEnt program (msa3d). The MaxEnt reconstruction parameters were determined using a previously reported approach (def=0.01, λ=0.02).[22; 23] Fig. 3 shows a 2D cross-section of the indirect 1H and 13C dimensions, at the direct 1H and indirect 15N frequencies corresponding to residue Ser30, revealing correlations from the sidechain protons of Arg29. This example illustrates: (i) the superb resolution of the 4D experiment, which is particularly useful for assigning complex sidechains; (ii) the excellent sensitivity of the 4D spectrum (note that Ser30 is one of the weakest peaks in the 1H-15N HSQC spectrum; see Fig. 4); (iii) the ability to unambiguously determine which protons are attached to which carbons using the 4D experiment.
Discussion
The number of sample points required to faithfully reconstruct a spectrum depends on the complexity of the underlying data (e.g., the number of signals, dynamic range etc.), with the dimensionality of the experiment being of little consequence.[20] Since the number of unknowns (i.e., frequencies and phases of signals) in each of the two 3D experiments is roughly the same as in the 4D experiment, the number of data points required to obtain spectra with sufficient information content should be the same in all three experiments. The time required to carry out a 4D experiment would be about the same as the time required for two 3D experiments if all the hypercomplex components are collected (8 elements for 3D and 16 for 4D).
Crucially, however, in the two 3D experiments the signal acquired in one experiment does not contribute to the other whilst in the 4D all acquired data contribute to the sensitivity of the final reconstruction. This suggests that the 4D experiment will have the same overall sensitivity as both 3D experiments combined (neglecting losses due to differential relaxation), without incurring additional experiment time.
The 4D HCC(CO)NH-TCOSY experiment described here lends itself particularly well to rapid sidechain assignment in the context of structural genomics initiatives. The ability to unequivocally elucidate C–H correlations should facilitate automated assignment approaches, which currently suffer from the ambiguity present when the conventional experiments are used for aliphatic sidechain assignment.[16; 24] Furthermore, dispersion of resonances into an additional dimension significantly reduces the possibility of peak overlap; for example, in the conventional 3D H(CC)(CO)NH-TOCSY experiment it would not be possible to resolve two overlapping peaks from protons attached to carbon atoms resonating at different frequencies, whereas in the 4D experiment these would be resolved and unambiguously assigned.
Conclusions
In conclusion, we have described an approach for aliphatic sidechain assignment employing NUS and non-Fourier spectrum analysis that offers significant advantages over existing methods. This is an attractive alternative to existing fast methods as it is practical, simple to implement and applicable to most proteins routinely analysed by NMR, and is likely to benefit all proteins where sampling to an optimal evolution time of 1.6 × T2 in all dimensions is unfeasible.[25; 26] Furthermore, the experiment yields results that can be readily analysed using either manual or automated approaches.
Experimental
The direct dimension was Fourier transformed. At each frequency point in the resulting interferogram, a 3D hyperplane (cube) was reconstructed using the MaxEnt approach. The time required to compute each cube using MaxEnt reconstruction varied and depends on the data in that cube. The example 2D plane of the 4D NUS experiment shown in Fig. 3 was taken from a cube processed on a laptop computer and converged after a total of 4 h; the average processing time per cube was ~2 h/CPU. Thus to reconstruct 512 pts would take ~1000 h on a single CPU or 128 h (5 days) on an 8 core computer. The full dataset presented here was processed on an Apple X-grid cluster consisting of 104 CPUs and converged overnight.
The 3D H(CC)(CO)NH-TOCSY experiment was acquired using 32 and 64 complex samples in the nitrogen and proton dimensions, respectively, while the 3D (H)CC(CO)NH-TOCSY experiment was acquired using 28 and 56 complex samples in the nitrogen and carbon dimensions, respectively. Both 3D experiments were acquired using 16 transients per time increment. The 3D data was processed using linear prediction extrapolation, shifted sine-bell apodization, zero-filling and discrete Fourier transform (DFT) in the indirect dimensions. The 4D experiment was collected using 8 scans per increment and processed using the new 3D MaxEnt algorithm (msa3d) to yield a digital resolution of 64×256×256 in the 15N, 13C, and 1H dimensions, respectively. Each experiment (3Ds and 4D) was acquired in ~1.5 days.
Complete 1H, 15N, and 13C chemical shifts for PcTx1 have been deposited in BioMagResBank (accession number 16468), while coordinates for both low-resolution and high-resolution structures are available from the Protein Data Bank (coordinate files 1LMM and 2KNI, respectively).
Acknowledgments
Supported by grants from the Australian Research Council (DP0774245 to GFK), the US National Institutes of Health (GM-47467 and RR-020125 to JCH), and the Queensland Smart State Research Facilities Fund. The authors thank Natalie Saez for the sample of PcTx1.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Ernst RR, Anderson WA. Application of Fourier transfroim spectroscopy to magnetic resonance. Review of Scientific Instruments. 1966;37:93–102. [Google Scholar]
- 2.Sattler M, Schleucher J, Griesinger C. Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients. Progress in Nuclear Magnetic Resonance Spectroscopy. 1999;34:93–158. [Google Scholar]
- 3.Montelione GT, Lyons BA, Emerson SD, Tashiro M. An efficient triple resonance experiment using carbon-13 isotropic mixing for determining sequence-specific resonance assignments of isotopically-enriched proteins. Journal of the American Chemical Society. 1992;114:10974–10975. [Google Scholar]
- 4.Clowes RT, Boucher W, Hardman CH, Domaille PJ, Laue ED. A 4D HCC(CO)NNH experiment for the correlation of aliphatic side-chain and backbone resonances in 13C/15N-labelled proteins. Journal of Biomolecular NMR. 1993;3:349–354. [Google Scholar]
- 5.Jaravine VA, Zhuravleva AV, Permi P, Ibraghimov I, Orekhov VY. Hyperdimensional NMR Spectroscopy with Nonlinear Sampling. Journal of the American Chemical Society. 2008;130:3927–3936. doi: 10.1021/ja077282o. [DOI] [PubMed] [Google Scholar]
- 6.Kazimierczuk K, Zawadzka A, Kozminski W, Zhukov I. Determination of spin-spin couplings from ultrahigh resolution 3D NMR spectra obtained by optimized random sampling and multidimensional fourier transformation. Journal of the American Chemical Society. 2008;130:5404–5405. doi: 10.1021/ja800622p. [DOI] [PubMed] [Google Scholar]
- 7.Snyder DA, Xu Y, Yang D, Bruschweiler R. Resolution-Enhanced 4D 15N/13C NOESY Protein NMR Spectroscopy by Application of the Covariance Transform. Journal of the American Chemical Society. 2007;129:14126–14127. doi: 10.1021/ja075533n. [DOI] [PubMed] [Google Scholar]
- 8.Coggins B, Zhou P. High resolution 4-D spectroscopy with sparse concentric shell sampling and FFT-CLEAN. Journal of Biomolecular NMR. 2008;42:225–239. doi: 10.1007/s10858-008-9275-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zawadzka-Kazimierczuk A, Kazimierczuk K, Kozminski W. A set of 4D NMR experiments of enhanced resolution for easy resonance assignment in proteins. Journal of Magnetic Resonance. 202:109–116. doi: 10.1016/j.jmr.2009.10.006. [DOI] [PubMed] [Google Scholar]
- 10.Hiller S, Ibraghimov I, Wagner G, Orekhov VY. Coupled Decomposition of Four-Dimensional NOESY Spectra. Journal of the American Chemical Society. 2009;131:12970–12978. doi: 10.1021/ja902012x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sibisi S. Two-dimensional reconstructions from one-dimensional data by maximum entropy. Nature. 1983;301:134–136. [Google Scholar]
- 12.Barna JCJ, Tan SM, Laue ED. Use of CLEAN in conjunction with selective data sampling for 2D NMR experiments. Journal of Magnetic Resonance. 1988;78:327–332. [Google Scholar]
- 13.Hyberts S, Frueh D, Arthanari H, Wagner G. FM reconstruction of non-uniformly sampled protein NMR data at higher dimensions and optimization by distillation. Journal of Biomolecular NMR. 2009;45:283–294. doi: 10.1007/s10858-009-9368-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hoch JC, Stern AS, James VDUS Thomas L. Methods in Enzymology. Academic Press; 2002. Maximum entropy reconstruction, spectrum analysis and deconvolution in multidimensional nuclear magnetic resonance; pp. 159–178. [DOI] [PubMed] [Google Scholar]
- 15.Hoch JC, Stern AS. NMR data processing. New York: Wiley-Liss; 1996. [Google Scholar]
- 16.Hiller S, Joss R, Wider G. Automated NMR Assignment of Protein Side Chain Resonances Using Automated Projection Spectroscopy (APSY) Journal of the American Chemical Society. 2008;130:12073–12079. doi: 10.1021/ja803161d. [DOI] [PubMed] [Google Scholar]
- 17.Mobli M, Stern AS, Hoch JC. Spectral reconstruction methods in fast NMR: Reduced dimensionality, random sampling and maximum entropy. Journal of Magnetic Resonance. 2006;182:96–105. doi: 10.1016/j.jmr.2006.06.007. [DOI] [PubMed] [Google Scholar]
- 18.Malmodin D, Billeter M. Multiway Decomposition of NMR Spectra with Coupled Evolution Periods. Journal of the American Chemical Society. 2005;127:13486–13487. doi: 10.1021/ja0545822. [DOI] [PubMed] [Google Scholar]
- 19.Kazimierczuk K, Zawadzka A, Komiski W. Narrow peaks and high dimensionalities: Exploiting the advantages of random sampling. Journal of Magnetic Resonance. 2009;197:219–228. doi: 10.1016/j.jmr.2009.01.003. [DOI] [PubMed] [Google Scholar]
- 20.Mobli M, Hoch JC. Maximum entropy spectral reconstruction of nonuniformly sampled data. Concepts in Magnetic Resonance Part A. 2008;32:436–448. doi: 10.1002/cmr.a.20126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Efraimidis PS, Spirakis PG. Weighted random sampling with a reservoir. Information Processing Letters. 2006;97:181–185. [Google Scholar]
- 22.Mobli M, Maciejewski MW, Gryk MR, Hoch JC. An automated tool for maximum entropy reconstruction of biomolecular NMR spectra. Nature Methods. 2007 doi: 10.1038/nmeth0607-467. [DOI] [PubMed] [Google Scholar]
- 23.Mobli M, Maciejewski M, Gryk M, Hoch J. Automatic maximum entropy spectral reconstruction in NMR. Journal of Biomolecular NMR. 2007;39:133–139. doi: 10.1007/s10858-007-9180-8. [DOI] [PubMed] [Google Scholar]
- 24.Bahrami A, Assadi AH, Markley JL, Eghbalnia HR. Probabilistic Interaction Network of Evidence Algorithm and its Application to Complete Labeling of Peak Lists from Protein NMR Spectroscopy. PLoS Comp. Biol. 2009;5:e1000307. doi: 10.1371/journal.pcbi.1000307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Stern AS, Li KB, Hoch JC. Modern spectrum analysis in multidimensional NMR spectroscopy: Comparison of linear-prediction extrapolation and maximum-entropy reconstruction. Journal of the American Chemical Society. 2002;124:1982–1993. doi: 10.1021/ja011669o. [DOI] [PubMed] [Google Scholar]
- 26.Rovnyak D, Frueh DP, Sastry M, Sun ZJ, Stern AS, Hoch JC, Wagner G. Accelerated acquisition of high resolution triple-resonance spectra using non-uniform sampling and maximum entropy. J. Magn. Reson. 2004;170:15–21. doi: 10.1016/j.jmr.2004.05.016. [DOI] [PubMed] [Google Scholar]
- 27.Emsley L, Bodenhausen G. Optimization of shaped selective pulses for NMR using a quaternion description of their overall propagators. Journal of Magnetic Resonance. 1992;97:135–148. [Google Scholar]
- 28.Boyd J, Soffe N. Selective excitation by pulse shaping combined with phase modulation. Journal of Magnetic Resonance. 1989;85:406–413. [Google Scholar]
- 29.Shaka AJ, Lee CJ, Pines A. Iterative schemes for bilinear operators; application to spin decoupling. Journal of Magnetic Resonance. 1988;77:274–293. [Google Scholar]
- 30.Freeman R, Kupce E. Decoupling: theory and practice I. Current methods and recent concepts. NMR in Biomed. 1997;10:372–380. doi: 10.1002/(sici)1099-1492(199712)10:8<372::aid-nbm478>3.0.co;2-z. [DOI] [PubMed] [Google Scholar]
- 31.Logan TM, Olejniczak ET, Xu RX, Fesik SW. A general method for assigning NMR spectra of denatured proteins using 3D HC(CO)NH-TOCSY triple resonance experiments. Journal of Biomolecular NMR. 1993;3:225–231. doi: 10.1007/BF00178264. [DOI] [PubMed] [Google Scholar]
- 32.Grzesiek S, Bax A. Amino acid type determination in the sequential assignment procedure of uniformly 13C/15N-enriched proteins. Journal of Biomolecular NMR. 1993;3:185–204. doi: 10.1007/BF00178261. [DOI] [PubMed] [Google Scholar]
- 33.Grzesiek S, Anglister J, Bax A. Correlation of Backbone Amide and Aliphatic Side-Chain Resonances in 13C/15N-Enriched Proteins by Isotropic Mixing of 13C Magnetization. Journal of Magnetic Resonance B. 1993;101:114–119. [Google Scholar]
- 34.Marion D, Ikura M, Tschudin R, Bax A. Rapid recording of 2D NMR spectra without phase cycling. Application to the study of hydrogen exchange in proteins. Journal of Magnetic Resonance. 1989;85:393–399. [Google Scholar]
- 35.Sklenar V, Piotto M, Leppik R, Saudek V. Gradient-Tailored Water Suppression for 1H-15N HSQC Experiments Optimized to Retain Full Sensitivity. Journal of Magnetic Resonance A. 1993;102:241–245. [Google Scholar]
- 36.Piotto M, Saudek V, Sklenář V. Gradient-tailored excitation for single-quantum NMR spectroscopy of aqueous solutions. Journal of Biomolecular NMR. 1992;2:661–665. doi: 10.1007/BF02192855. [DOI] [PubMed] [Google Scholar]