Skip to main content
Bioinformatics logoLink to Bioinformatics
. 2011 Mar 3;27(7):1039–1040. doi: 10.1093/bioinformatics/btr054

ncIDP-assign: a SPARKY extension for the effective NMR assignment of intrinsically disordered proteins

Kamil Tamiola 1,*, Frans A A Mulder 1,*
PMCID: PMC3065690  PMID: 21372082

Abstract

Summary: We describe here the ncIDP-assign extension for the popular NMR assignment program SPARKY, which aids in the sequence-specific resonance assignment of intrinsically disordered proteins (IDPs). The assignment plugin greatly facilitates the effective matching of a set of connected resonances to the correct position in the sequence by making use of IDP random coil chemical shifts.

Availability: The ncIDP-assign extension is available at http://www.protein-nmr.org/.

Contact: k.tamiola@rug.nl; f.a.a.mulder@rug.nl; support@protein-nmr.org

1 INTRODUCTION

The structural characterization of intrinsically disordered proteins (IDPs) is a rapidly growing field in structural molecular biology. Over the past decade NMR spectroscopy has proven to be singular in its capacity to provide detailed structural characterization of these dynamic entities (Eliezer, 2009). Although diverse experimental approaches have been developed (Bermel et al., 2006; Mäntylahti et al., 2010; O'Hare et al., 2009), the sequential resonance assignment of > 10 kDa natively unfolded polypeptides is still not a trivial task.

Here, we describe an enhanced version of the SPARKY (Goddard and Kneller, 2006) sequence repositioning extension, which assist in matching, connecting and assigning consecutive residues, and is specifically designed for intrinsically disordered proteins: ncIDP-assign (neighbor-corrected IDP chemical shift assignment). This tool makes use of a novel random-coil chemical shift library, enabling the accurate prediction of the chemical shifts of a queried protein on a basis of tripeptides (Tamiola et al., 2010). Predicted sequence-specific chemical shifts are used as a template for re-assignment and validation of existing resonance assignments. The newly designed ncIDP-assign greatly accelerates the process of sequential resonance assignment of large intrinsically disordered proteins by drastically reducing the level of assignment ambiguities.

2 IMPLEMENTATION

The process of assignment validation and repositioning begins with the computation of the sequence-specific random-coil chemical shifts for the intrinsically disordered protein under study, based on its primary sequence.The random-coil chemical shift for a nucleus n ∈ {1Hα, 1HN, 13Cα, 13Cβ, 13CO, 15N} of amino acid residue a, within a tripeptide xay, can be expressed as,

graphic file with name btr054m1.jpg (1)

where δrcn(a) is the ‘random−coil’ chemical shift in the reference peptide GlyaGly , Δ−1n(x) and Δ+1n(y), are the neighbor corrections due to the preceding and the sequential residue, respectively. Subsequently, a collection of experimentally assigned resonances in a range kk + l specified by the user is retrieved and compared against the predicted chemical shifts from Equation (1). All plausible locations for the fragment along the sequence of a protein are considered. Each investigated position is characterized with a chemical shift deviation score S,

graphic file with name btr054m2.jpg (2)

where Inline graphic, δexpn and σpredn are experimental chemical shifts and the standard deviations of expected chemical shieldings for kk + l sequence combination, respectively. A score of 1.0 implies that at the queried sequence position, the experimental shifts are one standard deviation from the computed values, on average. Hence, the ‘best-fit’ solutions are characterized by the lowest S values.

3 RESULTS AND CONCLUSION

To assess the performance of ncIDP-assign we used the NMR chemical shift assignments for the 140-residue intrinsically disordered, cytoplasmic domain of human neuroligin-3 (hNLG3cyt) (Paz et al., 2008). The robustness of the assignment procedure was established using the data obtained in 1H-15N HSQC (Bodenhausen and Ruben, 1980), COCAHA (Dijkstra et al., 1994) and HNCACB (Wittekind and Mueller, 1993) experiments, providing access to backbone and 13Cβ chemical shifts. Table 1 displays the level of completeness of the resonance assignment for hNLG3cyt, and the accuracy of the chemical shift back computation modules in the standard and ncIDP versions of the SPARKY sequence repositioning plugins. As borne out by Table 1, ncIDP-assign offers an almost two-fold improvement in the estimation of chemical shifts for hNLG3cyt. The superior predictive power of ncIDP-assign translates into detection sensitivity of chemical shift deviations from the sequence-specific ‘random-coil’ values due to resonance misassignment. This point is demonstrated by Fig. 1 where the repositioning performance of the two methods is compared against known resonance assignments for hNLG3cyt. ncIDP-assign identifies correct solutions much more readily, and already at the level of dipeptides for the considered experiments (Fig. 1A). Further expansion of the length of a query fragment to tripeptide rapidly shifts the probability of assigning the correct solution. Consequently, the information content contained within a combination of resonance frequencies in short peptides (length ≥2) is unique enough to make the correct position guess in most cases.

Table 1.

Comparative analysis of chemical shift back-computation for hNLG3cyt using standard chemical shift computation module available in SPARKY, and ncIDP-assign

Nucleus (n) Standarda ncIDPa Assignments
1Hα 0.122 0.044 120
1HN 0.148 0.111 128
13Cα 0.810 0.324 128
13Cβ 0.410 0.212 120
13CO 0.717 0.393 131
15N 1.314 0.664 129

aChemical shift RMSD computed after removal of mean systematic offsets between the computed and experimental resonance assignments for hNLG3cyt in order to minimize chemical shift referencing errors. Root mean square difference (RMSD) values are given in ppm.

Fig. 1.

Fig. 1.

Comparative analysis of accuracy of standard and ncIDP SPARKY sequence repositioning extensions, using resonance assignments in: 1H-15N HSQC, COCAHA and HNCACB experiments, for the intrinsically disordered protein hNLG3cyt. (A) Normalized frequency of correct repositioning solutions as a function of fragment length in: standard (blue) and ncIDP-enhanced (green) repositioning, respectively. (B) Sensitivity as a function of sequence length for: standard (blue) and ncIDP (green) repositioning extensions.

Another critical parameter in the assignment process is the relative separation of the ‘best-fit’ score Sbest with respect to the second-best scoring suggestion Ssecond-best, expressed here as the sensitivity Ssecond-best/Sbest (Fig. 1B). Values for the sensitivity close to 1.0 indicate ambiguity. Given a sequentially assigned dipeptide, ncIDP-assign generates a list of solutions in which the ‘best-fit’ scenario scores appreciably better than the next considered option. Already significant improvements are observed in the analysis of 1H-15N HSQC resonance lists, clearly demonstrating that information content of 1H-15N resonance pairs in sequentially connected dipeptides can be effectively used in the assignment process of an intrinsically disordered protein.

In conclusion, we have shown here that ncIDP-assign is an effective tool to aid the sequential NMR resonance assignment of (intrinsically) disordered proteins.

ACKNOWLEDGEMENTS

Renee Otten (University of Groningen) and Thomas Goddard (University of California, San Francisco) are gratefully acknowledged for help with Python code development and valuable suggestions.

Funding: Netherlands Organization for Scientific Research (NWO) (VIDI grant to F.A.A.M.).

Conflict of Interest: none declared.

REFERENCES

  1. Bermel W., et al. Protonless NMR experiments for sequence-specific assignment of backbone nuclei in unfolded proteins. J. Am. Chem. Soc. 2006;128:3918–3919. doi: 10.1021/ja0582206. [DOI] [PubMed] [Google Scholar]
  2. Bodenhausen G., Ruben D.J. Natural abundance nitrogen-15 NMR by enhanced heteronuclear spectroscopy. Chem. Phys. Lett. 1980;69:185–189. [Google Scholar]
  3. Dijkstra K., et al. The COCAH experiment to correlate intraresidue carbonyl, c[alpha], and h[alpha] resonances in proteins. J. Magn. Reson. Ser. A. 1994;107:102–105. [Google Scholar]
  4. Eliezer D. Biophysical characterization of intrinsically disordered proteins. Curr. Opin. Struct. Biol. 2009;19:23–30. doi: 10.1016/j.sbi.2008.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Goddard T.D., Kneller D. SPARKY 3. San Francisco: University of California; 2006. [Google Scholar]
  6. Mäntylahti S., et al. HA-detected experiments for the backbone assignment of intrinsically disordered proteins. J. Biomol. NMR. 2010;47:171–181. doi: 10.1007/s10858-010-9421-0. [DOI] [PubMed] [Google Scholar]
  7. O'Hare B., et al. Incorporating 1H chemical shift determination into 13C-direct detected spectroscopy of intrinsically disordered proteins in solution. J. Magn. Reson. 2009;200:354–358. doi: 10.1016/j.jmr.2009.07.014. [DOI] [PubMed] [Google Scholar]
  8. Paz A., et al. Biophysical characterization of the unstructured cytoplasmic domain of the human neuronal adhesion protein neuroligin 3. Biophys. J. 2008;95:1928–1944. doi: 10.1529/biophysj.107.126995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Tamiola K., et al. Sequence-specific random coil chemical shifts of intrinsically disordered proteins. J. Am. Chem. Soc. 2010;132:18000–18003. doi: 10.1021/ja105656t. [DOI] [PubMed] [Google Scholar]
  10. Wittekind M., Mueller L. HNCACB, a High-Sensitivity 3D NMR experiment to correlate Amide-Proton and nitrogen resonances with the alpha- and Beta-Carbon resonances in proteins. J. Magn. Reson. Ser. B. 1993;101:201–205. [Google Scholar]

Articles from Bioinformatics are provided here courtesy of Oxford University Press

RESOURCES