Abstract
Biomolecular NMR structures are now routinely used in biology, chemistry, and bioinformatics. Methods and metrics for assessing the accuracy and precision of protein NMR structures are beginning to be standardized across the biological NMR community. These include both knowledge-based assessment metrics, parameterized from the database of protein structures, and model vs. data assessment metrics. On line servers are available that provide comprehensive protein structure quality assessment reports, and efforts are in progress by the world-wide Protein Data Bank (wwPDB) to develop a biomolecular NMR structure quality assessment pipeline as part of the structure deposition process. These quality assessment metrics and standards will aid NMR spectroscopists in determining more accurate structures, and increase the value and utility of these structures for the broad scientific community.
Introduction
Nuclear Magnetic Resonance (NMR) spectroscopy, along with X-ray diffraction and cryo-electron microscopy, is one of the three major experimental techniques providing three-dimensional (3D) structures of biological macromolecules. In addition to its unique role in characterizing biomolecular dynamics, NMR is routinely used for structure determinations of small (< 20 kDa) proteins [1-3] and is beginning to be used more routinely for determining structures of larger (20 – 50 kDa) soluble and membrane proteins (e.g. refs [4-7]). NMR-derived structure models can be used interchangeably with models generated by X-ray crystallography in many biological applications. It is therefore natural that many quality assessment metrics are common between the two techniques. In addition, there is a portfolio of metrics that are specific to NMR, which take into account the distinctive features of the NMR data used in the structure determination process.
In this review, we outline some of the metrics in common use for protein NMR structure quality assessment. A large part of our review reflects recently published recommendations of the world-wide Protein Data Base (wwPDB) task forces on validation of biomolecular structures determined by X-ray crystallography [8] and NMR methods [9].
Knowledge-based measures
Knowledge-based (KB) metrics describe how well the structure model conforms to expectations with respect to selected features that can be assessed by comparison with the extensive data base of experimental structures. These include bond length and bond angle distributions, dihedral angle distributions, atomic packing, hydrogen bond geometries, and other geometric features. Ideal values or value distributions are derived from statistical analyses of high-resolution X-ray structures, and are generally consistent with basic principles of biophysical chemistry.
There has been some debate regarding the use of KB information derived from X-ray crystal structures of biomolecules in assessing solution NMR structures. There are often differences in the sample conditions used in determining NMR and X-ray structures, and particular conformations from the distribution present in solution may be selected by the requirements of the crystal lattice. Nevertheless, there is no cogent reason to adopt different KB parameter distributions for assessing solution or solid-state protein NMR structures with respect to those used to assess structures determined by X-ray crystallography. Problems indicated by KB assessments can also be mapped onto the 3D structure to identify local hot spots of structural inaccuracy [10,11].
The most general protein model assessment tools look at residue pair-distribution functions (e.g. PROSA2 [12]) or distributions of hydrophobic and hydrophilic residues (e.g. Verify3D [13]) which are characteristic of native protein structures. These analyses are important first steps in structure validation. However, their value is primarily in identifying severely incorrect folds [14], which rarely result from NMR structure determinations done with high restraint densities. These scores may, however, be important when assessing structures determined from sparse restraint networks. Non-globular protein folds (e.g. coil-coil structures) may also exhibit poor PROSA2 or Verify3D scores even when the models are accurate.
Dihedral angle distributions are the most prominent KB statistic used in assessing protein NMR structures. They are generally reported as Z-scores relative to distributions observed in high-resolution crystal structures, or across all structures that have been deposited in the wwPDB. Dihedral angle distributions are generally reported separately for amino-acid residue backbone and side chains. Backbone ϕ, ψ distributions are generally assessed based on compliance with the Ramachandran plot. Historically, the program ProCheck [15] has been used for this analysis, but recent work using a larger set of X-ray structures determined at high resolution suggests that more accurate assessments can be made using improved backbone ϕ, ψ distribution statistics [8,16]. The assessment of side chain dihedral angle distributions (also referred to as rotamer normality) can be more subtle. Protein side chains have been observed to largely adopt standard rotamer states (g-, t, g+) even when buried in the cores of protein crystal structures. While NMR data can in principle determine accurate side-chain conformations, surface side chains are often dynamically-averaged complicating the interpretation of the corresponding NMR data.
Several tools have been used to assess core atom packing, including the Molprobity [11] program for assessment of overpacking, and both the Molprobity and RosettaHoles [17] programs for assessment of underpacking. Severe atomic overlaps are rarely an issue in NMR structures, unless there are errors in the restraints, because of the use of lower-distance bounds. High-energy contacts can occur due to simplified treatments of van der Waals and hydrogen-bonded interactions, but are generally relieved by energy refinement. Underpacking, however, may be a more general problem for NMR-derived structures than generally appreciated, and methods like RosettaHoles [17] should be a key component of protein NMR structure assessment.
The role of KB-driven energy refinement in determining protein NMR structures, such as KB potentials or fragment libraries [18,19], is also somewhat controversial. In our opinion and experience, appropriate protocols for energy refinement, including the use of KB potentials and fragment libraries, can significantly improve the accuracy of NMR structures (see e.g., [1,3,20,21]), particularly for larger proteins determined with sparse restraint networks [4,5]. Approaches based on molecular dynamics can also often provide appreciable improvements in structure quality [22].
Model vs data measures
Model vs data (MvD) metrics describe how well the NMR structure model matches experimental data. MvD quality assessments include data that have been used in the structure generation process and, where feasible, cross-validation using data that have not been used in structure generation calculations.
NMR Restraint Analysis
The most general form of MvD validation involves comparison of distances and dihedral angles in models with the corresponding experimental restraints. Table 1 provides a standard format used to report such restraint violations [9,20], although other formats are also common in the NMR literature. These metrics provide an overview of global (or average) restraint violation statistics, as well as information on the most significant outliers. Clusters of restraint violations in regions of the 3D structure may indicate errors in the local structure. Methods have been described to convert between restraint formats [20,23,24], allowing initial restraint lists generated for one structure generation program to be used with alternative structure generation programs. The NOE completeness score [20,25] is a useful metric of structural accuracy, assessing the fraction of short distances in the model structure that are consistent with restraint data set.
Table 1.
Alr2454a | ||
---|---|---|
Completeness of resonance assignmentsb: | ||
Backbone (%) | 99.4 | |
Side chain (%) | 98.3 | |
Aromatic (%) | 96.6 | |
Stereospecific methyl (%) | 100 | |
Conformationally-restricting restraintsc: | ||
Distance restraints | ||
Total | 2478 | |
intra-residue (i = j) | 688 | |
sequential (i-j| = 1) | 619 | |
medium range (1 < |i - j| < 5) | 462 | |
long range (|i - j| ≥ 5) | 709 | |
Dihedral angle restraints | 162 | |
Hydrogen bond restraints | 0 | |
Disulfide bond restraints | 0 | |
No. of restraints per residue | 25.5 | |
No. of long range restraints per residue | 6.8 | |
Residual restraint violationsc: | ||
Average no. of distance violations per structure: | ||
0.1 - 0.2 Å | 8.75 | |
0.2 - 0.5 Å | 1.85 | |
> 0.5 Å | 0 | |
Largest distance violation (Å) | 0.35 | |
Average no. of dihedral angle violations per structure: | ||
1 - 10° | 8.75 | |
° 10° | 0 | |
Largest dihedral angle violation (°) | 3.8 | |
NOE Completeness Score | 0.692 | |
Model Qualityc: | ||
RMSD backbone atoms (Å)d | 0.6 | |
RMSD heavy atoms (Å)d | 0.9 | |
RMSD bond lengths (Å) | 0.018 | |
RMSD bond angles (°) | 1.1 | |
MolProbity Ramachandran statisticsc,d | ||
most favored regions (%) | 96.8 | |
allowed regions (%) | 3.1 | |
disallowed regions (%) | 0.1 | |
Global quality scores (Raw / Z-score)c | ||
Verify3D | 0.40 | −0.96 |
ProsaII | 0.66 | 0.04 |
ProCheck (phi-psi)d | −0.15 | −0.28 |
ProCheck (all)d | −0.03 | −0.18 |
MolProbity clash score | 12.51 | −0.62 |
RPF Scorese | ||
Recall / Precision | 0.976 | 0.934 |
F-measure / DP-score | 0.955 | 0.817 |
Model Contents: | ||
Ordered residue ranged | 1-100 | |
Total no. of residues | 108 | |
BMRB accession number: | 17965 | |
PDB ID: | 2LJW a |
Structural statistics computed for an ensemble of 20 structures.
Computed using AVS software [68] from the expected number of assignable resonances, excluding: highly exchangeable protons (N-terminal and Lys amino groups, Arg guanido groups, hydroxyls of Ser, Thr, Tyr), carboxyls of Asp and Glu, non-protonated aromatic carbons, and the C-terminal His6 tag. Methyl protons are counted as a single assignable resonance.
Calculated using PSVS 1.4 [10]. Ramachandran statistics were calculated by Molprobity [11]. Average distance violations were calculated using the sum over r−6 for degenerate protons and stereochemically-distinct protons lacking stereospecific assignments.
Based on “well defined” residue ranges [S(phi) + S(psi) > 1.8].
NOESY Data
Restraint analysis has the significant shortcoming that the restraints are themselves interpretations of NOESY and other NMR data. Accordingly, NMR structure quality assessment should also include some metrics validating models against uninterpreted spectral data. In the case of NOESY data, several methods have been developed for back-calculating NOESY spectral data (e.g. refs [26-28]), although to date none of these has come into general use. A more rapid, though approximate, approach is to compare models with unassigned NOESY peak lists. The RPF program [29] quantifies the agreement between the NOESY peak list, chemical shift assignments, and NMR models by calculating recall, precision, and F measures, as well as a normalized F-measure (called a Discriminating Power DP score). The normalized DP score is highly correlated with the accuracy of the NMR model [30,31].
RDC, Scalar Coupling, Paramagnetic, and SAXS Data
MvD metrics could also include assessment of scalar coupling, residual dipolar coupling (RDC), chemical shift anisotropy (CSA), paramagnetic resonance enhancement (PRE), paramagnetic pseudo-contact shift (PCS), solid-state dipolar coupling, and small angle X-ray or neutron scattering (SAXS or SANS ) data. Several tools for validating structures against these data are available, including methods for validation of protein structures against RDC data [32-34], CSA data [35] and SAXS data [36]. Residual dipolar coupling data provide information about the orientations of internuclear bond vectors with respect to the molecular orientation tensor, and hence provide long-range information including, for example, the relative orientations of secondary structure elements, such as helix tilt angles, which are sometimes inaccurate in protein NMR structures. The most commonly used MvD metric for these data is the RDC Q-factor [37]. Aromatic RDC's also have been found to provide an important approach for validating accuracy in core structures of proteins [38].
Free R Factors
In spite of extensive discussions and of various formulations that have been proposed [25,27-29,37,39], a cross-validation metric analogous to the free R-factor of X-ray crystallography has not yet been broadly adopted by the biological NMR community. This is attributable, at least in part, to the sparseness of NMR data, and the need to manually evaluate each individual peak in NOESY spectra when applying truly quantitative methods. The RDC free Q-factor [37] is potentially more accessible, as quantifying RDC data is generally more straight forward than the quantification of large numbers of NOESY peaks.
Chemical Shift Data
Chemical shift data have great potential for NMR structure validation. The most straightforward validation involves comparing experimental chemical shift data with values predicted from the 3D model structure [40-47]. Extensive chemical shift data must be obtained at the onset of any protein structure determination, and chemical shift data often used indirectly as restraints on dihedral angles [48]. Several recent papers have described important progress in calculating chemical shifts from molecular models using both empirical [49-52] and quantum chemical [44,47,53,54] approaches. However, systematic, large scale tests using chemical shifts as a standard metric for protein NMR structure validation are not yet available.
Which parts of the structure should be validated?
Protein NMR structures are generally represented by ensembles of conformers having the same level of agreement with the experimental data. Different regions of the structure are often converged to different degrees among conformers. In common practice, a distinction is made between well-defined and ill-defined (i.e., not-well-defined) regions [9,20]. In single-domain proteins, the backbone or all-heavy-atom root mean square deviation of coordinates (RMSD) computed after superposition of all the well-defined regions is taken as a measured of structure precision. Similarly, the per-residue RMSD is taken as a measure of local precision. Alternative measures of local or global precision include dihedral angle order parameters (DAOP) [55-57] and distance variance matrix methods [57-60] ( Fig. 1). Often, there is a good, albeit qualitative, correlation between local restraint density and local precision, implying that the ill-defined regions of the structure result from the experimental data providing insufficient information. This variability can be due to local protein dynamics (so that locally a single conformation in solution in fact does not exist) or to experimental factors limiting the information that can be extracted from spectra (e.g., extensive resonance degeneracy).
The precision of NMR structures indicated by the convergence across the ensemble of NMR conformers is operational, and does not provide a true representation of the Boltzmann distribution of conformations actually present in the NMR sample. Indeed, NMR structure ensembles do not even provide a statistically meaningful description of the true precision of coordinates given the experimental uncertainties in deriving distance restraints. For example, fast exchange may give rise to inconsistent restraints, which when simultaneously satisfied can pin the local conformational distribution into an unrealistically narrow range of conformations [61].
Because NMR experiments do not provide enough data to characterize them, the conformations observed in ill-defined regions largely result from of the combination of random initial conformations with the potential energy functions. Hence, these regions should not be included in global structure validation. However, even such ill-defined regions may be restrained to some degree by sparse experimental data, which may be biologically relevant. Ill-defined regions may also contribute to the back-calculation of NMR observables (e.g. NOESY peak lists). Hence, in our opinion all atoms for which experimental data are available (including only chemical shift data) should be included in the atomic coordinates that are deposited in the PDB, consistent with recommendations of the wwPDB NMR VTF [9].
Protein structure quality assessment servers
Recently, several on-line servers have become available which integrate both KB and MvD assessment metrics to provide a comprehensive NMR structure quality assessment report. Web-based tools described in Table 2 include the CiNG [62,63], Molprobity [11], PSVS [10,14], Vivaldi [64], and ResProx [43] servers. These, as well as other programs for NMR structure quality assessment, have been critically reviewed in a recent publication [57]. The wwPDB NMR VTF has also recommended metrics and standards for biomolecular structure quality assessment [8,9], and these guidelines are being used in developing software pipelines that will generate standardized structure quality reports for all NMR and X-ray crystal structures submitted to the wwPDB.
Table 2.
Server | url | Validation statistics provided |
---|---|---|
CING [62]v.1.0: Common Interface for NMR Structure Generation | http://nmr.cmbi.ru.nl/cing/Home.html | KB: Bond lengths and bond angles, backbone Ramachandran distributions, peptide omegas, packing, sidechain rotomer normalilty, disulfides, salt-bridges, chemical shift validations. Provides residue-specific ROC scores [69]. Uses DSSP [70,71], WHATIF [72], PROCHECK [15,73], and BMRB chemical shift validation. |
MvD: DOAP analysis. Restraint violation analysis, redundancy and duplicated restraints, back calculation of chemical shift from structure. Uses SHIFTX [49] and VASCO [74]. | ||
Molprobity [11] | http://molprobity.biochem.duke.edu/ | KB: Backbone Ramachandran distributions, peptide omegas, packing, H-bond satisfaction, sidechain rotomer normalilty. |
MvD: RDC analysis with RDCvis[75]. | ||
PSVS [10,14] v.1.5. Protein Structure Validation Software suite. | http://psvs.nesg.org/ | KB: Bond lengths and bond angles, backbone Ramachandran distributions, peptide omegas, packing, sidechain rotomer normality, chemical shift validation and completeness using AVS [68]. Provides Z scores relative to high-resolution crystal strutures for ProsaII [12], Verify3D [13], Procheck_bb [15,73], Procheck_all [15,73], and MolProbity [11]. Also uses DSSP [70,71], PDBStat [20], , LACS [76], and PDB Clash score (deposit.rcsb.org/validate). |
MvD: DAOP or Variance Matrix analysis. Restraint violation analysis, redundancy and duplicated restraints, NOE completeness, NOE DP scores, RDC Q scores, and GLM-RMSD [31] “equivalent resolution” score. Provides mapping of Ramachanran outliers and RPF violations onto 3D structure. Also uses PDBStat [20], FindCore [20,58,59], and RPF [29,30] software. | ||
ResProx [43] Resolution by Proxy | http://www.resprox.ca/ | KB: Assesses “equivalent resolution” based on 25 protein features including backbone Ramachandran distributions, peptide omegas, H-bond geometry, over and underpacking. Uses Vader [40], PROSESS [42], Molprobity [11], RosettaHoles [17], and GeNMR [41] software. |
KB: Bond lengths and bond angles, peptide omegas, chemical shift validations, backbone Ramachandran distributions, CING [62] ROC scores. Uses WHATIF [72], CING [62], and VASCO [74]. | ||
Vivaldi [64] v.1: Visualization and validation of biomacromolecular NMR structures from the PDB | http://www.ebi.ac.uk/pdbe/vivaldi/ | MvD: DAOP and Variance Matrix analysis. Restraint violation and RDC analysis. Uses NMRCore [77] |
Quality measures vs. structure accuracy
Satisfaction of KB metrics is a necessary but not sufficient criteria for validating the accuracy of protein NMR structures. This conclusion was illustrated by the recent Critical Assessment of Protein Structure Determination by NMR (CASD-NMR) [1,2], in which NMR data was publicly released, and the results of automated NOE assignment and structure generation methods were compared in a blind fashion with manually-refined protein NMR structures. In many cases, models with excellent energetics were in fact quite different from the manually-refined “gold standard” structures. However, energetic considerations can identify inaccurate restraint data, and thus guide the experimentalist to more accurate interpretations of the raw NMR data in terms of experimental restraints [20,21].
The CASD-NMR experiment also compared correlations between several KB and MvD metrics with structural accuracy. In this analysis, the DP score was observed to provide the highest correlation between submitted models and the manually-refined target structure (Fig. 2). Although the DP score has significant shortcomings to the extent that it uses an interpreted NOESY peak list rather than raw NOESY spectral data, these results demonstrate the importance of combining KB and MvD validation scores in structure quality assessment. For example, the DP score has recently been combined with several KB scores by linear regression analysis to provide an “equivalent resolution” metric [31,65].
Areas of current research
The trend towards determining larger biomolecular structures using sparse NMR data and hybrid methods incorporating small angle X-ray scattering, chemical cross links, and other data, requires increasing use of KB information and potential energy functions in the structure determination process. Some solid-state NMR structure determinations may also rely on relatively sparse networks of experimental data. These methods provide less independent information for use in cross-validation, creating important challenges to the field in terms of robust structure quality assessment.
Quality assessment considerations for NMR structures of nucleic acids and carbohydrates, while generally similar to those for proteins, has not yet been extensively investigated. Methods also need to be developed for generating Boltzmann ensembles of conformers that best satisfy the experimental data (e.g., [37,66]), rather than fitting all the data to a single conformer multiple times, as is the common practice. Ensemble-averaging interpretations are particularly important in highly dynamic regions of a biomolecular structure, and new methods are needed for quality assessment of the ensembles of models proposed for such disordered regions.
Conclusions
A high quality NMR-derived structure should meet minimal standards based on a wide range of KB and MvD validation assessment metrics. No single metric score is sufficient to validate a protein NMR structure. Users of biomolecular NMR structures also need to be informed about which parts of the structure are well-defined by the data, and which are less-reliably defined in terms of a unique structure. This information can be critical in interpreting structure-function relationships.
Structure determinations by NMR are often marginally underdetermined, and rely in either subtle or substantial ways on KB information, including at the very least standard values of bond lengths and bond angles. Although they are useful for biological studies, and in some cases quite accurate, the relatively low density of data constituting a typical protein NMR structure presents challenges to cross validation, using part of the data to generate the structure and another part of the data to validate the structure. In this regard, chemical shift data, which are available for most atoms reported in modern biomolecular NMR structures, hold promise for providing a robust and general approach for data-based protein structure quality assessments.
Highlights – Montelione.
- Model quality is assessed with Knowledge-Based (KB) and Model-vs-Data (MvD) metrics
- KB metrics compare structural features with the database of protein structures
- MvD metrics compare NMR-derived models against experimental NMR data
- Assessment requires annotation of well-defined regions the protein structure model
- NMR structure quality assessment provides new opportunities for methods development
Acknowledgements
Special thanks to J. Aramini, L. Ferella, Y.J. Huang, B. Mao, D. Snyder, and the members of the wwPDB NMR Validation Task Force for helpful scientific discussions. This work was supported by grants from the Protein Structure Initiative – Biology of the National Institutes of Health, Grant U54-GM094597 (to G.T.M), by the CONSOLIDER INGENIO CSD2010-00065 and Generalitat Valenciana PROMETEO 2011/008 (to R.T), and by the European Community FP7 e-Infrastructure “WeNMR” project, Grant 261572 (to A.R.).
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1••.Rosato A, Aramini JM, Arrowsmith C, Bagaria A, Baker D, Cavalli A, Doreleijers JF, Eletsky A, Giachetti A, Guerry P, Gutmanas A, et al. Blind testing of routine, fully automated determination of protein structures from nmr data. Structure. 2012;20(2):227–236. doi: 10.1016/j.str.2012.01.002. [This paper reports the first results of the CASD-NMR initiative. In this collaborative effort, raw data were provided to eight teams developing software for automated NMR structure determination, while the corresponding 3D coordinates were not released. An independent team then compared the automatically generated structures with the reference structure, manually determined from the same data. All automated NOESY-based methods performed well on all targets.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Rosato A, Bagaria A, Baker D, Bardiaux B, Cavalli A, Doreleijers JF, Giachetti A, Guerry P, Guntert P, Herrmann T, Huang YJ, et al. Casd-nmr: Critical assessment of automated structure determination by nmr. Nat Methods. 2009;6(9):625–626. doi: 10.1038/nmeth0909-625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3••.Mao B, Guan R, Montelione GT. Improved technologies now routinely provide protein nmr structures useful for molecular replacement. Structure. 2011;19(6):757–766. doi: 10.1016/j.str.2011.04.005. [This paper demonstrates that NMR structures determined and validated using modern methods provide coordinates which are sufficiently accurate to allow their routine use in determining phases of corresponding or homologous X-ray crystal structures by molecular replacement methods. In some cases, however, the accuracy of these NMR structures, as assessed by their phasing power, is improved by KB refinement with Rosetta.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4•.Lange OF, Rossi P, Sgourakis NG, Song Y, Lee HW, Aramini JM, Ertekin A, Xiao R, Acton TB, Montelione GT, Baker D. Determination of solution structures of proteins up to 40 kda using cs-rosetta with sparse nmr data from deuterated samples. Proc Natl Acad Sci U S A. 2012;109(27):10873–10878. doi: 10.1073/pnas.1203013109. [This paper demonstrates the use of KB methods, including fragment libraries, in determining accurate protein stuctures bu NMR, using perdeuterated, methyl protonated proteins, in the 20 - 40 kDa size range. Related results are presented in ref. 5.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5•.Raman S, Lange OF, Rossi P, Tyka M, Wang X, Aramini J, Liu G, Ramelot TA, Eletsky A, Szyperski T, Kennedy MA, et al. Nmr structure determination for larger proteins using backbone-only data. Science. 2010;327(5968):1014–1018. doi: 10.1126/science.1183649. [This paper demonstrates the use of KB methods, including fragment libraries, in determining accurate protein stuctures by NMR using perdeuterated proteins without methyl protonation in the 20 - 40 kDa size range. Related results are presented in ref. 4.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hiller S, Wagner G. The role of solution nmr in the structure determinations of vdac-1 and other membrane proteins. Current opinion in structural biology. 2009;19(4):396–401. doi: 10.1016/j.sbi.2009.07.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hiller S, Garces RG, Malia TJ, Orekhov VY, Colombini M, Wagner G. Solution structure of the integral human membrane protein vdac-1 in detergent micelles. Science. 2008;321(5893):1206–1210. doi: 10.1126/science.1161302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8••.Read RJ, Adams PD, Arendall WB, 3rd, Brunger AT, Emsley P, Joosten RP, Kleywegt GJ, Krissinel EB, Lutteke T, Otwinowski Z, Perrakis A, et al. A new generation of crystallographic validation tools for the protein data bank. Structure. 2011;19(10):1395–1412. doi: 10.1016/j.str.2011.08.006. [An overview of the recommendations of wwPDB X-ray Crystallography Validation Task Force, providing guidelines for both KB and MvD validation of biomolecular strutures determined by X-ray crystallography. Recommendations are also outlinee for the presentation of these results in structure validation reports to be prepared by the wwPDB.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9••.Montelione GT, Nilges M, Bax A, Güntert P, Herrmann T, Richardson JS, Schwieters CD, Vranken WF, Vuister GW, Wishart DS, Berman HM, et al. Recommendations of the wwpdb nmr validation task force. Structure. 2013 doi: 10.1016/j.str.2013.07.021. In press. [An overview of consensus conclusions by the wwPDB NMR Validation Task Force providing recommendations for validation of protien NMR structures in three phases: (i) validation methods ready to implement immediately as part of a consensus NMR structure validation pipeline, (ii) validation methods which work well, but have not yet been broadly adopted by the biomolecular NMR community, and (iii) key issues of biomolecular struture validation which require additional research.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10••.Bhattacharya A, Tejero R, Montelione GT. Evaluating protein structures determined by structural genomics consortia. Proteins. 2007;66(4):778–795. doi: 10.1002/prot.21165. [This paper describes the PSVS serever, a popular tool for NMR structure validation. PSVS provides a extensive analysis of both KB and MvD validation metrics for protein NMR strutures, and a comprehensive structure validation report. Although the original paper was published in 2007, the server is updated regularly by the introduction of new validation metrics. See also ref 14 exploring the use of PSVS for structure quality assessment of homology models.] [DOI] [PubMed] [Google Scholar]
- 11••.Chen VB, Arendall WB, 3rd, Headd JJ, Keedy DA, Immormino RM, Kapral GJ, Murray LW, Richardson JS, Richardson DC. Molprobity: All-atom structure validation for macromolecular crystallography. Acta crystallographica Section D, Biological crystallography. 2010;66(Pt 1):12–21. doi: 10.1107/S0907444909042073. [Molprobity software is a key component of most protein NMR structure validation servers. It provides comprehensive KB validation of protein structures, and includes the RDCvis module (ref 75) for visual validation of protein NMR structures using RDC data.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Sippl MJ. Recognition of errors in three-dimensional structures of proteins. Proteins. 1993;17(4):355–362. doi: 10.1002/prot.340170404. [DOI] [PubMed] [Google Scholar]
- 13.Luthy R, Bowie JU, Eisenberg D: Assessment of protein models with three-dimensional profiles. Nature. 1992;356(6364):83–85. doi: 10.1038/356083a0. [DOI] [PubMed] [Google Scholar]
- 14.Bhattacharya A, Wunderlich Z, Monleon D, Tejero R, Montelione GT. Assessing model accuracy using the homology modeling automatically software. Proteins. 2008;70(1):105–118. doi: 10.1002/prot.21466. [DOI] [PubMed] [Google Scholar]
- 15.Laskowski RA, Rullmannn JA, MacArthur MW, Kaptein R, Thornton JM. Aqua and procheck-nmr: Programs for checking the quality of protein structures solved by nmr. J Biomol NMR. 1996;8(4):477–486. doi: 10.1007/BF00228148. [DOI] [PubMed] [Google Scholar]
- 16.Arendall WB, 3rd, Tempel W, Richardson JS, Zhou W, Wang S, Davis IW, Liu ZJ, Rose JP, Carson WM, Luo M, Richardson DC, et al. A test of enhancing model accuracy in high- throughput crystallography. Journal of structural and functional genomics. 2005;6(1):1–11. doi: 10.1007/s10969-005-3138-4. [DOI] [PubMed] [Google Scholar]
- 17.Sheffler W, Baker D. Rosettaholes: Rapid assessment of protein core packing for structure prediction, refinement, design, and validation. Protein science : a publication of the Protein Society. 2009;18(1):229–239. doi: 10.1002/pro.8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18••.van der Schot G, Zhang Z, Vernon R, Shen Y, Vranken WF, Baker D, Bonvin AM, Lange OF. Improving 3d structure prediction from chemical shift data. J Biomol NMR. 2013 doi: 10.1007/s10858-013-9762-6. [This work demonstrates a new and improved fragment picker and the iterative sampling algorithm RASREC to provide significant improvements in convergence and accuracy of the CS-Rosetta method (ref 19). They also introduce improved criteria for assessing the accuracy of the models produced by CS-Rosetta.] [DOI] [PubMed] [Google Scholar]
- 19.Shen Y, Lange O, Delaglio F, Rossi P, Aramini JM, Liu G, Eletsky A, Wu Y, Singarapu KK, Lemak A, Ignatchenko A, et al. Consistent blind protein structure generation from nmr chemical shift data. Proc Natl Acad Sci U S A. 2008;105(12):4685–4690. doi: 10.1073/pnas.0800256105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20•.Tejero R, Snyder D, Mao B, Aramini JM, Montelione GT. Pdbstat: A universal restraint converter and restraint analysis software package for protein nmr. J Biomol NMR. 2013 doi: 10.1007/s10858-013-9753-7. [PDBStat software provides a robust tool for MvD validations, including extensive restraint analysis and conversion utlilities The software can also be used to convert an initial restraint list into restraint lists for other software (e.g. CYANA restraints to Rosetta restraints), allowing validation by consensus structure generation with multiple software packages. PDBStat also provides MvD validation against RDC data.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ramelot TA, Raman S, Kuzin AP, Xiao R, Ma LC, Acton TB, Hunt JF, Montelione GT, Baker D, Kennedy MA. Improving nmr protein structure quality by rosetta refinement: A molecular replacement study. Proteins. 2009;75(1):147–167. doi: 10.1002/prot.22229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bertini I, Case DA, Ferella L, Giachetti A, Rosato A. A grid-enabled web portal for nmr structure refinement with amber. Bioinformatics. 2011;27(17):2384–2390. doi: 10.1093/bioinformatics/btr415. [DOI] [PubMed] [Google Scholar]
- 23.Nederveen AJ, Doreleijers JF, Vranken W, Miller Z, Spronk CA, Nabuurs SB, Guntert P, Livny M, Markley JL, Nilges M, Ulrich EL, et al. Recoord: A recalculated coordinate database of 500+ proteins from the pdb using restraints from the biomagresbank. Proteins. 2005;59(4):662–672. doi: 10.1002/prot.20408. [DOI] [PubMed] [Google Scholar]
- 24.Vranken WF, Boucher W, Stevens TJ, Fogh RH, Pajon A, Llinas M, Ulrich EL, Markley JL, Ionides J, Laue ED. The ccpn data model for nmr spectroscopy: Development of a software pipeline. Proteins. 2005;59(4):687–696. doi: 10.1002/prot.20449. [DOI] [PubMed] [Google Scholar]
- 25.Doreleijers JF, Raves ML, Rullmann T, Kaptein R. Completeness of noes in protein structure: A statistical analysis of nmr. J Biomol NMR. 1999;14(2):123–132. doi: 10.1023/a:1008335423527. [DOI] [PubMed] [Google Scholar]
- 26.Borgias BA, James TL. Two-dimensional nuclear overhauser effect: Complete relaxation matrix analysis. Methods in enzymology. 1989;176:169–183. doi: 10.1016/0076-6879(89)76011-0. [DOI] [PubMed] [Google Scholar]
- 27.Gronwald W, Kirchhofer R, Gorler A, Kremer W, Ganslmeier B, Neidig KP, Kalbitzer HR. Rfac, a program for automated nmr r-factor estimation. J Biomol NMR. 2000;17(2):137–151. doi: 10.1023/a:1008360715569. [DOI] [PubMed] [Google Scholar]
- 28.Zhu L, Dyson HJ, Wright PE. A noesy-hsqc simulation program, spirit. J Biomol NMR. 1998;11(1):17–29. doi: 10.1023/a:1008252526537. [DOI] [PubMed] [Google Scholar]
- 29.Huang YJ, Powers R, Montelione GT. Protein nmr recall, precision, and f-measure scores (rpf scores): Structure quality assessment measures based on information retrieval statistics. J Am Chem Soc. 2005;127(6):1665–1674. doi: 10.1021/ja047109h. [DOI] [PubMed] [Google Scholar]
- 30•.Huang YJ, Rosato A, Singh G, Montelione GT. Rpf: A quality assessment tool for protein nmr structures. Nucleic Acids Res. 2012;40(Web Server issue):W542–546. doi: 10.1093/nar/gks373. [This article describes a web server for the calculation of RPF scores for NMR structures. This MvD measure exploits unassigned NOESY data. One of the output measures, the DP-score, is an useful predictor of the accuracy of the query structure (see also ref 29).] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31•.Bagaria A, Jaravine V, Huang YJ, Montelione GT, Guntert P. Protein structure validation by generalized linear model root-mean-square deviation prediction. Protein science : a publication of the Protein Society. 2012;21(2):229–238. doi: 10.1002/pro.2007. [This article describes a predictor of the quality of protein structures that integrates a variety of different measures into a single “equivalent resolution” score. The authors demonstrate that this score provides an indication of structural accuracy, provided as the estimated RMSD from the true structure.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Clore GM, Robien MA, Gronenborn AM. Exploring the limits of precision and accuracy of protein structures determined by nuclear magnetic resonance spectroscopy. J Mol Biol. 1993;231(1):82–102. doi: 10.1006/jmbi.1993.1259. [DOI] [PubMed] [Google Scholar]
- 33.Valafar H, Prestegard JH. Redcat: A residual dipolar coupling analysis tool. J Magn Reson. 2004;167(2):228–241. doi: 10.1016/j.jmr.2003.12.012. [DOI] [PubMed] [Google Scholar]
- 34.Bryson M, Tian F, Prestegard JH, Valafar H. Redcraft: A tool for simultaneous characterization of protein backbone structure and motion from rdc data. J Magn Reson. 2008;191(2):322–334. doi: 10.1016/j.jmr.2008.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Cornilescu G, Marquardt JL, Ottiger M, Bax A. Validation of protein structure from anisotropic carbonyl chemical shifts in a dilute liquid crystalline phase. J Am Chem Soc. 1998;120:6836–6837. [Google Scholar]
- 36.Grishaev A, Wu J, Trewhella J, Bax A. Refinement of multidomain protein structures by combination of solution small-angle x-ray scattering and nmr data. J Am Chem Soc. 2005;127(47):16621–16628. doi: 10.1021/ja054342m. [DOI] [PubMed] [Google Scholar]
- 37.Clore GM, Schwieters CD. How much backbone motion in ubiquitin is required to account for dipolar coupling data measured in multiple alignment media as assessed by independent cross-validation? Journal of the American Chemical Society. 2004;126(9):2923–2938. doi: 10.1021/ja0386804. [DOI] [PubMed] [Google Scholar]
- 38•.Sathyamoorthy B, Singarapu KK, Garcia AE, Szyperski T. Protein conformational space populated in solution probed with aromatic residual dipolar 13C-1H couplings. Chembiochem. 2013;14(6):684–688. doi: 10.1002/cbic.201300016. [An innovative approach demonstrating the correlation of aromatic Tyr and Phe 13C-1H RDC measurements with protein structure accuracy. These data are relatively easy to measure and provide important information useful for either structure determination, refinement, or validation.] [DOI] [PubMed] [Google Scholar]
- 39.Brunger AT, Clore GM, Gronenborn AM, Saffrich R, Nilges M. Assessing the quality of solution nuclear magnetic resonance structures by complete cross-validation. Science. 1993;261(5119):328–331. doi: 10.1126/science.8332897. [DOI] [PubMed] [Google Scholar]
- 40.Willard L, Ranjan A, Zhang H, Monzavi H, Boyko RF, Sykes BD, Wishart DS. Vadar: A web server for quantitative evaluation of protein structure quality. Nucleic Acids Res. 2003;31(13):3316–3319. doi: 10.1093/nar/gkg565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Berjanskii M, Tang P, Liang J, Cruz JA, Zhou J, Zhou Y, Bassett E, MacDonell C, Lu P, Lin G, Wishart DS. Genmr: A web server for rapid nmr-based protein structure determination. Nucleic Acids Res. 2009;37(Web Server issue):W670–677. doi: 10.1093/nar/gkp280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Berjanskii M, Liang Y, Zhou J, Tang P, Stothard P, Zhou Y, Cruz J, MacDonell C, Lin G, Lu P, Wishart DS. Prosess: A protein structure evaluation suite and server. Nucleic Acids Res. 2010;38(Web Server issue):W633–640. doi: 10.1093/nar/gkq375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43•.Berjanskii M, Zhou J, Liang Y, Lin G, Wishart DS. Resolution-by-proxy: A simple measure for assessing and comparing the overall quality of nmr protein structures. J Biomol NMR. 2012;53(3):167–180. doi: 10.1007/s10858-012-9637-2. [ResProx calculates equivalent X-ray crystal structure resolution of a protein NMR structure from 25 measurable protein features. It can be used to identify under-restrained, poorly refined or inaccurate NMR structures.] [DOI] [PubMed] [Google Scholar]
- 44••.Vila JA, Aramini JM, Rossi P, Kuzin A, Su M, Seetharaman J, Xiao R, Tong L, Montelione GT, Scheraga HA. Quantum chemical 13c(alpha) chemical shift calculations for protein nmr structure determination, refinement, and validation. Proc Natl Acad Sci U S A. 2008;105(38):14389–14394. doi: 10.1073/pnas.0807105105. [Quantum chemistry calculations by the program CheShift are used to predict 13Cα chemical shifts for various residue types, flanking sequences, and backbone ϕ,ψ values. The resulting web-accessible database provides a rapid approach for protein backbone structure quality assessment.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Martin OA, Villegas ME, Vila JA, Scheraga HA. Analysis of 13calpha and 13cbeta chemical shifts of cysteine and cystine residues in proteins: A quantum chemical approach. J Biomol NMR. 2010;46(3):217–225. doi: 10.1007/s10858-010-9396-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Vila JA, Serrano P, Wuthrich K, Scheraga HA. Sequential nearest-neighbor effects on computed 13calpha chemical shifts. J Biomol NMR. 2010;48(1):23–30. doi: 10.1007/s10858-010-9435-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47•.Martin OA, Vila JA, Scheraga HA. Cheshift-2: Graphic validation of protein structures. Bioinformatics. 2012;28(11):1538–1539. doi: 10.1093/bioinformatics/bts179. [CheShift-2 provides a graphical user interface to identify possible flaws in an NMR structure based on quantum mechanical computation of 13Cα shifts. The shifts are calculated as averages over all members of the ensemble, and their average compared to the experimental data.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48••.Shen Y, Bax A. Sparta+: A modest improvement in empirical nmr chemical shift prediction by means of an artificial neural network. J Biomol NMR. 2010;48(1):13–22. doi: 10.1007/s10858-010-9433-9. [Together with ShiftX2, this is one of the most commonly used and reliable mehods for NMR chemical shift predictions.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49•.Han B, Liu Y, Ginzinger SW, Wishart DS. Shiftx2: Significantly improved protein chemical shift prediction. J Biomol NMR. 2011;50(1):43–57. doi: 10.1007/s10858-011-9478-4. [Together with Sparta+, this is one of the most commonly used and reliable mehods for NMR chemical shift predictions. See also ref 50.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Wang B, Wang Y, Wishart DS. A probabilistic approach for validating protein nmr chemical shift assignments. J Biomol NMR. 2010;47(2):85–99. doi: 10.1007/s10858-010-9407-y. [DOI] [PubMed] [Google Scholar]
- 51•.Sahakyan AB, Vranken WF, Cavalli A, Vendruscolo M. Using side-chain aromatic proton chemical shifts for a quantitative analysis of protein structures. Angew Chem Int Ed Engl. 2011;50(41):9620–9623. doi: 10.1002/anie.201101641. [The authors demonstrate the use of aromatic chemical shift data in distinguishing correct structural models from incorrect ones. They also explore its use to detect differences caused by cofactor or ligand binding, or by sequence alterations between structures.] [DOI] [PubMed] [Google Scholar]
- 52••.Sahakyan AB, Vranken WF, Cavalli A, Vendruscolo M. Structure-based prediction of methyl chemical shifts in proteins. J Biomol NMR. 2011;50(4):331–346. doi: 10.1007/s10858-011-9524-2. [This paper describes the CH3Shift method of performing structure-based predictions of methyl chemical shifts. The method achieves high accuracy in the predictions of 1H chemical shifts for Ala, Thr, Val, Leu and Ile methyl groups. This approach is useful both for assignment of methyl groups from 3D protein structure, and for validation of 3D protein structures, particularly those determined using perdeuterated, methyl protonated samples.] [DOI] [PubMed] [Google Scholar]
- 53.Vila JA, Sue SC, Fraser JS, Scheraga HA, Dyson HJ. Cheshift-2 resolves a local inconsistency between two x-ray crystal structures. J Biomol NMR. 2012;54(2):193–198. doi: 10.1007/s10858-012-9663-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Vila JA, Arnautova YA, Vorobjev Y, Scheraga HA. Assessing the fractions of tautomeric forms of the imidazole ring of histidine in proteins as a function of ph. Proc Natl Acad Sci U S A. 2011;108(14):5602–5607. doi: 10.1073/pnas.1102373108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Hyberts SG, Goldberg MS, Havel TF, Wagner G. The solution structure of eglin c based on measurements of many noes and coupling constants and its comparison with x-ray structures. Protein Sci. 1992;1(6):736–751. doi: 10.1002/pro.5560010606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56•.Kirchner DK, Guntert P. Objective identification of residue ranges for the superposition of protein structures. BMC Bioinformatics. 2011;12:170. doi: 10.1186/1471-2105-12-170. [The CYRANGE method for the identification of well-ordered regions in NMR structures combines different aspects of approaches based on both dihedral angle order parameters and distance matrices, and integrates them with new ideas. Overall, the aim is to define residue ranges that contain few gaps and cover most of the sequence. This method has been adopted by the wwNMR VTF as a standard for defining well-defined residue ranges for quality assessment.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57••.Vuister GW, Fogh RH, Hendrickx PM, Doreleijers JF, Gutmanas A. An overview of tools for the validation of protein nmr structures. J Biomol NMR. 2013 doi: 10.1007/s10858-013-9750-x. [This review reports a comparison of the application of nine different validation tools on two example NMR structures. Interestingly, the same property as analyzed by different tools is not always consistently assessed as being good or bad.] [DOI] [PubMed] [Google Scholar]
- 58.Snyder DA, Montelione GT. Clustering algorithms for identifying core atom sets and for assessing the precision of protein structure ensembles. Proteins. 2005;59(4):673–686. doi: 10.1002/prot.20402. [DOI] [PubMed] [Google Scholar]
- 59••.Snyder DA, Grullon J, Huang YJ, Tejero R, Montelione GT. The expanded findcore method for identification of a core atom set for assessment of protein structure prediction. Proteins. 2013 doi: 10.1002/prot.24490. [In review. An extension of the variane matrix method of ref 58 for annotating well-defined vs ill-defined atoms in an NMR ensemble. FindCore also provides criteria for identifying regions of the protein structure that are internally well-defined, but not well-defined with respect to one another, guiding the independent superimposition of these regions for RNSD calculations.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Andrec M, Snyder DA, Zhou Z, Young J, Montelione GT, Levy RM. A large data set comparison of protein structures determined by crystallography and nmr: Statistical test for structural differences and the effect of crystal packing. Proteins. 2007;69(3):449–465. doi: 10.1002/prot.21507. [DOI] [PubMed] [Google Scholar]
- 61.Tejero R, Bassolino-Klimas D, Bruccoleri RE, Montelione GT. Simulated annealing with restrained molecular dynamics using congen: Energy refinement of the nmr solution structures of epidermal and type-alpha transforming growth factors. Protein Sci. 1996;5(4):578–592. doi: 10.1002/pro.5560050403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62••.Doreleijers JF, Sousa da Silva AW, Krieger E, Nabuurs SB, Spronk CA, Stevens TJ, Vranken WF, Vriend G, Vuister GW. CiNG: An integrated residue-based structure validation program suite. J Biomol NMR. 2012;54(3):267–283. doi: 10.1007/s10858-012-9669-7. [CiNG is a powerful web-based tool for NMR structure validation. CiNG provides a extensive analysis of both KB and MvD parameters, and provide residue-specific quality scores, ROC scores, which are useful for protein structure validation and refinement.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63•.Doreleijers JF, Vranken WF, Schulte C, Markley JL, Ulrich EL, Vriend G, Vuister GW. NRG-CiNG: Integrated validation reports of remediated experimental biomolecular nmr data and coordinates in wwpdb. Nucleic Acids Res. 2012;40(Database issue):519–524. doi: 10.1093/nar/gkr1134. [NRG-CiNG contains validation reports by CiNG for all NMR structures in the PDB, based on remediated, i.e. corrected for a variety of possible errors or inconsistencies, NMR restraints.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64•.Hendrickx PM, Gutmanas A, Kleywegt GJ. Vivaldi: Visualization and validation of biomacromolecular nmr structures from the pdb. Proteins. 2013;81(4):583–591. doi: 10.1002/prot.24213. [Vivaldi provides a web-based graphical interface to interactively visualize and analyze NMR structures and the corresponding restraints that is directly embedded in the PDBe. Vivaldi also permits the analysis of RDC data.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Bagaria A, Jaravine V, Guntert P. Estimating structure quality trends in the protein data bank by equivalent resolution. Comput Biol Chem. 2013;46C:8–15. doi: 10.1016/j.compbiolchem.2013.04.004. [DOI] [PubMed] [Google Scholar]
- 66.Lindorff-Larsen K, Best RB, Depristo MA, Dobson CM, Vendruscolo M. Simultaneous determination of protein structure and dynamics. Nature. 2005;433(7022):128–132. doi: 10.1038/nature03199. [DOI] [PubMed] [Google Scholar]
- 67.Aramini JM, Petrey D, Lee DY, Janjua H, Xiao R, Acton TB, Everett JK, Montelione GT. Solution nmr structure of alr2454 from nostoc sp. Pcc 7120, the first structural representative of pfam domain family pf11267. J Struct Funct Genomics. 2012;13(3):171–176. doi: 10.1007/s10969-012-9135-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Moseley HN, Sahota G, Montelione GT. Assignment validation software suite for the evaluation and presentation of protein resonance assignment data. J Biomol NMR. 2004;28(4):341–355. doi: 10.1023/B:JNMR.0000015420.44364.06. [DOI] [PubMed] [Google Scholar]
- 69.Doreleijers JF, Sousa da Silva AW, Krieger E, Nabuurs SB, Spronk CA, Stevens TJ, Vranken WF, Vriend G, Vuister GW. Cing: An integrated residue-based structure validation program suite. J Biomol NMR. 2012 doi: 10.1007/s10858-012-9669-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Kabsch W, Sander C. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22(12):2577–2637. doi: 10.1002/bip.360221211. [DOI] [PubMed] [Google Scholar]
- 71.Joosten RP, te Beek TA, Krieger E, Hekkelman ML, Hooft RW, Schneider R, Sander C, Vriend G. A series of pdb related databases for everyday needs. Nucleic Acids Res. 2011;39(Database issue):D411–419. doi: 10.1093/nar/gkq1105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Vriend G. What if: A molecular modeling and drug design program. J Mol Graph. 1990;8(1):52–56. 29. doi: 10.1016/0263-7855(90)80070-v. [DOI] [PubMed] [Google Scholar]
- 73.Laskowski RA, MacArthur MW, Moss DS, Thornton JM. Procheck: A program to check the stereochemical quality of protein structures. Journal of Applied Crystallography. 1993;26(2):283–291. [Google Scholar]
- 74.Rieping W, Vranken WF. Validation of archived chemical shifts through atomic coordinates. Proteins. 2010;78(11):2482–2489. doi: 10.1002/prot.22756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Block JN, Zielinski DJ, Chen VB, Davis IW, Vinson EC, Brady R, Richardso JS, Richardson DC. Kinimmerse: Macromolecular vr for nmr ensembles. Source Code Biol Medicine. 2009;4:3. doi: 10.1186/1751-0473-4-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Wang L, Eghbalnia HR, Bahrami A, Markley JL. Linear analysis of carbon-13 chemical shift differences and its application to the detection and correction of errors in referencing and spin system identifications. J Biomol NMR. 2005;32(1):13–22. doi: 10.1007/s10858-005-1717-0. [DOI] [PubMed] [Google Scholar]
- 77.Kelley LA, Gardner SP, Sutcliffe MJ. An automated approach for defining core atoms and domains in an ensemble of nmr-derived protein structures. Protein Eng. 1997;10(6):737–741. doi: 10.1093/protein/10.6.737. [DOI] [PubMed] [Google Scholar]