On average, the mother liquor or solvent and its constituents occupy about 50% of a macromolecular crystal. Ordered as well as disordered solvent components need to be accurately accounted for in modelling and refinement, often with considerable complexity.
Keywords: macromolecular crystals, solvent content, bulk solvent, ordered solvent
Abstract
The mother liquor from which a biomolecular crystal is grown will contain water, buffer molecules, native ligands and cofactors, crystallization precipitants and additives, various metal ions, and often small-molecule ligands or inhibitors. On average, about half the volume of a biomolecular crystal consists of this mother liquor, whose components form the disordered bulk solvent. Its scattering contributions can be exploited in initial phasing and must be included in crystal structure refinement as a bulk-solvent model. Concomitantly, distinct electron density originating from ordered solvent components must be correctly identified and represented as part of the atomic crystal structure model. Herein, are reviewed (i) probabilistic bulk-solvent content estimates, (ii) the use of bulk-solvent density modification in phase improvement, (iii) bulk-solvent models and refinement of bulk-solvent contributions and (iv) modelling and validation of ordered solvent constituents. A brief summary is provided of current tools for bulk-solvent analysis and refinement, as well as of modelling, refinement and analysis of ordered solvent components, including small-molecule ligands.
1. The solvent in macromolecular crystals
Crystals of proteins and other biological materials almost always grow in a drop of aqueous mother liquor containing a variety of reagents which promote the self-assembly of the irregularly shaped macromolecules into an ordered and regular crystal structure. This self-assembly during crystallogenesis leads to an ordered network of molecules connected by few, but specific, weak noncovalent intermolecular interactions (Fig. 1 ▶). Given the irregular shape of protein molecules, it is not surprising that when they self-assemble into a crystal, substantial intermolecular space (on average approximately 50% of the crystal volume; Matthews, 1968 ▶) remains between the molecules, and it is filled with the mother liquor or crystallization cocktail from which the crystal grew. Much of this medium in the intermolecular space is disordered bulk solvent, and its content estimation and treatment in modelling and refinement are discussed in §2.
The high solvent content in macromolecular crystals compared with solid-sphere close packing (where approximately 26% of the space is not occupied by spheres) on one hand limits the physical-mechanical stability of the crystals, but on the other does allow ions or small molecules to enter an already formed crystal through its solvent channels. Given that some components or additives of the mother liquor are chosen on the basis of their ability to mediate crystal contacts, it is also reasonable to expect that, in addition to ubiquitous water molecules, some of these special solvent components will form an integral part of the crystal structure. As ordered solvent molecules and ions, these components will exhibit distinctly visible electron density which must be modelled, refined and validated, as outlined in §3.
The analysis of crystal solvent content thus needs to consider a minimum of two major contributions to the data and the model: (i) the properties of disordered bulk solvent and (ii) the ordered solvent components displaying distinct electron density. A third intermediate region, the semi-ordered and inhomogeneous solvent, which often contains biologically relevant compounds such as lipids or membrane components, is not adequately described at present, which may be a contributing factor to the large gap observed between refinement R values and data quality (Holton et al., 2014 ▶).
For modelling and refinement, the simple binary and discontinuous ‘coordinates plus bulk solvent’ model does work reasonably well, although the dynamic nature of the macromolecule–solvent interface, as well as nonhomogenous solvent density, provide opportunities for methodological advances and better understanding of this biologically highly relevant and active interface region. In this review, we examine how to (i) estimate the overall solvent content of a macromolecular crystal, (ii) use this information to our advantage (for example in phase improvement), (iii) account for and model disordered bulk solvent and (iv) properly identify and model the distinct electron (or nuclear, in the case of neutron diffraction) density of ordered solvent molecules.
2. Bulk solvent
Shortly after about 100 crystal forms of globular proteins became available in the late 1960s, Matthews analysed the fractional volume of solvent in protein crystals (Matthews, 1968 ▶). His original estimate that on average close to half of the unit-cell volume is filled with mother liquor or solvent remains valid today. While the distribution of the solvent content peaks at around 50% of the crystal volume, the distribution is relatively broad, ranging from approximately the close-packing limit of 26% for spheres up to almost 90%, and in extreme cases even exceeding these limits in both directions; see Weichenberger & Rupp (2014 ▶) and Fig. 2 ▶. Knowing the solvent content of a biomolecular crystal generates useful information during the initial analysis of diffraction data (§2.1), and this knowledge is subsequently valuable in the phasing stage (§2.2 and §2.3). Proper modelling of the bulk-solvent contributions to the diffraction pattern is necessary for accurate reciprocal-space refinement of a crystal structure (§2.4).
2.1. Estimating the unit-cell content
The first step in the process of structure determination is almost always the estimation of the molecular unit-cell content, which can be performed as soon as the unit-cell dimensions and possible lattice types have been determined from indexed diffraction data. Solvent-content analysis often informs the choice of internal symmetry and point-group symmetry. Not only can the numbers of possible molecular entities fitting in the asymmetric unit cell be estimated, but highly improbable values of these numbers can indicate problems such as the presence of twinning, pseudo-symmetry or incorrect point (space) group assignment (Zwart et al., 2008 ▶). An accurate estimate of the solvent content is an important parameter in density-modification techniques (Hoppe & Gassmann, 1968 ▶; Barrett & Zwick, 1971 ▶; Bricogne, 1974 ▶; Zhang & Main, 1990 ▶; Podjarny et al., 1996 ▶; Zhang et al., 2001 ▶; Cowtan, 2010 ▶). Density modification is crucial in breaking the phase-angle ambiguity in single-wavelength anomalous diffraction phasing (Wang, 1985 ▶; Mueller-Dieckmann et al., 2007 ▶), for phase improvement (Abrahams & Leslie, 1996 ▶) and for phase extension (Sheldrick, 2010 ▶).
2.1.1. Matthews coefficient
Based on an analysis of 116 different crystal forms of globular proteins (Matthews, 1968 ▶, 1976 ▶), Matthews observed that the fraction of the crystal volume occupied by solvent ranged from 27 to 78%, with the most common value being about 43%. Matthews further defined V M, known as the Matthews coefficient, as the crystal (asymmetric unit) volume V A per unit of protein molecular weight, M,
and showed that V M bears a straightforward relationship to the fractional volume of solvent V S in the crystal,
where the dimensions of the conversion factor 1.230 are Å−3 Da. The derivation of (2) was not explicitly provided in Matthews’ original publications, but it is elaborated in Weichenberger & Rupp (2014 ▶) together with a more complete compilation of historic and current overall solvent-content data.
2.1.2. Matthews probabilities: solvent content affects resolution
Matthews realised that a relationship between the solvent content and the resolution of the diffraction data is plausible: tightly packed crystals with more contacts between individual molecules should be more stable and more ordered and possibly diffract better. Matthews’ intuitive prediction has been statistically verified using larger protein data sets, and a conditional probabilistic estimate for possible unit-cell contents as a function of resolution, termed the Matthews probability (MP), has been developed (Kantardjieff & Rupp, 2003 ▶). The MATTPROB web applet, the corresponding probability distribution function and its parameters for cumulative resolution bins have been provided to the crystallographic community, and the original MP estimator has been implemented in some form in the major crystallographic structure-determination packages [MATTHEWS_COEF in CCP4 (Winn et al., 2011 ▶), Phaser (McCoy et al., 2007 ▶) and PHENIX (Adams et al., 2010 ▶)]. A recent update of MATTPROB (Weichenberger & Rupp, 2014 ▶) employs a nonparametric kernel density estimator (http://www.ruppweb.org/mattprob/) which, by calculating the Matthews probabilities directly from empirical solvent-content data, avoids the need to revise the multiple parameters of the original binned empirical fit function presented in Kantardjieff & Rupp (2003 ▶). This updated analysis further reinforced the idea that solvent content and resolution are highly correlated. Modifications of the specific density for low-molecular-weight proteins (Quillin & Matthews, 2000 ▶; Fischer et al., 2004 ▶) have no practical effect on solvent-content prediction, and there is no correlation with oligomerization state (Chruszcz et al., 2008 ▶; Weichenberger & Rupp, 2014 ▶). While a weak, in practice irrelevant, dependence on symmetry and molecular weight was found, it cannot be satisfactorily explained using simple linear or categorical models.
A distinct feature of the Matthews probability is the implicit assumption of a Bayesian prior that the observed resolution represents only an empirical lower limit for the resolution. In other words, a crystal diffracted under certain experimental circumstances to at least the reported resolution, but in principle it could have diffracted better. The assumption of this Bayesian prior is also compatible with the fact that the distribution of reported resolution cutoffs is distinctly skewed towards cutoff values higher than the common mean I/σ(I) of 2.0 (Weichenberger & Rupp, 2014 ▶; Luo et al., 2014 ▶). At present, no agreement on objective criteria exists for the nontrivial selection of a resolution cutoff for model refinement, but it seems that commonly applied resolution cutoffs in fact under-report the actual diffraction potential of the crystals (Diederichs & Karplus, 2013 ▶; Luo et al., 2014 ▶).
2.2. Solvent in phasing: electron-density modification and masking
Disordered bulk solvent constitutes a rather substantial amount of scattering matter in a macromolecular crystal. Despite the bulk solvent not being structured, its average electron density in a crystal is still periodic and therefore it contributes to the Bragg reflections. The resulting bulk-solvent contribution to the structure factors is not the same across the entire resolution range, but varies from almost negligible at high resolution to quite significant at resolutions lower than approximately 6–8 Å (Fig. 3 ▶). The effect of bulk-solvent contributions on observed structure-factor amplitudes and the possibility of deriving the approximate molecular shape from their systematic change with bulk-solvent density was recognized and used by Bragg & Perutz (1952 ▶).
One of the most significant benefits of the large solvent content in protein crystals is that the difference between protein electron density and that of the surrounding bulk solvent provides the basis for exceptionally powerful phase-improvement techniques. In addition to handedness ambiguity of the heavy-atom solution, single anomalous diffraction (SAD) phasing experiments (Dauter et al., 2002 ▶; Mueller-Dieckmann et al., 2007 ▶; Read & McCoy, 2011 ▶) often have poor (centroid) starting phases as a consequence of the inherent phase-angle ambiguity. Such poor starting phases would not be routinely useable without the benefit of density modification for phase improvement and phase extension. Such methods were pioneered by Wang (1985 ▶), have been steadily improved (see, for example, Bricogne, 1984 ▶; Abrahams & Leslie, 1996 ▶; Podjarny & Urzhumtsev, 1997 ▶; Terwilliger, 2000 ▶; Zhang et al., 2001 ▶; Cowtan, 2010 ▶) and have been implemented in various forms in almost all macromolecular crystallographic structure-solution (i.e. phasing) software.
2.3. Delineation of atomic model and bulk solvent
The idea behind most real-space-based phase-improvement methods (a.k.a. density modification) is in principle simple: make a (poor) initial electron-density map look more like a ‘real’ protein electron-density map1. A high solvent content is beneficial for density modification (phase improvement), because the description of a large bulk-solvent area as flat, continuous electron density is in fact a reasonable approximation of reality. A high solvent content therefore means that a large part of the crystal content is well described, and this is knowledge that comes into our hands essentially for free. Solvent flattening-based density-modification methods generally work better with higher solvent contents and perform poorly below 30%, where the solvent content essentially approaches the void volume of close-packed spheres, and the ratio of correctly described flat solvent versus poorly described partially ordered transition area decreases. In addition, high solvent contents benefit density modification following SAD phasing because the map noise (or when the substructure is centrosymmetric, the inverted image of the structure; see Table 10-2 in Rupp, 2009 ▶) can be better distinguished from the correct electron density. The various density-modification and phase-improvement methods have been reviewed, for example, in Podjarny et al. (1996 ▶), Rupp (2009 ▶) and Cowtan (2010 ▶) and in additional references provided below. The role and application of density modification and density averaging in the ab initio phasing of virus crystal structures has been reviewed separately by McPherson & Larson (2015 ▶).
2.3.1. Solvent flattening
Common to the real-space-based methods is that a delineation is made between what is believed to be solvent and the parts of the map which might represent the protein model. In solvent flattening, a solvent mask (see §2.4.3) is generated within which the density is constant and belongs to bulk solvent (0.33 e Å−3 for pure water), and outside the mask protein is assumed (which has a higher average density of approximately 0.44 e Å−3). The phases generated from map inversion with the solvent region set to the flat density value then are used to improve the phases of the entire electron-density map, the solvent mask is updated and the process is iteratively repeated until convergence (Bricogne, 1974 ▶; Kleywegt & Read, 1997 ▶; Podjarny et al., 1987 ▶).
2.3.2. Solvent flipping
The related technique of solvent flipping (Abrahams & Leslie, 1996 ▶) does not set the density values in a solvent region to a constant low value, but rather flips (i.e. changes the sign of) the grid point (or density voxel) values in the solvent region. Flipping the solvent density introduces independence between the partial maps, and thus allows unbiased phase probability combination. The powerful solvent-flipping procedure has been implemented, for example, in the program SOLOMON available in CCP4 (Winn et al., 2011 ▶) and as part of AutoSHARP (Vonrhein et al., 2007 ▶) and PHENIX (Adams et al., 2010 ▶).
2.3.3. Sphere of influence
Flipping of density voxels that are unlikely to represent protein-atom sites is also an essential component of the SHELXE Sphere of Influence (SoI) algorithm (Sheldrick, 2010 ▶). In contrast to the binary methods that essentially integrate over the volume of a Wang sphere, SoI calculates the variance of the density on the surface of a 2.42 Å sphere. The reason for this choice of radius is to maintain some plausible chemical information: 2.42 Å is in fact the dominant 1–3 atom distance in proteins and nucleic acids. The covered density region is then split into solvent, macromolecule and a crossover region (Sheldrick, 2002 ▶). The variable (fuzzy) crossover region prevents the solvent mask from becoming locked in owing to an incorrect initial solvent-content estimate. Furthermore, the fuzzy region allows a smooth transition from solvent to protein. The final phase combination uses σA-weighted maximum-likelihood coefficients (Read, 1986 ▶), which reduce the partial bias from the parts of the map assigned as protein. The procedure is then repeated until convergence.
2.3.4. Boundary-region complexity
The dynamic boundary layer surrounding a macromolecule in its solvent environment is in all likelihood incompletely described by a discontinuous binary transition in a ‘coordinates plus bulk solvent’ model. Efforts to use a smoothly changing continuum boundary (Fenn et al., 2010 ▶) have led to algorithmic advances, but no significant improvements in R values have been achieved. The indications are that the large gap between the R values observed in refinement of macromolecular structures at typical resolutions (R values of ∼20–30%) and the precision with which data are measured (R merge of ∼4–7% on I) may originate at least partially from an inadequate description of the true electron density (Holton et al., 2014 ▶). Because the zone between the dynamically moving bulk solvent and the macromolecule is also the area where interesting interactions actually take place, one can expect that better modelling of and accounting for this boundary region will also improve the understanding of the associated biological phenomena.
2.4. Bulk-solvent refinement
The bulk-solvent contribution to Bragg reflections originating from periodic bulk solvent is not the same across the entire resolution range, but instead varies from almost negligible at high resolution to rather strong at low resolution (Fig. 3 ▶). At low resolution, the contributions of the atomic model and bulk solvent at low resolution are comparable in magnitude but are opposite in phase. As a consequence, uncorrected model-only calculated structure factors are too large, and correspondingly higher R values result upon refinement without bulk-solvent correction; see the figures in Kostrewa (1997 ▶), Fokine & Urzhumtsev (2002b ▶) and Afonine & Adams (2012 ▶).
For the first few decades of protein crystallography, it was common practice to truncate the low-resolution data in refinement simply owing to the lack of a proper bulk-solvent model. Unfortunately, missing data, especially high-intensity low-resolution reflections (sometimes as few as 5%) can significantly deteriorate the quality of crystallographic maps (Lunin, 1988 ▶; Urzhumtsev et al., 1989 ▶; Cowtan, 1996 ▶; Urzhumtseva & Urzhumtsev, 2011 ▶), adversely impact crystallographic structure refinement (Kostrewa, 1997 ▶) and limit the success of molecular replacement (Fokine & Urzhumtsev, 2002a ▶).
2.4.1. Bulk-solvent models
Because reciprocal-space refinement of a crystal structure model minimizes a likelihood target function derived from a sum of squared residuals between observed (F obs) and calculated model structure-factor amplitudes (F model), the bulk-solvent contribution F bulk needs to be accounted for during the refinement of a crystal structure. For the purpose of reciprocal-space refinement, a physical model describing the entire crystal structure is needed, from which in turn model structure factors can be calculated. Such a model contains in general (i) atomic model parameters (parameters that describe coordinates, displacements etc. of specific atoms) and (ii) non-atomic model parameters that describe everything else such as bulk-solvent contributions, twinning fractions, various scales, overall anisotropy etc.). The complete model of a crystal structure in reciprocal space can then be represented in a generalized form by the total model structure factor (F model) consisting of contributions from the individual atoms in the model (F atoms) and the disordered bulk solvent (F bulk) (Moews & Kretsinger, 1975 ▶; Afonine et al., 2012 ▶),
with the corresponding scales and components determined as described, for example, by Afonine et al. (2013 ▶) and Murshudov et al. (2011 ▶).
Currently, two major bulk-solvent models are in use in macromolecular crystallographic studies to calculate F bulk(h):
(i) the exponential (Babinet) bulk-solvent model based on Babinet’s principle, where a solvent contribution proportional but with the opposite phase of the protein contributions (that is with a negative sign) is added to the atomic model scattering contributions (Moews & Kretsinger, 1975 ▶; Tronrud, 1997 ▶); and
(ii) the mask-based, flat bulk-solvent model, where the solvent contribution is accounted for as a partial structure contribution from a homogeneous, masked bulk-solvent region (Phillips, 1980 ▶; Jiang & Brünger, 1994 ▶). Although earlier refinement programs such as EREF (Jack & Levitt, 1978 ▶) and early versions of TNT (Tronrud, 1997 ▶) included masked bulk model options, they were nontrivial to use and were computationally very expensive at the time.
2.4.2. Exponential (Babinet) bulk-solvent model
Because Babinet’s principle (see Sidebar 11-7 in Rupp, 2009 ▶) only holds for uniformly scattering objects, the exponential bulk-solvent model2
is strictly valid only for the correction of low-resolution reflections, but it is still effective to about 6 Å (Tronrud, 1997 ▶; Podjarny & Urzhumtsev, 1997 ▶; Glykos & Kokkinidis, 2000 ▶). In the exponential model, the only adjustable parameters are k sol, the so-called contrast, a term first coined by Bragg & Perutz (1952 ▶) and defined as the ratio of the mean solvent electron density and mean protein electron density (∼0.76), and B sol, a high attenuation factor (∼125–200 Å2) affecting the extent of the bulk-solvent contribution. As the highly attenuated solvent contribution is subtracted from the protein structure factors, the exponential model is most effective at low resolution. The Babinet model does not need a mask, and Babinet scaling can therefore be applied to compute improved scale factors before molecular-replacement searches, as implemented, for example, via improved σA estimates in the molecular-replacement likelihood functions in Phaser (McCoy et al., 2007 ▶; McCoy, 2007 ▶).
The Babinet-based model is available in the refinement programs SHELXL (Sheldrick, 2008 ▶) and BUSTER-TNT (Tronrud, 1997 ▶; Blanc et al., 2004 ▶) and in REFMAC, where it also can be used in combination with the flat solvent model (Murshudov et al., 2011 ▶).
2.4.3. Flat (masked) bulk-solvent model
The flat or masked solvent model
or more generally, as described in Moews & Kretsinger (1975 ▶) and Afonine et al. (2013 ▶),
is presently the default choice to account for bulk solvent in macromolecular crystallographic studies. F mask is the contribution from a homogeneous, masked bulk-solvent region. The physical meaning of the bulk-solvent parameter k mask in the flat mask-based model is different from the interpretation of k sol in the Babinet model (§2.4.2): in the flat mask model k mask is the mean solvent electron density in e Å−3, while the Babinet k sol is the contrast (ratio) between solvent and protein electron density. Both B sol and B mask are smearing (attenuation) factors in Å2 which describe how deeply (on average) the bulk-solvent and macromolecular regions interpenetrate (Fokine & Urzhumtsev, 2002b ▶). Owing to the presence of an already defined mask boundary region, B mask, with a mean of 42 Å2, is significantly smaller than B sol (Fig. 4 ▶). Depending on the refinement program, additional mask-generation parameters such as the solvent probe radius extending the solvent boundary beyond the van der Waals surface of the protein and a back-fill probe (shrink) radius determining the penetration of the solvent back into the gap between the protein surface and solvent boundary can be optimized in the flat bulk-solvent model (Richards, 1985 ▶; Jiang & Brünger, 1994 ▶; Kostrewa, 1997 ▶).
The flat solvent model is available in programs such as CNS (Brünger et al., 1998 ▶), PHENIX (Adams et al., 2010 ▶) and REFMAC, which also allows the flat masked solvent model to be combined with a modified exponential Babinet model (Murshudov et al., 2011 ▶). CNS uses an improved flat model that incorporates grid searches for solvent parameters as suggested by Fokine & Urzhumtsev (2002b ▶) to improve convergence. CNS also refines the parameters involved in mask calculation such as solvent and shrink-truncation radii (Brunger, 2007 ▶). PHENIX uses an even more enhanced flat solvent model in which the solvent parameters are calculated analytically and simultaneously with all of the other scale parameters such as overall anisotropic scaling and twin fraction, making this method fast and independent of minimizer convergence (Afonine et al., 2013 ▶). Other enhancements to the flat solvent model that address discontinuity at the solvent–macromolecule boundary have been proposed (Fenn et al., 2010 ▶). In some cases the distribution of bulk solvent may be naturally non-uniform across the volume of the unit cell (Burling et al., 1996 ▶; Sonntag et al., 2011 ▶). In these cases the simple flat mask model may not be sufficient and better models that would account for distinguishable differences in the bulk-solvent region will have to be developed.
2.4.4. Physical meaning of solvent parameters
In a simple and most used case (except for PHENIX, where the parameterization is different; for details, see Afonine et al., 2013 ▶), the two adjustable parameters (apart from mask calculation parameters) of the flat bulk-solvent model have a clear physical meaning and predictable values (Fokine & Urzhumtsev, 2002b ▶): the mean solvent density k mask (e Å−3) and a smearing factor B mask (Å2) which describes how deeply (on average) the bulk-solvent and macromolecular regions interpenetrate. Because these two parameters are good indicators of data and model quality, they are useful criteria for the validation of new structures or checking the consistency of existing entries (for a systematic study, see, for example, Afonine, Grosse-Kunstleve et al., 2010 ▶). Unusual values of these parameters may indicate serious problems with the data, model or refinement. Typically, k mask approaching zero means no bulk-solvent contribution (which is highly improbable) and may indicate serious problems with the model, the data or both (Janssen et al., 2007 ▶; Rupp, 2012 ▶).
2.4.5. Bulk-solvent correction caveats
Mask-based bulk-solvent models can produce (difference) electron-density map artefacts depending on the choice of the parameters describing the mask surrounding the macromolecule. Examples include the following.
(i) If the mask around the protein and/or the associated shrink radius are selected so that the protein mask intrudes into true bulk solvent (i.e. no bulk solvent density is calculated in these solvent regions), the resulting electron-density difference map will show positive peaks or ‘blobs’ for the missing bulk-solvent density, which might be misinterpreted as ordered solvent components.
(ii) When unmodelled protein density is assigned as solvent, a background of solvent density will be calculated there, reducing the positive difference density of any ordered protein features.
(iii) If the protein mask excludes channels that are in fact empty but large enough so that the mask probe can enter them, they are mistakenly assigned as bulk-solvent density, resulting in negative electron-density difference map peaks.
Precautions against such artefacts include cavity-detection algorithms that eliminate holes inside a protein mask (Murshudov et al., 2011 ▶) or the introduction of an additional, unmodelled protein density contribution (Blanc et al., 2004 ▶) in the refinement.
Comparing maps obtained from the application of the masked, flat bulk-solvent model with those obtained using the exponential Babinet bulk-solvent correction can serve as a control if solvent correction-induced artefacts are suspected. The Babinet model differs from the flat mask model in two crucial aspects. The high effective attenuation (B ≃ 125–200 Å2) restricts the effect of the Babinet correction to low-resolution data, keeping the remaining high-resolution reflections (and the overall scaling dominated by them) unaffected. In addition, in the Babinet model anything that is not part of the modelled protein (F atoms) is automatically and implicitly assigned as solvent, eliminating local masking bias. As a consequence, situation (i) described above, for example, is not possible: anything not assigned as protein is treated as belonging to solvent, leading to a situation similar to that described in (ii) but likely less dramatic. Although the effects of incorrect modelling of protein versus solvent are in principle similar in the Babinet model, the type and severity of the errors that can be introduced are limited. Ultimately, better methods to define, refine and update mask parameters, probably combined with a better description of disordered protein regions (see Blanc et al., 2004 ▶), or even more complex transition regions, need to be developed to address solvent correction-induced artefacts in electron-density reconstructions.
2.5. Diffuse scattering
In contrast to the periodically present solvent density in the crystal that contributes to Bragg reflections, thermal diffuse scattering (TDS) arises from temporal and spatial disorder in the lattice [with additional diffuse scattering contributions to the recorded diffraction pattern originating from Compton scattering of the entire object(s) in the X-ray beam, not just the crystal]. Any deviation from the periodicity in the crystal lattice, including variation in the solvent density and not limited to disorder introduced by atomic displacements, causes variation in the periodicity of the electron density of a crystal and gives rise to diffuse scattering. Bragg spots reflect the time-averaged structure, while diffuse scattering contributions to the diffraction pattern provide a measure of the dynamic motions or deviations within the crystal (Clarage & Phillips, 1997 ▶). TDS can therefore provide a window into the dynamics of biological macromolecules. The advance of high-speed data collection with pixel-array detectors (PADs) and the potential to take advantage of the time structure of free-electron laser (FEL) X-ray sources have renewed interest in TDS analysis of proteins (Wall et al., 2014 ▶).
3. Ordered solvent components
Solvent components which have become ordered in the crystal lattice contribute to Bragg scattering and their electron-density reconstruction will display distinct features, allowing them to be modelled as part of the atomic structure model. Their identification is often nontrivial because their electron density may not be unique (NH4 +, Na+ and Mg2+, for example, are isoelectronic with water), and more often than not, the composition of the crystallization cocktail, chemical plausibility, charge complementarity, local environment, coordination geometry and stereochemistry provide additional and crucial clues about their identity (Table 1 ▶). Additional complications arise from the possibility of partial occupancies (which are not restrained to sum to 1.0, as is the case, for example, for multiple side-chain conformations) and from the fact that moieties possessing symmetry can be located on corresponding special positions that share the same or higher symmetry than the object. At typical macromolecular resolution, electron-density features representing water molecules or single ions appear spherical and they can be located on any special position. Small-molecule ligands or cofactors often possess symmetry compatible with crystallographic symmetry operations and can be located on corresponding special positions. Alternate partial occupancies of ligands on special positions such as twofold axes are also possible and are not uncommon. Because water molecules or ions are not as strongly bound as, for example, covalently connected side-chain atoms in a protein molecule, their B factors are not tightly restrained and can be significantly higher than those of the neighbouring protein atoms. Complicating matters further, although the resulting electron-density distributions are theoretically different, the distinction between low occupancy versus high B factor of an atom is not clear at the resolution of most protein structures.
Table 1. Common constituents of intermolecular solvent space.
Solvent component | Common source | Geometry | Electron density |
---|---|---|---|
Water | CC, PS | Single, tetrahedral, five- and six-membered rings, networks | Spherical, can be on any special position |
Buffer | CC, PS | Molecular shape, e.g. tetrahedral (SO4 2, PO4 3), some zwitterionic etc. | Molecular shape, sometimes only parts visible |
Metal ions, anions | CC, PS, S | Varying typical distances/environments, often octahedral | Spherical, higher density value according to Z, can be on any special position |
Ligand | CC, S | Molecular pose, specific contacts | Molecular shape, sometimes only parts visible |
Cofactor | CC, PS, S | Molecular pose, specific contacts, some covalent (e.g. PLP) | Molecular shape, sometimes only parts visible |
Other precipitant | CC | According to molecule (e.g. glycerol, cryo-buffer) | Molecular shape, sometimes only parts visible |
Lipid | PS, LCP | Tail in hydrophobic environment, head polar | Molecular shape, sometimes only parts visible |
Detergent | CC, PS | According to molecule, some zwitterionic | Molecular shape, sometimes only parts visible |
3.1. Water molecules
In its natural environment, a protein is surrounded by and interacts with a wide variety of other moieties, and a large part of this molecular environment consists of water molecules. The physicochemical properties of the surface amino acids of the protein impose a dynamic hydration shell of one or two layers of ordered water molecules around the protein (Fogarty & Laage, 2014 ▶), which contribute to the measured X-ray diffraction intensities and therefore must be included in the structure model. Electron density of water molecules can be readily identified by spherical 2mF o − DF c and/or mF o − DF c Fourier maps located at a hydrogen-bond distance from hydrogen-bond acceptors or donors. Only in ultrahigh-resolution electron-density maps (resolution better than ∼1.0 Å) or neutron diffraction nuclear density maps (Afonine, Mustyakimov et al., 2010 ▶) can individual H atoms be clearly discerned.
3.1.1. Hydrogen-bond networks
Owing to their polarity, water molecules are highly versatile hydrogen-bond donors and acceptors. A single molecule can donate two hydrogen bonds through its covalently bound H atoms and can accept two hydrogen bonds via the two lone pairs of electrons on its oxygen, resulting in a total of four hydrogen bonds. The hydrogen-bond network of modelled water molecules needs to make corresponding chemical sense, and a network of tetrahedrally coordinated water molecules and five-ring or six-ring assemblies can often be observed (Fig. 5 ▶). The number of water molecules that can be identified and built depends on the molecular structure, but in general more ordered water molecules can be placed at higher resolution (Fig. 6 ▶).
Because H or D atoms have significant neutron scattering factors (the former with a negative sign but significant background scattering) comparable with those of the heavier atoms, water networks and protonation states can be often assigned in nuclear density. PHENIX provides the option for separate or combined X-ray/neutron refinement (Afonine, Mustyakimov et al., 2010 ▶).
3.1.2. Automated water building
Almost all major crystallographic packages and model-building programs have been furnished with automated water-picking and analysis procedures. In the CCP4 (Winn et al., 2011 ▶) program suite, REFMAC (Murshudov et al., 2011 ▶) is coupled with calls to arp_waters (Lamzin & Wilson, 1993 ▶), which removes atoms that do not fit into the electron density and places new water atoms into unoccupied electron-density peaks under the consideration of various distance criteria. The phenix.refine program (Afonine et al., 2012 ▶) provides a command-line option for water picking (ordered_solvent) and SHELXL (Sheldrick & Schneider, 1997 ▶) provides the program SHELXWAT for ‘automated water divining’ based on the ARP/wARP procedure (Lamzin & Wilson, 1993 ▶). The CNS refinement protocols (Brunger, 2007 ▶) provide input files for water placement and removal. In general, these programs follow the same strategy as a human model builder: waters are selected by a peak search in the difference electron-density map. Additional constraints ensure that the putative water molecule is at a hydrogen-bonding distance from protein atoms or other water molecules and that it is realistically close to the protein, manifesting a reasonable hydration-shell model. In the presence of noncrystallographic symmetry (NCS), the CCP4 program WATNCS uses this information to remove waters that do not follow the NCS constraints, whereas WATERTIDY can be seen as a post-processing step after water picking that moves symmetry-related waters close to the protein chain and attempts to establish sensible hydration shells. The popular interactive model-building program Coot (Emsley et al., 2010 ▶) combines water picking and water-molecule analysis with real-space validation.
Recent developments in program packages such as PHENIX have led to much improved ‘water’-divining procedures (Echols, Morshed et al., 2014 ▶) which are able to discern water molecules from other ions based on a number of electron-density, refinement and plausibility criteria, as detailed in §3.2.
3.1.3. B factors and occupancies of ordered water molecules
After refinement, it is worthwhile taking a critical look at the B factors of modelled water molecules, which are highly variable and depend on the local environment. Water molecules are not linked by covalent bonds to other atoms, and they become increasingly more mobile with increasing distance from the protein. While the B factors of such water molecules can be considerably higher than the B factors of neighbouring protein atoms, closely bound water molecules in a stable hydrogen-bond network with protein atoms, which de facto provides a restraint on increasing displacement or mobility, will in general have B factors that are not much higher than their environmental neighbours.
3.1.4. High B-factor water molecules
With increasing distance from the ordered protein surface, displacement and mobility generally increase and, correspondingly, electron-density peaks will become wider and lower and refined B factors will increase. Very high B factors are often associated with spurious water molecules, and any density that has no sensible noncovalent interaction of less than 4–5 Å with other moieties is hard to justify using a simple water model. While significant positive difference density should be explained when possible, indiscriminately placing water molecules into each and every positive difference density creates a nonparsimonious model (Dauter et al., 2014 ▶) that is overfitted and often has inferior R free values (Brünger, 1992 ▶) or larger R–R free gaps (Tickle et al., 2000 ▶) than a simpler, plausible model. Similar considerations hold for water molecules uncritically placed in density peaks of low-resolution maps: would a single water molecule of about 1.5 Å hydrogen-to-hydrogen distance really generate a distinct peak in a 3.5 Å electron-density map? While Fig. 6 ▶ does allow a coarse estimate of how may waters can be expected to be built at a certain resolution, the ultimate criteria are reasonable electron density (satisfying the need for evidence) and physico-chemical plausibility (satisfying the need for compliance with well established prior expectations).
3.1.5. Partially occupied water molecules
When water molecules refine to high B factors, which is simply an indication that the refinement program wants less scattering contribution at the proposed water location, it can be justified to assign a partial occupancy of the water atom. Doing so will reduce the B factor and, given the (at medium resolution) almost identical effect of decreasing the occupancy or increasing the B factor, it will have little effect on the global R values. Justification therefore must come from necessity, such as distinct water densities that are perhaps too close to another one, indicating reduced occupancies (with their sum ≤1.0) or water positions that are correlated with alternate side-chain confirmations. Respective occupancy groups can be defined in the refinement programs and, if appropriate, their sum constrained to 1.0. Fig. 7 ▶ exemplifies a relatively simple instance of such a situation.
3.1.6. Low B-factor ‘water’ molecules
Low B-factor waters that deviate significantly below the B factors of neighbouring protein atoms or even approach the hard-coded lower limits for B factors (often set at 1–5 Å2 in refinement programs) are indicative of the incorrect assignment of metal cations, or of anions such as Cl−, as water molecules (see Echols, Morshed et al., 2014 ▶ and Fig. 8 ▶). In many cases, the coordination geometry and bond distances for ions (see §3.2), a review of the crystallization-cocktail components (see §3.2) and/or anomalous difference map peaks will allow a sensible explanation.
3.2. Elemental ions
Elemental ions such as cations of various natively present metals or those originating from the crystallization cocktail are almost always present in a crystal structure. Similarly, but less frequently, anions from cryo-salts can be present in crystal structures. In the case of quick ion soaks for anomalous phasing, the presence of ordered (but not necessarily fully occupied) ions and their anomalous scattering contributions is a desired outcome. Distinction between ions such as NH4 +, Na+ and Mg2+, all of which are practically isoelectronic with water (ten electrons), cannot in general be achieved on the basis of electron density or atomic displacement parameters alone. Discrimination of such isoelectronic ion sites is possible via interpretation of their bonding and coordination environments. For heavier ions, the electron density will be correspondingly higher than for water molecules, and erroneously modelled waters in place of heavier ions therefore refine to implausibly low B factors. In cases where anomalous data have been collected, anomalous difference density maps (see Strahs & Kraut, 1968 ▶; Mueller-Dieckmann et al., 2007 ▶; Read & McCoy, 2011 ▶) can be a powerful means to provide experimental evidence for localized metals. A corresponding X-ray absorption/fluorescence edge scan can identify the ion, and to some degree its local environment (Frankaer et al., 2014 ▶).
Coordination distances and geometries for most elemental ions have been tabulated, for example by Harding (2004 ▶, 2006 ▶) and Hsin et al. (2008 ▶). These parameters provide a basis for validation servers such as CheckMyMetal (Zheng et al., 2014 ▶). Coordinate-based a posteriori validation is necessarily not as powerful as combined validation/identification that includes electron density and anomalous difference peak analysis (Echols, Morshed et al., 2014 ▶), which is a convincing reason to record, keep and deposit unmerged anomalous diffraction data if at all possible.
3.3. Small-molecule ligands
Binding sites have, by their nature, evolved to attract ligand moieties. While specifics assure that the correct substrate is processed in vivo, even remotely similar molecular moieties (i.e. anything from expression host cellular contents to purification buffers to crystallization-cocktail components) can be forced into a binding site by a high enough concentration (and can even partly replace or entirely compete out the desired ligand). The potential of weak binders being forced into active sites given sufficient concentration can be used to advantage in fragment-based drug-lead discovery (Burley, 2004 ▶; Hajduk & Greer, 2007 ▶), but it can lead to unexpected ligands such as buffers in the binding site (Gokulan et al., 2005 ▶) or to the misinterpretation of spurious density, which may then be modelled with a desired but fictional ligand (Rupp, 2010 ▶; Pozharski et al., 2013 ▶; Dauter et al., 2014 ▶).
The presence of large solvent channels in macromolecular crystals allows the soaking of small molecules (mostly ligands, inhibitors, therapeutic drug leads, cofactors or nonhydrolysable substrates) into pre-formed crystals. Should soaking not be possible, co-crystallization remains an option. The techniques are well established (Danley, 2006 ▶; Hassell et al., 2007 ▶). Prerequisites for successful soaking experiments are solvent channels that are large enough to allow the respective ligand to diffuse into the crystal. The specific diffusion rate and the on-rate for ligand binding determine the time that it will take to achieve binding of the ligand, which ranges from several minutes to many hours. In small single-ion soaking, as is used for SAD phasing (Nagem et al., 2001 ▶, 2003 ▶), partial occupancies are acceptable, which can be achieved in as little as 60–300 s.
The occupancy of a ligand site is a direct function of the binding affinity and ligand concentration; it can be derived (Danley, 2006 ▶) from the definition of the dissociation constant K d and is illustrated in Fig. 9 ▶. If a ligand does not have a high binding affinity, it will not have full occupancy at low concentrations and therefore even less of its already low relative scattering mass will contribute to the (uninformative for ligand-validation purposes) global refinement residuals (Pozharski et al., 2013 ▶; Weichenberger et al., 2013 ▶). Similarly, any contributions to the corresponding ligand electron density will be reduced in proportion to the ligand occupancy. However, high occupancy can still be reached even at a low K d by sufficiently high concentrations, effectively forcing the ligand (or a crystallization-cocktail component) into a binding site (Fig. 9 ▶).
3.3.1. Experimental evidence
The proposition that a ligand in a complex structure is located in a specific position, exhibiting a unique conformation (i.e. present in a specific pose) is a very strong and powerful statement. Correspondingly, clear experimental evidence to back this strong claim is necessary. Clear positive OMIT difference electron density (Bhat, 1988 ▶) has been proposed as a minimum requirement to justify ligand placement based on experimental crystallographic data (Kleywegt, 2007 ▶; Pozharski et al., 2013 ▶; Wlodawer et al., 2013 ▶; Dauter et al., 2014 ▶). The OMIT difference density should be generated without the ligand molecule ever being placed into the elsewhere almost finished model. Otherwise, the ligand phase bias must be removed at a minimum by means such as re-refinement of the model after slight coordinate perturbation (Reddy et al., 2003 ▶) or more rigorous methods (Hodel et al., 1992 ▶; Adams et al., 1997 ▶; Brünger et al., 1997 ▶; Terwilliger et al., 2008 ▶).
In general, very high ligand B factors and low partial ligand occupancies, particularly when combined, do not provide sufficient scattering contributions, and convincing positive difference density can seldom be reconstructed. The absence of supporting ligand electron density is typically revealed by high real-space R (RSR) values and low real-space correlation coefficients (RSCCs), as calculated by various refinement packages or with separate tools such as Coot (Emsley et al., 2010 ▶), EDSTATS (Tickle, 2012 ▶) or OVERLAPMAP (Winn et al., 2011 ▶). The EDS electron-density server (Kleywegt et al., 2004 ▶) provides electron-density analysis for most published PDB entries. Note that EDS density is not ligand-omit density, and is therefore biased towards the presence of a ligand rather than its absence. The latest PDB validation reports provide another, combined measure of ligand fit, the local ligand density fit (LLDF; http://wwpdb.org/ValidationPDFNotes.html) comparing the RSR of the ligand to the mean and standard deviation of the RSR for protein atoms within 5 Å. In cases where ligands contain heavier atoms, anomalous difference data can provide clues towards their identification and evidence for their presence (Strahs & Kraut, 1968 ▶; Thorn & Sheldrick, 2011 ▶). While obvious for ligands containing phosphate moieties or heavy-metal ions, an educational example of the identification of the S atom in HEPES buffer by means of anomalous difference Fourier density is provided in a publicly available tutorial collection (Faust et al., 2008 ▶).
3.3.2. Plausibility
The weaker the scattering contributions and therefore the reconstructed electron density, the more one must rely on stereochemical restraints and the nature of the chemical environment of the proposed ligand. Poor restraint files often lead to implausible ligand geometries upon refinement (Kleywegt, 2007 ▶), and databases have been created to allow the comparison of ligand geometry with known (but not necessarily correct) PDB ligand entries (Kleywegt & Harris, 2007 ▶). PDB Ligand Expo (http://ligand-expo.rcsb.org) provides a collection of tools to access ligand structures already present in PDB files. Multiple tools exist to generate ligand restraint files based on high-resolution small-molecule crystallographic data and/or quantum-mechanical optimization (Schüttelkopf & van Aalten, 2004 ▶; Moriarty et al., 2009 ▶; Smart et al., 2011 ▶; Lebedev et al., 2012 ▶), with additional resources tabulated in Deller & Rupp (2015 ▶).
In addition to the ligand geometry, the pose of the ligand needs to be plausible and to allow corresponding contacts with the macromolecule which can be examined in model-building tools such as Coot (Emsley et al., 2010 ▶). Graphical display programs such as LIGPLOT (Laskowski & Swindells, 2011 ▶) enable visualization of the local environment of a ligand in its binding site.
3.3.3. R free set selection and model bias
Protein–ligand complex structures are often determined by molecular replacement from already known protein structure models. In the case of an isomorphous structure, simple rigid-body refinement followed by rebuilding and individual coordinate refinement may suffice. The same reflections as selected for the R free set of the original data set should be kept for proper cross-validation (Brünger, 1997 ▶). In addition, if another isomorphous structure with a ligand has been used as a starting model for re-refinement, spurious density resembling the original ligand can be reproduced as a result of model phase bias. Although modern maximum-likelihood methods are relatively robust against model bias, this possibility should be kept in mind. An initial round of simulated-annealing molecular-dynamics refinement (Adams et al., 1997 ▶) or small coordinate perturbations (Perrakis et al., 1997 ▶; Reddy et al., 2003 ▶), always with the ligand omitted (Bhat, 1988 ▶), can be used to minimize bias issues if these are suspected. If data from an isomorphous and ligand-free apo crystal are also available, F obs(ligand) − F obs(apo) difference maps revealing unbiased ligand density can be generated (Stryer et al., 1963 ▶).
3.4. Automated ligand-building tools
Ligand identification in itself is often the very goal of a crystal structure determination, initiating time- and resource-expensive steps down the line. The application and limitations of X-ray crystal structure models for structure-guided lead discovery (Blundell et al., 2002 ▶; Tickle et al., 2004 ▶), fragment screening (Burley, 2004 ▶; Hartshorn et al., 2005 ▶; Fischer & Hubbard, 2009 ▶) and drug design (Goodwill, 2001 ▶) have been reviewed elsewhere (Davis et al., 2008 ▶). As in the case of ordered solvent modelling outlined in §3.1.2, automated programs for ligand placement have been developed and implemented in most crystallographic structure-determination (Oldfield, 2001 ▶; Zwart et al., 2004 ▶; Wlodek et al., 2006 ▶; Terwilliger et al., 2006 ▶, 2007 ▶; Binkowski et al., 2010 ▶; Klei et al., 2014 ▶; Echols, Moriarty et al., 2014 ▶; Carolan & Lamzin, 2014 ▶) and validation (Kleywegt, 2007 ▶; Kleywegt & Harris, 2007 ▶; Smart et al., 2011 ▶; Pozharski et al., 2013 ▶; Weichenberger et al., 2013 ▶) packages.
While it may seem counterintuitive, ligand identification and placement are conceptually simpler tasks than ‘single-atom’ (such as water and metal ion) model building. Differentiating a single peak in a Fourier map from noise and interpreting it in terms of an atom type are difficult tasks, because at typical macromolecular resolutions Fourier maps alone do not convey the information necessary to identify the chemical element type. Consequently, most water-building tools are limited and use only very basic information, such as difference map peak height, density shape (sphericity) and position with respect to neighbouring peaks or already placed atoms. Identification of non-water single-atom ions can use more heuristics, such as preferred chemical environments and characteristic peaks in anomalous difference maps, when available.
Ligands containing more than one atom carry extra information, which is crucial to uniquely and unambiguously identify and place them: the shape. However, a combination of imperfections may transform the shape of the ligand density quite significantly and make its interpretation a challenging task. Contributing factors are (i) pathologies related to crystal quality (anisotropy, twinning etc.), (ii) widely varying diffraction data quality (resolution, completeness, experimental errors etc.) and (iii) the intrinsic nature of ligands being flexible and dynamic. As a result model quality can vary widely, and corresponding validation against evidence and prior expectations as outlined in §§3.3.1 and 3.3.2 is necessary.
3.5. Crystallization-cocktail components
The vast majority of proteins only crystallize in the presence of a highly concentrated precipitant cocktail which, together with components from high concentrations of cryoprotecting agents, can provide a source of unintended ligands. Consequently, components of the crystallization cocktail are often the source of some electron density visible in a known binding site. Occasionally, specific interactions are formed in these sites, and the identity of the unexpected ligand is clearly revealed (Gokulan et al., 2005 ▶). In the case of unexpected ligands that are disordered and appear in or near the predicted target binding sites, it may be rather tempting to place the ligand of interest in an arbitrary or even a plausible pose into such uninterpretable density (Fig. 10 ▶). The poor fit may then be explained by invoking the possibility that the ligand binds in multiple conformations. For instance, in PDB entry 3qd1/4i8e (Pyburn et al., 2011 ▶) a disaccharide was positioned into difference density that can readily be identified as originating from a HEPES molecule (Muller, 2013 ▶). In a recent analysis (Pozharski et al., 2013 ▶; Weichenberger et al., 2013 ▶) it was found that a significant fraction of problematic ligands belong to the class of misinterpreted crystallization-cocktail components. Using critical analysis and examining plausible sources of not clearly interpretable electron density can almost always prevent the biased misinterpretation of such electron density as the desired ligands.
4. Summary of solvent constituents and tools for their treatment
The solvent is an integral and also often an intricate part of almost any macromolecular crystal structure. Its disordered bulk components as well as its ordered constituents of varying nature need to be accounted for in modelling and refinement. The improvement of bulk-solvent description from a fundamental perspective is largely driven by methods development. In bulk-solvent refinement, users have relatively little choice beyond solvent-model selection and not much opportunity for the introduction of bias or specific model errors, with the caveat that suboptimal masking can introduce density artefacts. Modelling of distinct solvent electron density requires thoughtful interpretation, and using appropriate tools for (automated) building and validation can greatly improve the quality of structure models. A summary of frequently encountered tasks and current tools for solvent treatment, modelling and validation are provided in Table 2 ▶.
Table 2. Summary of common tasks and current tools for bulk-solvent treatment and modelling and validation of ordered solvent constituents.
Task | Program | Reference |
---|---|---|
Solvent-content estimate | MATTPROB | Matthews (1968 ▶), Kantardjieff Rupp (2003 ▶), Weichenberger Rupp (2014 ▶) |
MATTHEWS_COEF | Matthews (1968 ▶), Winn et al. (2011 ▶) | |
Phaser | McCoy et al. (2007 ▶) | |
BUSTER | Bricogne et al. (2010 ▶) | |
PHENIX | Adams et al. (2010 ▶) | |
Solvent masking, density modification | SOLOMON | Abrahams Leslie (1996 ▶) |
AutoSHARP | Vonrhein et al. (2007 ▶) | |
DM, DMMULTI | Kostrewa (1997 ▶), Cowtan (2010 ▶) | |
SHELXE | Sheldrick (2010 ▶) | |
PHENIX | Adams et al. (2010 ▶) | |
Water building, sorting | arp_waters | Perrakis et al. (1997 ▶), Winn et al. (2011 ▶) |
WATNCS | Winn et al. (2011 ▶) | |
WATERTIDY | Winn et al. (2011 ▶) | |
SHELXWAT | Sheldrick (2008 ▶) | |
CNS | Brunger (2007 ▶) | |
Coot | Emsley et al. (2010 ▶) | |
PHENIX | Adams et al. (2010 ▶) | |
Metal identification, validation | CheckMyMetal | Zheng et al. (2014 ▶) |
PHENIX | Echols, Morshed et al. (2014 ▶) | |
Anomalous difference maps | Read McCoy (2011 ▶), Mueller-Dieckmann et al. (2007 ▶) | |
ANODE | Thorn Sheldrick (2011 ▶) | |
Distance-geometry table | Hsin et al. (2008 ▶), Harding Hsin (2014 ▶) | |
Ligand restraint files | PDB Ligand Expo | Feng et al. (2004 ▶) |
PRODRG | Schttelkopf van Aalten (2004 ▶) | |
Grade | Smart et al. (2011 ▶) | |
JLigand | Lebedev et al. (2012 ▶) | |
PHENIX eLBOW | Moriarty et al. (2009 ▶) | |
Coot | Debreczeni Emsley (2012 ▶) | |
Mogul | Bruno et al. (2004 ▶) | |
Ligand building | Coot | Emsley et al. (2010 ▶) |
ARP/wARP ligand | Carolan Lamzin (2014 ▶) | |
PHENIX | Echols, Moriarty et al. (2014 ▶) | |
Ligand validation | EDS | Kleywegt et al. (2004 ▶) |
EDSTATS | Tickle (2012 ▶) | |
OVERLAPMAP | Winn et al. (2011 ▶) | |
HIC-Up | Kleywegt (2007 ▶) | |
ValLigURL | Kleywegt Harris (2007 ▶) | |
VHELIBS | Cereto-Massagu et al. (2013 ▶) | |
Twilight | Weichenberger et al. (2013 ▶), Pozharski et al. (2013 ▶) | |
LIGPLOT | Laskowski Swindells (2011 ▶) | |
Coot | Emsley et al. (2010 ▶), Debreczeni Emsley (2012 ▶) | |
BUSTER | Smart et al. (2011 ▶) | |
PDB (RSR Z score, LLDF) | Read et al. (2011 ▶), http://wwpdb.org/ValidationPDFNotes.html |
Supplementary Material
Supplementary material with additional figures for nucleic acids.. DOI: 10.1107/S1399004715006045/jm5003sup1.pdf
Acknowledgments
BR acknowledges support from the European Union under an FP7 Marie Curie People Action grant PIIF-GA-2011-300025 (SAXCESS). PVA acknowledges support by the NIH (Project 1P01 GM063210), the Phenix Industrial Consortium and the US Department of Energy under Contract No. DE-AC02-05CH11231. Extended discussions with Dale Tronrud (Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon, USA) and Dirk Kostrewa (LMU Gene Center, Munich, Germany) as well as review comments and suggestions have led to significant improvements of the manuscript. Dale Tronrud also kindly provided Figs. 10(e) and 10(f). Randy Read (CIMR, Cambridge, England) and Alexandre Urzhumtsev (IGBMC, Strasbourg, France) provided implementation details for their respective computer programs.
Footnotes
Note that crystallographers never work with true electron-density maps. The maps we construct and display are only finite-resolution Fourier images (transforms) of the true electron density.
S 2(h) = h t G*h, where G * is the reciprocal-space metric tensor and h t = (h, k, l) is the transpose of the Miller-index column vector h.
References
- Abrahams, J. P. & Leslie, A. G. W. (1996). Acta Cryst. D52, 30–42. [DOI] [PubMed]
- Adams, P. D. et al. (2010). Acta Cryst. D66, 213–221.
- Adams, P. D., Pannu, N. S., Read, R. J. & Brünger, A. T. (1997). Proc. Natl Acad. Sci. USA, 94, 5018–5023. [DOI] [PMC free article] [PubMed]
- Afonine, P. V. & Adams, P. D. (2012). Comput. Crystallogr. Newsl. 3, 18–21.
- Afonine, P. V., Grosse-Kunstleve, R. W., Adams, P. D. & Urzhumtsev, A. (2013). Acta Cryst. D69, 625–634. [DOI] [PMC free article] [PubMed]
- Afonine, P. V., Grosse-Kunstleve, R. W., Chen, V. B., Headd, J. J., Moriarty, N. W., Richardson, J. S., Richardson, D. C., Urzhumtsev, A., Zwart, P. H. & Adams, P. D. (2010). J. Appl. Cryst. 43, 669–676. [DOI] [PMC free article] [PubMed]
- Afonine, P. V., Grosse-Kunstleve, R. W., Echols, N., Headd, J. J., Moriarty, N. W., Mustyakimov, M., Terwilliger, T. C., Urzhumtsev, A., Zwart, P. H. & Adams, P. D. (2012). Acta Cryst. D68, 352–367. [DOI] [PMC free article] [PubMed]
- Afonine, P. V., Mustyakimov, M., Grosse-Kunstleve, R. W., Moriarty, N. W., Langan, P. & Adams, P. D. (2010). Acta Cryst. D66, 1153–1163. [DOI] [PMC free article] [PubMed]
- Barrett, A. N. & Zwick, M. (1971). Acta Cryst. A27, 6–11.
- Bhat, T. N. (1988). J. Appl. Cryst. 21, 279–281.
- Binkowski, T. A., Cuff, M., Nocek, B., Chang, C. & Joachimiak, A. (2010). J. Struct. Funct. Genomics, 11, 21–30. [DOI] [PMC free article] [PubMed]
- Blanc, E., Roversi, P., Vonrhein, C., Flensburg, C., Lea, S. M. & Bricogne, G. (2004). Acta Cryst. D60, 2210–2221. [DOI] [PubMed]
- Blundell, T. L., Jhoti, H. & Abell, C. (2002). Nature Rev. Drug. Discov. 1, 45–54. [DOI] [PubMed]
- Bragg, W. L. & Perutz, M. F. (1952). Acta Cryst. 5, 277–283.
- Bricogne, G. (1974). Acta Cryst. A30, 395–405.
- Bricogne, G. (1984). Acta Cryst. A40, 410–445.
- Bricogne, G., Blanc, E., Brandl, M., Flensburg, C., Keller, P., Paciorek, P., Roversi, P., Sharff, A., Smart, O., Vonrhein, C. & Womack, T. O. (2010). BUSTER v.2.9. Global Phasing Ltd, Cambridge, England.
- Brünger, A. T. (1992). Nature (London), 355, 472–475. [DOI] [PubMed]
- Brünger, A. T. (1997). Methods Enzymol. 277, 366–396. [DOI] [PubMed]
- Brunger, A. T. (2007). Nature Protoc. 2, 2728–2733. [DOI] [PubMed]
- Brünger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J.-S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T. & Warren, G. L. (1998). Acta Cryst. D54, 905–921. [DOI] [PubMed]
- Brünger, A. T., Adams, P. D. & Rice, L. M. (1997). Structure, 5, 325–336. [DOI] [PubMed]
- Bruno, I. J., Cole, J. C., Kessler, M., Luo, J., Motherwell, W. D., Purkis, L. H., Smith, B. R., Taylor, R., Cooper, R. I., Harris, S. E. & Orpen, A. G. (2004). J. Chem. Inf. Comput. Sci. 44, 2133–2144. [DOI] [PubMed]
- Burley, S. (2004). Mod. Drug. Discov. 7, 53–56.
- Burling, F. T., Weis, W. I., Flaherty, K. M. & Brünger, A. T. (1996). Science, 271, 72–77. [DOI] [PubMed]
- Carolan, C. G. & Lamzin, V. S. (2014). Acta Cryst. D70, 1844–1853. [DOI] [PMC free article] [PubMed]
- Cereto-Massagué, A., Ojeda, M. J., Joosten, R. P., Valls, C., Mulero, M., Salvado, M. J., Arola-Arnal, A., Arola, L., Garcia-Vallvé, S. & Pujadas, G. (2013). J. Cheminform. 5, 36. [DOI] [PMC free article] [PubMed]
- Chruszcz, M., Potrzebowski, W., Zimmerman, M. D., Grabowski, M., Zheng, H., Lasota, P. & Minor, W. (2008). Protein Sci. 17, 623–632. [DOI] [PMC free article] [PubMed]
- Clarage, J. B. & Phillips, G. N. (1997). Methods Enzymol. 277, 407–432. [DOI] [PubMed]
- Cowtan, K. D. (1996). Proceedings of the CCP4 Study Weekend. Macromolecular Refinement, edited by E. Dodson, M. Moore, A. Ralph & S. Bailey, pp. 23–28. Warrington: Daresbury Laboratory.
- Cowtan, K. (2010). Acta Cryst. D66, 470–478. [DOI] [PMC free article] [PubMed]
- Danley, D. E. (2006). Acta Cryst. D62, 569–575. [DOI] [PubMed]
- Dauter, Z., Dauter, M. & Dodson, E. J. (2002). Acta Cryst. D58, 494–506. [DOI] [PubMed]
- Dauter, Z., Wlodawer, A., Minor, W., Jaskolski, M. & Rupp, B. (2014). IUCrJ, 1, 179–193. [DOI] [PMC free article] [PubMed]
- Davis, A. M., St-Gallay, S. A. & Kleywegt, G. J. (2008). Drug Discov. Today, 13, 831–841. [DOI] [PMC free article] [PubMed]
- Debreczeni, J. É. & Emsley, P. (2012). Acta Cryst. D68, 425–430. [DOI] [PMC free article] [PubMed]
- Deller, M. & Rupp, B. (2015). J. Comput. Aided Mol. Des. 29, 1–20. [DOI] [PMC free article] [PubMed]
- Diederichs, K. & Karplus, P. A. (2013). Acta Cryst. D69, 1215–1222. [DOI] [PMC free article] [PubMed]
- Echols, N., Moriarty, N. W., Klei, H. E., Afonine, P. V., Bunkóczi, G., Headd, J. J., McCoy, A. J., Oeffner, R. D., Read, R. J., Terwilliger, T. C. & Adams, P. D. (2014). Acta Cryst. D70, 144–154. [DOI] [PMC free article] [PubMed]
- Echols, N., Morshed, N., Afonine, P. V., McCoy, A. J., Miller, M. D., Read, R. J., Richardson, J. S., Terwilliger, T. C. & Adams, P. D. (2014). Acta Cryst. D70, 1104–1114. [DOI] [PMC free article] [PubMed]
- Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. [DOI] [PMC free article] [PubMed]
- Faust, A., Panjikar, S., Mueller, U., Parthasarathy, V., Schmidt, A., Lamzin, V. S. & Weiss, M. S. (2008). J. Appl. Cryst. 41, 1161–1172.
- Feng, Z., Chen, L., Maddula, H., Akcan, O., Oughtred, R., Berman, H. M. & Westbrook, J. (2004). Bioinformatics, 20, 2153–2155. [DOI] [PubMed]
- Fenn, T. D., Schnieders, M. J. & Brunger, A. T. (2010). Acta Cryst. D66, 1024–1031. [DOI] [PMC free article] [PubMed]
- Fischer, M. & Hubbard, R. E. (2009). Mol. Interv. 9, 22–30. [DOI] [PubMed]
- Fischer, H., Polikarpov, I. & Craievich, A. F. (2004). Protein Sci. 13, 2825–2828. [DOI] [PMC free article] [PubMed]
- Fogarty, A. C. & Laage, D. (2014). J. Phys. Chem. B, 118, 7715–7729. [DOI] [PMC free article] [PubMed]
- Fokine, A. & Urzhumtsev, A. (2002a). Acta Cryst. A58, 72–74. [DOI] [PubMed]
- Fokine, A. & Urzhumtsev, A. (2002b). Acta Cryst. D58, 1387–1392. [DOI] [PubMed]
- Frankaer, C. G., Mossin, S., Ståhl, K. & Harris, P. (2014). Acta Cryst. D70, 110–122. [DOI] [PMC free article] [PubMed]
- Glykos, N. M. & Kokkinidis, M. (2000). Acta Cryst. D56, 1070–1072. [DOI] [PubMed]
- Gokulan, K., Khare, S., Ronning, D., Linthicum, S. D., Sacchettini, J. C. & Rupp, B. (2005). Biochemistry, 44, 9889–9898. [DOI] [PubMed]
- Goodwill, K. E. (2001). Drug Discov. Today, 6, 113–118.
- Hajduk, P. J. & Greer, J. (2007). Nature Rev. Drug Discov. 6, 211–219. [DOI] [PubMed]
- Harding, M. M. (2004). Acta Cryst. D60, 849–859. [DOI] [PubMed]
- Harding, M. M. (2006). Acta Cryst. D62, 678–682. [DOI] [PubMed]
- Harding, M. M. & Hsin, K.-Y. (2014). Methods Mol. Biol. 1091, 333–342. [DOI] [PubMed]
- Hartshorn, M. J., Murray, C. W., Cleasby, A., Frederickson, M., Tickle, I. J. & Jhoti, H. (2005). J. Med. Chem. 48, 403–413. [DOI] [PubMed]
- Hassell, A. M. et al. (2007). Acta Cryst. D63, 72–79.
- Hodel, A., Kim, S.-H. & Brünger, A. T. (1992). Acta Cryst. A48, 851–858.
- Holton, J. M., Classen, S., Frankel, K. A. & Tainer, J. A. (2014). FEBS J. 281, 4046–4060. [DOI] [PMC free article] [PubMed]
- Hoppe, W. & Gassmann, J. (1968). Acta Cryst. B24, 97–107.
- Hsieh, Y.-C., Wu, Y.-J., Chiang, T.-Y., Kuo, C.-Y., Shrestha, K. L., Chao, C.-F., Huang, Y.-C., Chuankhayan, P., Wu, W., Li, Y.-K. & Chen, C.-J. (2010). J. Biol. Chem. 285, 31603–31615. [DOI] [PMC free article] [PubMed]
- Hsin, K., Sheng, Y., Harding, M. M., Taylor, P. & Walkinshaw, M. D. (2008). J. Appl. Cryst. 41, 963–968.
- Hunter, J. D. (2007). Comput. Sci. Eng. 9, 90–95.
- Jack, A. & Levitt, M. (1978). Acta Cryst. A34, 931–935.
- Janssen, J. C., Read, R. J., Brünger, A. T. & Gros, P. (2007). Nature (London), 448, E1–E2. [DOI] [PMC free article] [PubMed]
- Jiang, J.-S. & Brünger, A. T. (1994). J. Mol. Biol. 243, 100–115. [DOI] [PubMed]
- Kantardjieff, K. A., Höchtl, P., Segelke, B. W., Tao, F.-M. & Rupp, B. (2002). Acta Cryst. D58, 735–743. [DOI] [PubMed]
- Kantardjieff, K. A. & Rupp, B. (2003). Protein Sci. 12, 1865–1871. [DOI] [PMC free article] [PubMed]
- Klei, H. E., Moriarty, N. W., Echols, N., Terwilliger, T. C., Baldwin, E. T., Pokross, M., Posy, S. & Adams, P. D. (2014). Acta Cryst. D70, 134–143. [DOI] [PMC free article] [PubMed]
- Kleywegt, G. J. (2007). Acta Cryst. D63, 94–100. [DOI] [PMC free article] [PubMed]
- Kleywegt, G. J. & Harris, M. R. (2007). Acta Cryst. D63, 935–938. [DOI] [PubMed]
- Kleywegt, G. J., Harris, M. R., Zou, J., Taylor, T. C., Wählby, A. & Jones, T. A. (2004). Acta Cryst. D60, 2240–2249. [DOI] [PubMed]
- Kleywegt, G. J. & Read, R. J. (1997). Structure, 5, 1557–1569. [DOI] [PubMed]
- Kostrewa, D. (1997). CCP4 Newsl. Protein Crystallogr. 34, 9–22.
- Lamzin, V. S. & Wilson, K. S. (1993). Acta Cryst. D49, 129–147. [DOI] [PubMed]
- Laskowski, R. A. & Swindells, M. B. (2011). J. Chem. Inf. Model. 51, 2778–2786. [DOI] [PubMed]
- Lebedev, A. A., Young, P., Isupov, M. N., Moroz, O. V., Vagin, A. A. & Murshudov, G. N. (2012). Acta Cryst. D68, 431–440. [DOI] [PMC free article] [PubMed]
- Li, J., Derewenda, U., Dauter, Z., Smith, S. & Derewenda, Z. S. (2000). Nature Struct. Biol. 7, 555–559. [DOI] [PubMed]
- Lunin, V. Y. (1988). Acta Cryst. A44, 144–150.
- Luo, Z., Rajashankar, K. & Dauter, Z. (2014). Acta Cryst. D70, 253–260. [DOI] [PMC free article] [PubMed]
- Matthews, B. W. (1968). J. Mol. Biol. 33, 491–497. [DOI] [PubMed]
- Matthews, B. W. (1976). Annu. Rev. Phys. Chem. 27, 493–523.
- McCoy, A. J. (2007). Acta Cryst. D63, 32–41. [DOI] [PMC free article] [PubMed]
- McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. [DOI] [PMC free article] [PubMed]
- McPherson, A. & Larson, S. B. (2015). Crystallogr. Rev. 21, 3–54.
- Moews, P. C. & Kretsinger, R. H. (1975). J. Mol. Biol. 91, 201–225. [DOI] [PubMed]
- Moriarty, N. W., Grosse-Kunstleve, R. W. & Adams, P. D. (2009). Acta Cryst. D65, 1074–1080. [DOI] [PMC free article] [PubMed]
- Mueller-Dieckmann, C., Panjikar, S., Schmidt, A., Mueller, S., Kuper, J., Geerlof, A., Wilmanns, M., Singh, R. K., Tucker, P. A. & Weiss, M. S. (2007). Acta Cryst. D63, 366–380. [DOI] [PubMed]
- Muller, Y. A. (2013). Acta Cryst. F69, 1071–1076. [DOI] [PMC free article] [PubMed]
- Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. [DOI] [PMC free article] [PubMed]
- Nagem, R. A. P., Dauter, Z. & Polikarpov, I. (2001). Acta Cryst. D57, 996–1002. [DOI] [PubMed]
- Nagem, R. A. P., Polikarpov, I. & Dauter, Z. (2003). Methods Enzymol. 374, 120–137. [DOI] [PubMed]
- Navratna, V., Nadig, S., Sood, V., Prasad, K., Arakere, G. & Gopal, B. (2010). J. Bacteriol. 192, 134–144. [DOI] [PMC free article] [PubMed]
- Oldfield, T. J. (2001). Acta Cryst. D57, 696–705. [DOI] [PubMed]
- Parkin, S., Rupp, B. & Hope, H. (1996). Acta Cryst. D52, 18–29. [DOI] [PubMed]
- Perrakis, A., Sixma, T. K., Wilson, K. S. & Lamzin, V. S. (1997). Acta Cryst. D53, 448–455. [DOI] [PubMed]
- Phillips, S. E. V. (1980). J. Mol. Biol. 142, 531–554. [DOI] [PubMed]
- Podjarny, A. D., Bhat, T. N. & Zwick, M. (1987). Annu. Rev. Biophys. Biophys. Chem. 16, 351–373. [DOI] [PubMed]
- Podjarny, A. D., Rees, B. & Urzhumtsev, A. G. (1996). Methods Mol. Biol. 56, 205–226. [DOI] [PubMed]
- Podjarny, A. D. & Urzhumtsev, A. G. (1997). Methods Enzymol. 276, 641–658. [DOI] [PubMed]
- Pozharski, E., Weichenberger, C. X. & Rupp, B. (2013). Acta Cryst. D69, 150–167. [DOI] [PubMed]
- Pyburn, T. M., Bensing, B. A., Xiong, Y. Q., Melancon, B. J., Tomasiak, T. M., Ward, N. J., Yankovskaya, V., Oliver, K. M., Cecchini, G., Sulikowski, G. A., Tyska, M. J., Sullam, P. M. & Iverson, T. M. (2011). PLoS Pathog. 7, e1002112. [DOI] [PMC free article] [PubMed]
- Quillin, M. L. & Matthews, B. W. (2000). Acta Cryst. D56, 791–794. [DOI] [PubMed]
- Read, R. J. (1986). Acta Cryst. A42, 140–149.
- Read, R. J. & McCoy, A. J. (2011). Acta Cryst. D67, 338–344. [DOI] [PMC free article] [PubMed]
- Read, R. J. et al. (2011). Structure, 19, 1395–1412. [DOI] [PMC free article] [PubMed]
- Reddy, V., Swanson, S. M., Segelke, B., Kantardjieff, K. A., Sacchettini, J. C. & Rupp, B. (2003). Acta Cryst. D59, 2200–2210. [DOI] [PubMed]
- Richards, F. M. (1985). Methods Enzymol. 115, 440–464. [DOI] [PubMed]
- Rupp, B. (2009). Biomolecular Crystallography: Principles, Practice, and Application to Structural Biology. New York: Garland Science.
- Rupp, B. (2010). J. Appl. Cryst. 43, 1242–1249.
- Rupp, B. (2012). Acta Cryst. F68, 366–376. [DOI] [PMC free article] [PubMed]
- Schüttelkopf, A. W. & van Aalten, D. M. F. (2004). Acta Cryst. D60, 1355–1363. [DOI] [PubMed]
- Sheldrick, G. M. (2002). Z. Kristallogr. 217, 644–650.
- Sheldrick, G. M. (2008). Acta Cryst. A64, 112–122. [DOI] [PubMed]
- Sheldrick, G. M. (2010). Acta Cryst. D66, 479–485. [DOI] [PMC free article] [PubMed]
- Sheldrick, G. M. & Schneider, T. R. (1997). Methods Enzymol. 277, 319–343. [PubMed]
- Smart, O. S., Womack, T. O., Flensburg, C., Keller, P., Paciorek, W., Sharff, A., Vonrhein, C. & Bricogne, G. (2011). Acta Cryst. A67, C134. [DOI] [PMC free article] [PubMed]
- Sonntag, Y., Musgaard, M., Olesen, C., Schiøtt, B., Møller, J. V., Nissen, P. & Thøgersen, L. (2011). Nature Commun. 2, 304. [DOI] [PubMed]
- Strahs, G. & Kraut, J. (1968). J. Mol. Biol. 35, 503–512. [DOI] [PubMed]
- Stryer, L., Kendrew, J. C. & Watson, H. C. (1963). J. Mol. Biol. 8, 96–104. [DOI] [PubMed]
- Terwilliger, T. C. (2000). Acta Cryst. D56, 965–972. [DOI] [PMC free article] [PubMed]
- Terwilliger, T. C., Adams, P. D., Moriarty, N. W. & Cohn, J. D. (2007). Acta Cryst. D63, 101–107. [DOI] [PMC free article] [PubMed]
- Terwilliger, T. C., Grosse-Kunstleve, R. W., Afonine, P. V., Moriarty, N. W., Adams, P. D., Read, R. J., Zwart, P. H. & Hung, L.-W. (2008). Acta Cryst. D64, 515–524. [DOI] [PMC free article] [PubMed]
- Terwilliger, T. C., Klei, H., Adams, P. D., Moriarty, N. W. & Cohn, J. D. (2006). Acta Cryst. D62, 915–922. [DOI] [PMC free article] [PubMed]
- Thorn, A. & Sheldrick, G. M. (2011). J. Appl. Cryst. 44, 1285–1287. [DOI] [PMC free article] [PubMed]
- Tickle, I. J. (2012). Acta Cryst. D68, 454–467. [DOI] [PMC free article] [PubMed]
- Tickle, I. J., Laskowski, R. A. & Moss, D. S. (2000). Acta Cryst. D56, 442–450. [DOI] [PubMed]
- Tickle, I. J., Sharff, A., Vinkovic, M., Yon, J. & Jhoti, H. (2004). Chem. Soc. Rev. 33, 558–565. [DOI] [PubMed]
- Trofimov, A. A., Polyakov, K. M., Boyko, K. M., Tikhonova, T. V., Safonova, T. N., Tikhonov, A. V., Popov, A. N. & Popov, V. O. (2010). Acta Cryst. D66, 1043–1047. [DOI] [PubMed]
- Tronrud, D. E. (1997). Methods Enzymol. 277, 306–319. [DOI] [PubMed]
- Tronrud, D. E. & Allen, J. P. (2012). Photosynth. Res. 112, 71–74. [DOI] [PMC free article] [PubMed]
- Urzhumtseva, L. & Urzhumtsev, A. (2011). J. Appl. Cryst. 44, 865–872. [DOI] [PMC free article] [PubMed]
- Urzhumtsev, A. G., Lunin, V. Y. & Luzyanina, T. B. (1989). Acta Cryst. A45, 34–39.
- Viola, R., Carman, P., Walsh, J., Miller, E., Benning, M., Frankel, D., McPherson, A., Cudney, B. & Rupp, B. (2007). J. Appl. Cryst. 40, 539–545. [DOI] [PMC free article] [PubMed]
- Vonrhein, C., Blanc, E., Roversi, P. & Bricogne, G. (2007). Methods Mol. Biol. 364, 215–253. [DOI] [PubMed]
- Wall, M. E., Adams, P. D., Fraser, J. S. & Sauter, N. K. (2014). Structure, 22, 182–184. [DOI] [PMC free article] [PubMed]
- Wang, B.-C. (1985). Methods Enzymol. 115, 90–112. [DOI] [PubMed]
- Weichenberger, C. X., Pozharski, E. & Rupp, B. (2013). Acta Cryst. F69, 195–200. [DOI] [PMC free article] [PubMed]
- Weichenberger, C. X. & Rupp, B. (2014). Acta Cryst. D70, 1579–1588. [DOI] [PubMed]
- Winn, M. D. et al. (2011). Acta Cryst. D67, 235–242.
- Wlodawer, A., Minor, W., Dauter, Z. & Jaskolski, M. (2013). FEBS J. 280, 5705–5736. [DOI] [PMC free article] [PubMed]
- Wlodek, S., Skillman, A. G. & Nicholls, A. (2006). Acta Cryst. D62, 741–749. [DOI] [PubMed]
- Wunderlich, M., Max, K. E. A., Roske, Y., Mueller, U., Heinemann, U. & Schmid, F. X. (2007). J. Mol. Biol. 373, 775–784. [DOI] [PubMed]
- Zhang, K. Y. J., Cowtan, K. D. & Main, P. (2001). International Tables for Crystallography, Vol. F, editeted by M. G. Rossmann & E. Arnold, pp. 311–324. Dordrecht: Kluwer Academic Publishers.
- Zhang, K. Y. J. & Main, P. (1990). Acta Cryst. A46, 41–46.
- Zheng, H., Chordia, M. D., Cooper, D. R., Chruszcz, M., Müller, P., Sheldrick, G. M. & Minor, W. (2014). Nature Protoc. 9, 156–170. [DOI] [PMC free article] [PubMed]
- Zwart, P. H., Grosse-Kunstleve, R. W., Lebedev, A. A., Murshudov, G. N. & Adams, P. D. (2008). Acta Cryst. D64, 99–107. [DOI] [PMC free article] [PubMed]
- Zwart, P. H., Langer, G. G. & Lamzin, V. S. (2004). Acta Cryst. D60, 2230–2239. [DOI] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary material with additional figures for nucleic acids.. DOI: 10.1107/S1399004715006045/jm5003sup1.pdf