Abstract
X-ray crystallography is experiencing a renaissance as a method for probing the protein conformational ensemble. The inherent limitations of Bragg analysis, however, which only reveals the mean structure, have given way to a surge in interest in diffuse scattering, which is caused by structure variations. Diffuse scattering is present in all macromolecular crystallography experiments. Recent studies are shedding light on the origins of diffuse scattering in protein crystallography, and provide clues for leveraging diffuse scattering to model protein motions with atomic detail.
Introduction
With over 100 000 X-ray structures deposited in the wwPDB [1], improvements in data processing pipelines, and the advent of completely unattended data collection, it seems hard to imagine that there are any aspects of protein X-ray crystallography that remain to be optimized. However, only about half of the X-rays scattered by the crystalline sample are currently being analyzed —those in the Bragg peaks. The weaker, more smoothly varying features in diffraction images, known as diffuse scattering, are largely ignored by current practices. While the analysis of diffuse scattering is an established method in the fields of small molecule crystallography [2] and materials science [3], there are only very few foundational studies of diffuse scattering in macromolecular crystallography [4–15,16•,17,18]. However, the relative scarcity of diffuse scattering studies is poised to change as activity in the field has recently increased.
A small group of researchers (including MEW and JSF) met in 2014 to discuss the challenges and opportunities of investigating macromolecular diffuse scattering [20]. Our attention was drawn to several key developments in the field of macromolecular crystallography that motivated and enabled assessment of the diffuse signal. First, structural models obtained using traditional methods appear to be reaching a plateau in quality, as R factors remain relatively high compared to what can be achieved in small-molecule crystallography. The origin of this ‘R-factor gap’ is likely due to the underlying inadequacies of the structural models refined against crystallographic data [23]. These inadequacies can only be overcome if we can improve the modeling, including, for example, conformational heterogeneity (especially in data collected at room temperature [24]), solvation, and lattice imperfections that break the assumptions of ‘perfect crystals’ used in data reduction and refinement. Second, new detectors were enabling collection of data with lower noise, higher dynamic range, and highly localized signal. Third, new light sources were emerging with very bright, micro-focused beams (e.g. X-ray free-electron lasers). Collectively, these factors made us optimistic that diffuse scattering data both was needed and could be measured accurately enough to improve structural modeling. In early 2017, many of us met again to discuss the progress of the field with respect to each of these challenges identified in 2014 [25]. In this review, we provide our perspective on this progress and the status of the field, informed in part by our observations at that meeting and advances covered by Meisburger et al. [26••]. While there have been exciting developments in recent years, there are still major challenges ahead, include modeling atomic motions in protein crystals using diffuse scattering data with accuracy comparable to the Bragg analysis, and utilizing these models of protein motions to distinguish between competing biochemical mechanisms.
Data collection
Extraction of diffuse scattering data from conventional protein crystallography experiments is becoming straightforward thanks to the increased accessibility of photon-counting pixel array detectors (PADs, e.g. Pilatus detectors). These detectors have greater dynamic range and do not suffer from ‘blooming’ overloads that obscured diffuse signals near Bragg peaks on conventional charge-coupled device (CCD) detectors. (An early CCD detector was programmed to drain excess charge away from over-flowing pixels to enable measurement of diffuse scattering data [18,27]; however, this feature was not implemented in commercial detectors.) Additionally the use of PADs has led to changes in data collection strategies, such as the use of fine phi angle scans, that facilitate analysis of Bragg peaks and diffuse features from the same set of images [19••]. A second major advance is the measurement of diffuse scattering using an X-ray free-electron laser (XFEL) in a serial femtosecond crystallography (SFX) experiment [28••]. Using an XFEL enables collection of radiation-damage-free room temperature data, as well the potential to examine time-resolved changes in the diffuse scattering signal.
Despite these advances in collection of diffuse scattering data, minimizing background scattering remains the most important obstacle to collecting high quality data. While it is possible to remove some background scattering during data processing, the cleanest separation requires one to remove scattering extraneous to the crystal during the experiment. Factors to consider during collection of single crystal datasets include the thickness and orientation of the loop (for relevant mounting schemes), the volume of liquid surrounding the crystal, and the amount of airspace between the crystal and the detector. Background air scatter can be also reduced by a Helium or vacuum path between sample and detector. Collection of SFX data adds additional complexity, as the injection stream and crystal size will vary. Ayyer et al. [28••] addressed this challenge by selecting only the frames with the strongest diffuse scattering signal, in which the size of the crystal was expected to be comparable to the width of the jet. As the landscape of sample delivery devices for SFX and conventional crystallography continues to evolve, mounted sample delivery on materials such as graphene [29•] provides a promising route for minimization of background scattering.
Data integration
Early studies of protein diffuse scattering focused on explaining features in individual diffraction images. The introduction of methods for three-dimensional diffuse data integration enabled quantitative validation of models of correlated motions [18]. Several approaches to 3D data integration now have been implemented [27,28••,30,31••,32]. These approaches differ in several key ways: the scaling of intensities when merging the data; the handling of intensities in the neighborhood of the Bragg peak; and the strategy for sampling of reciprocal space. In the Lunus software for diffuse scattering (https://github.com/mewall/lunus) we have chosen:
To use the diffuse intensity itself to scale the diffuse data (as opposed to using the Bragg peaks, as in Ref. [31••]). This choice avoids artifacts due to potential differences in the way the Bragg and diffuse scattering vary with radiation damage and other confounding factors. The response of these signals to damage requires further study before a definitive scaling strategy can be chosen.
To ignore or filter intensity values in regions where the variations are sharper than the 3D grid that will hold the integrated data. This can include masking halo intensities too close to a Bragg peak, and kernel-based image processing to remove Bragg peaks from diffraction images. These steps avoid the mixing of signal associated with sharp features into the signal associated with larger-scale, cloudy diffuse features. The sharply varying features (e.g. streaks) are an important component of the signal; however, to avoid artifacts in analysis, we prefer to measure them on a grid that is fine enough to resolve them [17]. If the sampling is finer than one measurement per integer Miller index, but still too coarse to resolve the halos, and if the halo intensity is nevertheless included (as in Ref. [31• •]) then the measurements at integer Miller indices may be segregated from the rest of the data and analyzed separately.
To sample the data on a grid that includes points at Miller indices (corresponding to where the Bragg peaks are located), and, for finer sampling, points corresponding to integer subdivisions of Miller indices. Sampling strategies that are not tied to the reciprocal lattice also are valid (as used in Refs. [28••,30]); however, on-lattice strategies enable leveraging of existing crystallographic analysis and modeling tools for diffuse scattering.
Efforts are now underway to decrease the burden of diffuse data integration and make diffuse data collection accessible for any protein crystallography lab. Recent algorithmic improvements have led to scalable, parallelized methods for real-time processing of single-crystal synchrotron data, decreasing the time required to extract a diffuse dataset from diffraction images. These improvements aim to keep pace with real-time analysis of Bragg data at high frame rates, such as those expected at LCLS-II and euXFEL. Initial tests mapped staphylococcal nuclease diffuse data onto a fine-grained reciprocal lattice, using two samples per Miller index [33•]. This implementation of the Lunus software is capable of processing thousands of diffraction images within a few minutes on a small computing cluster.
In addition to improving the scalability of diffuse scattering data processing, efforts are underway to create a pushbutton diffuse data processing pipeline. The Sematura pipeline (https://github.com/fraser-lab/diffuse_scattering) was inspired by the user-friendly environment provided by software for analyzing Bragg peaks, such as xia2 [34]. To ensure portability the project was built upon the CCTBX framework [35], with future work focusing on developing Sematura as a CCTBX module for ease of access.
Building and refining models of protein motions
Liquid-like motions
After early experiments on tropomyosin [15], the liquid-like motions (LLM) model became a key tool in interpreting diffuse features in diffraction images [4,6]. In the LLM model, the crystal is treated as a soft material. All atoms are assumed to exhibit statistically identical normally distributed displacements about their mean position. The correlation between atom displacements is a decreasing function of the distance between the atoms, usually an exponential decay. A LLM was used to interpret early 3D diffuse data sets, refined using a correlation coefficient as a target function [18]. Successful refinement of a LLM model was used to demonstrate that interpretable diffuse data-sets can be extracted from Bragg diffraction experiments, when data collection is not specifically targeted at measuring the diffuse signal [19••]. Peck et al. [31••] recently found the ability of the LLM to include correlations across unit cell boundaries was essential for modeling the diffuse signal in several 3D datasets. This result is intuitive, as many atoms in a typical protein crystal are within 5–10 Å of symmetry related molecules. Overall the LLM model has proven to be a simple means of explaining the general features of the data with a straightforward interpretation, and therefore remains an important first approach to analysis of protein diffuse X-ray scattering data.
Normal mode analysis and elastic network models
Beyond the LLM model, normal mode analysis (NMA) of elastic network models (ENMs) can provide insights into the soft modes of protein dynamics in more detail, helping to reveal mechanisms that bridge protein structure and function [36]. In an ENM, the atoms of the crystal structure are connected by springs, and the resulting network is coupled to a thermal bath. NMA then yields the covariance matrix of atom displacements. The diagonal elements of the covariance matrix correspond to the crystallographic B factors. ENMs are often used to predict B factors, which come from the Bragg analysis through the crystal structure model. However, Riccardi et al. [37] showed how to renormalize the entire covariance matrix using the crystallographic B factors. Importantly, this renormalization enables any ENM to be entirely consistent with the Bragg data, while preserving differences in diffuse scattering. Diffuse scattering could help differentiate between these ENMs because the off-diagonal elements directly influence the diffuse signal. Thus, there is an opportunity for carefully measured diffuse data to be used in refinement of ENM models, and subsequent refinement of models of protein structure and dynamics.
Indeed, many key elements needed for refinement of normal modes models using diffuse scattering already have been demonstrated. Cloudy diffuse features in X-ray diffraction from lysozyme crystals resemble the diffuse scattering predicted from simulations of normal modes models [9,13]. Similarly, sharper diffuse features in the neighborhood of Bragg peaks in ribonuclease crystals can be captured by lattice normal modes [38]. Different varieties of ENMs for staphylococcal nuclease give rise to distinct diffuse scattering patterns, even when renormalized using the crystallographic B factors [37].
Three-dimensional diffuse scattering data from trypsin and proline isomerase (CypA) recently were modeled using ENMs [19••]. The agreement was substantial, considering that the models were not refined. On the other hand, Peck et al. [31••] found a low agreement between ENM models and diffuse data. How much can refinement improve the agreement of an ENM model? Here we provide an example. In our example, the asymmetric unit of PDB ID 4WOR was expanded to the P1 unit cell, and an ENM was constructed as in Ref. [19••]. The spring force constants between C-alpha atoms were computed as e−rij/λ, where rij is the closest distance between atoms i and j, either in the same unit cell or in neighboring unit cells of the crystal structure. All atoms on the same residue as the C-alpha were assumed to move rigidly as a unit. The initial value λ = 10.5 Å yielded a linear correlation of 0.07 with the anisotropic component of the diffuse data, as computed in Ref. [19••]. Powell minimization using the scipy.optimize.minimize method was used to refine the value of λ, using the negative correlation as a target. The final correlation was 0.54 for a value λ = 0.157 Å — a substantial improvement, but one that indicates that the direct interactions are essentially limited to nearest atomic neighbors. Simulated diffuse intensity in diffraction images calculated using the model vs. the data show similarities in cloudy diffuse features (Figure 2). Key strategies for improving the model are: extending from a C-alpha network to an all-atom network; using crystalline normal modes that extend beyond a single unit cell (prior studies used the Born von Karman method to compute these modes [37,38], but did not fully include the resulting modes in the thermal diffuse scattering calculation [39]); and allowing spring constants to deviate locally from the exponential behavior. Optimizing this type of model has applications beyond diffuse scattering validation and model refinement, as structures derived from normal modes analysis of network models have been useful for providing alternative starting points for molecular replacement [40] and have recently been used in an exciting local refinement procedure in cryo electron microscopy [41].
Figure 2.

Comparison of simulated diffuse intensity in diffraction images computed from (a) a refined ENM of staphylococcal nuclease and (b) experimental data from Ref. [18].
Ensemble refinement
A great promise of diffuse scattering is the potential to validate ensemble or multiconformer models of protein structures (Figure 1). As for as with Translation-Libration-Screw (TLS) refinement [42•] and ENM models, diffuse signal might differ for ensembles that result in the same average structure. Even if information about atomistic conformations remains out of reach, the signal could potentially be leveraged to improve ensemble models derived from time-averaged refinement using the scheme by Gros and colleagues [43]. Currently, this procedure operates on the rationale that large scale deviations can be modeled using a TLS model, and the residual local deviations are then sampled by a molecular dynamics simulation with a time-averaged difference electron density term. Our work has revealed that diffuse scattering calculated from TLS models of disorder do not match the measured diffuse signal, however, indicating that TLS is a poor descriptor of the disorder within the protein crystals we considered [19••]. Given the improvements seen when including neighboring unit cells in LLM models [31••], the disorder of the crystal environment might be better accounted for by a coarse-grained model of intramolecular motion using a NMA model refined against the diffuse scattering signal. In addition, due to the limited number of copies in ensemble models, they can exhibit artificially long length scales compared to molecular dynamics simulations, which contain orders of magnitude more finely time-sliced ‘snapshots’ of the system [44]. Ensemble models of diffuse scattering data will therefore need to include the effect of decoherence corresponding to the finer scale motions that are filtered out in conformational selection. Once large-scale disorder is accounted for by NMA, local anharmonic deviations from the modes can be explored using MD simulations restrained by the X-ray data. As diffuse analysis becomes more sensitive, the selection of the final representative ensemble also might be optimized against the diffuse data. This selection step could supplement the current practice of selecting an ensemble that matches the Bragg data.
Figure 1.
A typical detector image in X-ray crystallography (from [19••]) (upper, left) records all of the X-rays scattered by a protein crystal during a single exposure. Dark pixels correspond to high X-ray intensities. A cartoon crystal is depicted (lower, left) that contains a series of unit cells, with the contents of any given unit cell adopting one of two conformations (the conformations are expected to be more varied in a real protein crystal). Conformation A is shown in orange, while conformation B is shown in black (lower panel). During analysis, data are reduced by examining only the Bragg peaks (upper, middle), which report on the average charge density within a unit cell (lower, middle). The electron density is shown in blue, with areas of especially strong charge highlighted in purple. While multiple conformations may be modeled into the average density, assigning which conformations occur together across residues requires additional information. Current modeling practices use geometric constraints to help classify different alternative conformation groups. The diffuse scattering left behind during data reduction (upper, right) is an additional potential source of such information. Diffuse scattering includes an isotropic component that is determined both by protein and solvent scattering [21,22••], and an anisotropic component that is dominated by correlated protein motions within the crystal [22••]. Analyzing this anisotropic signal might help to distinguish networks of residues that move together (lower, right).
Molecular dynamics simulations
In addition to refining models of protein motions, diffuse scattering can be used to validate MD simulations [7,9,21,22••,33•,45–47]. Early efforts were hindered by the use of 10 ns or shorter simulation durations [7,9,21,45], which lacked sufficient sampling for the calculations. Microsecond duration simulations of protein crystals are now becoming routine [22••,33•,48,49•]. For staphylococcal nuclease, microsecond simulations overcome the sampling limitations for diffuse scattering calculations, while providing insight into ligand binding and catalysis [22••].
The agreement of the total diffuse intensity with MD simulations is high for staphylococcal nuclease [22,46], yielding a linear correlation of 0.94 for a microsecond simulation [22••]. Agreement with the 10-fold weaker anisotropic component is lower [22••,33•], but is more sensitive to the details of the simulation, creating opportunities for increasing the accuracy of MD models. Expanding the staphylococcal nuclease model from a single periodic unit cell to a 2 × 2 × 2 supercell increased the correlation with the anisotropic component to 1.6 Å resolution from 0.42 to 0.68 for a microsecond simulation [33•].
Even though MD simulations provide a picture of crystalline dynamics at atomic detail, the accuracy of the published MD models is not yet high enough to validate the atomic details using diffuse scattering. Although we do not know what level of accuracy will be required for diffuse scattering to reveal the atomic details of protein motions, the possibility of validating the atomic details of protein motions strongly motivates improving the MD models. Ideas for improving the MD model include: increasing the size of the supercell even further, to 3x3x3 or beyond; improving force fields; increasing the simulation duration; and introducing crystal imperfections such as vacancies (missing copies of the protein) or dislocations. It is also possible that higher quality experimental data would be required to improve the model. Additional insights for increasing model accuracy might come from solid state NMR (ssNMR) experiments combined with crystalline protein simulations [50–52], which create opportunities for joint validation of MD simulations using crystallography and NMR.
Phasing and resolution extension
In a high-profile publication, the Chapman and Fromme groups integrated the first three-dimensional diffuse scattering dataset from a serial femtosecond protein crystallography experiment at an X-ray free electron laser [28••]. Their analysis focused on the potential for phasing and resolution extension of a charge density map of photosystem II (PSII). In this study, the method, based on the difference-map algorithm [53], depends critically on the assumption that the diffuse signal is proportional to the molecular transform of the PSII dimer. In this respect, the work is closely related to that of Stroud and Agard [54] and Makowski [55] on phasing using continuous diffraction data. In Ref. [28••], and in a follow-on study using improved data integration [56], the validity of the method was argued by assuming independent, rigid body motions of the dimer.
Using diffuse scattering for phasing and resolution extension might prove to be useful in rare cases when diffuse scattering extends to higher resolution than the Bragg diffraction; however, many technical questions remain both about the origin of diffuse scattering in PSII and the role of diffuse scattering in yielding the resulting charge density. What effect does the presence of Bragg peaks have on phasing and resolution extension in the 4.5–3.5 Å range, which is where the diffuse intensity was measured? How does the improvement in the PSII map compare to what would be obtained by using randomized intensities, due to the free lunch effect [57] and solvent flattening [58]? How would the R-factors in the extended resolution range reported in PDB 5E79 compare to pseudo-crystallographic refinement [59] of using either random intensities or the uniform average intensity in these bins? How robust are the improved features of the charge density in Ref. [28••] to omit map analysis [60], especially at the solvent/protein interface? Might a LLM model (or a ENM or MD model) more accurately describe the diffuse scattering than rigid-body translations of PSII dimers (in Ref. [56], the agreement was improved when the intensities were convoluted with a 4 × 4 × 4 voxel kernel)? Can the model be improved by assuming the rigid units are coupled instead of independent [8], or if the model included rotations as well as translations [14] (in Ref. [56], the intensities were rotationally blurred, but this does not correspond to rigid-body rotations [61])? What is the role of substitution disorder [62] (e.g. unit cells in which one or more copies of the PSII dimer are missing) in determining the diffuse signal? Understanding the implications of this method for protein crystallography will rely on answering these questions and determining whether signals at higher angle than Bragg are commonly observed.
Future perspective
The massive investment in structural genomics in the 2000s dramatically increased the robustness of X-ray crystallography data collection, processing, and refinement. The resulting technological improvements and standardizations have led to more robust methods and instrumentation for data collection that are well-aligned with the requirements for diffuse scattering experiments, enabling measurement of diffuse scattering data from traditional crystallography experiments [19••,31••]. These advances, along with software that makes the data processing and analysis more accessible, will enable diffuse scattering studies at any modern beamline, by any crystallography lab. Why should crystallographers take advantage of this offering? Diffuse features in protein crystallography can be myriad and complex: a mixture streaks, satellite reflections, isotropic scattering, cloudy patterns, circles, and others. Sometimes the diffuse signal appears to reflect imperfections in crystal packing, and offers a possible explanation for why a structure cannot be solved. In other cases, the diffuse signal might be so complicated as to be uninterpretable, or might be so weak that it is difficult to learn anything interesting.
Integrating diffuse scattering with Bragg diffraction to improve crystallographic models could become a major application [17,32,63]. Although assuming proteins are rigid provides the greatest potential for phasing using diffuse scattering data [28••,56], multiple studies of both Bragg and diffuse scattering point to a more dynamic picture of crystalline proteins. A model with internal motions such as the LLM tends to obscure the molecular transform signal and to limit the information to what is available from the crystal transform, at Miller indices [31••]. Nevertheless, because the diffuse signal can extend well beyond the resolution limit of the Bragg peaks in rare cases, it still allows for the possibility of resolution extension. The blurring of the signal implied by the LLM means there is a loss of information in the diffuse Patterson function at long distances, so the path to resolution extension might require model refinement in addition to, or instead of, direct methods. In addition, the apparent success of the LLM [4,6,17–19••,31••] and MD simulations [21,22••,33•,46,47] in obtaining insights into diffuse scattering data points to a picture in which internal motions are important. This opens up the possibility that diffuse scattering can be used to reveal atomic models of protein motions, a possibility that is eliminated when proteins are treated as rigid units.
Ironically, the strongly diffracting model systems that have enabled experimental measurements of diffuse scattering may contain less informative signals than more poorly ordered crystals. In the case of poorly ordered systems, multiple significant conformational minima may co-exist in the crystal. This disorder would limit the power of Bragg diffraction, while the correlations present in the ensemble or the spread of conformations between extremes [64] would lead to diffuse signal. However, increased disorder, which may or may not be biochemically meaningful, may also present additional challenges in processing the diffuse scattering data. Nevertheless, a small but growing number of systems have shown simpler patterns of strong diffuse features that appear to be connected to protein motions. Modeling of diffuse scattering for such systems has improved substantially in recent years. For some models, like normal modes, the agreement with the diffuse signal is still relatively weak (Figure 2); for others, such as MD, the agreement is stronger [33•], but still lower than what is typical in Bragg analysis.
What will be required to take diffuse scattering to the next level and make it an equal partner to Bragg analysis? We can look to traditional crystallography for clues. In Bragg analysis, using model phases, the details of the atomic structure only are revealed when the model of the whole system is sufficiently accurate. As the accuracy is increased, the effect of refining the model of small numbers of atoms can be seen in the analysis. When we have a sufficiently accurate model for all the diffracted X-rays, diffuse scattering might become sensitive to atomic details of correlated variations, with the caveat that the diffuse signal might contain less information than the Bragg diffraction, as measurements nearby in reciprocal space can be correlated. If atomic details of correlated motions can be revealed for the systems that have the clearest diffuse signal, diffuse scattering would then begin to provide very interesting information for these systems. As capabilities evolve, more complex patterns might be tackled and the analysis methods applied more generally. Meanwhile, it will be important to scour the diffraction image repositories such as SBGrid Databank [65] or CXIdb [66] and archive raw data from new experiments, identifying cases where the diffuse signal is strong and amenable to analysis [67].
Despite being present in all macromolecular diffraction patterns, the origins of diffuse scattering in protein crystallography are in many cases still mysterious. Our current short-term outlook is that, for a small number of cases, whether it is due to long-range [28••] or short-range disorder [19••,31••], diffuse scattering will provide valuable information for structural modeling. The types of conformational heterogeneity that can be validated and, potentially, refined against diffuse scattering data can guide us to define better models of protein structure and dynamics. As the structural biology toolkit expands, X-ray scattering, including diffuse scattering, still provides unique capabilities to probe conformational ensembles over many length scales, as captured in the recent review by Meisburger et al. [26••]. Ultimately, the better models of concerted motions will have far ranging impact beyond the average structure that is accessible using conventional X-ray crystallography and cryo-electron microscopy data, yielding a deeper understanding of biochemical mechanism [68].
Acknowledgments
We thank TJ Lane, R Stroud, H Chapman, and K Ayyer, GN Phillips, Jr, and anonymous reviewers for helpful comments on the preprint. JSF is supported by a Pew Scholar Award from the Pew Charitable Trusts, a Packard Fellowship from the David and Lucile Packard Foundation, NSF STC-1231306, NIH GM123159, and NIH GM124149. JSF and MEW are supported by UC Office of the President Laboratory Fees Research Program LFR-17-476732. MEW is supported by US Department of Energy, via the Exascale Computing Project.
Footnotes
Los Alamos National Laboratory Unclassified Release #LA-UR-17-30486.
Conflict of interest
The authors declare there is no conflict of interest.
References and recommended reading
Papers of particular interest, published within the period of review, have been highlighted as:
• of special interest
•• of outstanding interest
- 1.Burley SK, Berman HM, Kleywegt GJ, Markley JL, Nakamura H, Velankar S. Protein Data Bank (PDB): the single global macromolecular structure archive. Methods Mol Biol. 2017;1607:627–641. doi: 10.1007/978-1-4939-7000-1_26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Welberry TR. Diffuse X-Ray Scattering and Models of Disorder. Oxford: Oxford University Press; 2004. [Google Scholar]
- 3.Keen DA, Goodwin AL. The crystallography of correlated disorder. Nature. 2015;521:303–309. doi: 10.1038/nature14453. [DOI] [PubMed] [Google Scholar]
- 4.Caspar DL, Clarage J, Salunke DM, Clarage M. Liquid-like movements in crystalline insulin. Nature. 1988;332:659–662. doi: 10.1038/332659a0. [DOI] [PubMed] [Google Scholar]
- 5.Chacko S, Phillips GN., Jr Diffuse X-ray scattering from tropomyosin crystals. Biophys J. 1992;61:1256–1266. doi: 10.1016/S0006-3495(92)81934-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Clarage JB, Clarage MS, Phillips WC, Sweet RM, Caspar DL. Correlations of atomic movements in lysozyme crystals. Proteins. 1992;12:145–157. doi: 10.1002/prot.340120208. [DOI] [PubMed] [Google Scholar]
- 7.Clarage JB, Romo T, Andrews BK, Pettitt BM, Phillips GN., Jr A sampling problem in molecular dynamics simulations of macromolecules. Proc Natl Acad Sci U S A. 1995;92:3288–3292. doi: 10.1073/pnas.92.8.3288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Doucet J, Benoit JP. Molecular dynamics studied by analysis of the X-ray diffuse scattering from lysozyme crystals. Nature. 1987;325:643–646. doi: 10.1038/325643a0. [DOI] [PubMed] [Google Scholar]
- 9.Faure P, Micu A, Pérahia D, Doucet J, Smith JC, Benoit JP. Correlated intramolecular motions and diffuse X-ray scattering in lysozyme. Nat Struct Biol. 1994;1:124–128. doi: 10.1038/nsb0294-124. [DOI] [PubMed] [Google Scholar]
- 10.Glover ID, Harris GW, Helliwell JR, Moss DS. The variety of X-ray diffuse-scattering from macromolecular crystals and its respective components. Acta Crystallogr B. 1991;47:960–968. [Google Scholar]
- 11.Helliwell JR, Glover ID, Jones A, Pantos E, Moss DS. Protein dynamics — use of computer-graphics and protein crystal diffuse-scattering recorded with synchrotron X-radiation. Biochem Soc Transact. 1986;14:653–655. [Google Scholar]
- 12.Kolatkar AR, Clarage JB, Phillips GN., Jr Analysis of diffuse scattering from yeast initiator tRNA crystals. Acta Crystallogr D. 1994;50:210–218. doi: 10.1107/S0907444993011692. [DOI] [PubMed] [Google Scholar]
- 13.Mizuguchi K, Kidera A, Go N. Collective motions in proteins investigated by X-ray diffuse scattering. Proteins. 1994;18:34–48. doi: 10.1002/prot.340180106. [DOI] [PubMed] [Google Scholar]
- 14.Perez J, Faure P, Benoit JP. Molecular rigid-body displacements in a tetragonal lysozyme crystal confirmed by X-ray diffuse scattering. Acta Crystallogr D: Biol Crystallogr. 1996;52:722–729. doi: 10.1107/S0907444996002594. [DOI] [PubMed] [Google Scholar]
- 15.Phillips GN, Jr, Fillers JP, Cohen C. Motions of tropomyosin. Crystal as metaphor. Biophys J. 1980;32:485–502. doi: 10.1016/S0006-3495(80)84985-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16•.Polikanov YS, Moore PB. Acoustic vibrations contribute to the diffuse scatter produced by ribosome crystals. Acta Crystallogr D: Biol Crystallogr. 2015;71:2021–2031. doi: 10.1107/S1399004715013838. Careful analysis of the diffuse scattering present in 70S ribosome crystals revealed that lattice vibrations may explain a significant portion of the diffuse signal, highlighting the importance of models that account for correlated variations across unit cell boundaries. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wall ME, Clarage JB, Phillips GN. Motions of calmodulin characterized using both Bragg and diffuse X-ray scattering. Structure. 1997;5:1599–1612. doi: 10.1016/s0969-2126(97)00308-0. [DOI] [PubMed] [Google Scholar]
- 18.Wall ME, Ealick SE, Gruner SM. Three-dimensional diffuse X-ray scattering from crystals of staphylococcal nuclease. Proc Natl Acad Sci U S A. 1997;94:6180–6184. doi: 10.1073/pnas.94.12.6180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19••.Van Benschoten AH, Liu L, Gonzalez A, Brewster AS, Sauter NK, Fraser JS, Wall ME. Measuring and modeling diffuse scattering in protein X-ray crystallography. Proc Natl Acad Sci U S A. 2016;113:4069–4074. doi: 10.1073/pnas.1524048113. Diffuse data is extracted from data collected under optimal conditions for Bragg analysis, revealing that modern PAD detectors and fine phi slicing can make diffuse data widely available. Also, LLM and normal modes models of disorder account for a substantial portion of the diffuse signal isolated from cypa and trypsin. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wall ME, Adams PD, Fraser JS, Sauter NK. Diffuse X-ray scattering to model protein motions. Structure. 2014;22:182–184. doi: 10.1016/j.str.2014.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Meinhold L, Smith JC. Correlated dynamics determining X-ray diffuse scattering from a crystalline protein revealed by molecular dynamics simulation. Phys Rev Lett. 2005;95:218103. doi: 10.1103/PhysRevLett.95.218103. [DOI] [PubMed] [Google Scholar]
- 22••.Wall ME, Van Benschoten AH, Sauter NK, Adams PD, Fraser JS, Terwilliger TC. Conformational dynamics of a crystalline protein from microsecond-scale molecular dynamics simulations and diffuse X-ray scattering. Proc Natl Acad Sci U S A. 2014;111:17887–17892. doi: 10.1073/pnas.1416744111. In this study, the authors demonstrate the importance of long time-scale simulations to accurately sample a protein’s correlated motions. The improvement is most evident when examining the anisotropic portion of the diffuse signal attributed to protein dynamics alone. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Holton JM, Classen S, Frankel KA, Tainer JA. The R-factor gap in macromolecular crystallography: an untapped potential for insights on accurate structures. FEBS J. 2014;281:4046–4060. doi: 10.1111/febs.12922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Fraser JS, van den Bedem H, Samelson AJ, Lang PT, Holton JM, Echols N, Alber T. Accessing protein conformational ensembles using room-temperature X-ray crystallography. Proc Natl Acad Sci U S A. 2011;108:16247–16252. doi: 10.1073/pnas.1111325108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wall ME, Sweet RM, Ando N, Fraser JS, Phillips GN. Measurement and Interpretation of Diffuse Scattering in X-Ray Diffraction for Macromolecular Crystallography. 2017 https://www.osti.gov/scitech/biblio/1400134.
- 26••.Meisburger SP, Thomas WC, Watkins MB, Ando N. X-ray scattering studies of protein structural dynamics. Chem Rev. 2017;117:7615–7672. doi: 10.1021/acs.chemrev.6b00790. This excellent review thoroughly lays out the connection between diffuse scattering, solution scattering, and crystallography. The assumptions and limitations of various approaches to analyzing diffuse data are clearly explained, and several disorder models are explored using case studies of biochemical interest. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wall ME. (PhD) Physics. Princeton University; 1996. Diffuse features in X-ray diffraction from protein crystals. [Google Scholar]
- 28••.Ayyer K, Yefanov OM, Oberthur D, Roy-Chowdhury S, Galli L, Mariani V, Basu S, Coe J, Conrad CE, Fromme R, et al. Macromolecular diffractive imaging using imperfect crystals. Nature. 2016;530:202–206. doi: 10.1038/nature16949. This work extends the analysis of diffuse X-ray scattering into the realm of XFELS and serial crystallography, while also advocating for the use of diffuse scattering for phasing and resolution extension. Rigid body translations of the PSII dimer are the proposed source of the diffuse signal. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29•.Sui S, Wang Y, Kolewe KW, Srajer V, Henning R, Schiffman JD, Dimitrakopoulos C, Perry SL. Graphene-based microfluidics for serial crystallography. Lab Chip. 2016;16:3082–3096. doi: 10.1039/c6lc00451b. Graphene coated microfluidic chips enable collection of diffraction data with very high signal-to-noise, and may provide an alternative to jet based delivery systems for SFX experiments. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Estermann MA, Steurer W. Diffuse scattering data acquisition techniques. Phase Transit. 1998;67:165–195. [Google Scholar]
- 31••.Peck A, Poitevin F, Lane TJ. Intermolecular correlations are necessary to explain diffuse scattering from protein crystals. IUCrJ. 2018;5 doi: 10.1107/S2052252518001124. in press. The authors approach diffuse scattering from a modelling perspective, and rigorously test current disorder models against a series of experimental datasets. Quantitative tests allow them to discern that most disorder models explain a limited portion of the diffuse signal, and that a LLM model is the best option, especially when correlations across unit cell boundaries are included. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wall ME. Methods and software for diffuse X-ray scattering from protein crystals. Methods Mol Biol. 2009;544:269–279. doi: 10.1007/978-1-59745-483-4_17. [DOI] [PubMed] [Google Scholar]
- 33•.Wall ME. Internal protein motions in molecular dynamics simulations of Bragg and diffuse X-ray scattering. IUCrJ. 2017;5 doi: 10.1107/S2052252518000519. A molecular dynamics simulation of diffuse X-ray scattering from staphylococcal nuclease crystals is greatly improved when the unit cell model is expanded to a 2 × 2 × 2 layout of eight unit cells. The dynamics are dominated by internal protein motions rather than rigid packing interactions. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Winter G, Lobley CM, Prince SM. Decision making in xia2. Acta Crystallogr D: Biol Crystallogr. 2013;69:1260–1273. doi: 10.1107/S0907444913015308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Grosse-Kunstleve RW, Sauter NK, Moriarty NW, Adams PD. The Computational Crystallography Toolbox: crystallographic algorithms in a reusable software framework. J Appl Crystallogr. 2002;35:126–136. [Google Scholar]
- 36.Haliloglu T, Bahar I. Adaptability of protein structures to enable functional interactions and evolutionary implications. Curr Opin Struct Biol. 2015;35:17–23. doi: 10.1016/j.sbi.2015.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Riccardi D, Cui Q, Phillips GN., Jr Evaluating elastic network models of crystalline biological molecules with temperature factors, correlated motions, and diffuse X-ray scattering. Biophys J. 2010;99:2616–2625. doi: 10.1016/j.bpj.2010.08.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Meinhold L, Merzel F, Smith JC. Lattice dynamics of a protein crystal. Phys Rev Lett. 2007;99:138101. doi: 10.1103/PhysRevLett.99.138101. [DOI] [PubMed] [Google Scholar]
- 39.James R. The Optical Principles of the Diffraction of X-Rays. London: Bell; 1948. [Google Scholar]
- 40.Suhre K, Sanejouand YH. ElNemo: a normal mode web server for protein movement analysis and the generation of templates for molecular replacement. Nucleic Acids Res. 2004;32:W610–W614. doi: 10.1093/nar/gkh368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Schilbach S, Hantsche M, Tegunov D, Dienemann C, Wigge C, Urlaub H, Cramer P. Structures of transcription pre-initiation complex with TFIIH and Mediator. Nature. 2017;551:204–209. doi: 10.1038/nature24282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42•.Van Benschoten AH, Afonine PV, Terwilliger TC, Wall ME, Jackson CJ, Sauter NK, Adams PD, Urzhumtsev A, Fraser JS. Predicting X-ray diffuse scattering from translation-libration-screw structural ensembles. Acta Crystallogr D: Biol Crystallogr. 2015;71:1657–1667. doi: 10.1107/S1399004715007415. Predicted diffuse scattering patterns differ substantially across different TLS models derived from the same data. This provides an important proof of principle for the use of diffuse scattering in refinement of macromolecular models. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Burnley BT, Afonine PV, Adams PD, Gros P. Modelling dynamics in protein crystal structures by ensemble refinement. Elife. 2012;1:e00311. doi: 10.7554/eLife.00311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Levin EJ, Kondrashov DA, Wesenberg GE, Phillips GN., Jr Ensemble refinement of protein crystal structures: validation and application. Structure. 2007;15:1040–1052. doi: 10.1016/j.str.2007.06.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Héry S, Genest D, Smith JC. X-ray diffuse scattering and rigid-body motion in crystalline lysozyme probed by molecular dynamics simulation. J Mol Biol. 1998;279:303–319. doi: 10.1006/jmbi.1998.1754. [DOI] [PubMed] [Google Scholar]
- 46.Meinhold L, Smith JC. Fluctuations and correlations in crystalline protein dynamics: a simulation analysis of Staphylococcal nuclease. Biophys J. 2005;88:2554–2563. doi: 10.1529/biophysj.104.056101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Meinhold L, Smith JC. Protein dynamics from X-ray crystallography: anisotropic, global motion in diffuse scattering patterns. Proteins. 2007;66:941–953. doi: 10.1002/prot.21246. [DOI] [PubMed] [Google Scholar]
- 48.Janowski PA, Cerutti DS, Holton J, Case DA. Peptide crystal simulations reveal hidden dynamics. J Am Chem Soc. 2013;135:7938–7948. doi: 10.1021/ja401382y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49•.Janowski PA, Liu C, Deckman J, Case DA. Molecular dynamics simulation of triclinic lysozyme in a crystal lattice. Protein Sci. 2016;25:87–102. doi: 10.1002/pro.2713. MD simulations of lysozyme in a crystalline lattice reveal enhanced agreement with structural models derived from Bragg data. Nonetheless, convergence is slow, the lattice becomes disordered, and fluctuations of residues involved in crystal contacts are too high, indicating the need for improved MD force fields. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Kurauskas V, Izmailov SA, Rogacheva ON, Hessel A, Ayala I, Woodhouse J, Shilova A, Xue Y, Yuwen T, Coquelle N, et al. Slow conformational exchange and overall rocking motion in ubiquitin protein crystals. Nat Commun. 2017;8:145. doi: 10.1038/s41467-017-00165-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Ma P, Xue Y, Coquelle N, Haller JD, Yuwen T, Ayala I, Mikhailovskii O, Willbold D, Colletier JP, Skrynnikov NR, et al. Observing the overall rocking motion of a protein in a crystal. Nat Commun. 2015;6:8361. doi: 10.1038/ncomms9361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Mollica L, Baias M, Lewandowski JR, Wylie BJ, Sperling LJ, Rienstra CM, Emsley L, Blackledge M. Atomic-resolution structural dynamics in crystalline proteins from NMR and molecular simulation. J Phys Chem Lett. 2012;3:3657–3662. doi: 10.1021/jz3016233. [DOI] [PubMed] [Google Scholar]
- 53.Elser V, Millane RP. Reconstruction of an object from its symmetry-averaged diffraction pattern. Acta Crystallogr Sect A. 2008;64:273–279. doi: 10.1107/S0108767307050684. [DOI] [PubMed] [Google Scholar]
- 54.Stroud RM, Agard DA. Structure determination of asymmetric membrane profiles using an iterative Fourier method. Biophys J. 1979;25:495–512. doi: 10.1016/S0006-3495(79)85319-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Makowski L. The use of continuous diffraction data as a phase constraint. 1. One-dimensional theory. J Appl Crystallogr. 1981;14:160–168. [Google Scholar]
- 56.Chapman HN, Yefanov OM, Ayyer K, White TA, Barty A, Morgan A, Mariani V, Oberthuer D, Pande K. Continuous diffraction of molecules and disordered molecular crystals. J Appl Crystallogr. 2017;50:1084–1103. doi: 10.1107/S160057671700749X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Caliandro R, Carrozzini B, Cascarano GL, De Caro L, Giacovazzo C, Siliqi D. Phasing at resolution higher than the experimental resolution. Acta Crystallogr D: Biol Crystallogr. 2005;61:556–565. doi: 10.1107/S090744490500404X. [DOI] [PubMed] [Google Scholar]
- 58.Wang BC. Resolution of phase ambiguity in macromolecular crystallography. Methods Enzymol. 1985;115:90–112. doi: 10.1016/0076-6879(85)15009-3. [DOI] [PubMed] [Google Scholar]
- 59.Fischer N, Neumann P, Konevega AL, Bock LV, Ficner R, Rodnina MV, Stark H. Structure of the E coli ribosome-EF-Tu complex at <3 A resolution by Cs-corrected cryo-EM. Nature. 2015;520:567–570. doi: 10.1038/nature14275. [DOI] [PubMed] [Google Scholar]
- 60.Bhat TN, Cohen GH. OMITMAP: an electron density map suitable for the examination of errors in a macromolecular model. J Appl Cryst. 1984;17:244–248. [Google Scholar]
- 61.Moore PB. On the relationship between diffraction patterns and motions in macromolecular crystals. Structure. 2009;17:1307–1315. doi: 10.1016/j.str.2009.08.015. [DOI] [PubMed] [Google Scholar]
- 62.Guinier A. X-Ray Diffraction in Crystals, Imperfect Crystals, and Amorphous Bodies. San Francisco: W. H. Freeman and Company; 1963. [Google Scholar]
- 63.Clarage JB, Phillips GN., Jr Analysis of diffuse scattering and relation to molecular motion. Methods Enzymol. 1997;277:407–432. doi: 10.1016/s0076-6879(97)77023-x. [DOI] [PubMed] [Google Scholar]
- 64.Chaudhry C, Horwich AL, Brunger AT, Adams PD. Exploring the structural dynamics of the E coli chaperonin GroEL using translation-libration-screw crystallographic refinement of intermediate states. J Mol Biol. 2004;342:229–245. doi: 10.1016/j.jmb.2004.07.015. [DOI] [PubMed] [Google Scholar]
- 65.Meyer PA, Socias S, Key J, Ransey E, Tjon EC, Buschiazzo A, Lei M, Botka C, Withrow J, Neau D, et al. Data publication with the structural biology data grid supports live analysis. Nat Commun. 2016;7:10882. doi: 10.1038/ncomms10882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Maia FR. The coherent X-ray imaging data bank. Nat Methods. 2012;9:854–855. doi: 10.1038/nmeth.2110. [DOI] [PubMed] [Google Scholar]
- 67.Kroon-Batenburg LM, Helliwell JR, McMahon B, Terwilliger TC. Raw diffraction data preservation and reuse: overview, update on practicalities and metadata requirements. IUCrJ. 2017;4:87–99. doi: 10.1107/S2052252516018315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Shoemaker S, Ando N. X-rays in the cryo-EM era: structural biology’s dynamic future. Biochemistry. 2017;57:277–285. doi: 10.1021/acs.biochem.7b01031. [DOI] [PMC free article] [PubMed] [Google Scholar]

