Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2023 Jul 4:2023.07.04.547724. [Version 1] doi: 10.1101/2023.07.04.547724

Modeling the Correlation between Z and B in an X-ray Crystal Structure Refinement

Trixia M Buscagan a, Douglas C Rees a,*
PMCID: PMC10350028  PMID: 37461620

Abstract

We have examined how the refined B-factor changes as a function of Z (the atomic number of a scatterer) at the sulfur site of the [4Fe:4S] cluster of the nitrogenase iron protein by refinement. A simple model is developed that quantitatively captures the observed relationship between Z and B, based on a Gaussian electron density distribution with a constant electron density at the position of the scatterer. From this analysis, the fractional changes in B and Z are found to be similar. The utility of B-factor refinement to potentially distinguish atom types reflects the Z dependence of X-ray atomic scattering factors; the weaker dependence of electron atomic scattering factors on Z implies that distinctions between refined values of B in an electron scattering structure will be less sensitive to the atomic identity of a scatterer than for the case with X-ray-diffraction. This behavior provides an example of the complementary information that can be extracted from different types of scattering studies.

Keywords: occupancy refinement, temperature factor refinement, iron-sulfur clusters, metalloproteins

Introduction

We recently reported a series of selenium-incorporated nitrogenase iron (Fe) protein crystal structures in which mixed occupancies of sulfur and selenium were observed at the chalcogenide sites of the [4Fe:4X] cluster (X = S, Se (Buscagan et al., 2022)). Occupancy refinements of these sites were correlated with shifts in the B-factor, reflecting the well-recognized correlation between occupancy and B-factor parameters that is most frequently encountered with solvents during protein structure refinement (Watenpaugh et al., 1978, Kundrot & Richards, 1987, Bhat, 1989, Jensen, 1990). By fixing the B-factors of the X atoms to match the B-factors of the Fe atoms, consistent with observations on other metalloprotein systems (Wittenborn et al., 2018, Jeoung et al., 2022), we refined occupancies at the mixed chalcogenide sites with minimal residual density in the Fobs - Fcalc difference maps.

From that study, we became interested in characterizing the correlation between the atomic number, Z (as a proxy for occupancy), and the refined B-factor, to address two questions:

  1. for a given change in Z, how much does B change?

  2. can a physical model be devised that captures this relationship?

To address the first question, the sulfide ligands in the [4Fe:4S] clusters of nitrogenase Fe protein PDB data sets 7TPW and 7TPY (with resolutions of 1.18 Å and 1.48 Å, respectively) were modeled with different elements in place of sulfur, and the B-factors refined. In these structures, the sulfurs in the cluster were fully occupied; i.e., no selenium was present. PHENIX (Liebschner et al., 2019), and COOT (Emsley et al., 2010) were used for this analysis, and the cluster coordinates were fixed so that only the B-factors were refined. Neutral scattering factors were used for the cluster atoms. The refined B values are provided in Appendix A. Over a Z range from 7 (N) to 34 (Se), the expected positive correlations between Z and refined B values are evident for both structures (Figure 1). A linear fit to this data for 8 ≤ Z ≤ 25 indicates that over this range, B varies approximately linearly with changes in Z; the fractional change in B relative to the fractional change in Z was found to be approximately 0.9 and 0.8 for the 7TPW and 7TPY data sets, respectively (Appendix A). Thus, the answer to the first question is that for this system, a 10% change in Z results in an approximately 8–9% change in B.

Figure 1.

Figure 1

Refined B2,iso values as a function of Z2/Z1, the ratio of the atomic number of the scatterer refined in the chalcogenide site (Z2) relative to Z1 = 16 (the true scatterer, sulfur), in the [4Fe:4S] cluster of the nitrogenase Fe protein (PDB data sets 7TPW (red circles) and 7TPY (blue circles)). The solid and dashed lines represent the fits to Eq. 4 and Eq. 6, respectively, with Z1 = 16 e and B0 = 6 Å2 for both structures, and B1,iso = 12.0 Å2 and 19.8 Å2 for the 7TPW and 7TPY structures, respectively.

Modeling the relationship between Z and B

To model the relationship between Z and B, we represent a scatterer by a single Gaussian with atomic number Z and overall temperature factor B (Teneyck, 1977). The electron density ρ(r) is then described:

ρ(r)=Z(4πB)3/2e4π2r2/B Eq. 1

with

ρ(0)=Z(4πB)3/2 Eq. 2

It is important to recognize that the B in Eq. 1 and Eq. 2 includes contributions from both the atomic scattering factor B0 and the isothermal temperature factor Biso, with B = B0 + Biso. From Eq. 2 and ρ(0) calculated with the Cromers and Mann atomic scattering factors (Cromer & Mann, 1968) and Biso = 16 Å2, B0 is found to be approximately 8 Å2 and 6 Å2 for N and S, respectively. If the true Z/B for a given atom are Z1 and B1, but the refinement is conducted with Z2, the corresponding B2 will be shifted from the true value to compensate for the incorrect occupancy. We developed two simple models to capture the possible relationship between Z2 and B2:

  • Model 1:B2 is calculated for a given Z2 such that the density at the atomic position, ρ(0), has the same value as for Z1, B1. For a single Gaussian, this is equivalent to equating ρ(0) in Eq. 2 calculated for either Z1, B1 or Z2, B2, which gives
    Z1B13/2=Z2B23/2B2=B1(Z2Z1)2/3 Eq. 3
    B2,iso=(B1,iso+B0)(Z2Z1)2/3B0 Eq. 4
  • The ratio Z2/Z1 corresponds to the occupancy of the Z1 scatterer at the site (to within the approximation that the shape of the atomic scattering factor is independent of Z).

  • Model 2:In this case, B2 is calculated for a given Z2 to minimize the square of the difference density over the atomic volume:
    Δρ2=0(ρ1(r)ρ2(r))24πr2dr=0([Z1(4πB1)3/2e4π2r2/B1][Z2(4πB2)3/2e4π2r2/B2])24πr2dr Eq. 5
    From the condition that Δρ2B2=0 at the minimum, one can derive (Appendix B)
    B2=B1(Z2Z1)2/52(Z2Z1)2/5B2,iso=(B1,iso+B0)(Z2Z1)2/52(Z2Z1)2/5B0 Eq. 6
  • The variations in B2,iso as a function of Z2/Z1 were evaluated from Eq. 4 and Eq. 6 (Figure 1). For these calculations, Z1 = 16 e and B0 = 6 Å2, with B1,iso = 12.0 Å2 and 19.8 Å2 for the 7TPW and 7TPY structures, respectively. (These B1,iso values correspond to the average B-factor for the two Fe sites in each structure; Appendix A.) As illustrated in Figure 1, while both Eq. 4 and Eq. 6 fit the refined B values reasonably well for Z2/Z1 < 1, the fit of Eq. 4 is superior over the entire range tested. This was a surprising result to us, as we anticipated that the Δρ2 model would better capture the structure refinement process; instead, the isolated atom approximation (reflected in the upper limit of r = ∝ in Eq. 5) for a macromolecular structure refinement is evidently less accurate relative to the localized treatment implicit in the derivation of Eq. 4.

Discussion

We recognize that the approximations used to derive the relationship between Z and B in Eq. 4 are inferior to the results of a full structure refinement. Nevertheless, Eq. 4 can provide a useful starting point to evaluate the relationship between Z and B for scatterers involving either species of unknown atomic identity (such as the original analysis of the interstitial ligand in the nitrogenase FeMo-cofactor (Einsle et al., 2002), that prompted the initial development of model 2); partial occupancy (such as solvents (Watenpaugh et al., 1978)); mixed atomic composition (exchange reactions or disorder, as in (Spatzal et al., 2015, Buscagan et al., 2022)); or incorrect modeling of residues adopting distinct flipped orientations, such as the side chains of asparagine and glutamine residues where flipping interchanges N and O atoms (Word et al., 1999, Weichenberger & Sippl, 2006).

The utility of B-factor refinement to potentially distinguish atom types reflects the Z dependence of X-ray atomic scattering factors. An instructive comparison may be drawn to electron atomic scattering factors that depend on the Coulomb potential of the scatterer and have a less significant dependence on Z. In particular, ρ(0)for fully occupied C, N and O atoms calculated using electron scattering factors (parameterized in (Saha et al., 2021)) with a B = 16 Å2 vary by less than 2%, while the corresponding variation between the C and O atoms using X-ray scattering factors is approximately 50%. Consequently, the distinctions between the C, N, and O atoms in an electron scattering map are less evident than in X-ray scattering maps. This property of electron scattering was reflected in our recent structure determination of an antibiotic peptide by micro-electron diffraction, where the orientation of a histidine sidechain in a novel cross link could not be established from an analysis of the B-factors for the two distinct rotamers (Miller et al., 2022). A “multi-messenger” approach using combinations of X-ray (with anomalous scattering, if applicable), electron, and neutron diffraction can provide additional experimental restraints to help resolve ambiguities arising in structure refinements from the correlation between Z and/or occupancy with the B-factor of scatterers.

Acknowledgments

We thank Jens Kaiser for enlightening discussions. The authors are grateful to the Gordon and Betty Moore Foundation, Don and Judy Voet, and the Beckman Institute at Caltech for their generous support of the Molecular Observatory at Caltech. Use of the Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, is supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences under Contract No. DE-AC02–76SF00515. The SSRL Structural Molecular Biology Program is supported by the DOE Office of Biological and Environmental Research, and by the National Institutes of Health, National Institute of General Medical Sciences (including P41GM103393). This work was supported by the National Institute of Health (NIH Grant GM45162) and the Howard Hughes Medical Institute (HHMI).

Appendix A: B-factor refinement of different scatterers in the sulfur sites of a [4Fe:4S] cluster

7TPW refinement
X = element N O Mg S Cl Ti Mn Co Zn Se
atomic number (Z) 7 8 12 16 17 22 25 27 30 34
Z2/Z1 0.44 0.50 0.75 1.00 1.06 1.38 1.56 1.69 1.88 2.13
cluster atom refined B values (Å2)
Fe1 (Å2) 11.82 11.90 11.99 12.02 12.11 12.07 12.12 12.28 12.27 12.30
Fe2 (Å2) 11.86 11.88 12.01 11.98 12.03 12.06 12.03 12.14 12.29 12.18
X3 (Å2) 3.41 5.38 10.85 12.75 13.30 16.23 18.18 19.42 21.87 23.77
X4 (Å2) 2.80 4.61 10.22 11.99 12.49 15.41 17.18 18.53 20.84 22.21
X Bavg2) 3.11 5.00 10.54 12.37 12.90 15.82 17.68 18.98 21.36 22.99
Rwork 14.20 14.16 14.11 14.07 14.09 14.12 14.14 14.17 14.24 14.24
Rfree 16.09 16.03 15.92 15.89 15.84 16.06 16.04 16.04 16.34 16.34
7TPY refinement
X = element N O Mg S Cl Ti Mn Co Zn Se
cluster atom refined B values (Å2)
Fe1 (Å2) 19.82 19.73 19.91 19.73 19.91 19.77 19.60 19.60 19.44 19.38
Fe2 (Å2) 19.84 20.01 20.13 19.83 20.02 20.02 19.75 19.75 19.76 19.59
X3 (Å2) 8.33 10.86 17.83 20.38 21.47 26.02 28.68 30.40 33.01 35.31
X4 (Å2) 7.90 10.39 17.38 19.81 20.96 25.40 28.03 29.80 32.30 34.76
X Bavg2) 8.12 10.63 17.61 20.10 21.22 25.71 28.36 30.10 32.66 35.04
Rwork 17.34 17.25 17.18 17.17 17.23 17.24 17.26 17.23 17.27 17.36
Rfree 19.21 18.98 18.85 19.01 18.92 19.02 19.03 19.00 19.15 19.06

The B-factors are refined as a function of the element in the chalcogenide site of the [4Fe:4S] cluster for two nitrogenase Fe protein structures (PDB IDs 7TPW and 7TPY, with resolutions of 1.18 Å and 1.48 Å, respectively)) detailed in (Buscagan et al., 2022). The cluster sits on a crystallographic two-fold axis, so that the crystallographically unique sites are designated Fe1, Fe2, X3 and X4. Least squares lines fit to the Bavg values of the scatterers at the X position over the range 8 ≤ Z ≤ 25 yielded slopes of 0.692 and 0.987 for the 7TPW and 7TPY data sets, respectively; when normalized to the appropriate Z and B values for S, the fractional changes in B with a fractional change in Z are found to be 0.896 and 0.786, respectively.

Appendix B: Derivation of Equation 6

Eq. 6 can be derived from Eq. 5 as follows. The integral and derivative were evaluated with Mathematica®.

Δρ2=0(ρ1(r)ρ2(r))24πr2dr=0([Z1(4πB1)3/2e4π2r2/B1][Z2(4πB2)3/2e4π2r2/B2])24πr2dr=2π3/2[2Z12B13/28Z1Z2(B1+B2)3/2+2Z22B23/2]Δρ2B2=2π3/2Z2(12Z1(B1+B2)5/23Z22B25/2)=0B25/2(B1+B2)5/2=3Z2122Z1=1(2)5/2Z2Z1(2)5/2B25/2(B1+B2)5/2=Z2Z12B2(B1+B2)=(Z2Z1)2/52B2=(Z2Z1)2/5(B1+B2)B2(2(Z2Z1)2/5)=B1(Z2Z1)2/5B2=B1(Z2Z1)2/5(2(Z2Z1)2/5)

References

  1. Bhat T. N. (1989). Acta Crystallographica Section A 45, 145–146. [DOI] [PubMed] [Google Scholar]
  2. Buscagan T. M., Kaiser J. T. & Rees D. C. (2022). eLife 11, e79311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Cromer D. T. & Mann J. B. (1968). Acta Crystallographica Section A 24, 321–323. [Google Scholar]
  4. Einsle O., Tezcan F. A., Andrade S. L. A., Schmid B., Yoshida M., Howard J. B. & Rees D. C. (2002). Science 297, 1696–1700. [DOI] [PubMed] [Google Scholar]
  5. Emsley P., Lohkamp B., Scott W. G. & Cowtan K. (2010). Acta Crystallographica Section D 66, 486–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Jensen L. H. (1990). Acta Crystallographica Section B 46, 650–653. [DOI] [PubMed] [Google Scholar]
  7. Jeoung J. H., Fesseler J., Domnik L., Klemke F., Sinnreich M., Teutloff C. & Dobbek H. (2022). Angewandte Chemie-International Edition 61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Kundrot C. E. & Richards F. M. (1987). Acta Crystallographica Section B 43, 544–547. [Google Scholar]
  9. Liebschner D., Afonine P. V., Baker M. L., Bunkoczi G., Chen V. B., Croll T. I., Hintze B., Hung L. W., Jain S., McCoy A. J., Moriarty N. W., Oeffner R. D., Poon B. K., Prisant M. G., Read R. J., Richardson J. S., Richardson D. C., Sammito M. D., Sobolev O. V., Stockwell D. H., Terwilliger T. C., Urzhumtsev A. G., Videau L. L., Williams C. J. & Adams P. D. (2019). Acta Crystallographica Section D 75, 861–877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Miller R. D., Iinishi A., Modaresi S. M., Yoo B. K., Curtis T. D., Lariviere P. J., Liang L. B., Son S., Nicolau S., Bargabos R., Morrissette M., Gates M. F., Pitt N., Jakob R. P., Rath P., Maier T., Malyutin A. G., Kaiser J. T., Niles S., Karavas B., Ghiglieri M., Bowman S. E. J., Rees D. C., Hiller S. & Lewis K. (2022). Nature Microbiology 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Saha A., Evans M., Holton T. & Rodriguez J. (2021). Acta Crystallographica Section A 77, a50. [Google Scholar]
  12. Spatzal T., Perez K. A., Howard J. B. & Rees D. C. (2015). eLife 4, e11620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Teneyck L. F. (1977). Acta Crystallographica Section A 33, 486–492. [Google Scholar]
  14. Watenpaugh K. D., Margulis T. N., Sieker L. C. & Jensen L. H. (1978). Journal of Molecular Biology 122, 175–190. [DOI] [PubMed] [Google Scholar]
  15. Weichenberger C. X. & Sippl M. J. (2006). Structure 14, 967–972. [DOI] [PubMed] [Google Scholar]
  16. Wittenborn E. C., Merrouch M., Ueda C., Fradale L., Leger C., Fourmond V., Pandelia M. E., Dementin S. & Drennan C. L. (2018). Elife 7, e39451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Word J. M., Lovell S. C., Richard J. S. & Richardson D. C. (1999). Journal of Molecular Biology 285, 1735–1747. [DOI] [PubMed] [Google Scholar]

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES