Skip to main content
Journal of Applied Crystallography logoLink to Journal of Applied Crystallography
. 2011 Nov 12;44(Pt 6):1285–1287. doi: 10.1107/S0021889811041768

ANODE: anomalous and heavy-atom density calculation

Andrea Thorn a, George M Sheldrick a,*
PMCID: PMC3246834  PMID: 22477786

The program ANODE determines anomalous (or heavy-atom) densities by reversing the usual procedure for experimental phase determination. Instead of adding a phase shift to the heavy-atom phases to obtain a starting value for the native protein phase, this phase shift is subtracted from the native phase to obtain the heavy-atom substructure phase.

Keywords: anomalous density, heavy-atom density, experimental phasing, computer programs

Abstract

The new program ANODE estimates anomalous or heavy-atom density by reversing the usual procedure for experimental phase determination by methods such as single- and multiple-wavelength anomalous diffraction and single isomorphous replacement anomalous scattering. Instead of adding a phase shift to the heavy-atom phases to obtain a starting value for the native protein phase, this phase shift is subtracted from the native phase to obtain the heavy-atom substructure phase. The required native phase is calculated from the information in a Protein Data Bank file of the structure. The resulting density enables even very weak anomalous scatterers such as sulfur to be located. Potential applications include the identification of unknown atoms and the validation of molecular replacement solutions.

1. Introduction

The programs SHELXC/D/E (Sheldrick, 2008, 2010) adopt a simplified but effective approach to the experimental phasing of macromolecules. SHELXC estimates the marker-atom structure factors |F A| and phase shifts α from the experimental data. The marker atoms are typically heavy metals, halides, selenium or bromine incorporated specifically for phasing, or naturally present metal or sulfur atoms. The |F A| values are used by SHELXD to find the marker-atom positions by integrated Patterson and direct methods. Approximate native phases ϕT are then estimated by adding the phase shifts α to the calculated phases ϕA for the marker-atom substructure:

1.

Given high-quality multiple-wavelength anomalous difffraction (MAD) or single isomorphous replacement anomalous scattering (SIRAS) data, the resulting native phases may suffice to give an interpretable map, but for single-wavelength anomalous difffraction (SAD) and SIR phasing these phases will always need to be improved by density modification, e.g. with SHELXE. In the MAD method, the analysis of data collected at two or more wavelengths close to an absorption edge theoretically enables estimates of |F A| and α to be obtained, which are limited only by the accuracy of the measured data. In the SAD method, |F A| is assumed to be proportional to the absolute value of the anomalous difference Δano = |F hkl| − |F Inline graphic|, and α is assumed to be 90° when Δano is positive and 270° when it is negative. In fact a much better approximation would be Δano = |F A|sinα, with α in the range 0–360°, but it is not possible to deduce two parameters (|F A| and α) from one observation (Δano). That SAD phasing works despite these drastic assumptions is a tribute to the power of density modification, though it helps that these approximations hold best for the reflections with the largest anomalous differences.

If the structure and hence ϕT are known, equation (1) can be rearranged to estimate ϕA:

1.

When |F A| and α originate from SAD data, a map calculated using these values and equation (2) is often colloquially referred to as an ‘anomalous Fourier map’. Such maps were probably first used by Strahs & Kraut (1968). This approach is, however, equally valid for SIR, MAD and SIRAS phasing, for which ‘heavy-atom density map’ might be a more appropriate description. The program ANODE described here simply applies equation (2) to calculate such maps. The phases ϕT are obtained by a structure factor calculation using the information in a Protein Data Bank (PDB; Berman et al., 2000) file, and the α and |F A| values are conveniently provided by the programs SHELXC or XPREP (from SHELXTL; Sheldrick, 2008), which are currently used to estimate these parameters for experimental phasing. This always creates maps with the same choice of unit-cell origin as the input PDB file, but care may still be needed if the crystal symmetry permits alternative indexing (see §2.4).

1.1. Input and output files

ANODE is started by a command line containing a file-name stem and optionally one or more switches:

1.1.

reads a PDB-format file name.ent, or if that is not found name.pdb, and extracts the unit-cell parameters, space-group name and atom coordinates from this file. The file name_fa.hkl from SHELXC or XPREP is then read to extract the reflection indices and the |F A| and α values. For the subsequent calculations, it makes no difference whether these originate from SAD, SIR, SIRAS, MAD or radiation-damage-induced phasing experiments. A structure factor calculation using the information from the PDB file generates phases ϕT for these reflections and ϕA is then calculated for them using equation (2). The switches control the amount of output required and may be used to truncate the data beyond a specified resolution (−d) or multiply the Fourier coefficients by a damping term of the form exp(−8π2 Bsin2θ/λ2). In practice the default settings for these switches are almost always adequate (Thorn, 2011). ANODE calculates the density map by fast Fourier transform and then derives σ, the square root of the variance of the density, and outputs the following:

(1) The average density (in units of σ) at each type of atom site, e.g. ‘S_Met’, using the coordinates from the PDB file.

(2) The heights and coordinates of the unique peaks in the map, and their distances from the nearest atom in the PDB file, taking space-group symmetry and unit-cell translations into account.

(3) A file name.pha containing the map coefficients in a format understood by the program Coot (Emsley et al., 2010) for display of the density.

(4) A file name_fa.res in the same format as written by the program SHELXD for SAD or MAD phasing etc. This can be input into SHELXE (Sheldrick, 2010) for molecular replacement with SAD phasing (Schuermann & Tanner, 2003), e.g. to reduce the model bias often associated with molecular replacement structure solutions.

(5) A listing file name.lsa.

Entering anode without a file-name stem on the command line lists the available options. Fig. 1 shows a flow diagram for ANODE.

Figure 1.

Figure 1

Flow diagram for the ANODE program.

2. Examples

2.1. Very weak sulfur-SAD data

For attempted phasing based on sulfur as anomalous scatterer, it appears that, even when the Friedel differences are too weak for the structure to be solved by SAD phasing, the S atoms can often be readily identified by ANODE. An example is viscotoxin B2 (PDB code 2v9b; Pal et al., 2008), which resisted all attempts to solve the structure by sulfur-SAD despite the availability of synchrotron data at a wavelength of 1.70 Å and the fact that there are six disulfide bonds in the asymmetric unit (it was in fact solved by ab initio direct methods using high-resolution native data from another crystal without using the anomalous data). ANODE found a mean density of 9.20σ at the disulfide S atoms and 6.21σ at the S atoms of the six sulfate ions. The next strongest density at an atomic site was 0.84σ. The corresponding anomalous density map is shown in Fig. 2.

Figure 2.

Figure 2

Anomalous density for viscotoxin B2, contoured at 2.8σ. Even though the anomalous signal was too weak for sulfur-SAD phasing, the six disulfide bridges and the S atoms of the sulfate anions are clearly visible in the anomalous density. The density of the sulfate ions is lower because of their higher mobility and because some of them may be partially replaced by phosphate ions.

2.2. Magic triangle phasing

An attractive alternative to sulfur-SAD and halide soaking (Dauter et al., 2000) is to soak or co-crystallize with the magic triangle (Beck et al., 2008), which contains three iodine atoms that make an equilateral triangle with 6.0 Å sides. This structure is easy to recognize and provides much more phasing power than sulfur-SAD. Using the data of Beck et al. (2008) for tetragonal lysozyme (PDB entry 3e3d), ANODE obtained average densities of 22.6σ at the 12 iodine sites (four bound I3C molecules), 2.71σ for the SD_Met sites, 2.30σ for the SG_Cys sites and 2.30σ at the S atom of the single HEPES buffer molecule. The next strongest density at an atomic site was 0.54σ. The location of sulfur sites in this way might be useful in tracing a structure phased with the help of I3C.

2.3. A MAD example

At first sight one might expect the density maps created by ANODE to show only the atoms with absorption edges straddled by the wavelengths employed in a MAD experiment. However because of their anomalous scattering, other atoms also contribute to the F A values, though less strongly than the targeted atoms. The situation becomes clearer when the MAD heavy-atom map is compared with the SAD maps for the different wavelengths used in the MAD experiment. ANODE was used to analyse the results of a three-wavelength Zn-MAD experiment on thermolysin (PDB code 3fgd; P. Pfeffer, G. Neudert, L. Englert, T. Ritschel, B. Baum & G. Klebe, in preparation) in which excess Zn2+ had been added to ensure full occupancy of the zinc site. All three data sets were collected at beamline 14.2 at the Bessy synchrotron in Berlin to a resolution of 2.06 Å. In addition to strong density at the zinc site, the Ca and S atoms correspond to density maxima. There is also a peak with about 35% of the density of the primary zinc site about 3.25 Å from it. Unlike the calcium and sulfur densities, this density appears as a nearly constant fraction of the primary zinc density, as shown in Table 1, indicating that it must also be a zinc site (possibly connected to the primary site by one or more O atoms). This confirms the conclusions of Holland et al. (1995) based on the native density and chemical environment of the site. The heavy-atom density map is shown in Fig. 3.

Table 1. Heavy-atom densities in σ units from ANODE for the three-wavelength Zn-MAD data for thermolysin with excess Zn2+ .

Whereas the ratio of the average density at the calcium sites to that at the zinc site varies with the wavelength for the three SAD experiments because f′ for zinc varies strongly, the ratio of the density at the unknown site to that at the zinc site is almost constant (at 35%) for the MAD experiment and for the same data treated as three separate SAD experiments, strongly indicating that this site is also occupied by zinc. Anomalous density is also observed for the methionine S atoms but is barely significant.

Data Three wavelengths Inflection point Peak High-energy remote
Experiment MAD SAD SAD SAD
Zn2+ 82.5 55.7 66.4 56.0
Ca2+ (mean) 11.2 15.1 11.1 12.8
SD_Met (mean) 1.8 3.5 2.3 2.9
Unknown 28.5 18.2 24.7 20.1
Ratio Ca2+/Zn2+ 0.136 0.271 0.167 0.229
Ratio unknown/Zn2+ 0.345 0.326 0.372 0.359

Figure 3.

Figure 3

Heavy-atom density from the three-wavelength Zn-MAD data contoured at 3.5σ for thermolysin with excess Zn2+, showing the additional partially occupied zinc site about 3.25 Å from the primary zinc site.

2.4. Inconsistent indexing

When the Laue symmetry is lower than the metric symmetry of the lattice, there are alternative ways of indexing the reflections that are incompatible with each other. Thus the indices in the name_fa.hkl file may not be compatible with those in the data used to generate the PDB file, which may well be from a different crystal. Fortunately, for all but three of the 65 Sohnke space groups, not more than one reorientation matrix needs to be considered. The appropriate matrix is applied to the reflection data when the -i flag is specified on the ANODE command line. For the three Sohnke space groups that can each be indexed in four different ways, the flags -i1, -i2 and -i3 may be used. ANODE outputs a warning when alternative indexing is possible, so the user does not need to be familiar with such technical details.

3. Program availability and distribution

ANODE is available as a Fortran source and as precompiled binaries for most modern Windows, Mac and Linux systems as part of the SHELX system (http://shelx.uni-ac.gwdg.de/SHELX/), which is distributed free for academic use. The statically linked binaries require no further programs, libraries, data files or environment variables except that the program SHELXC (or XPREP) is needed to prepare the name_fa.hkl input file for ANODE.

4. Conclusions

Despite the rather drastic simplifications and approximations involved – in particular the assumption that only one type of anomalous scatterer is present, and the restriction of α to 90 or 270° for SAD data – ANODE proves to be a rather effective way to generate and analyse anomalous or heavy-atom densities. ANODE appears to work well with appreciably weaker anomalous data than are required for experimental phasing, and should prove useful in the identification of unknown atoms (e.g. to distinguish between chloride ions and water molecules), for the validation of molecular replacement solutions (e.g. by locating S atoms) or as part of the validation of the final model.

Acknowledgments

We are grateful to Tobias Beck, Marianna Biadene, Gabor Buncoczi, Judit Debreczeni, Ina Dix, Christian Grosse, Tim Gruene, Uwe Müller and Manfred Weiss for providing test data and/or testing ANODE. Figs. 2 and 3 were prepared using COOT (Emsley et al., 2010) and PyMOL (DeLano, 2002).

References

  1. Beck, T., Krasauskas, A., Gruene, T. & Sheldrick, G. M. (2008). Acta Cryst. D64, 1179–1182. [DOI] [PubMed]
  2. Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N. & Bourne, P. E. (2000). Nucleic Acids Res. 28, 235–242. [DOI] [PMC free article] [PubMed]
  3. Dauter, Z., Dauter, M. & Rajashankar, K. R. (2000). Acta Cryst. D56, 232–237. [DOI] [PubMed]
  4. DeLano, W. L. (2002). PyMOL DeLano Scientific, San Carlos, USA, http://www.pymol.org/.
  5. Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. [DOI] [PMC free article] [PubMed]
  6. Holland, D. R., Hausrath, A. C., Juers, D. & Matthews, B. W. (1995). Protein Sci. 4, 1955–1965. [DOI] [PMC free article] [PubMed]
  7. Pal, A., Debreczeni, J. É., Sevvana, M., Gruene, T., Kahle, B., Zeeck, A. & Sheldrick, G. M. (2008). Acta Cryst. D64, 985–992. [DOI] [PubMed]
  8. Schuermann, J. P. & Tanner, J. J. (2003). Acta Cryst. D59, 1731–1736. [DOI] [PubMed]
  9. Sheldrick, G. M. (2008). Acta Cryst. A64, 112–122. [DOI] [PubMed]
  10. Sheldrick, G. M. (2010). Acta Cryst. D66, 479–485. [DOI] [PMC free article] [PubMed]
  11. Strahs, G. & Kraut, J. (1968). J. Mol. Biol. 35, 503–512. [DOI] [PubMed]
  12. Thorn, A. (2011). PhD thesis, Georg-August-Universität Göttingen, Germany.

Articles from Journal of Applied Crystallography are provided here courtesy of International Union of Crystallography

RESOURCES