Skip to main content
Journal of Applied Crystallography logoLink to Journal of Applied Crystallography
. 2008 Apr 8;41(Pt 3):644–646. doi: 10.1107/S0021889808007966

SnB version 2.3: triplet sieve phasing for centrosymmetric structures

Hongliang Xu a,*, Alexander B Smith b, Nikolaos V Sahinidis b,, Charles M Weeks a
PMCID: PMC2391309  NIHMSID: NIHMS48630  PMID: 18497890

A new version of SnB implements the triplet sieve method for phasing centrosymmetric structures.

Keywords: SnB, triplet sieve method, phasing of centrosymmetric structures, computer programs

Abstract

A new version of the direct-methods program SnB has been developed. This version incorporates the triplet sieve method for phasing centrosymmetric structures in a way that is transparent to users. The triplet sieve procedure may decrease significantly the time required to achieve a solution for such structures.

1. Introduction

SnB is a computer program (Miller et al., 1994; Weeks & Miller, 1999; Weeks et al., 2002) that implements the direct-methods phasing algorithm known as Shake-and-Bake (DeTitta et al., 1994; Weeks et al., 1994). Shake-and-Bake is an example of a ‘multi-solution’ or ‘multi-trial’ procedure (Germain & Woolfson, 1968). First, multiple trial structures are created by a random number generator that is used to assign initial atomic coordinates. These trial structures are then subjected to a dual-space refinement procedure that automatically and repetitively alternates reciprocal-space phase refinement, either by using the tangent formula (Karle & Hauptman, 1956) or by reducing the value of the minimal function (Debaerdemaeker & Woolfson, 1983), with complementary peak picking in real space to impose physical constraints. Potential solutions are identified on the basis of figures of merit such as the minimal function (R min) itself or a crystallographic R factor (R cryst) calculated at the end of SnB refinement.

The time required to achieve a solution depends on (i) the computational time of an individual SnB refinement cycle and (ii) the success rate or percentage of trial structures that refine to solutions. The success rate can be increased by providing a better-than-random set of starting atoms or phases. For example, the phasing program SHELXD (Schneider & Sheldrick, 2002), which is also based on the Shake-and-Bake algorithm, uses Patterson minimum functions (Buerger, 1959; Nordman, 1966) to derive sets of starting atoms that are, in some way, consistent with the Patterson function. Alternatively, the triplet sieve method (Smith et al., 2007) uses an integer minimal principle to provide a subset of perfect, or nearly perfect, initial phases that can be expanded using standard Shake-and-Bake refinement.

Direct methods rely on the fact that the structure invariants or triplet phases,

1.

are approximately equal to 0 if the corresponding values of Inline graphic are large. (N is the number of non-H atoms in the unit cell, and the |E| values are normalized structure factors.) In the centrosymmetric case, Inline graphic equals 0 or 180° only, and, given a subset of Inline graphic which are all equal to 0, it is possible to solve the system of homogeneous equations (1) with the triplet sieve technique and to obtain the desired subset of perfectly correct phases. However, two complications exist. First, the number of phases (NSP) appearing in the triplets involved in the sieving process must be limited to a small number with the largest |E| values in order to avoid inclusion of Inline graphic with values of 180° in the set used for sieving. If such triplets are included, it cannot be guaranteed that the correct set of phases can be found. Consequently, the NSP is significantly smaller than the total number of reflections that need to be phased in an SnB job. In some cases, it will be necessary to reduce the NSP iteratively in order to find a solution, but there is also a minimum value of the NSP below which solutions will never be found. To avoid inclusion of triplets with values of 180°, use of the sieving technique should also be limited to structures with fewer than ˜100 atoms in the asymmetric unit. The second complication to the sieving process is that, depending on the number of phases required to fix the origin in the particular space group, as well as the nature of the triplet interactions among the NSP, the homogeneous system of equations will have a variable number of degrees of freedom, leading to the generation of a variable number of sieve phase sets or trial structures.

2. Materials and methods

The sieving process can be incorporated into the Shake-and-Bake procedure, as illustrated in Fig. 1, with the addition of three new operational parameters. These parameters are the number of sieve phases (NSP), the reduction in the number of phases (SPR) considered in each successive sieving step and the minimum number of phases to be used for sieving (NSPmin). A subroutine implementing the procedure described by Smith et al. (2007) was added to the SnB program, and the additional steps introduced by triplet sieving are indicated by a gray background in the flow chart (Fig. 1). If the structure is centrosymmetric and the number of atoms in the asymmetric unit is less than 100, the variable ‘UseSieve’ is set to TRUE, and the phases of the trial structures are generated by the new subroutine. If the number of degrees of freedom (DF) is too large, the number of phases used for sieving is reduced and the trial phase sets are regenerated. If satisfactory trial structures cannot be generated, the variable ‘UseSieve’ is set to FALSE, and the program reverts to standard SnB operation using trial structures with randomly positioned atoms.

Figure 1.

Figure 1

A flow chart illustrating the SnB algorithm after incorporation of the triplet sieving method.

The modified SnB program was applied to the 15 centrosymmetric test data sets listed in Table 1. First, using the relevant deposited CIF for the compound, basic crystallographic information, including space group, cell parameters and chemical formula, was input to SnB. Then, using the DREAR package (Blessing & Smith, 1999) in SnB, E values were generated from the observed intensity data. As a final initialization step, reflection and invariant files were generated containing 10N reflections and 100N triplets, respectively. Next, the sieving procedure was carried out, and the three sieve parameters were varied in order to find a combination of values that would optimize the efficiency of the Shake-and-Bake procedure for these structures. In all cases, a small number (0.1N) of conventional SnB refinement cycles were added to expand the set of phased reflections and to improve the quality of the phases. Finally, the best values of the sieve parameters were chosen, and a final SnB job was run for each data set in order to measure the time required to obtain a solution. Solutions were identified on the basis of mean phase errors when compared with correct phase sets computed using the known atomic coordinates.

Table 1. Test structures and crystallographic information.

Ratio is the time required to achieve a solution without sieving divided by the time required with sieving.

Sample Chemical formula Atoms (ASU) Space group Ratio Reference
1 C12H10O3 15 Inline graphic 5.4 Howie et al. (2001)
2 C30H32N2O6 19 Inline graphic 14.4 Sun et al. (2002)
3 C34H42B2N2O4 21 Inline graphic 25.6 Kliegel, Amt et al. (2002)
4 C42H56N2O2 23 Inline graphic 12.4 Lynch (2002)
5 C44H38O4 24 Inline graphic 20.8 Vande Velde et al. (2002)
6 C44H52N4·C2H5OH 26 Inline graphic 98.5 Camiolo et al. (2001)
7 C36H62 36 Inline graphic 0.47 Bragg et al. (2002)
8 C30H22O6S 37 Inline graphic 4.1 Krishnakumar et al. (2002)
9 C34H26N2O 37 Inline graphic 81.8 Zhuang et al. (2002)
10 C24H12N6·4CHCl3 46 Inline graphic 16.0 Alfonso & Stoeckli-Evans (2001)
11 C33H49B6N3O11·0.5C4H10O 57.5 Inline graphic 7.9 Kliegel, Drückler et al. (2002)
12 C41H78O11Si8 60 Inline graphic 70.0 Arnold & Blake (2001)
13 C50H66O6·C3H7NO 61 Inline graphic 6.4 Bryan & Levitskaia (2002)
14 3C40H32O2·4C6H6 75 Inline graphic 44.5 Ohba et al. (2002)
15 C60H102N6O18 84 Inline graphic 5.6 Pletnev et al. (1980)

Structures that crystallize with a crystallographic center of symmetry in the middle of the molecule. In such cases, the number of atoms in the asymmetric unit is half of the molecule.

3. Results

As a result of the test jobs, the parameter values given in Table 2 (all a function of the size of the structure) were chosen as default values. The results of the final jobs, showing a comparison of the time required to yield SnB solutions with and without the sieve procedure, are presented in Table 1. This comparison shows that the computing time required for 14 of the 15 test structures is reduced by a factor of 4.1 to 98.5 when sieving is included. The average reduction factor is 29.5. The modified version of the SnB program including the sieve procedure with the default parameter values determined in this study is now available as version 2.3 from the SnB website, http://www.hwi.buffalo.edu/SnB/. Unlike earlier versions of SnB, version 2.3 also contains a tool for automatic solution detection that permits calculations to be terminated as soon as a solution is found. Thus, full advantage can be taken of the new sieving feature.

Table 2. Parameters for the combined SnB/sieve procedure.

Parameter SnB alone SnB with sieving
Total reflections (phases) 10N 10N
Triplets 100N 100N
Peaks to select N N
SnB cycles 0.5N 0.1N
Number of sieve phases (NSP) NA 1.5N–4N
Sieve phase reduction (SPR) NA 0.15N
Minimum number of sieve phases (NSPmin) NA 0.5N

The initial set of sieve reflections is chosen utilizing a binary search for the first instance of non-origin degrees of freedom, as described in §5 of Smith et al. (2007).

Acknowledgments

This work was supported in part by the Joint NSF/NIGMS Initiative to Support Research in the Area of Mathematical Biology under NIH award GM072023.

References

  1. Alfonso, M. & Stoeckli-Evans, H. (2001). Acta Cryst. E57, o242–o244.
  2. Arnold, P. L. & Blake, A. J. (2001). Acta Cryst. E57, o131–o133.
  3. Blessing, R. H. & Smith, G. D. (1999). J. Appl. Cryst. 32, 664–670.
  4. Bragg, S., Johnson, J. E. B., Graziano, G. M., Balaich, G. J. & Heimer, N. E. (2002). Acta Cryst. E58, o1010–o1012.
  5. Bryan, J. C. & Levitskaia, T. G. (2002). Acta Cryst. E58, o240–o242.
  6. Buerger, M. J. (1959). Vector Space New York: John Wiley and Sons.
  7. Camiolo, S., Coles, S. J., Gale, P. A., Hursthouse, M. B. & Paver, M. A. (2001). Acta Cryst. E57, o258–o260.
  8. Debaerdemaeker, T. & Woolfson, M. M. (1983). Acta Cryst. A39, 193–196.
  9. DeTitta, G. T., Weeks, C. M., Thuman, P., Miller, R. & Hauptman, H. A. (1994). Acta Cryst. A50, 203–210. [DOI] [PubMed]
  10. Germain, G. & Woolfson, M. M. (1968). Acta Cryst. B24, 91–96.
  11. Howie, R. A., Skakle, J. M. S. & Wardell, S. M. S. V. (2001). Acta Cryst. E57, o72–o74.
  12. Karle, J. & Hauptman, H. (1956). Acta Cryst. 9, 635–651.
  13. Kliegel, W., Amt, H., Patrick, B. O., Rettig, S. J. & Trotter, J. (2002). Acta Cryst. E58, o473–o475.
  14. Kliegel, W., Drückler, K., Patrick, B. O., Rettig, S. J. & Trotter, J. (2002). Acta Cryst. E58, o393–o395.
  15. Krishnakumar, R. V., Nandhini, M. S., Renuga, S., Natarajan, S., Selvaraj, S. & Perumal, S. (2002). Acta Cryst. E58, o1174–o1176.
  16. Lynch, D. E. (2002). Acta Cryst. E58, o1025–o1027.
  17. Miller, R., Gallo, S. M., Khalak, H. G. & Weeks, C. M. (1994). J. Appl. Cryst. 27, 613–621.
  18. Nordman, C. E. (1966). Trans. Am. Crystallogr. Assoc. 2, 29–38.
  19. Ohba, S., Hiratsuka, T. & Tanaka, K. (2002). Acta Cryst. E58, o1013–o1015.
  20. Pletnev, V. Z., Galitskii, N. M., Smith, G. D., Weeks, C. M. & Duax, W. L. (1980). Biopolymers, 19, 1517–1534.
  21. Schneider, T. R. & Sheldrick, G. M. (2002). Acta Cryst. D58, 1772–1779. [DOI] [PubMed]
  22. Smith, A. B., Xu, H. & Sahinidis, N. V. (2007). Acta Cryst. A63, 164–171. [DOI] [PubMed]
  23. Sun, G.-C., Li, Y.-Z., He, Z.-H., Li, Z.-J., Qu, J.-Q., Liu, C.-R. & Wang, L.-F. (2002). Acta Cryst. E58, o417–o418.
  24. Vande Velde, C. M. L., Hoefnagels, R. & Geise, H. J. (2002). Acta Cryst. E58, o454–o455.
  25. Weeks, C. M., Blessing, R. H., Miller, R., Mungee, R., Potter, S. A., Rappleye, J., Smith, G. D., Xu, H. & Furey, W. (2002). Z. Kristallogr. 217, 686–693.
  26. Weeks, C. M., DeTitta, G. T., Hauptman, H. A., Thuman, P. & Miller, R. (1994). Acta Cryst. A50, 210–220. [DOI] [PubMed]
  27. Weeks, C. M. & Miller, R. (1999). J. Appl. Cryst. 32, 120–124.
  28. Zhuang, J.-P., Zheng, Y. & Zhang, W.-Q. (2002). Acta Cryst. E58, o720–o722.

Articles from Journal of Applied Crystallography are provided here courtesy of International Union of Crystallography

RESOURCES