Abstract
A new version of the direct-methods program SnB has been developed. This version incorporates the triplet sieve method for phasing centrosymmetric structures in a way that is transparent to users. The triplet sieve procedure may decrease significantly the time required to achieve a solution for such structures.
1. Introduction
SnB is a computer program (Miller et al., 1994; Weeks & Miller, 1999; Weeks et al., 2002) that implements the direct-methods phasing algorithm known as Shake-and-Bake (DeTitta et al., 1994; Weeks et al., 1994). Shake-and-Bake is an example of a ‘multi-solution’ or ‘multi-trial’ procedure (Germain & Woolfson, 1968). First, multiple trial structures are created by a random-number generator that is used to assign initial atomic coordinates. Then, these trial structures are subjected to a dual-space refinement procedure that automatically and repetitively alternates reciprocal-space phase refinement, either by using the tangent formula (Karle & Hauptman, 1956) or by reducing the value of the minimal function (Debaerdemaeker & Woolfson, 1983), with complementary peak picking in real-space to impose physical constraints. Potential solutions are identified on the basis of figures of merit such as the minimal function (Rmin) itself or a crystallographic R factor (Rcryst) calculated at the end of SnB refinement.
The time required to achieve a solution depends on (1) the computational time of an individual SnB refinement cycle and (2) the success rate or percentage of trial structures that refine to solutions. Success rate can be increased by providing a better-than-random set of starting atoms or phases. For example, the phasing program SHELXD (Schneider & Sheldrick, 2002), which is also based on the Shake-and-Bake algorithm, uses Patterson minimum functions (Buerger, 1959; Nordman, 1966) to derive sets of starting atoms that are, in some way, consistent with the Patterson function. Alternatively, the triplet sieve method (Smith et al., 2007) uses an integer minimal principle to provide a subset of perfect, or nearly perfect, initial phases that can be expanded using standard Shake-and-Bake refinement.
Direct methods rely on the fact that the structure invariants or triplet phases,
(1) |
are approximately equal to 0 if the corresponding values of AHK = 2N−1/2|EHEKE−H−K| are large. (N is the number of non-H atoms in the unit cell, and the |E|s are normalized structure factors.) In the centrosymmetric case, φHK equals 0 or 180 degrees only, and, given a subset of φHK which are all equal to 0, it is possible to solve the system of homogeneous equations (eqn. 1) with the triplet sieve technique and to obtain the desired subset of perfectly correct phases. However, two complications exist. First, the number of phases (NSP) appearing in the triplets involved in the sieving process must be limited to a small number with the very largest |E| values in order to avoid inclusion of φHK with values of 180 in the set used for sieving. If such triplets are included, it cannot be guaranteed that the correct set of phases can be found. Consequently, NSP is significantly smaller than the total number of reflections that need to be phased in an SnB job. In some cases, it will be necessary to iteratively reduce NSP in order to find a solution, but there is also a minimum value of NSP below which solutions will never be found. To avoid inclusion of triplets with values of 180 degrees, use of the sieving technique should also be limited to structures with fewer than ∼100 atoms in the asymmetric unit. The second complication to the sieving process is that, depending on the number of phases required to fix the origin in the particular space group as well as the nature of the triplet interactions among the NSP phases, the homogeneous system of equations will have a variable number of degrees of freedom leading to the generation of a variable number of sieve phase sets or trial structures.
2. Materials and Methods
The sieving process can be incorporated into the Shake-and-Bake procedure, as illustrated in Figure 1, with the addition of three new operational parameters. These parameters are the number of sieve phases (NSP), the reduction in number of phases (SPR) considered in each successive sieving step, and the minimum number of phases to be used for sieving (NSPmin). A subroutine implementing the procedure described by Smith et al. (2007) was added to the SnB program, and the additional steps introduced by triplet sieving are indicated by a gray background in the flow chart. If the structure is centrosymmetric and the number of atoms in the asymmetric unit is less than 100, the variable ‘UseSieve’ is set to TRUE, and the phases of the trial structures are generated by the new subroutine. If the degrees of freedom (DF) are too large, the number of phases used for sieving is reduced, and the trial phase sets are regenerated. If satisfactory trial structures can not be generated, the variable ‘UseSieve’ is set to FALSE, and the program reverts to standard SnB operation using trial structures with randomly positioned atoms.
The modified SnB program was applied to the 15 centrosymmetric test data sets listed in Table 1. First, using the deposited CIF file, basic crystallographic information including space group, cell parameters, and chemical formula was input to SnB. Then, using the DREAR package (Blessing & Smith, 1999) in SnB, E-values were generated from the observed intensity data. As a final initialization step, reflection and invariant files were generated containing 10N reflections and 100N triplets, respectively. Next, the sieving procedure was carried out, and the three sieve parameters were varied in order to find a combination of values that would optimize the effciency of the Shake-and-Bake procedure for these structures. In all cases, a small number (0.1N) of conventional SnB refinement cycles was added to expand the set of phased reflections and to improve the quality of the phases. Finally, the best values of the sieve parameters were chosen, and a final SnB job was run for each data set in order to measure the time required to obtain a solution. Solutions were identified on the basis of mean phase errors when compared to correct phase sets computed using the known atomic coordinates.
Table 1.
ID | Chemical Formula | Atoms (ASU) | Space Group | Ratio | Reference |
---|---|---|---|---|---|
1 | C12H10O3 | 15 | P21/n | 5.4 | Howie et al. (2001) |
2 | C30H32N2O6 | 19* | P21/c | 14.4 | Sun et al. (2002) |
3 | C34H42B2N2O4 | 21* | P21/n | 25.6 | Kliegel et al. (2002a) |
4 | C42H56N2O2 | 23* | P21/n | 12.4 | Lynch (2002) |
5 | C44H38O4 | 24* | 20.8 | Vande Velde et al. (2002) | |
6 | C44H52N4 · C2H5OH | 26* | C2/c | 98.5 | Camiolo et al. (2001) |
7 | C36H62 | 36 | P21/c | 0.47 | Bragg et al. (2002) |
8 | C30H22O6S | 37 | P21/c | 4.1 | Krishnakumar et al. (2002) |
9 | C34H26N2O | 37 | P21/c | 81.8 | Zhuang et al. (2002) |
10 | C24H12N6 · 4CHCl3 | 46 | P21/n | 16.0 | Alfonso & Stoeckli-Evans (2001) |
11 | C33H49B6N3O11 · 0.5C4H10O | 57.5 | 7.9 | Kliegel et al. (2002b) | |
12 | C41H78O11Si8 | 60 | 70.0 | Arnold & Blake (2001) | |
13 | C50H66O6 · C3H7NO | 61 | P21/c | 6.4 | Bryan & Levitskaia (2002) |
14 | 3C40H32O2 · 4C6H6 | 75* | 44.5 | Ohba et al. (2002) | |
15 | C60H102N6O18 | 84 | 5.6 | Pletnev et al. (1980) |
: Structures which crystallize with a crystallographic center of symmetry in the middle of the molecule. In such cases, the number of atoms in the asymmetric unit is half of the molecule.
3. Results
As a result of the test jobs, the parameter values (all a function of the size of the structure) given in Table 2 were chosen as default values. The results of the final jobs showing a comparison of the time required to yield SnB solutions with and without the sieve procedure is presented in Table 1. This comparison shows that the computing time required for 14 of the 15 test structures is reduced by a factor of 4.1 to 98.5 when sieving is included. The average reduction factor is 29.5.
Table 2.
Parameter | SnB alone | SnB with sieving |
---|---|---|
Total reflections (phases) | 10N | 10N |
Triplets | 100N | 100N |
Peaks to select | N | N |
SnB cycles | 0.5N | 0.1N |
No. sieve phases (NSP) | NA | 1.5N-4N† |
Sieve phase reduction (SPR) | NA | 0.15N |
Min. sieve phases (NSPmin) | NA | 0.5N |
: The initial set of sieve reflections is chosen utilizing a binary search for the first instance of non-origin degrees of freedom as described in section 5 of Smith et al. (2007).
The modified version of the SnB program with the sieve procedure included with the default parameter values determined in this study is now available as version 2.3 from the SnB website, http://www.hwi.buffalo.edu/SnB/. Unlike earlier versions of SnB, version 2.3 also contains a tool for automatic solution detection that permits calculations to be terminated as soon as a solution is found. Thus, full advantage can be taken of the new sieving feature.
Acknowledgments
This work was supported in part by the Joint NSF/NIGMS Initiative to Support Research in the Area of Mathematical Biology under NIH award GM072023.
References
- Alfonso M, Stoeckli-Evans H. Acta Cryst. 2001;E57:o242–o244. [Google Scholar]
- Arnold PL, Blake AJ. Acta Cryst. 2001;E57:o131–o133. [Google Scholar]
- Blessing RH, Smith GD. J. Appl. Cryst. 1999;32:664–670. [Google Scholar]
- Bragg S, Johnson JEB, Graziano GM, alaich GJ, Heimer NE. Acta Cryst. 2002;E58:o1010–o1012. [Google Scholar]
- Bryan JC, Levitskaia TG. Acta Cryst. 2002;E58:o240–o242. [Google Scholar]
- Buerger MJ. Vector Space. John Wiley & Sons; New York: 1959. [Google Scholar]
- Camiolo S, Coles SJ, Gale PA, Hursthouse MB, Paver MA. Acta Cryst. 2001;E57:o258–o260. [Google Scholar]
- Debaerdemaeker T, Woolfson MM. Acta Cryst. 1983;A39:193–196. [Google Scholar]
- DeTitta GT, Weeks CM, Thuman P, Miller R, Hauptman HA. Acta Cryst. 1994;A50:203–210. doi: 10.1107/s0108767393008980. [DOI] [PubMed] [Google Scholar]
- Germain G, Woolfson MM. Acta Cryst. 1968;B24:91–96. [Google Scholar]
- Howie RA, Skakle JMS, Wardell SMSV. Acta Cryst. 2001;E57:o72–o74. [Google Scholar]
- Karle J, Hauptman HA. Acta Cryst. 1956;9:635–651. [Google Scholar]
- Kliegel W, Amt H, Patrick BO, Rettig SJ, Trotter J. Acta Cryst. 2002a;E58:o473–o475. [Google Scholar]
- Kliegel W, Drückler K, Patrick BO, Rettig SJ, Trotter J. Acta Cryst. 2002b;E58:o393–o395. [Google Scholar]
- Krishnakumar RV, Nandhini MS, Renuga S, Natarajan S, Selvaraj S, Perumal S. Acta Cryst. 2002;E58:o1174–o1176. [Google Scholar]
- Lynch DE. Acta Cryst. 2002;E58:o1025–o1027. [Google Scholar]
- Miller R, Gallo SM, Khalak HG, Weeks CM. J. Appl. Cryst. 1994;27:613–621. [Google Scholar]
- Nordman CE. Trans. Am. Crystallogr. Assoc. 1966;2:29–38. [Google Scholar]
- Ohba S, Hiratsuka T, Tanaka K. Acta Cryst. 2002;E58:o1013–o1015. [Google Scholar]
- Pletnev VZ, Galitskii NM, Smith GD, eeks CM, Duax WL. Biopolymers. 1980;19:1517–1534. [Google Scholar]
- Schneider TR, Sheldrick GM. Acta Cryst. 2002;D58:1772–1779. doi: 10.1107/s0907444902011678. [DOI] [PubMed] [Google Scholar]
- Smith AB, Xu H, Sahinidis NV. Acta Cryst. 2007;A63:164–171. doi: 10.1107/S0108767307000621. [DOI] [PubMed] [Google Scholar]
- Sun G-C, Li Y-Z, He Z-H, Li Z-J, Qu J-Q, Liu C-R, Wang L-F. Acta Cryst. 2002;E58:o417–o418. [Google Scholar]
- Vande Velde CML, Hoefnagels R, Geise HJ. Acta Cryst. 2002;E58:o454–o455. [Google Scholar]
- Weeks CM, DeTitta GT, Hauptman HA, Thuman P, Miller R. Acta Cryst. 1994;A50:210–220. doi: 10.1107/s0108767393008992. [DOI] [PubMed] [Google Scholar]
- Weeks CM, Miller R. J. Appl. Cryst. 1999;32:120–124. [Google Scholar]
- Weeks CM, Blessing RH, Miller R, Mungee R, Potter SA, Rappleye J, Smith GD, Xu H, Furey W. Z. Kristallogr. 2002;217:686–693. [Google Scholar]
- Zhuang J-P, Zheng Y, Zhang W-Q. Acta Cryst. 2002;E58:o720–o722. [Google Scholar]