Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Jul 31.
Published in final edited form as: J Am Chem Soc. 2008 Jul 4;130(30):9695–9701. doi: 10.1021/ja8013538

X-ray Structure of Snow Flea Antifreeze Protein Determined by Racemic Crystallization of Synthetic Protein Enantiomers

Brad L Pentelute †,, Zachary P Gates †,, Valentina Tereshko †,§, Jennifer L Dashnau , Jane M Vanderkooi , Anthony A Kossiakoff †,§, Stephen B H Kent †,‡,§,*
PMCID: PMC2719301  NIHMSID: NIHMS122408  PMID: 18598029

Abstract

Chemical protein synthesis and racemic protein crystallization were used to determine the X-ray structure of the snow flea antifreeze protein (sfAFP). Crystal formation from a racemic solution containing equal amounts of the chemically synthesized proteins d-sfAFP and l-sfAFP occurred much more readily than for l-sfAFP alone. More facile crystal formation also occurred from a quasi-racemic mixture of d-sfAFP and l-Se-sfAFP, a chemical protein analogue that contains an additional -SeCH2- moiety at one residue and thus differs slightly from the true enantiomer. Multiple wavelength anomalous dispersion (MAD) phasing from quasi-racemate crystals was then used to determine the X-ray structure of the sfAFP protein molecule. The resulting model was used to solve by molecular replacement the X-ray structure of l-sfAFP to a resolution of 0.98 A. The l-sfAFP molecule is made up of six antiparallel left-handed PPII helixes, stacked in two sets of three, to form a compact brick-like structure with one hydrophilic face and one hydrophobic face. This is a novel experimental protein structure and closely resembles a structural model proposed for sfAFP. These results illustrate the utility of total chemical synthesis combined with racemic crystallization and X-ray crystallography for determining the unknown structure of a protein.

Introduction

Antifreeze proteins are a structurally diverse class of macromolecules that have in common the abilities to effect thermal hysteresis of water (nonequilibrium freezing point depression) and to inhibit ice recrystallization (growth of large ice crystals at the expense of small ones).1 These proteins are critical to the existence of life at subzero temperatures2,3 and also have potential practical applications, e.g. for the storage of organs for use in transplantation. Both antifreeze proteins and antifreeze peptides function by binding to the surface of ice crystals and preventing further crystal growth.4 The exact molecular basis of the binding of antifreeze proteins to ice crystals is still subject to investigation;5 it was originally thought that antifreeze peptides bind to the surface of an ice crystal through H-bonding of polar side chain moieties, for example in Thr or Lys residues, suitably disposed on the peptide molecule. More recent studies of antifreeze proteins have pointed toward key roles for shape complementarity between a flat hydrophobic surface of the protein and the surface of the ice crystal, with both van der Waals-type forces and H-bonds contributing to keep the hydrophobic protein face bound tightly to the ice crystal.1

Two years ago, Davies and co-workers reported the discovery of a novel antifreeze protein from the Canadian snow flea.6 The snow flea antifreeze protein (sfAFP) studied here is the smaller of two protein isoforms that they isolated by ice-affinity purification of snow flea homogenates. The predicted 81-residue amino acid sequence of sfAFP is glycine rich, and is characterized by a repetitive Gly-Xaa-Xaa motif, the presence of several proline residues, and by two intramolecular disulfides, the connectivity of which has not been experimentally determined (Scheme 1). sfAFP has no known homologues, antifreeze protein or otherwise, and no experimental structure has been reported for the folded protein molecule. Recently, Davies and coworkers have proposed a structural model for sfAFP.7 Experimental structure determination has been precluded in the first instance by a simple lack of material—isolation of the natural product gives only microgram quantities, and recombinant expression is reportedly difficult and does not yield significant amounts of protein.7

Scheme 1.

Scheme 1

Predicted 81-Residue Glycine-Rich Amino Acid Sequence of sfAFP 6

Chemical protein synthesis, based on modern ligation methods,8,9 provides a route to the preparation of protein molecules up to 200 or more amino acid residues in size.10 Recent refinements have made total chemical synthesis an increasingly practical way of obtaining useful amounts of protein for study. 1114 In an accompanying paper, we report the total chemical synthesis of the mirror image forms of the snow flea antifreeze protein: l-sfAFP and d-sfAFP;15 the antifreeze activities of both of these folded, synthetic proteins were verified by ice recrystallization inhibition assays.15 With a ready supply of sfAFP to hand, we set out to determine the experimental structure of sfAFP using racemic protein crystallography, a technique that is made possible only by using chemical protein synthesis.

While X-ray structures have been determined for many thousands of distinct protein molecules, it is not infrequently observed that a particular protein may be difficult to crystallize or may form crystals unsuitable for the collection of single-crystal X-ray diffraction data. This has become one of the principal obstacles to the high throughput determination of protein X-ray structures. As will be described below, we encountered severe difficulties in the crystallization of sfAFP, which we were able to overcome as follows.

Proteins found in nature (i.e., proteins made up of l-amino acids and glycine) are inherently chiral entities and can therefore crystallize only in one of the 65 “biological” space groups that lack an inversion center or mirror plane, out of the total of 230 unique space groups that describe all possible crystal symmetries.16 The frequency of protein crystallization in different space groups is markedly nonuniform, with most protein crystals belonging to one of just a few space groups. P212121 is by far the most commonly observed space group for globular protein molecules. Yeates has explained this preference for certain space groups based on an entropic model: “the favored space groups are simply less restrictive than others… they allow the molecules more rigid-body degrees of freedom and can therefore be realized in a greater number of ways.”16 This same model predicts that, for globular proteins, a racemic protein mixture will crystallize more readily. This is so because the space group most favored for globular proteins on theoretical grounds, P1̄, contains inversion symmetry and is thus not accessible to natural l-proteins.16,17 However, the P1̄ space group would in principle be accessible to the achiral entity made up of a d-protein and the corresponding l-protein as a molecular pair. The more facile crystallization of racemic protein mixtures has never been experimentally verified. Only two proteins have been crystallized as racemates; in both cases centrosymmetric racemic crystals in space group P1̄ were obtained.18,19 These pioneering studies showed that it is possible to obtain centrosymmetric crystals from a racemic protein mixture; however, in neither case was crystallization of the natural l-protein limiting, so the studies did not speak to whether or not racemic crystallization could be used to facilitate obtaining crystals from a difficult-to-crystallize protein target.

In this paper we report the use of racemic crystallization1622 to obtain high-quality crystals of the protein sfAFP for X-ray diffraction studies. Crystal formation occurred much more readily from a racemic mixture of d-sfAFP and l-sfAFP than with the l-protein alone. More facile crystal formation also occurred from a quasi-racemic mixture of d-sfAFP and l-Se-sfAFP, a chemical protein analogue that contains an additional -SeCH2- moiety at one residue and thus differs slightly from the true enantiomer. Multiple wavelength anomalous dispersion (MAD) phasing of diffraction data from a quasi-racemate crystal was then used to solve the X-ray structure of the sfAFP protein molecule. The resulting model was used to solve by molecular replacement the structure of l-sfAFP to a resolution of 0.98 Å. The X-ray structure of the true racemate made up of the protein enantiomers l-sfAFP and d-sfAFP in space group P1̄ was also analyzed.

Results and Discussion

Crystallization of l-sfAFP

Using sfAFP prepared by total chemical synthesis,15 extensive crystallization trials were carried out over a period of six months with l-sfAFP alone, varying temperature, protein concentration, and the concentration of (NH4)2SO4 or other precipitants. Crystals were eventually obtained, with difficulty. By screening selected sections of ∼30 crystals, grown at room temperature (∼23 °C) by vapor diffusion, we were able to collect a synchrotron diffraction data set to 0.98 Å resolution. These data were indexed in the P21 space group and had unit cell parameters a = 16.7 Å, b = 74.3 Å, c = 17.7 Å, β = 102.2°, and had one sfAFP molecule per asymmetric unit. The solvent content for this crystal was unusually low at 25%. Exhaustive attempts to crystallize heavy atom derivatives of l-sfAFP for use in phase determination failed, and no structures of homologous protein molecules were available, so we were unable to solve the X-ray structure.

Racemic Crystallization

The total chemical synthesis of the two enantiomeric molecules, d-sfAFP and l-sfAFP, was carried out as previously described.15 In trials using a racemic solution containing equal amounts of d-sfAFP and l-sfAFP we found that crystals were obtained in ∼50% of the 194 conditions initially explored using the commercially available Hampton index, crystal screen and crystal screen II. This was in marked contrast to what we had observed in attempts to crystallize l-sfAFP. Figure 1 shows results obtained from simultaneous crystallization experiments under representative conditions for l-sfAFP alone, and for a racemic solution of d-sfAFP and l-sfAFP under the same conditions.

Figure 1.

Figure 1

A direct comparison of the results for attempted crystallization of the racemic protein mixture [d-sfAFP + l-sfAFP] or the single enantiomer l-sfAFP under three sets of conditions. Protein crystals appeared rapidly from a solution containing a racemic mixture of d-sfAFP and l-sfAFP, but not for l-sfAFP alone, when screened under a standard set of crystallization conditions. Representative examples are shown in this Figure. X-ray diffraction was used to verify that the crystals were in fact protein. The crystallization conditions shown are a subset of the main findings reported here and are as follows: (a) 0.1 M bicine pH = 9.0, 2% v/v 1,4-dioxane, 10% w/v polyethylene glycol 20,000; (b) 0.1 M bis-Tris pH = 6.5, 1.15 M ammonium sulfate; (c) 0.1 M Tris pH = 8.5, 25% w/v polyethylene glycol 3350. All crystallization trials were carried out at room temperature (∼23 °C).

Diffraction data from a sampling of the racemic crystals formed in our initial screen revealed that, under all conditions examined, the protein racemate crystallized with triclinic symmetry and had unit cell parameters a = 28.6 Å, b = 32.4 Å, c = 59.8 Å, α = 88.7°, β = 89.2°, γ = 73.4° with two sfAFP molecules in the asymmetric unit. The 〈|E2 − 1|〉 statistic calculations in program XPREP indicated the centrosymmetric space group P

Crystallization of a Quasi-Racemate for Phase Determination

The next step was to solve the structure of the sfAFP molecule. It was suggested by Berg that, because of the quantization of phases in diffraction data obtained from a centrosymmetric racemic protein crystal, direct methods should be more applicable.22 For the sfAFP protein, our preliminary attempts to solve the structure by direct methods failed. To speed up the structure determination we chose to solve the structure using experimental phases obtained using the multiple wavelength anomalous dispersion (MAD) method.23 We collected synchrotron X-ray data from a selenium-containing quasiracemate in which only one enantiomer contained a selenium atom. (For centrosymmetric structures, such as a protein racemate [pair of enantiomers], Friedel’s law holds even in the presence of an anomalous scattering atom, and it is not possible to measure anomalous differences.24) The quasi-racemate was obtained by the cocrystallization of a mixture of d-sfAFP and l-Se-sfAFP. The l-Se-sfAFP was a chemically synthesized sfAFP analogue in which Asn11 was replaced by seleno-Cys alkylated with bromoacetamide (“pseudo-Se-Gln”; see Experimental Section for details). The more facile production of crystals by racemic crystallization was critical for obtaining crystals—we had made extensive efforts to grow crystals of the l-Se-sfAFP protein alone, with no success. By contrast, a quasiracemic solution containing equal amounts of d-sfAFP and l-Se-sfAFP crystallized readily under a variety of conditions.

Synchrotron X-ray diffraction data to a resolution of 1.2 Å were collected from crystals of the d-sfAFP and l-Se-sfAFP quasi-racemate. The quasi-racemate had unit cell parameters nearly identical to those of the true racemate. The structure was solved in space group P1 using MAD data collected at three wavelengths (see Experimental Section). Figure 2 shows the arrangement of d-sfAFP and l-Se-sfAFP molecules in the quasiracemate.

Figure 2.

Figure 2

Quasi-racemate crystal packing in space group P1, showing the d-sfAFP molecule (purple ribbon) and the l-Se-sfAFP (green ribbon) with the “pseudo-Se-Gln” at position 11 shown as CPK solid spheres colored by atom type (C, green; O, red; N, cyan; Se, yellow). (a) Unit cell with two d-sf AFP/l-Se-sfAFP pairs located at positions 0, 1/2b, 1/2c, and 0,0,0, in the asymmetric unit. The d-sfAFP contains right-handed polyproline type II (PPII) helices; the l-Se-sfAFP contains left-handed PPII helices. (b) View of one d-sfAFP/l-Se-sfAFP pair located at 0, 1/2b, 1/2c. Side chains are included as stick representations.

X-ray Structure of sfAFP

The resulting high resolution X-ray structure of sfAFP is shown in Figure 3. The protein fold observed in the X-ray structure of sfAFP is similar to the structural model recently proposed by Davies et al.7 It consists of six left-handed PP-II helices, stacked in two antiparallel groups to form an oblong brick-shaped molecule. The turns linking the helices are in general defined by the presence of Pro residues, and the fold of the N-terminal half of the polypeptide chain is stabilized by two intramolecular disulfide bonds between residues Cys1-Cys28 and Cys13-Cys43. One face of the molecule is hydrophobic, and the other, polar/hydrophilic. The implications of this novel protein fold for antifreeze activity have been discussed.7 It has been hypothesized that in antifreeze proteins a flat, hydrophobic face interacts snugly with the highly ordered water molecules found at the surface of an ice crystal, presenting the polar (hydrophilic) face to the surrounding solution and inhibiting further growth of the ice crystal.25,26 Interestingly, we observed an array of highly ordered water molecules on the flat, hydrophobic face of the sfAFP protein (Figure 3). This observation is consistent with the proposed mechanism for the binding of an sfAFP molecule to an ice crystal.7

Figure 3.

Figure 3

The X-ray structure of snow flea antifreeze protein (sfAFP). Atomic resolution (1.2 A) X-ray structure of sfAFP, obtained from quasi-racemate crystals of d-sfAFP and l-Se-sf AFP in the P1 space group. (a) SigmaA-weighted 2FoFc electron density map (blue, 1.5 sigma level) of the proposed ice-binding surface encompassing residues 17–21, 47–51, 74–78. Five ordered water molecules are shown (magenta spheres) with electron density shown at 1σ. (b) Cartoon of the backbone fold of sfAFP. Amino acid side chains are shown as sticks. The dashed box indicates the region shown in (a) above. (c) The main structural elements of sfAFP are six antiparallel polyproline type II (PPII) (left-handed) helices, stacked in two groups and joined by five reverse turns and interlocked by a complex hydrogen bond network. The N-terminal half of the protein contains two intramolecular disulfide bonds between residues Cys1-Cys28 and Cys13-Cys43. (d) Surface map highlighting sfAFP’s amphiphilic character. A 180° rotation (along the long axis) shows the contrasting surfaces: hydrophobic/apolar (left); hydrophilic/polar (right). (e) Two views related by a 90° rotation of highly ordered first shell water molecules (shown in magenta) interacting with the sfAFP hydrophobic face backbone (dashed magneta lines). Water molecules (magenta spheres) are within hydrogen-bonding distance (2.7–3.0 Å) of the sfAFP backbone amide carbonyls and nitrogens, and of one another (solid magnenta lines). The spacing between water molecules is 4.5 Å and 9.2 Å along the short and long sfAFP axes, respectively. X-ray statistics are given in Table 1. Backbone rmsd deviations between molecules of l-sfAFP and d-sfAFP are given in Table 2.

The molecular model from the quasi-racemate was used to solve by molecular replacement the structure of l-sfAFP using the original 0.98 Å diffraction data (Table 3). The l-sfAFP molecular structure in the P21 crystal is identical to that observed in the quasi-racemate. A portion of the 2FoFc electron density map obtained at 0.98 Å is shown in Figure 4(a).

Table 3.

X-ray Data Collection and Refinement Statistics for l-sfAFP

Space group P21
Cell parameters a = 16.70 Å
b = 74.28 Å
c = 17.69 Å
β = 102.2°
Data Collection Statistics:
Beamline/Detector APS 19BM/SBC2
Wavelength (Å) 0.9792
Resolution range (Å) 50.00 − 0.95 (0.98 − 0.95)a
Reflections measured 241662
Unique reflections measured 23972
Completeness (%) 90.6 (46.0)
Average redundancy 3.3 (1.3)
<I>/<σ(I)> 17.9 (3.0)
Rmerge 0.059 (0.25)
Refinement Statistics:
Resolution range (Å) 17.29 − 0.98(1.01 − 0.98)
Reflections used (working set) 21784
Reflections used (test set) 1173
Completeness (working + test set, %) 95.5 (83.6)
Rwork 0.140 (0.303)
Rfree 0.164 (0.255)
Rms deviations
  Bonds (Å) 0.017
  Angles (degrees) 1.7
Average B factor (overall, Å2) 6.6
Ramachandran Plot Statistics:
Most favored (%) 94.6
Additionally allowed (%) 5.4
Generally allowed (%) 0
a

Values for highest resolution shell are given in parentheses.

Figure 4.

Figure 4

SigmaA-weighted 2FoFc electron density map (blue, 1.5σ level) of the proposed ice-binding surface encompassing residues 17–21, 47–51, 74–78 for: (a) P21 l-sfAFP; (b) P 1̄ {d-sfAFP + l-sfAFP}. Ordered water molecules with electron density at 1σ are shown (magenta spheres). Five ordered water molecules are conserved in all structures. An additional water molecule is observed in the P21 structure; this water is absent in the racemic crystals because of packing interactions.

Diffraction data were also collected from crystals of the true racemate {d-sfAFP + l-sfAFP}. Table 4 gives the X-ray statistics for the {d-sfAFP + l-sfAFP} data set. A portion of the 2FoFc electron density map obtained for the true racemate {d-sfAFP + l-sfAFP} at 1.0 Å is shown in Figure 4 (b). The crystals of the true racemate were refined in the P1̄ centrosymmetric space group.

Table 4.

X-ray Data Collection and Refinement Statistics for Racemic sfAFP

D, l-sfAFP
Space group P1<bar>
Cell parameters a = 28.60 Å
b = 32.40 Å
c = 59.85 Å
α = 88.69°
β = 89.19°
γ = 73.41°
Data Collection Statistics:
Beamline/Detector APS 23ID-B/MARCCD300
Wavelength (Å) 0.9792
Resolution range (Å) 50.00 − 0.95 (0.98 − .95)a
Reflections measured 974696
Unique reflections measured 101192
Completeness (%) 78.2(18.9)
Average redundancy 4.3 (3.3)
<I>/<σ(I)> 17.4(3.6)
Rmerge 0.084 (0.42)
Refinement Statistics:
Resolution range (Å) 25.00 − 1.00(1.03 − 1.00)
Reflections used (working set) 92050
Reflections used (test set) 4909
Completeness (working + test set, %) 87.1 (43.7)
Rwork 0.218(0.229)
Rfree 0.233 (0.297)
Rms deviations
  Bonds (Å) 0.014
  Angles (degrees) 1.6
Average B factor (overall, Å2) 7.9
Ramachandran Plot Statistics:
Most favored (%) 93.2
Additionally allowed (%) 6.8
Generally allowed (%) 0
a

Values for highest resolution shell are given in parentheses.

Conclusions

Crystallization of a racemic mixture of enantiomeric protein molecules prepared by total chemical synthesis has been used to experimentally determine the X-ray structure of the snow flea antifreeze protein (sfAFP). This protein was found to have a tertiary structure consisting only of stacked PPII helices, which represents a unique protein fold. The experimental structure of sfAFP is similar to a molecular model recently proposed, and has implications for the molecular basis of the antifreeze activity of sfAFP.7

In this paper we have experimentally demonstrated, for the first time, that a racemic protein mixture can more readily form highly ordered crystals suitable for X-ray diffraction studies. This can greatly facilitate the X-ray structure determination of a synthetically accessible protein, as was the case for the snow flea antifreeze protein studied here. Moreover, we have shown that, at least for sfAFP, the more facile crystallization also applies to quasi-racemic mixtures in which one protein molecule contains an anomalous scattering atom for phase determination. Including the work reported here, only a handful of racemic protein crystallizations have been described. Because a mixture of enantiomers can crystallize either as a racemate or the individual enantiomers can crystallize as a conglomerate, it is too early to say whether more facile crystallization of racemic protein mixtures is a general phenomenon, as predicted by Yeates.16 If protein racemates crystallize as a rule, then the chemical synthesis of mirror image proteins for racemic crystallization, as suggested by Yeates,16,17 may prove to be a powerful tool in structural biology.

Experimental Section

Total Chemical Synthesis of sfAFP and Analogues

The 81-residue sfAFP polypeptide chain (Scheme 1) was synthesized by native chemical ligation of four unprotected peptides (Thz1–12-thioester; Thzl3–27-thioester; Thz28–42-thioester; Cys43–81) as described in an accompanying paper.15 l-Se-sfAFP was prepared by incorporation of “pseudo-Se-Gln” into the Thz1–12- thioester peptide during stepwise solid-phase synthesis.27 Boc-“[4Se]-pseudo-Gln”-OH for use in peptide synthesis was prepared by reducing l-selenocystine with sodium borohydride, alkylating at selenium with 2-bromoacetamide, and N-acylating with Boc2O. The LC—MS analysis of chemically synthesized l-Se-sfAFP synthetic protein is shown in Figure 5.

Figure 5.

Figure 5

Analytical LC-MS data for synthetic folded l-Se-sfAFP. The chromatographic separations were carried out on a Vydac C4 2.1 mm × 150 mm column using a linear gradient of 1–61% buffer B over 15 min (buffer A = 0.1% TFA in H2O; buffer B = 0.08% TFA in acetonitrile). The inset is the ESMS summed over the entire LC peak. Calculated masses were based on average isotope composition.

Protein Crystallization

Lyophilized sfAFP was dissolved in water at 10–40 mg/mL and was used directly for crystallizations. Crystallization screening conditions were based on the commercially available Hampton index, crystal screen and crystal screen II. Crystals were grown by vapor diffusion in hanging drops at room temperature (∼23 °C). The drops were generated by mixing 1 µL of protein solution with 1 µL of reservoir solution and placed against 1 mL of reservoir solution. Racemic solutions of sfAFP were prepared in water at 38 mg/mL (19 mg of d-sfAFP and 19 mg of l-sfAFP) and 19 mg/mL (9.5 mg of d-sfAFP and 9.5 mg of l-sfAFP). Simultaneous sparse matrix room temperature screens were carried out for the racemic protein solutions at both concentrations. Crystals appeared after 1 day, and after 10 days, 93 of the 194 sets of the conditions explored had crystals suitable for use in diffraction experiments. No crystals were formed from the l-sfAFP solutions using the same sets of conditions. The same approach was used for crystallization of the quasi-racemate in which crystals suitable for use in diffraction experiments appeared after 1 day in a number of conditions. Diffraction data for the quasi-racemate were collected using a crystal grown at room temperature from 0.2 M sodium malonate, pH 7, and 20% w/v polyethylene glycol 3350. Diffraction data for l-sfAFP were collected using a crystal grown at room temperature from 0.1 M bis-Tris pH = 5.5, 1.8 M ammonium sulfate. Diffraction data for the racemate was collected using a crystal grown at room temperature from 0.1 M bis-Tris pH = 6.5, 1.15 M ammonium sulfate.

Diffraction Data Collection for l-sfAFP

For low temperature data collection, selected crystals of l-sfAFP were transferred to the cryoprotectant (reservoir solution plus 20% (v/v) glycerol) for a few seconds and flash-frozen in liquid nitrogen. Synchrotron X-ray data were collected at a temperature of 100 K at the Advanced Photon Source, Argonne National Laboratory, beamline 19BM equipped with a SBC2 detector. Images were processed and scaled with HKL2000.28 Data were collected to a resolution of 0.98 Å (Table 3).

Diffraction Data Collection for the Racemate

For low temperature data collection, selected crystals were transferred to the cryoprotectant (reservoir solution plus 20% (v/v) glycerol) for a few seconds and flash-frozen in liquid nitrogen. X-ray data were collected at 100 K at the Advanced Photon Source, Argonne National Laboratory, beamline 23ID equipped with a MARCCD 300 detector). Images were processed and scaled with HKL2000.28 Data were collected to a resolution of 1.0 Å (Table 4).

Diffraction Data Collection for the Quasi-Racemate

For low temperature data collection, selected crystals of the l-Se-sfAFP/d-sfAFP quasi-racemate were transferred to the cryoprotectant (reservoir solution plus 20% (v/v) glycerol) for a few seconds and flash-frozen in liquid nitrogen. Three-wavelengh anomalous dispersion X-ray data were collected at 100K at the Advanced Photon Source, Argonne National Laboratory, beamline 23ID equipped with a MARCCD 300 detector). Images were processed and scaled with HKL2000.28 Data were collected to a resolution of 1.2 Å (Table 1).

Table 1.

X-ray Data Collection and Refinement Statistics for the Quasi-Racemate [d-sfAFP+l-Se-sfAFP]

Crystal 1
Space group P1
Cell parameters a = 28.64
b = 32.36
c = 59.70
α = 88.62°
β = 89.31°
γ = 72.84°
Peak Inflection Remote
Data Collection Statistics:
Beamline/Detector APS 23ID-D/MARCCD APS 23ID-D/MARCCD APS 23ID-D/MARCCD
Wavelength (Å) 0.9793 0.9794 0.9493
Resolution range (Å) 50.00 − 1.20(1.24 − 1.20)a 50.00 − 1.20 (1.24 − 1.20) 50.00 − 1.20 (1.24 − 1.20)
Reflections measured 856855 426808 422666
Unique reflections measuredb 60583 60655 60754
Completeness (%) 94.7(91.7) 94.7(91.4) 95.0 (92.0)
Average Redundancy 8.0 (7.9) 4.0 (3.9) 4.0 (4.0)
<I>/<σ(I)> 37.9 (9.9) 33.8 (6.7) 39.6(11.4)
Rmerge 0.068(0.21) 0.048 (0.18) 0.041(0.11)
Refinement:
Resolution range (Å) 27.37 − 1.20(1.23 − 1.20)
Reflections used (working set) 57496 (4052)
Reflections used (test set) 3064 (208)
Completeness (working + test set, %) 94.7 (90.5)
Rwork 0.160(0.137)
Rfree 0.192(0.195)
Rms deviations
  Bond lengths (Å) 0.016
  Bond angles (°) 1.6
Average B factor (overall, Å2) 7.8
Ramachandran Plot Statistics:
Most favored (%) 93.2 (L), 91.9(D)
Additionally allowed (%) 6.8 (L), 8.1 (D)
Generally allowed (%) 0 (L), 0 (D)
a

Values for highest resolution shell are given in parentheses.

b

Friedel mates merged.

Determination of the sfAFP X-ray Structure from the Quasi-Racemate

The l-Se-sfAFP/d-sfAFP quasi-racemate structure was solved by MAD phasing.23 The heavy atom search was performed in SHELXD29 using X-ray data collected at 12.6607 keV peak energy and 3.5 Å resolution cutoff. MAD phasing and density modification were performed with use of SOLVE and RESOLVE, respectively.30 The model building performed with the RESOLVE basic script was complete with use of ARP/WARP program.31 The electron density and model examinations were done using TURBO-FRODO.32 The restrained positional and anisotropic B-factor refinement was done in REFMAC5.33 Rfree was monitored by setting aside 5% of the reflections as a test set. Crystal data and X-ray data statistics for the quasi-racemate data set are listed in Table 1. Backbone rmsd (Å) deviations between the enantiomeric protein molecules in the quasi-racemate are presented in Table 2. The coordinates and structure factors have been deposited in the Protein Data Bank with entry code 3BOG.

Table 2.

Backbone rmsd (Å) Deviations between Molecules in the Quasi-Racemate: Molecules A and B are l-Se-sfAFP, and Molecules C and D Are Inverted d-sfAFP

A B C-inverted D-inverted
A 0.520 0.165 0.514
B 0.564 0.203
C-inverted 0.540

Determination of the X-ray Structure of l-sfAFP

The structure was determined by molecular replacement using the program MOLREP34 with the l-Se-sfAFP molecule from the quasiracemate as a starting model. The rigid body refinement, restrained positional and anisotropic B-factor refinement, was done in REFMAC5.33 The electron density and model examinations were done using TURBO-FRODO.32 Rfree was monitored by setting aside 5% of the reflection as a test set. Crystal data and X-ray data statistics for the l-sfAFP data set are listed in Table 3. The coordinates and structure factors have been deposited in the Protein Data Bank with entry code 2PNE.

Determination of the X-ray Structure of the {l-sfAFP+d-sfAFP} True Racemate

Since the crystal packing of the quasi-racemate is the same as the true racemate, we proceeded directly to rigid-body refinement with two l-fAFP monomers in the asymmetric unit (chains A and B) oriented as for the quasi-racemate. The rigid body refinement, restrained positional and anisotropic B-factor refinement, was done in REFMAC5.33 The electron density and model examinations were done using TURBO-FRODO.32 Rfree was monitored by setting aside 5% of the reflection as a test set. Crystal data and X-ray data statistics for the racemate data set are listed in Table 4. The coordinates and structure factors have been deposited in the Protein Data Bank with entry code 3BOI.

Acknowledgment

We thank W. DeGrado (U. Penn) for suggesting that we work on the snow flea antifreeze protein. We thank T. Sosnick, V. Pingali, V. Torbeev, Li-Wei Hung, C. Weeks, S. Anderson, N. Sanishvili, M. Becker, and A. Moglich for useful discussions and technical advice. Z.G. was supported by the University of Chicago NIH Training Program in Physical and Chemical Biology. This research was supported by the Office of Science (BER), U.S. Department of Energy, Grant No. DEFG02-07ER64501 to S.B.H.K., and by the National Institutes of Health, Grant No. R01 GM075993 to SBHK. The use of the Advanced Photon Source was supported by the U.S. Department of Energy, Basic Energy Sciences, Office of Science, under Contract No. DE-AC02-06CH11357. Portions of this work were performed at the SBC-CAT and GM/CA CAT located at Sectors 19BM and 23ID, respectively, of the Advanced Photon Source. SBC-CAT is supported by the U.S. Department of Energy, Office of Biological and Environmental Research, under Contract No. W-31-109-ENG-38. Use of the GM/CA CAT Sector 23-ID was funded with Federal funds from the National Cancer Institute (Y1-CO-1020) and the National Institute of General Medical Science (Y1-GM-1104).

References

  • 1.Davies PL, Baardsnes J, Kuiper MJ, Walker VK. Philos. Trans. R. Soc. London, Ser. B. 2002;357:927–935. doi: 10.1098/rstb.2002.1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.DeVries AL, Wohlschlag DE. Science. 1969;163:1073–1075. doi: 10.1126/science.163.3871.1073. [DOI] [PubMed] [Google Scholar]
  • 3.DeVries AL, Komatsu SK, Feeney RE. J. Biol. Chem. 1970;245:2901–2908. [PubMed] [Google Scholar]
  • 4.DeVries AL. Science. 1971;172:1152–1155. doi: 10.1126/science.172.3988.1152. [DOI] [PubMed] [Google Scholar]
  • 5.Prabhu N, Sharp K. Chem. Rev. 2006;106:1616–1623. doi: 10.1021/cr040437f. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Graham LA, Davies PL. Science. 2005;310:461. doi: 10.1126/science.1115145. [DOI] [PubMed] [Google Scholar]
  • 7.Lin FH, Graham LA, Campbell RL, Davies PL. Biophys. J. 2007;92:1717–1723. doi: 10.1529/biophysj.106.093435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Schnolzer M, Kent SBH. Science. 1992;256:221–225. doi: 10.1126/science.1566069. [DOI] [PubMed] [Google Scholar]
  • 9.Dawson PE, Muir TW, Clark-Lewis I, Kent SBH. Science. 1994;266:776–779. doi: 10.1126/science.7973629. [DOI] [PubMed] [Google Scholar]
  • 10.Dawson PE, Kent SB. Annu. Rev. Biochem. 2000;69:923–960. doi: 10.1146/annurev.biochem.69.1.923. [DOI] [PubMed] [Google Scholar]
  • 11.Bang D, Kent SB. Angew. Chem., Int. Ed. Engl. 2004;43:2534–2538. doi: 10.1002/anie.200353540. [DOI] [PubMed] [Google Scholar]
  • 12.Johnson EC, Kent SB. J. Am. Chem. Soc. 2006;128:6640–6646. doi: 10.1021/ja058344i. [DOI] [PubMed] [Google Scholar]
  • 13.Durek T, Torbeev VY, Kent SB. Proc. Natl. Acad. Sci. U.S.A. 2007;104:4846. doi: 10.1073/pnas.0610630104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Torbeev VY, Kent SB. Angew. Chem., Int. Ed. 2007;46(10):1667–1670. doi: 10.1002/anie.200604087. [DOI] [PubMed] [Google Scholar]
  • 15.Pentelute BL, Gates ZP, Dashnau JL, Vanderkooi JM, Kent SBH. J. Am. Chem. Soc. 2008;130:9702–9707. doi: 10.1021/ja801352j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wukovitz SW, Yeates TO. Nat. Struct. Biol. 1995;2(12):1062–1067. doi: 10.1038/nsb1295-1062. [DOI] [PubMed] [Google Scholar]
  • 17.Pellegrini M, Wukovitz SW, Yeates TO. Proteins. 1997;28:515. [PubMed] [Google Scholar]
  • 18.Zawadzke LE, Berg JM. Proteins. 1993;16:301–305. doi: 10.1002/prot.340160308. [DOI] [PubMed] [Google Scholar]
  • 19.Hung LW, Kohmura M, Ariyoshi Y, Kim SH. J. Mol. Biol. 1999;285:311–321. doi: 10.1006/jmbi.1998.2308. [DOI] [PubMed] [Google Scholar]
  • 20.Toniolo C, Peggion C, Crisma M, Formaggio F, Shui X, Eggleston DS. Nat. Struct. Biol. 1994;1:908–914. doi: 10.1038/nsb1294-908. [DOI] [PubMed] [Google Scholar]
  • 21.Patterson WR, Anderson DH, DeGrado WF, Cascio D, Eisenberg D. Protein Sci. 1999;8:1410. doi: 10.1110/ps.8.7.1410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Berg JM, Goffeney NW. Methods Enzymol. 1997;276:619–627. [PubMed] [Google Scholar]
  • 23.Hendrickson WA, Ogata CM. Macromol. Crystallogr., Pt A. 1997;276:494–523. doi: 10.1016/S0076-6879(97)76074-9. [DOI] [PubMed] [Google Scholar]
  • 24.Stout GH, Jensen LH. X-Ray Structure Determination: A Practical Guide. 2nd ed. New York: Wiley; 1989. [Google Scholar]
  • 25.Liou YC, Tocilj A, Davies PL, Jia Z. Nature. 2000;406:322–324. doi: 10.1038/35018604. [DOI] [PubMed] [Google Scholar]
  • 26.Leinala EK, Davies PL, Jia Z. Structure. 2002;10:619–627. doi: 10.1016/s0969-2126(02)00745-1. [DOI] [PubMed] [Google Scholar]
  • 27.Schnolzer M, Alewood P, Jones A, Alewood D, Kent SBH. Int. J. Peptide Res. Ther. 2007;13:31–44. [Google Scholar]
  • 28.Otwinowski Z, Minor W. Mocromol. Crystallogr., Pt A. 1997;276:307–326. doi: 10.1016/S0076-6879(97)76066-X. [DOI] [PubMed] [Google Scholar]
  • 29.Schneider TR, Sheldrick GM. Acta Crystallogr., Sect. D. 2002;58:1772–1779. doi: 10.1107/s0907444902011678. [DOI] [PubMed] [Google Scholar]
  • 30.Terwilliger TC, Berendzen J. Acta Crystallogr., Sect. D. 1999;55:849–861. doi: 10.1107/S0907444999000839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Perrakis A, Harkiolaki M, Wilson KS, Lamzin VS. Acta Crystallogr, Sec. D. 2001;57:1445–1450. doi: 10.1107/s0907444901014007. [DOI] [PubMed] [Google Scholar]
  • 32.Cambillau C, Roussel A. Turbo Frodo. Marseille, France: Université Aix-Marseille II; 1997. Version OpenGL.1. [Google Scholar]
  • 33.Murshudov GN, Vagin AA, Dodson EJ. Acta Crystallogr, Sect. D. 1997;53:240–255. doi: 10.1107/S0907444996012255. [DOI] [PubMed] [Google Scholar]
  • 34.Vagin A, Teplyakov A. J. Appl. Crystallogr. 1997;30:1022–1025. [Google Scholar]

RESOURCES