Abstract
Total chemical synthesis was used to prepare the mirror image (D-protein) form of the angiogenic protein vascular endothelial growth factor (VEGF-A). Phage display against D-VEGF-A was used to screen designed libraries based on a unique small protein scaffold in order to identify a high affinity ligand. Chemically synthesized D- and L- forms of the protein ligand showed reciprocal chiral specificity in surface plasmon resonance binding experiments: The L-protein ligand bound only to D-VEGF-A, whereas the D-protein ligand bound only to L-VEGF-A. The D-protein ligand, but not the L-protein ligand, inhibited the binding of natural VEGF165 to the VEGFR1 receptor. Racemic protein crystallography was used to determine the high resolution X-ray structure of the heterochiral complex consisting of {D-protein antagonist + L-protein form ofVEGF-A}. Crystallization of a racemic mixture of these synthetic proteins in appropriate stoichiometry gave a racemic protein complex of more than 73 kDa containing six synthetic protein molecules. The structure of the complex was determined to a resolution of 1.6 Å. Detailed analysis of the interaction between the D-protein antagonist and the VEGF-A protein molecule showed that the binding interface comprised a contact surface area of approximately 800 Å2 in accord with our design objectives, and that the D-protein antagonist binds to the same region of VEGF-A that interacts with VEGFR1-domain 2.
Keywords: chemical protein synthesis, mirror image proteins, protein engineering, racemic X-ray crystallography
One of the most remarkable aspects of the natural world is the homochirality of the protein macromolecules found in living systems (1). All protein molecules found in nature contain ribosomally translated polypeptide chains that are comprised exclusively of L-amino acids and the achiral amino acid glycine. Starting about 20 years ago, chemists and biochemists became interested in the properties of unnatural mirror image D-protein molecules; i.e., proteins with polypeptide chains of the same sequence as natural proteins, but made from D-amino acids and glycine. In pioneering work, Zawadzke and Berg showed that a racemic mixture of the enantiomeric forms of the small protein rubredoxin crystallized in a centrosymmetric space group and that the folded protein molecules were mirror images of one another (2). More recently, racemic crystallography of chemically synthesized protein enantiomers has been used to facilitate the crystallization of recalcitrant proteins and to solve protein structures that had not previously been determined by X-ray crystallography (3–7).
Chirality, the handedness of molecules (8), is fundamental to biological interactions. This aspect of chirality was first systematically demonstrated by Emil Fischer, who—based on his studies of sugar stereochemistry—formulated the ‘lock and key’ principle of enzyme action on chiral substrate molecules (9). Since then, it has been repeatedly demonstrated that natural proteins preferentially bind to only one enantiomer of chiral small molecule ligands. The principles of molecular chirality dictate that the inverse also be true—i.e., that the mirror image form of a natural protein molecule will preferentially bind to the opposite enantiomer of a chiral ligand (10). Chirality is believed to be of particular importance in protein-protein interactions—it is considered a self-evident conclusion of symmetry arguments that the enantiomer of a protein ligand will not bind to the same natural protein target, and that enantiomeric forms of a protein-(protein ligand) pair will have the same affinity for one another. Nonetheless, experimental demonstration of this principle may be instructive.
In 1996, Peter Kim and his colleagues at the Whitehead Institute at MIT used peptide phage display against a D-protein target as a way to systematically develop D-peptide ligands of natural proteins (11). As originally conceived, this ‘mirror image (peptide) phage display’ method involved the chemical synthesis of the mirror image form of a natural protein molecule, after which peptide phage libraries were screened to identify L-peptide ligands to the D-protein. Then, chemical synthesis was used to make the corresponding D-peptide ligand, which obligately bound with the same affinity to the natural protein target. Such D-peptide ligands would be resistant to proteolytic digestion in vivo, and for that reason they have excited a great deal of interest. Although mirror image phage display has been used in a number of academic studies (12–14), it has not yet led to the development of D-peptides as therapeutics.
We set out to apply the mirror image phage display approach to esigned libraries of a unique protein scaffold in order to develop high affinity D-protein antagonist(s) for biologically active protein targets. A properly engineered D-protein molecule would have near optimal properties as a therapeutic: A small D-protein can be produced by chemical manufacture, will resist proteolytic degradation, and is expected to be nonimmunogenic (15). Further, it should be possible to engineer a small D-protein to have affinity and specificity similar to that of antibodies for a therapeutic target molecule. Development of improved antagonists of growth factor activity is an important current objective in medicinal chemistry. The angiogenic factor vascular endothelial growth factor (VEGF-A) is the target of engineered monoclonal antibodies that inhibit angiogenesis by interfering with the interaction of VEGF-A with its receptors (16), and several of these engineered antibodies have become important human therapeutics.
The objective of the work reported here was to develop a D-protein ligand that binds to VEGF-A and acts as an antagonist of VEGF-A binding to its receptor. Our target was the covalent homodimer VEGF[8–109] protein molecule; this form of VEGF-A retains full biological activity and was used in the development of the monoclonal antibodies bevacizumab and ranibizumab (16–18). Here we report the total chemical synthesis of the mirror image D-VEGF-A protein molecule, the use of mirror image protein phage display of designed libraries based on a unique protein scaffold, and the consequent generation of a specific D-protein ligand for VEGF-A that blocks binding to the VEGFR1 receptor. The high resolution X-ray structure of the {D-Protein Antagonist plusVEGF-A} protein complex was determined by racemic protein crystallography.
Results and Discussion
Chemical Synthesis of D-VEGF-A.
The essential first step was to prepare the mirror image form of the VEGF[8–109] target protein. Recently we reported the total synthesis of the natural enantiomer of this form of the VEGF-A protein molecule with full biological activity (19). Here, we used total synthesis enabled by modern chemical ligation methods to prepare D-VEGF-A, the mirror image form of VEGF-A (Fig. 1). Three unprotected synthetic peptide segments were condensed by native chemical ligation (20) in a series of ‘one pot’ reactions; i.e., without purification of intermediate products (Fig. 1A). The purified synthetic 102 residue D-polypeptide chain (Fig. 1B) was folded in the presence of a redox couple, with concomitant formation of native disulfide bonds, to give the D-protein molecule (Fig. 1C). Synthetic D-VEGF-A was characterized by electrospray mass spectrometry, and had an observed mass (23,849.3 ± 0.7 Da) in agreement with the expected mass (23,849.1 Da, average isotope composition) for the covalent homodimer containing eight disulfide bonds (19). Synthetic D-VEGF-A was crystallized, and X-ray diffraction data was acquired to a resolution of 1.9 Å. The structure was solved by molecular replacement, using inverted coordinates of the previously reported VEGF-A structure (PDB code: 3QTK) as a search model. A portion of the resulting 2Fo-Fc electron density map is shown in SI Appendix, Fig. S1C, and the structure of synthetic D-VEGF-A is shown in Fig. 1D. The chemically synthesized D-VEGF-A had a structure that was the mirror image of natural recombinant VEGF-A (17, 18), within experimental uncertainty.
Fig. 1.
Total chemical synthesis of D-VEGF-A. The amino acid sequence is given in (19). (A) Analytical HPLC data for the total chemical synthesis of the D-VEGF[8–109] polypeptide chain. (B) LCMS data for the synthetic D-VEGF[8–109] polypeptide chain [observed mass: 11,932.7 ± 0.3 Da; calculated mass 11,932.5 Da (average isotopes)]. (C) HPLC and direct infusion electrospray MS data for the folded, homodimeric synthetic D-VEGF-A protein [observed mass: 23,849.3 ± 0.7 Da; calculated mass 23,849.1 Da (average isotopes)]. (D) Cartoon representation of the 1.9 Å resolution X-ray structure of synthetic D-VEGF-A protein. (E) (left) Cartoon of the protein GB1 scaffold; (right) representation of the surface of GB1 with residues randomized in the designed library colored red.
A Protein Ligand for D-VEGF-A
We used the B1 domain of streptococcal protein G (GB1) (21) as a scaffold for the development of small protein ligands. Natural protein GB1 is a well studied small protein that folds stably and reversibly and that binds to the Fc region of immunoglobulins with high affinity (22). GB1 consists of a polypeptide chain of 56 amino acids, small enough to be made by total chemical synthesis and yet large enough to provide a sufficient surface area for a strong binding interface between the D-protein antagonist and its target protein. While there are several miniprotein scaffolds that have been developed (23), they typically do not present enough solvent exposed surface area to match the affinity and specificity of antibodies. We displayed GB1 on M13 filamentous phage as a fusion with coat protein p3, and with an N-terminal FLAG tag. A library of GB1 mutants was constructed from 15 contiguous residues on the surface of the protein, within the region spanning residues 21–41 and with the insertion of one or two additional residues between positions 41 and 42 (Fig. 1E), in order to generate a binding interface with an area comparable to that of antibodies. The selected residues were randomized by a KHT codon, which allows the amino acids Y, A, D, S, F, and V at each position. This codon was chosen because one of us (Sidhu) has previously shown that minimalist libraries of aromatic amino acids and small residues are sufficient for generating high affinity antibodies (24, 25). This strategy allows better sampling of the potential sequence space in a library of approximately 1010 variants compared to randomizing with all 20 amino acids.
The library of GB1 mutants was screened against the chemically synthesized D-VEGF-A and after four rounds of panning 20-fold enrichment was observed by phage pool ELISA. Single clones were analyzed for binding-specificity (Fig. 2A), and then sequenced. SI Appendix, Table S1 shows the sequences of selected clones that bound to D-VEGF-A. SI Appendix, Fig. S2 shows the binding specificity of selected clones. Phage containing the clone RFX.T1.4 had the highest affinity for D-VEGF-A as determined by competitive ELISA. RFX.T1.4 phage bound specifically to D-VEGF-A (Fig. 2A), and the binding could be competed by the mirror image form of a known antagonistic peptide v114 (Fig. 2B) (26), suggesting that the GB1 mutants encoded by these clones would be likely to have antagonistic activity. Further affinity maturation was performed on clone RFX.T1.4, by relaxing codon constraints to allow all 20 amino acids at each of the randomized positions. Four libraries, each mutating a set of four residues (Fig. 2C), were constructed and three rounds of phage selections were performed. Twenty-four clones were selected and sequenced from each library, and from the unique sequences a binding profile was generated as shown in Fig. 2D. The binding profiles indicated that hydrophobic residues were preferred at positions 22, 23, and 40 and were conserved at positions 26, 30, and 37. Negatively charged residues were conserved in positions 27 and 42. Of the 16 randomized residues, these eight residues seemed to be important for binding to D-VEGF-A. From the binding profiles it appeared that the dominant mutations obtained from affinity maturation were Y23F and F40I. Preliminary experiments showed that only the Y23F mutation increased the affinity (twofold), while the F40I mutation did not improve affinity. Therefore a point mutant [Y23F]RFX.T1.4 was constructed and phage containing this point mutant had high affinity for D-VEGF-A. A GB1-derived protein molecule with this amino acid sequence (RFX001) was chosen for further study (Fig. 2E).
Fig. 2.
Specificity and sequence diversity of selected phage-displayed protein ligands. (A) Phage-displayed protein ligand RFX.T1.4 specifically binds to only D-VEGF (blue) and does not bind to L-VEGF (red), GST (green) and Neutravidin (purple); (B) Competitive binding of phage-displayed protein ligand RFX.T1.4 in the presence of a saturating concentration (100 μM) of enantiomeric forms of the peptide competitor v114 (26): (red) D-v114 competitively inhibits binding; (green) L-v114 has no effect on binding. (C) Sequences of the best hit RFX.T1.4 (top) and the affinity maturation libraries used. (D) Sequence profile logos for selected clones (8–12 unique sequences); the size of the single letter code represents the frequency of occurrence of that amino acid at a given position. (E) Sequence of the optimized point mutant RFX001 (bottom); the parent sequence of GB1 (top). Underlined/bold residues are optimized in RFX001 compared to GB1; a residue was inserted between residues 41 and 42 as discussed in the text.
Chiral Specificity of VEGF-A Interactions with Mirror Image RFX001 Protein Ligands.
The D-amino acid and L-amino acid forms of the RFX001 polypeptide chain were chemically synthesized and were each separately folded by dissolution in PBS at pH 7.4, and used to evaluate affinity for the enantiomeric forms of VEGF-A.
The mirror image RFX001 protein ligands showed reciprocal chiral specificity in surface plasmon resonance experiments as shown in Fig. 3A: D-RFX001 bound only to L-VEGF-A and did not bind to D-VEGF-A, while L-RFX001 bound only to D-VEGF-A and did not bind to L-VEGF-A. Using data from the two highest concentrations of protein ligand, the observed dissociation constant values Kd were 85 ± 12 nM for the D-RFX001 and 95 ± 8 nM for the L-RFX001 (SI Appendix, Table S2). The enantiomeric forms of the RFX001 protein molecule were also evaluated for their ability to inhibit the binding of VEGF165, the most abundant form of VEGF-A found in vivo (16), to its receptor VEGFR1 (Flt1). Only D-RFX001 inhibited the binding of VEGF165 to the VEGF receptor Flt1 (Fig. 3B).
Fig. 3.
Chiral specificity of protein ligand/VEGF-A interactions. (A) Surface plasmon resonance sensorgrams showing the interaction of enantiomeric forms of the protein ligand RFX001 with enantiomeric forms of VEGF-A: L-RFX001 bound to D-VEGF-A, but did not bind to L-VEGF-A; in contrast, D-RFX001 bound to L-VEGF-A, but did not bind to D-VEGF-A. (B) As measured by biolayer interferometry (ForteBio Octet) using the wavelength shift on binding, D-RFX001 is an antagonist of VEGF165 binding to VEGFR1: D-RFX001 inhibited the binding of VEGF165 to VEGFR1 (lower curve), while added L-RFX001 had no effect even at 2 μM (upper curves).
Structure of the {VEGF-A + D-Protein Antagonist} Complex by Racemic Crystallography.
We next set out to determine the molecular structure of the complex of VEGF-A with the protein antagonist D-RFX001. We attempted to crystallize a mixture of synthetic L-VEGF-A with two equivalents of synthetic D-RFX001 using 96 index conditions of the Quiagen Pro-complex suite; only two conditions produced micro crystals. Optimization of these two conditions did not lead to the formation of diffraction-quality crystals. Recently we showed that a racemic protein mixture crystallizes more readily than the natural L-protein alone*, and this has enabled us to use racemic crystallography to determine a number of protein structures that had not previously been determined by X-ray crystallography (3, 4, 7). However, to date racemic protein crystallography has not been used to determine the structure of a heterochiral protein complex; i.e., a complex that consists of two different protein molecules of opposite chirality that bind to one another in solution. Here, we obtained crystals from a solution of chemically synthesized protein molecules that contained one equivalent each of L-VEGF-A and D-VEGF-A, and two equivalents each of the enantiomeric forms of the protein antagonist; i.e., D-RFX001 and L-RFX001†. Crystals were readily obtained, and synchrotron X-ray diffraction data were collected to 1.6 Å resolution. Data reduction statistics revealed that the protein complex had crystallized in the centrosymmetric space group P21/n. Matthews cell content analysis (27) suggested that there were one VEGF-A protein molecule and two RFX001 protein molecules in the P21/n asymmetric unit. The structure was solved in P21/n by molecular replacement using inverted and noninverted coordinates of the previously reported crystal structures of VEGF (PDB code: 3QTK) and GB1 (PDB code: 2QMT), as shown in Fig. 4.
Fig. 4.
X-ray structure of the {VEGF-A + D-RFX001} complex. Racemic crystallization gave crystals that diffracted to a resolution of 1.6 Å, and the structure was solved in P21 and P21/n (21). (A) Unit cell representation of the {VEGF-A + D-RFX001} complex crystallized as a racemate. The unit cell contains a total of 12 synthetic protein molecules {2 × L-VEGF-A,2 × D-VEGF-A,4 × D-RFX001,4 × L-RFX001}. (B) (upper): Asymmetric unit in the space group P21; (lower): Surface representation of the asymmetric unit in the space group P21. (C) (upper): Asymmetric unit in the space group P21/n; (lower): Surface representation of the asymmetric unit in space group P21/n. Color coding: green and cyan indicate the two chains of the homodimeric D-VEGF-A protein molecule; gray and salmon indicate the two chains of the homodimeric L-VEGF-A protein; yellow and orange indicate the two D-protein antagonist molecules; purple and mauve indicate the two L-protein antagonist molecules.
The Binding Interface.
Each L-VEGF-A protein molecule bound two D-RFX001 protein molecules. The structure of the VEGF protein molecule in the {VEGF-A + D-RFX001} complex is closely similar to that found in previously reported structures (17, 18). The structure of RFX001 in comparison to wild-type GB1 (PDB code: 2QMT) revealed that the major differences in the structure occur in the loop 2 region which has one amino acid inserted between residues 41 and 42 of the wild-type sequence. The Cα atoms in the rest of the two proteins align with a rmsd of 0.75 Å. Note that Phe37 and Phe40 in loop 2 form part of the RFX001 protein core and help order loop 2 to present a suitable surface for VEGF-A binding (SI Appendix, Fig. S5). Apart from the mutated residues, the side chains of the residues in the RFX001 molecule align well with those of GB1, except for Trp42 that shows a shift in position most likely caused by the insertions of Phe37 and Phe40 into the core.
It is of particular note that D-RFX001 binds to the same region of VEGF-A that interacts with VEGFR1-domain 2 (17), consistent with the ability of D-RFX001 to act as an antagonist of the VEGF-A/VEGFR1 interaction. The structural basis for this antagonism is vividly illustrated in Fig. 5 (top). Despite its modest size, the protein D-RFX001 exhibits a contact surface area of approximately 800 Å2 at the binding interface with VEGF-A and covers much of the contact surface that VEGF-A uses to interact with domain 2 of the VEGFR1 receptor molecule (17).
Fig. 5.
Interactions at the binding interface between VEGF-A and D-RFX001. (Upper) Comparison of binding surfaces. (Left box) Recombinant VEGF-A interacts with VEGFR1-domain 2 (light magenta) with buried surface area of 1,300 Å2. The interaction interface is shown in dark blue. Taken from Wiesmann, et al. (17). (Right box) Interaction interface of synthetic VEGF-A and D-RFX001 (orange). The buried surface area (shown in red) between VEGF and D-protein antagonist is ∼800 Å2. (Bottom) Contact residues in the D-RFX001–VEGF-A complex. (Left) A distinctive feature of the D-protein ligand-binding surface is the presence of numerous aromatic side chains originating from the N-terminal alpha-helix of VEGF and the helix/third β-strand of the D-protein ligand. Residues that are involved in the interactions are colored in light blue. (Middle) Direct hydrogen bonding interactions involving the side chains of the residues of VEGF and D-protein ligand are shown as dotted lines. (Right) Water-mediated hydrogen bonding networks at the interface. Water molecules are shown as green spheres. In all the three boxes the D-protein ligand is colored in orange, and the interacting VEGF chains are colored in silver-black and salmon.
The interaction between the two protein molecules is dominated by a total of ten aromatic residues (six from D-RFX001 and four from the VEGF-A molecule) (Fig. 5, bottom, left). In the D-RFX001 ligand the helix (α1), the third β-strand (β3), and the loop spanning residues 37 to 41 all participate in contacts with VEGF-A. Each residue in the loop region (residue 37 to 41) of the D-RFX001 molecule makes direct polar contact with VEGF-A at the binding interface: Residue Ser38 makes a direct backbone-backbone H-bond contact with Gln82 of VEGF-A; and, residue 40 in D-RFX001makes two additional direct H-bonds involving the main chain amide bonds of residues 82 and 84 of VEGF-A. Similarly, the backbone amide oxygen atom of residue 23 of the D-RFX001 molecule makes a direct polar contact with the side chain of Asn55 residue of VEGF. In addition to the direct H-bonding network, another feature of the D-RFX001–VEGF-A interaction is the presence of a salt bridge at the binding interface between the side chain of Asp39 of D-RFX001 and the side chain of His83 of VEGF-A (Fig. 5, bottom, middle). There are several other D-RFX001–VEGF-A hydrogen bonding networks mediated by well defined water molecules as shown in Fig. 5, bottom, right.
Comparison of Protein Enantiomers.
In order to compare the enantiomeric forms of the proteins present in the unit cell, we also solved the structure in the space group P21‡. This procedure allows the enantiomeric protein molecules present in the complex to vary independently of one another in optimizing the fit of the molecular structures to the experimental electron densities. The P21 asymmetric unit contained a total of six synthetic protein molecules: Two D-RFX001 protein molecules bound to the opposite poles of one L-VEGF-A molecule; and, two L-RFX001 protein molecules bound to the opposite poles of one D-VEGF-A molecule (Fig. 4B). Because the complex crystallized in the centrosymmetric monoclinic space group P21/n, it possesses an inversion center. Comparison of the coordinates of main chain nonhydrogen atoms between enantiomeric VEGF-A molecules which are related by inversion gave an rmsd value of 0.2 Å. Comparisons between enantiomers for the four RFX001 protein molecules found in the P21 asymmetric unit are more complicated: Two enantiomeric pairs are related by inversion (L1-D2 or L2-D1 in Fig. 6), and two enantiomeric pairs are NOT related by inversion (L1-D1 or L2-D2 in Fig. 6). Comparison of pairs of enantiomeric RFX001 protein molecules that are related by inversion gave a main chain rmsd value of just 0.1 Å. Comparison of pairs of enantiomers that are NOT related by inversion gave a main chain rmsd value of 0.5 Å; this larger value is not unexpected, because in this case the enantiomers are in distinct environments within the crystal.
Fig. 6.
Comparison of the structures of the four RFX001 protein molecules in the {VEGF-A + D-RFX001} racemic complex solved in P21. The asymmetric unit (inset above) contained a total of six synthetic protein molecules. Backbone rmsd values for comparison of all possible pairs of RFX001 protein molecules are shown. The four RFX001 protein molecules are related as follows: Two enantiomeric pairs are related by inversion (L1-D2 or L2-D1), and two enantiomeric pairs are NOT related by inversion (L1-D1 or L2-D2). For the enantiomeric forms of RFX001 related by inversion, main chain rmsd value was just 0.1 Å. For the nonsymmetry related RFX001 pairs, the main chain rmsd value was 0.5 Å.
Conclusions
Total chemical synthesis of the mirror image form of VEGF-A has enabled the identification of an engineered protein ligand with high affinity and specificity for the D-VEGF-A protein molecule. The small protein GB1 was used as a unique scaffold to display a designed diversity library of mutants. Screening against D-VEGF-A, followed by a round of affinity maturation, identified mutants with good affinity and enabled the design of a point mutant with high affinity for D-VEGF-A. Chemical synthesis of the mirror image form of the point mutant gave a D-protein ligand that bound to L-VEGF-A but as expected did not bind to D-VEGF-A. The D-protein ligand acted as an antagonist of natural VEGF binding to its cognate receptor Flt1. The high resolution X-ray structure of the heterochiral {L-VEGF-A + D-protein antagonist} was determined by racemic protein crystallography. Detailed analysis of the interaction between the D-protein antagonist and VEGF-A showed that the binding interface comprised a contact surface area of approximately 800 Å2 in accord with our design objectives, and that the D-protein antagonist binds to the same region of VEGF-A that interacts with VEGFR1-domain 2.
Using the D-protein ligand designed by phage display and the corresponding L-protein target molecule, we have shown that the determination of the high resolution X-ray structures of even large protein complexes can be facilitated by racemic crystallography. Experimentally, this work involved the total chemical synthesis of the enantiomeric forms of protein molecules of 6.4 kDa and 23.8 kDa. Crystallization of a racemic mixture of these synthetic proteins in appropriate stoichiometry gave a racemic protein complex of more than 73 kDa containing six synthetic protein molecules. The structure shown in Fig. 4 is the first example of a new class of protein complex, one consisting of two protein molecules of different amino acid sequence and with opposite chiralities that specifically interact with one another. Based on the principles of stereochemistry and associated nomenclature, we have termed this class a ‘heterochiral protein complex’ (28).
We believe that this work illustrates the great potential of a systematic chemical protein synthesis plus protein phage display approach to the development of D-proteins as a previously unexplored class of molecules for antagonizing the action of natural protein molecules. Such D-protein antagonists may have significant advantages as human therapeutics.
Methods
Chemical Protein Synthesis.
Total chemical synthesis of D-VEGF-A was performed as reported for the synthesis of biologically active VEGF-A (19). Final HPLC purification of the full length synthetic 102 residue polypeptide, folding with concomitant formation of disulfides gave synthetic D-VEGF protein with correct mass of 23,849.3 ± 0.7 Da by direct infusion electrospray ionization. Details are given in the SI Appendix.
Phage Display.
The gene encoding the wild-type streptococcal protein GB1 domain sequence (22) was cloned into a display vector as N-terminal fusion to truncated protein 3 of M13 filamentous phage. A subset of 15 contiguous solvent exposed residues was chosen for randomization. Oligonucleotides with degenerate codon KHT (encoding Y, A, D, S, F, V) were used to construct a library of 8 × 109 transformants by previously described protocols (29, 30). Four rounds of selection against D-VEGFA were carried out following essentially the same protocols previously described (30). Because limited diversity (Y, A, D, S, F, V) was used in the initial library, we prepared affinity maturation libraries to allow all 20 amino acids to occur at each randomized position. A library of 1 × 109 transformants was obtained and selections were performed as described in the SI Appendix.
Racemic Protein Crystallography.
The heterochiral protein complex was crystallized from the racemic mixture using 1∶2 stoichiometry of protein∶ligand. Diffraction data sets were collected to a resolution of 1.6 Å at the Advanced Photon Source, Argonne National Laboratory. The structures were solved by molecular replacement with the program PHASER (31) using the inverted and noninverted coordinates of previously reported X-ray structures of synthetic L-VEGF(8–109) (PDB code 3QTK) and GB1 (PDB code 2QMT) as search models. Full details are given in the SI Appendix.
Supplementary Material
ACKNOWLEDGMENTS.
Use of NE-CAT beamline 24-ID at the Advanced Photon Source is supported by award RR-15301 from the National Center for Research Resources at the National Institutes of Health. Use of the Advanced Photon Source is supported by the Department of Energy, Office of Basic Energy Sciences, under contract no. DE-AC02-06CH11357. This work was supported by funds from the University of Chicago, the University of Toronto, and by Reflexion Pharmaceuticals.
Footnotes
Conflict of interest statement: This research has been carried out at the University of Chicago and the University of Toronto as part of a research program funded by the two universities under agreements with a start up company, Reflexion Pharmaceuticals, Incorporated. Both universities have minor equity positions in Reflexion. Ault-Riché, Kent, and Sidhu are founders of Reflexion. With the exception of Joshua Lowitz, all the authors of this paper own equity in Reflexion, and thus each of these authors declares a conflict of interest.
*This Direct Submission article had a prearranged editor.
Data deposition: Crystallography, atomic coordinates, and structure factors have been deposited in the Protein Data Bank, www.pdb.org [PDB ID codes 4GLU (D-VEGF-A), 4GLS (racemic complex in space group P21), and 4GLN (racemic complex in space group P21/n)].
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1210483109/-/DCSupplemental.
*Other potential advantages of racemic protein crystallography include: Facilitated crystallization to give well-ordered racemic crystals that diffract to high resolution; and, in the centrosymmetric space groups that can only be formed from a racemic mixture, phases of the reflections are quantized (e.g. for P1
or P21/c it is 0 or π radians), which can simplify structure solution (2, 5, 32).
†There is a two-fold axis of symmetry in the homodimeric VEGF-A protein molecule (17,18); hence, one molecule of VEGF-A was expected to bind two molecules of the D-protein antagonist.
‡Solving a structure in the centrosymmetric space group P21/n involves a mathematical inversion that averages the electron densities of the protein enantiomers, and thus obscures any potential differences that may exist.
References
- 1.Siegel JS. Homochiral imperative of molecular evolution. Chirality. 1998;10:24–27. [Google Scholar]
- 2.Zawadzke LE, Berg JM. The structure of a centrosymmetric protein crystal. Proteins Struct Funct Genet. 1993;16:301–305. doi: 10.1002/prot.340160308. [DOI] [PubMed] [Google Scholar]
- 3.Pentelute BL, et al. X-ray structure of snow flea antifreeze protein determined by racemic crystallization of synthetic protein enantiomers. J Am Chem Soc. 2008;130:9695–9701. doi: 10.1021/ja8013538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Mandal K, et al. Racemic crystallography of synthetic protein enantiomers used to determine the X-ray structure of plectasin by direct methods. Protein Sci. 2009;18:1146–1154. doi: 10.1002/pro.127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Matthews BW. Racemic crystallography—easy crystals and easy structures: What’s not to like? Protein Sci. 2009;18:1135–1138. doi: 10.1002/pro.125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mortenson DE, Satyshur KA, Guzei IA, Forest KT, Gellman SH. Quasiracemic crystallization as a tool to assess the accommodation of noncanonical residues in nativelike protein conformations. J Am Chem Soc. 2012;134:2473–2476. doi: 10.1021/ja210045s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mandal K, et al. Design, total chemical synthesis, and X-ray structure of a protein having a novel polypeptide chain topology. Angew Chem Int Ed. 2012;51:1481–1486. doi: 10.1002/anie.201107846. [DOI] [PubMed] [Google Scholar]
- 8.Pasteur L. Mémoire sur la relation qui peut exister entre la forme cristalline et la composition chimique, et sur la cause de la polarisation rotatoire. Compt Rend. 1848;26:535–538. [Google Scholar]
- 9.Fischer E. Einfluss der Konfiguration auf die Wirkung der Enzyme. Ber Dtsch Chem Ges. 1894;27:2985–2993. [Google Scholar]
- 10.de Lisle Milton RC, Milton SCF, Kent SBH. Total chemical synthesis of a D-enzyme: The enantiomers of HIV-1 protease demonstrate reciprocal chiral substrate specificity. Science. 1992;256:1445–1448. doi: 10.1126/science.1604320. [DOI] [PubMed] [Google Scholar]
- 11.Schumacher TNM, et al. Identification of D-peptide ligands through mirror-image phage display. Science. 1996;271:1854–1857. doi: 10.1126/science.271.5257.1854. [DOI] [PubMed] [Google Scholar]
- 12.Welch BD, et al. Design of a potent D-peptide HIV-1 entry inhibitor with a strong barrier to resistance. J Virol. 2010;84:11235–11244. doi: 10.1128/JVI.01339-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Liu M, et al. D-peptide inhibitors of the p53-MDM2 interaction for targeted molecular therapy of malignant neoplasms. Proc Nat Acad Sci USA. 2010;107:14321–14326. doi: 10.1073/pnas.1008930107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Funke SA, Willbold D. Mirror image phage display—a method to generate D-peptide ligands for use in diagnostic or therapeutical applications. Mol BioSyst. 2009;5:783–786. doi: 10.1039/b904138a. [DOI] [PubMed] [Google Scholar]
- 15.Dintzis HM, Symer DE, Dintzis RZ, Zawadzke LE, Berg JM. A comparison of the Immunogenicity of a pair of enantiomeric proteins. Proteins Struct Funct Genet. 1993;16:306–308. doi: 10.1002/prot.340160309. [DOI] [PubMed] [Google Scholar]
- 16.Ferrara N, Mass RD, Campa C, Kim R. Targeting VEGF-A to treat cancer and age-related macular degeneration. Annu Rev Med. 2007;58:491–504. doi: 10.1146/annurev.med.58.061705.145635. [DOI] [PubMed] [Google Scholar]
- 17.Wiesmann C, et al. Crystal structure at 17 Å resolution of VEGF in complex with domain 2 of the Flt-1 receptor. Cell. 1997;91:695–704. doi: 10.1016/s0092-8674(00)80456-0. [DOI] [PubMed] [Google Scholar]
- 18.Wiesmann C, et al. Crystal structure of the complex between VEGF and a receptor-blocking peptide. Biochemistry. 1998;37:17765–17772. doi: 10.1021/bi9819327. [DOI] [PubMed] [Google Scholar]
- 19.Mandal K, Kent SBH. Total chemical synthesis of biologically active vascular endothelial growth factor. Angew Chem Int Ed. 2011;50:8029–8033. doi: 10.1002/anie.201103237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dawson PE, Muir TW, Clark-Lewis I, Kent SBH. Synthesis of proteins by native chemical ligation. Science. 1994;266:776–779. doi: 10.1126/science.7973629. [DOI] [PubMed] [Google Scholar]
- 21.Gallagher T, Alexander P, Bryan P, Gilliland GL. Two crystal structures of the B1 immunoglobulin-binding domain of Streptococcal protein G and comparison with NMR. Biochemistry. 1994;33:4721–4729. [PubMed] [Google Scholar]
- 22.Gronenborn AM, et al. A novel, highly stable fold of the immunoglobulin binding domain of streptococcal protein G. Science. 1991;253:657–661. doi: 10.1126/science.1871600. [DOI] [PubMed] [Google Scholar]
- 23.Zoller F, Haberkorn U, Mier W. Miniproteins as phage display-scaffolds for clinical applications. Molecules. 2011;16:2467–2485. doi: 10.3390/molecules16032467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Fellouse FA, et al. Molecular recognition by a binary code. J Mol Biol. 2005;348:1153–1162. doi: 10.1016/j.jmb.2005.03.041. [DOI] [PubMed] [Google Scholar]
- 25.Fellouse FA, et al. High-throughput generation of synthetic antibodies from highly functional minimalist phage-displayed libraries. J Mol Biol. 2007;373:924–940. doi: 10.1016/j.jmb.2007.08.005. [DOI] [PubMed] [Google Scholar]
- 26.Fairbrother WJ, et al. Novel peptides selected to bind vascular endothelial growth factor target the receptor-binding site. Biochemistry. 1998;37:17754–17764. doi: 10.1021/bi981931e. [DOI] [PubMed] [Google Scholar]
- 27.Matthews BW. Solvent content of protein crystals. J Mol Biol. 1968;33:491–497. doi: 10.1016/0022-2836(68)90205-2. [DOI] [PubMed] [Google Scholar]
- 28.Mislow K, Bickart P. An epistemological note on chirality. Isr J Chem. 1976/77;15:1–6. [Google Scholar]
- 29.Kunkel TA, Roberts JD, Zakour RA. Rapid and efficient site-specific mutagenesis without phenotypic selection. Methods Enzymol. 1987;154:367–382. doi: 10.1016/0076-6879(87)54085-x. [DOI] [PubMed] [Google Scholar]
- 30.Fellouse FA, Sidhu SD. In: Making and Using Antibodies. Howard GC, Kaser MR, editors. Boca Raton, FL: CRC Press; 2007. pp. 157–180. [Google Scholar]
- 31.McCoy AJ, et al. Phaser crystallographic software. J Appl Crystallogr. 2007;40:658–674. doi: 10.1107/S0021889807021206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Mackay AL. Crystal enigma. Nature. 1989;342:133. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






