Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Jul 14.
Published in final edited form as: Prog Biophys Mol Biol. 2014 Aug 10;116(0):92–100. doi: 10.1016/j.pbiomolbio.2014.08.004

Advantages of Crystallographic Fragment Screening: Functional and Mechanistic Insights from a Powerful Platform for Efficient Drug Discovery

Disha Patel 1, Joseph D Bauman 1, Eddy Arnold 1
PMCID: PMC4501029  NIHMSID: NIHMS706120  PMID: 25117499

Abstract

X-ray crystallography has been an under-appreciated screening tool for fragment-based drug discovery due to the perception of low throughput and technical difficulty. Investigators in industry and academia have overcome these challenges by taking advantage of key factors that contribute to a successful crystallographic screening campaign. Efficient cocktail design and soaking methodologies have evolved to maximize throughput while minimizing false positives/negatives. In addition, technical improvements at synchrotron beamlines have dramatically increased data collection rates thus enabling screening on a timescale comparable to other techniques. The combination of available resources and efficient experimental design has resulted in many successful crystallographic screening campaigns. The three-dimensional crystal structure of the bound fragment complexed to its target, a direct result of the screening effort, enables structure-based drug design while revealing insights regarding protein dynamics and function not readily obtained through other experimental approaches. Furthermore, this “chemical interrogation” of the target protein crystals can lead to the identification of useful reagents for improving diffraction resolution or compound solubility.

Keywords: X-ray crystallography, fragment screening, HIV, influenza

1. Introduction

Fragment-based drug discovery (FBDD) has gained popularity as an effective and efficient approach for identifying pharmacophores for lead development. FBDD entails screening of small-molecule libraries against a target protein to identify weakly potent, bioactive molecules. The smaller, less complex nature of fragments increases the probability of binding to a target protein, resulting in higher hit rates and efficient search of diverse chemical space (Hadjuk et al., 2007; Hesterkamp et al., 2008; Erlanson et al., 2004). Hits identified from fragment screening do not require deconstruction and can be efficiently developed for specificity and inhibitory activity.

Today, an array of biophysical methods has been developed to rapidly identify weakly binding fragments to a target protein. Nuclear magnetic resonance (NMR) and X-ray crystallography were among the first applied and, more recently, surface plasmon resonance (SPR) and differential scanning fluorimetry (thermal shift) have become popular as primary screening methods. X-ray crystallography provides structural information that enables rapid and efficient assessment of hits with respect to synthetic tractability for structure-based drug design. With respect to FBDD, high-resolution X-ray crystal structures provide a three-dimensional perspective of the binding landscape for potential fragment expansion, merging, or linking. Compared with other screening methods, X-ray crystallographic fragment screening is unmatched in terms of the range of ligand binding affinity (sub-nanomolar to millimolar, limited only by solubility of the ligand) and the size of the target macromolecule (up to megadaltons, limited only by crystal robustness and resolution limit). However, application of X-ray crystallography as a primary screening method has been under-appreciated and often considered to be impractical due to its relatively low throughput and highly resource-intensive nature.

Recent advances in crystal transport, robotic crystal mounting, powerful detectors, and automated data collection software have significantly improved the throughput and reduced the amount of labor associated with this approach. The technical improvements combined with an efficient strategy for data collection and rapid hit identification makes X-ray crystallography an attractive approach for primary screening, in that it not only identifies fragment binding but also provides a three-dimensional structure that can enable and facilitate rapid structure-based optimization.

2. Historical Perspective

Mattos and Ringe (1996) were among the first to demonstrate the potential for systematic exploration of potential small molecule binding sites by soaking organic solvents into crystals that diffract X-rays to high resolution. This approach, referred to as multiple solvent crystallographic structures (MSCS), uses organic solvents as probes for locating and characterizing potential binding sites for ligand design (Allen et al., 1996; Mattos and Ringe, 1996; English et al., 1999). Superimposing crystal structures from soaking with different organic solvents can provide insight regarding the conformational landscape, chemical complementarity, plasticity, and surface hydration for a binding site of interest. In addition, the structural information, with the aid of molecular modeling, can be used to develop pharmacophoric models or improve binding affinity and specificity for lead optimization (Mattos et al., 2006; Behnen et al., 2012). However, difficulties in reliably predicting optimal binding modes and overcoming exquisitely demanding stereochemical requirements for linked molecules have limited the application of MSCS for drug discovery.

In 1998, Stout et al. developed a deconstruction approach using E. coli thymidylate synthase (TS) and its natural substrate, deoxyuridine monophosphate (dUMP), to understand ligand binding. Structural analysis of small molecule fragments of dUMP in complex with TS revealed the relative contributions of each individual chemical component to the overall binding of dUMP. This modular approach to deconvolution of a ligand hinted that the opposite methodology where compatible fragments can be merged in an additive manner might be promising.

Christophe Verlinde and Wim Hol were among the first to utilize X-ray crystallography to identify small molecule binding from a fragment library of 384 compounds to triose phosphate isomerase from Trypanosoma brucei (Verlinde et al., 1997). For high-throughput purposes, crystallographic screening was conducted by soaking pre-formed crystals into three mixtures, commonly referred to as cocktails, of 128 compounds (Verlinde et al., 2009). Electron density from one cocktail soak indicated fragment binding; however, compound identification was not readily possible due to a limited resolution of 2.8 Å. Subsequent deconvolution indicated that the observed electron density was likely from a product of a chemical reaction between fragments within the cocktail. Based on the experimental results, it was clear that X-ray crystallography in combination with cocktails could be used for fragment hit identification. However, design of these cocktails should take into consideration chemical reactivity, solubility, and shape diversity.

Initial studies conducted by both academic and industrial groups pioneered the use of X-ray crystallography as a primary screening method for fragment-based drug discovery. Along the way, several important experimental factors were found to be critical for a successful crystallographic screening campaign. This review will describe some of the experimental challenges and approaches associated with X-ray crystallographic fragment screening. In addition, interesting biological and technical insights gained through crystallographic fragment screening campaigns will also be highlighted.

3. Experimental Considerations

3.1 Cocktail Design

The approach to cocktail crystallography devised by Nienaber and colleagues (2000) at Abbott Laboratories led to the development of the CrystaLEADS system. To enable rapid hit identification, cocktails were designed to maximize the structural diversity of the fragments. Urokinase was used to demonstrate the applicability of CrystaLEADS for hit identification. Nine crystals of urokinase were used to screen a fragment library of 61 compounds grouped into cocktails of 6-8 fragments. Screening identified five promising fragments hits, of which 8-hydroxy-2-aminoquinoline was found to be the most potent. With the availability of a crystal structure combined with previously established SAR, a single modification of the fragment hit allowed for extension into an adjacent pocket leading to a 151-fold improvement in potency. More commonly, the CrystaLEADS approach entails grouping a 10,000 compound library into 100 fragments per cocktail. The application of this cocktail design was demonstrated by crystallographic screen of 7,8-dihydroneopterin aldolase (DHNA). Screening identified two fragment hits, which were subsequently developed into lead compounds with μM activity against DHNA (Sanders et al., 2004).

Astex Pharmecuticals pioneered a successful approach for cocktail crystallography as part of their proprietary Pyramid platform. Cocktail design limited the number of fragments per cocktail to four and strongly emphasized chemical diversity within a cocktail (Hartshorn et al., 2005). This allowed for efficient deconvolution and reduced the likelihood of multiple fragment binding. In contrast, Johnson and Johnson’s approach to cocktail design focused on designing cocktails of five compounds with similar shape (Spurlino, 2011). This approach took advantage of multiple-fragment binding within one cocktail to strengthen the electron density for a fragment hit.

The Biomolecular Structure Center at University of Washington as part of the Medical Structural Genomics for Protozoan Parasites Consortium (MSGPPC) reported using a computational approach for cocktail design. A 680 compound library was designed from an initial library of 9,486 compounds using a series of filters to limit the size, number of heteroatoms, hydrophobicity, and number of rotatable bonds as well as to remove compounds that were chemically reactive and expensive. A shape fingerprint analysis was conducted to design 68 cocktails of 10 structurally diverse compounds. This not only takes advantage of shape diversity to facilitate deconvolution but also allows for rapid screening.

3.2 Protein crystallization

Verlinde et al. (2009) described the importance of protein crystallization towards the success of a fragment screening campaign. The MSGPPC screened a total of 26 protein targets, of which 19 were found to be impervious to fragment binding. The high percentage of nonproductive targets was due to poor resolution (lower than 2.8 Å), reduced crystal stability or robustness in the soaking conditions, or simply no observed fragment binding. The number of cocktails that could be successfully screened was limited by either poor resolution or crystal robustness for certain protein targets. For instance, only 42 useful datasets were obtained from soaking 66 cocktails into 147 crystals of Leishmania major coproporphyrinogen III oxidase. Similarly, only 42 of the 68 cocktails screened against Leishmania naiffi uracil-DNA glycosylase produced datasets useful for fragment identification.

Thus, the success and throughput of crystallographic fragment screening is heavily dependent on resolution and robustness of the protein crystal. Ideally, protein crystals that diffract X-rays to high resolution (better than 2.5 Å) help to reduce ambiguities that may occur upon ligand binding. Crystal engineering and proteolytically stable protein constructs can be used to improve resolution for proteins that are either highly flexible or poorly diffracting. Crystal robustness in a high concentration fragment soaking solution is essential as it allows for rapid screening and reduces the amount of labor associated with this approach. Either alternative crystal forms or co-crystallization will be necessary if the crystals are either too fragile to handle or behave poorly during soaking.

Prior to conducting a fragment screening campaign for HIV-1 reverse transcriptase (RT), crystal engineering was used to improve the resolution quality and reproducibility of protein crystals by introducing point mutations and C-terminal truncations to reduce the surface entropy of the protein (Bauman et al., 2008). Despite an improvement in resolution, crystal robustness proved to be a major challenge during our fragment screening campaign against HIV-1 RT (Bauman et al., 2013a). Originally, a high-resolution construct of the apo HIV-1 RT (RT69A) was selected for screening. However, the RT69A crystals were not able to withstand the soaking conditions, resulting in a decrease in the throughput of the method. The high sensitivity of these crystals was speculated to be a consequence of fragment binding to the allosteric non-nucleoside reverse transcriptase inhibitor (NNRTI) binding pocket. The formation of this binding site requires large conformational changes, resulting in collapse of the crystal lattice. Of the crystals that survived the cocktail soaks, only one out of ten crystals diffracted to useful resolution. To improve throughput and data collection, an alternate protein construct, RT52A, in complex with a potent NNRTI, rilpivirine, was used to block access to the NNRTI-binding pocket and limit protein flexibility. These crystals were extremely robust with ~90% of crystals surviving cocktail soaks and maintaining high quality X-ray diffraction (1.8-2.0 Å resolution).

It is necessary for the protein in the selected crystal form to be in a biologically relevant conformation and binding sites of interest should not be occluded by protein-protein crystal contacts. This helps to maintain a correlation with the enzymatic assays that are commonly used to characterize the inhibitory activity of the fragment hits. Prior to conducting a fragment screening campaign against the influenza A endonuclease (PAN), it was necessary to identify a new crystal form because the two published crystal forms were not useful for structure-based drug design. One crystal form was very difficult to reproduce while other published structure of PAN had an active site occluded by a crystallographically symmetric copy of the protein (Dias et al., 2009; Yuan et al., 2009; Bauman et al., 2013b). Additionally, binding sites should not be occupied by chemical components from the crystallization condition. This can result in false negatives since the fragments, at a lower concentration, are competing with the crystallization components for binding.

3.3 Optimization of Fragment Soaking

Crystallographic fragment screening is typically conducted at high compound concentrations (10 mM and higher) to ensure full occupancy for low-affinity ligands. The most common method for screening entails soaking pre-formed crystals into a solution of fragment(s) for a period of time, which can range from as short as several seconds to days. To ensure crystal stability and longevity during the soak, diluting the fragment or cocktail using the crystallization buffer is necessary. However, this may come at the expense of fragment solubility and therefore may limit the concentration of fragment available for binding. Unlike most biophysical methods, it is not required for the fragment to be fully soluble at the tested concentration as it is possible to recover crystals from soaking drops that are only partially transparent due to precipitation. During the screening campaign against HIV-1 RT, fragment solubility in the crystallization buffer was improved using L-arginine as an additive (Bauman et al., 2013a). Initial soaking experiments revealed electron density for some hydrophobic fragments only in the presence of L-arginine. We speculate that this improvement is most likely due to the guanidinium functionality of L-arginine, which can form hydrogen bonding and/or π-stacking interactions (Flocco et al., 1994).

The solvent used for solubilizing fragments can have a tremendous impact on crystal stability during soaking. For instance, many protein crystals can only tolerate up to 10% (v/v) dimethylsulfoxide (DMSO), a commonly used solvent for fragment solutions. Soaking at higher concentrations of DMSO can either cause the crystal lattice to collapse or have a detrimental effect on X-ray diffraction. Incorporating DMSO into the protein crystallization condition can circumvent sensitivity of protein crystals to DMSO (Perryman et al., 2010). The cryoprotective properties of DMSO (Atwell et al., 2010) can be advantageous as it allows for a single solution for fragment soaking and cryoprotection (Bauman et al., 2013 a,b). This minimizes crystal damage by mechanical stress and reduces the amount of time needed for soaking and freezing.

The solvent, due to its higher concentration, may also bind throughout the protein. Initial soaking experiments during the HIV-1 RT fragment screening campaign revealed electron density for DMSO molecules throughout the protein regardless of which cocktail was soaked. A crystal structure obtained from a soak with DMSO without any fragment present revealed 16 binding sites throughout HIV-1 RT (Figure 1). This structure proved to be valuable as a blank reference to avoid misinterpretation of solvent binding as fragment binding (Bauman et al., 2013a). In the event that DMSO should significantly affect crystal stability or fragment binding, an alternate solvent, such as water or methanol, can be used for solubilizing fragments.

Figure 1.

Figure 1

Crystal structure of HIV-1 reverse transcriptase-rilpivirine complex soaked with 20% (v/v) d6-DMSO. Color-coding: p66 subdomains as fingers (blue), palm (red), thumb (green), connection (yellow), and RNase H (orange), and the p51 subunit (cyan). Rilpivirine is shown as yellow and blue spheres, ordered waters are shown as blue dots, and d6-DMSO molecules are shown as green, yellow, and red spheres. Crystal structure revealed a total of 16 d6-DMSO binding sites. This crystal form was very robust to fragment soaking leading to ~90% of crystals yielding high-resolution datasets.

The crystallization condition itself can prove to be problematic for soaking by altering the fragment solubility in solution. Davies et al. (2012) described a fragment screening campaign against a proprietary target PRP4 that was productive but failed to identify carboxylic acid-containing fragments, which were predicted to have activity in an independent assay. Lack of binding for these fragments, was attributed to 200 mM calcium chloride present in the crystallization condition. Specifically, the calcium ion was believed to reduce the solubility of the acid containing fragments and thereby prevent their binding. Replacing the calcium chloride with an alternate chloride salt in the soaking solution enabled fragment binding to be visualized. Thus, optimization of the soaking conditions to ensure protein stability, fragment solubility, and technical ease is a necessary pre-requisite prior to initiating a crystallographic fragment-screening campaign.

3.4 Fragment Screening by Co-crystallization

An alternative approach for fragment screening is the co-crystallization of the protein in the presence of the fragment. This method is resource-intensive since each compound requires a separate crystallization experiment. Thus, co-crystallization is commonly used when fragment soaking either affects crystal stability or fails to identify any hits. Co-crystallization with fragment cocktails can be challenging due to issues with fragment solubility and long-term stability. Furthermore, ligand binding can alter the mobility of the protein thus requiring a change in the crystallization condition. This approach cannot identify weak fragment binders since high concentration of fragments can interfere with crystallization. Perryman et al. (2010) used co-crystallization approach to identify fragment binding to HIV-1 protease after soaking individual fragments into two different crystal forms failed. Co-crystallization screening at 10 and 2.5 mM fragment concentrations using a crystallization condition for a P6122 crystal form of HIV-1 protease produced two new crystal forms, resulting in the identification of three fragment hits.

3.5 Quality Control

Quality control is often considered as an afterthought mainly because the fragments purchased are of nominally high purity. However, it is possible that the silent impurities, such as metal ions, synthetic byproducts, or products of compound degradation, can produce false positives, which can often result in a tremendous waste of time and resources. The high fragment concentrations used in crystallographic screening can exacerbate these issues because even minor contaminants can be present at sufficient concentrations leading to significant electron density changes and false positives.

During the screening campaign against HIV-1 RT, electron density for binding of a commercially obtained fragment, 1, was observed; however, fitting the compound into the density proved to be challenging (Figure 2A). Promising RNase H inhibitory activity in the enzymatic assays for “1” fueled excitement, however, subsequent crystallization experiments by both soaking and co-crystallization failed to demonstrate sensible electron density for this fragment. The next sample of “1” purchased showed a dramatic loss of inhibitory activity and difference electron density. Subsequent crystallographic analyses using anomalous difference maps indicated that the observed electron density was unexpectedly due to contaminating metal ions (Figure 2B). The inhibition was most likely by competition of the contaminating metal with Mg2+ ions used in the RNase H assay. Our and others’ (often anecdotal) experiences suggest that compound quality control can vary widely among commercial vendors. Given the frequent waste of considerable time, effort, and resources pursuing contaminants, rigorous quality control of fragments and cocktails should be an integral component of all fragment-screening platforms.

Figure 2.

Figure 2

Importance of quality control of compounds used for crystallographic screening. A.) Electron density from a difference map (contoured at 3.0σ) observed upon soaking of 1 into crystals of HIV-1 RT-rilpivirine complex. Electron density envelope does not accommodate the ligand in question. B.) Electron density with metal ions and coordinating water molecules modeled in. Anomalous density indicating probable metal binding is also shown (cyan mesh contoured at 3.0σ). The metal atoms were unexpectedly present in a “pure” commercial sample; apparent high inhibition (RNase H) led to excitement that was mitigated when fractionation and purification of the compound resulted only in loss of activity.

3.6 High-Throughput Crystallography

Application of X-ray crystallography as a primary method for fragment screening can be limited by the lack of high throughput and its highly specialized nature. However, recent technological improvements in crystal freezing and synchrotron resources have significantly reduced these hurdles. For our screening campaigns, use of a cassette or puck system facilitated the transport of larger quantities of crystals to the synchrotron. A total of seven ALS-style pucks with each puck holding 16 crystals can be transported in a CP100 Dewar. This allowed for the transport of 112 crystals per dewar in comparison to only 56 crystals using the traditional cane transport system. The pucks allowed for rapid freezing of the crystals and reduced handling error compared to the cane system. Additionally, use of the puck system significantly reduced ice buildup around the crystals, thus reducing the time spent at the beam rinsing the crystal with liquid nitrogen to remove the ice. The compatibility of puck systems with the robotic crystal automounters available at synchrotrons further facilitated rapid data collection by reducing the time, labor, and the risk to the samples associated with manual mounting and dismounting a crystal. Rapid identification of potential problems with the automounter system is critical because malfunctions can lead to crystal loss through temperature fluctuations and frosting. These problems, though common in the early days of these systems, are now rare and are more than compensated for by the extraordinary increases in throughput and reliability enabled by this instrumentation.

Taking advantage of beamline resources, we developed a high-speed data collection strategy aimed at maximizing data collection and enabling rapid hit identification. Initial screening consisted of low-resolution data collection by limiting the exposure time and increasing the oscillation range of the frames. This reduced the duration of dataset collection with a minimal impact on the diffraction resolution of the crystals. Fragment binding was determined from a visual inspection of difference Fourier maps with Fo-Fo coefficients derived from subtracting the structure factor magnitudes of the DMSO reference dataset from those of the fragment/cocktail soaked crystal. An advantage of the Fo-Fo difference map is that phases from the reference structure can be used without requiring refinement of the fragment-containing dataset. Crystals with changes in the electron density due to fragment binding and/or repositioning of protein residues or solvent molecules were prioritized for further studies. Although this can be done through standard refinement software such as CCP4 (Winn et al., 2011) and PHENIX (Adams et al., 2010), specialized software, such as AutoSolve (developed at Astex), has been developed to automate this process (Blundell et al., 2002). Crystals for which fragment binding was detected were remounted and subjected to high-resolution data collection. Both the high and low-resolution datasets were then merged to improve the electron density for the bound fragment for subsequent refinement and analysis.

Advancements in detector technology have also contributed to the improvement in efficiency and throughput of X-ray crystallography as a screening method. The ultra-rapid readout and high quantum efficiency associated with the Pilatus 6M detector allows for continuous crystal rotation without opening and closing of the shutter after each frame (Henrich et al., 2009). The Pilatus 6M detector reduced the average time for high-resolution data collection for an influenza endonuclease crystal from six to two minutes. When combined with an automounter, we were able to collect 15 datasets per hour with approximately half of the time used for mounting/unmounting and crystal centering. As a result, complete datasets for a total of 112 influenza endonuclease crystals were collected within a 12-hour X-ray beam allocation. Due to the rapid data collection with the Pilatus detector, it was no longer necessary to utilize the high-speed data collection strategy previously described. These improvements in data collection and processing have dramatically reduced the difficulties commonly associated with crystallographic fragment screening.

4. Biological Applications of Fragment Screening

4.1 Probing Protein Dynamics

Despite the constrained environment within a crystal lattice, proteins and other macromolecules still possess significant degrees of flexibility, which can be investigated through crystallographic fragment screening. The extent of protein dynamics within a crystal depends on the intrinsic conformational flexibility of the protein target, the presence of bound ligands, the crystal form under investigation, and conditions including temperature, pH, and buffer and salt concentrations. The HIV-1 RT screening campaign revealed not only fragment binding to existing crevices but also successfully trapped previously unseen protein conformations. In particular, fragment binding to the Knuckles pocket, which is normally a buried cavity in previously reported RT structures, was successfully characterized despite a 1.2 Å backbone shift in the fingers subdomains (Figure 3). Time-based soaking experiments ranging from 10 seconds to 30 minutes revealed fragments binding to the Knuckles, demonstrating the dynamic nature of RT in crystallo. Visualization of these dynamically accessible states is possible when fragment binding stabilizes a transient protein conformation. Similar observations were made from a crystallographic screening campaign targeting glutaryl-CoA dehydrogenase from B. pseudomallei (BpGCDH) (Begley et al., 2011). Fragment binding to the apo BpGCDH induces a conformational change in which the binding site for FAD, a co-factor, becomes occupied with protein residues. Presumably, the fragment can inhibit the enzyme through stabilization of an alternative co-factor pocket conformation.

Figure 3.

Figure 3

Crystal structure of RT-rilpivirine with a fragment bound at the Knuckles site, revealing a novel allosteric inhibitory site for HIV-1 RT that is potentially druggable. This pocket is not observed to be open in any of hundreds of published RT structures, nor in the many hundreds of structures solved during our fragment screening campaign, unless occupied by similar fragments. Residues that shift in presence of bound ligand are shown in cyan and in their non-bound positions as green. Y115 and F116 form part of the substrate dNTP-binding site, suggesting that these conformational changes may account for the observed inhibition by fragments binding to this site.

Probing protein dynamics using fragments is not limited to small-scale, transient conformations, larger domain interactions can also be explored. Saalau-Bethell and colleagues (2012) at Astex Therapeutics altered the equilibrium between the active and inactive conformations of hepatitis C virus (HCV) NS3-NS4A protein. Crystallographic screening identified fragment binding to a new allosteric site located between the protease and helicase domains. Crystal structure of a chemical lead developed from the original hit bound to HCV NS3-NS4A suggests that compound binding to the allosteric site stabilizes the closed conformation, thereby preventing the domain movement necessary to form the active conformation, which results in a loss of protease activity necessary for HCV polyprotein processing.

4.2 Functional Studies

Fragments are not solely limited to probing protein flexibility but can also enable a better understanding of enzymatic function. Two of the eight fragments identified from a crystallographic fragment screening against influenza PAN revealed a previously undetected third metal ion within the endonuclease active site (Bauman et al., 2013b). This was unexpected as a two metal ion-binding model had been established based on previous structural and calorimetric studies. To determine whether this third metal binding was simply a non-biologically relevant artifact of fragment binding, crystals of apo EDTA-treated PAN were soaked with a range of metal ions (Mn2+, Zn2+, Ca2+, and Ni2+) at high concentrations. Of the metals soaked, the Ca2+ soak revealed strong electron density for the third metal (Figure 4) in the absence of fragment. Based on structural studies with related endonucleases, we hypothesize that this third metal ion may stabilize the negatively charged transition state during catalysis (Horton et al., 2004). Thus, this third metal ion binding to the endonuclease active site is not simply a by-product of fragment binding and warrants further investigation to fully understand its functional and mechanistic relevance.

Figure 4.

Figure 4

Cartoon representation of apo PAN active site from a 100 mM CaCl2 soak (inspired by observation of a third Mn2+ during fragment screening) showing electron density (light blue mesh contoured at 1.5σ) for the metal ions and coordinating waters. The locations of the metal ions from a fragment soak are displayed in green whereas the locations of the calcium cations from a 100 mM soak are in yellow. The third metal had not been observed in previously reported structures and may have functional significance in stabilization of the reaction products.

Small-molecule fragments can also be used to characterize the enzymatic function of unknown proteins as demonstrated by Shumilin et al. (2012). The natural substrates for two unknown proteins, YjeF_N and PF01256, were identified from a crystallographic screening of a small molecule library of metabolites followed by a focused screen of structurally similar molecules. By visualizing metabolites, reactants or reaction products bound directly in the enzyme active site, specific follow-up experiments could be prioritized to confirm the putative enzymatic function of the protein. Thus, fragment screening by X-ray crystallography has proven to be a powerful tool for probing the biology of enzyme structure and function.

5. Technical Insights

5.1 Halogens

Halogenated compounds are often under-represented in fragment libraries due to the arbitrary molecular mass cutoffs. However, these fragments are highly versatile allowing for rapid hit identification, specifically anomalous X-ray diffraction (Blaney et al., 2006) and isotope enrichment for NMR spectroscopy (Dalvit et al., 2002) can be used to detect binding of brominated and fluorinated fragments, respectively. Furthermore, halogenated fragments can serve as molecular probes for binding through favorable halogen-bonding interactions. Wilcken et al. (2012) assembled and screened a halogen-enriched fragment library to take advantage of predicted halogen-bonding interactions based on docking studies to develop small molecule stabilizers of a mutant Y220C p53.

Analysis of fragment hits from the HIV-1 RT fragment screening campaign revealed a preference for halogenated fragments. Specifically, seven out of 29 fluorinated fragments and four out of 17 brominated fragments screened were identified as hits. The higher hit rates for halogenated fragments indicated that a halogen-focused fragment library could be beneficial.

Based on the HIV-1 RT results, Tiefenbrunn et al. (2013) demonstrated that a brominated library could be useful in targeting recalcitrant crystal forms, which have yielded low hit rates during screening. A library of 68 brominated fragments was screened to identify binding to two sites of interest—exosite and flap site—within HIV protease. At least one bromine anomalous peak was detected for 23 of the 68 compounds. However, most of these compounds were not of interest as either electron density for only the bromine atom or binding to interstitial or the TL-3 sites was observed. Clear binding was observed for two fragments—each binding to the exosite and flap site— yielding a hit rate of 3%. The hit rate increased to 6% if compound binding at the TL-3 site is taken into account. Compared to the initial ActiveSite screen (Tiefenbrunn et al., 2013), a four- to eight-fold improvement in hit rate was observed for the brominated library. Thus, the focused brominated fragment library proved to be valuable not only for hit identification, but also for improving hit rates, presumably through halogen-bonding interactions.

5.2 Trimethylamine-N-oxide: A Useful Cryocrystallography Additive

An unexpected observation from soaking crystals of the HIV-1 RT/rilpivirine complex with a wide variety of solutions was the identification of trimethylamine-N-oxide (TMAO) as a useful reagent for improving the diffraction quality of crystals. Soaking at a concentration of 6% (w/v) TMAO improved the resolution of the HIV-1 RT crystals from 1.80 Å to 1.51 Å (Kuroda et al., 2013). A similar improvement in diffraction resolution with TMAO was also observed for crystals of influenza endonuclease and other HIV-1 RT complexes. Inspired by our observations, crystals of alphavirus polyprotein were treated with 5% (w/v) TMAO during cryoprotection, resulting in a similar improvement in resolution (Shin et al., 2012). Improvement in diffraction resolution for both the HIV-1 RT and alphavirus polyprotein crystals was observed solely through soaking of TMAO, not through co-crystallization.

TMAO has been explored previously as a useful biological or crystallographic reagent. At very high concentrations (around 4 M), TMAO by itself can act as an excellent cryoprotective agent (Mueller-Dieckmann et al., 2011). Biological studies have demonstrated that TMAO can improve protein folding and stability. Specifically, TMAO destabilizes unfavored protein conformations (Yancey and Somero, 1979; Yancey et al., 1982; Lin et al., 1994; Wang et al., 1997). Molecular dynamics simulations indicate that TMAO stabilizes the protein structure in solution by enhancing favorable water interactions (Zou et al., 2002). Thus, we hypothesize that the improvement in resolution quality from TMAO soaking is due to stabilization of the protein within the crystal lattice in a manner similar to dehydration of the crystals without any significant changes in the unit cell dimensions.

A fortuitous benefit of the high-resolution crystal structure of HIV-1 RT in complex with rilpivirine was the discovery and clear assignment of a water molecule that makes hydrogen-bonding interactions with the cinnamonitrile group of rilpivirine (Figure 5). Through collaboration with the lab of Robin Hochstrasser, we identified this hydrogen bond to be an important contributing factor for the observed potency of rilpivirine using a combination of two-dimensional vibrational echo spectroscopy and molecular dynamics (Kuroda et al., 2013). The location of this water molecule as initially predicted by the molecular dynamics simulation was not concordant with the 1.80 Å crystal structure of HIV-1 RT in complex with rilpivirine. However, the 1.51 Å resolution crystal structure resulting from the TMAO soak revealed electron density for this water molecule, thus confirming the results from 2D-IR and molecular dynamics studies.

Figure 5.

Figure 5

Crystal structure (1.51 Å resolution) of HIV-1 RT-rilpivirine complex (RT52A construct), showing two water molecules making hydrogen-bonding interactions (dashed lines) with the bound drug molecule. The previously undetected water molecule (upper left corner) may account for some of the increased inhibitory potency of rilpivirine (in yellow) conferred by the cyanovinyl substituent vs. related diarylpyrimidine analogs. The crystal used for data collection was treated with 6% (w/v) TMAO in the cryoprotective solution; without TMAO this crystal form usually yields 1.8 Å resolution datasets.

Although TMAO improved X-ray diffraction quality of crystals, fragment soaking with TMAO as an additive proved to be challenging. Previously identified fragment hits from both HIV-1 RT and influenza PAN screening campaigns no longer bound in the presence of 6% (w/v) TMAO. This may be due to a reduction in fragment solubility in the presence of TMAO, consistent with observation of precipitate formation in fragment/cocktail soaking solutions.

5.3 Multiple Binding

Fragment screening campaigns using biophysical screening methods, such as NMR and SPR, favor single molecule binding to a target protein and discard multiple binders as non-specific. When multiple binders are detected, aggregation rather than specific interactions is usually assumed to be responsible for the observed binding. However, it is possible for viable fragment hits to be discarded as false positives. X-ray crystallography, to a certain degree, can reduce this occurrence by facilitating rapid determination of molecular specificity. Structural characterization of a promiscuous fragment bound to multiple sites on a target protein can guide compound derivatization to improve specificity and potency. This was successfully demonstrated by the development of a new class of influenza endonuclease (PAN) inhibitors (Bauman et al., 2013b). A fragment hit, 5-chloro-3-hydroxypyridin-2(1H)-one, identified as binding to three separate sites, was pursued for structure-based drug design despite its multiple binding modes (Figure 6). Docking using the available crystal structure allowed for rapid and efficient fragment expansion to improve specificity and potency. Further optimization produced a single-binding-mode inhibitor of PAN with significant antiviral activity.

Figure 6.

Figure 6

Structure-guided lead development of a fragment hit, 5-chloro-2,3-dihydroxypyridine, identified from a screening campaign targeting influenza endonuclease. Electrostatic surface (APBS) is shown for each ligand bound structure. The availability of numerous crystal structures allowed for molecular modeling to be used, resulting in not only an increase in the specificity of binding to the active site but also an improvement in the inhibitory activity of the compounds. Merger of substitutions at the 5- and 6-positions yielded compounds with greater than a thousand-fold increase in potency compared to the initial hit.

6. Conclusion

When suitable crystals of the target protein are obtainable, fragment screening by X-ray crystallography has been established to be a very powerful tool for drug discovery and development. In addition, it can also provide exceptional insights into functional aspects of the target protein. Crystallographic fragment screening has been successful in stabilizing previously unseen protein conformations as well as discovery of functional aspects of enzymes. The continual development of high-throughput crystallography and its unique benefits for structure-based drug design make crystallographic fragment screening an excellent tool for characterizing a protein and its interactions.

Acknowledgements

Data collection for unpublished work presented here was conducted at the Cornell High Energy Synchrotron Source (CHESS) and the National Synchrotron Light Source (NSLS) at Brookhaven National Laboratory. CHESS is supported by the NSF & NIH/NIGMS via NSF award DMR-0225180, and the MacCHESS resource is supported by NIH/NCRR award RR-01646. Use of NSLS is supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Contract No. DE-AC02-98CH10886. We thank Michael Parniak (University of Pittsburgh) for the RNase H inhibition assay for compound 1. We are also thankful to Kalyan Das and other members of the Arnold laboratory for helpful discussions and inspiration for the figures. We are extremely grateful to NIH MERIT Award R37AI027690 and grant P50GM103368 for generous support of these efforts.

Abbreviations

BpGCDH

B. pseudomallei glutaryl-CoA dehydrogenase

DHNA

7,8-dihydroneopterin aldolase

DMSO

dimethylsulfoxide

FBDD

fragment-based drug discovery

HCV

hepatitis C virus

HIV

human immunodeficiency virus

NMR

nuclear magnetic resonance spectroscopy

NNRTI

non-nucleoside reverse transcriptase inhibitor

PAN

influenza endonuclease

PR

protease

RT

reverse transcriptase

TMAO

trimethylamine-N-oxide

TS

thymidylate synthase

References

  1. Adams PD, Afonine PV, Bunkóczi G, Chen VB, Davis IW, Echols N, Headd JJ, Hung LW, Kapral GJ, Grosse-Kunstleve RW, McCoy AJ, Moriarty NW, Oeffner R, Read RJ, Richardson DC, Richardson JS, Terwilliger TC, Zwart PH. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallographica Section D Biological Crystallography. 2010;66:213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Allen KN, Bellamacina CR, Ding X, Jeffery CJ, Mattos C, Petsko GA, Ringe D. An experimental approach to mapping the binding surfaces of crystalline proteins. Journal of Physical Chemistry. 1996;100:2605–2611. [Google Scholar]
  3. Atwell S, Brouillette CG, Conners K, Emtage S, Gheyi T, Guggino WB, et al. Structures of a minimal human CFTR first nucleotide-binding domain as a monomer, head-to-tail homodimer, and pathogenic mutant. Protein Engineering, Design & Selection. 2010;23:375–384. doi: 10.1093/protein/gzq004. [DOI] [PubMed] [Google Scholar]
  4. Bauman JD, Das K, Ho W. C, Baweja, M., Himmel DM, Clark AD, Oren DA, Boyer PL, Hughes SH, Shatkin AJ, Arnold E. Crystal engineering of HIV-1 reverse transcriptase for structure-based drug design. Nucleic Acids Research. 2008;36:5083–5092. doi: 10.1093/nar/gkn464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bauman JD, Patel D, Baker S, Vijayan RSK, Xiang A, Parhi A, Martinez-Sobrido L, LaVoie EJ, Das K, Arnold E. Crystallographic fragment screening and structure-based optimization yields a new class of influenza endonuclease inhibitors. ACS Chemical Biology. 2013;8:2501–2508. doi: 10.1021/cb400400j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bauman J, Patel D, Dharia C, Fromer MW, Ahmed S, Frenkel Y, Vijayan RSK, Eck TJ, Ho WC, Das K, Shatkin AJ, Arnold E. Detecting allosteric sites of HIV-1 reverse transcriptase by X-ray crystallographic fragment screening. Journal of Medicinal Chemistry. 2013;56:2738–2746. doi: 10.1021/jm301271j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Begley DW, Davies DR, Hartley RC, Hewitt SN, Rychel AL, Myler PJ, Van Voorhis WC, Staker BL, Stewart LJ. Probing conformation stats of glutaryl-CoA dehydrogenase by fragment screening. Acta Crystallographic Section F. 2011;67:1060–1069. doi: 10.1107/S1744309111014436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Behnen J, Köster H, Neudert G, Craan T, Heine A, Klebe G. Experimental and computational active site mapping as a starting point to fragment-based lead discovery. ChemMedChem. 2012;7:248–261. doi: 10.1002/cmdc.201100490. [DOI] [PubMed] [Google Scholar]
  9. Blaney J, Nienaber V, Burley SK. Fragment-based lead discovery and optimisation using X- ray crystallography, computational chemistry, and high-throughput organic synthesis. In: Jahnke W, Erlanson DA, editors. Fragment-based Approaches in Drug Discovery. Methods and Principles in Medicinal Chemistry. Wiley–VCH; Weinheim, Germany: 2006. pp. 215–248. [Google Scholar]
  10. Blundell TL, Jhoti H, Abell C. High-throughput crystallography for lead discovery in drug design. Nature Reviews Drug Discovery. 2002;1:45–54. doi: 10.1038/nrd706. [DOI] [PubMed] [Google Scholar]
  11. Dalvit C, Flocco M, Veronesi M, Stockman BJ. Fluorine-NMR competition binding experiments for high-throughput screening of large compound mixtures. Combinatorial Chemistry High Throughput Screening. 2002;5:605–611. doi: 10.2174/1386207023329923. [DOI] [PubMed] [Google Scholar]
  12. Davies TG, Tickle IJ. Fragment screening using X-ray crystallography. Topics in Current Chemistry. 2012;317:33–59. doi: 10.1007/128_2011_179. [DOI] [PubMed] [Google Scholar]
  13. Dias A, Bouvier D, Crepin T, McCarthy AA, Hart DJ, Baudin F, Cusack S, Ruigrok RW. The cap-snatching endonuclease of influenza virus polymerase resides in the PA subunit. Nature. 2009;458:914–918. doi: 10.1038/nature07745. [DOI] [PubMed] [Google Scholar]
  14. English AC, Done SH, Caves LS, Groom CR, Hubbard RE. Locating interaction sites on proteins: the crystal structure of thermolysin soaked in 2% to 100% isopropanol. Proteins: Structure, Function, and Genetics. 1999;37:628–640. [PubMed] [Google Scholar]
  15. Erlanson DA, Wells JA, Braisted AC. Tethering: fragment-based drug discovery. Annual Review of Biophysics Biomolecular Structure. 2004;33:199–223. doi: 10.1146/annurev.biophys.33.110502.140409. [DOI] [PubMed] [Google Scholar]
  16. Flocco MM, Mowbray SL. Planar stacking interactions of arginine and aromatic-side chains in proteins. Journal of Molecular Biology. 1994;235:709–717. doi: 10.1006/jmbi.1994.1022. [DOI] [PubMed] [Google Scholar]
  17. Hadjuk PJ, Greer J. A decade of fragment-based drug design: strategic advances and lessons learned. Nature Reviews Drug Discovery. 2007;6:211–219. doi: 10.1038/nrd2220. [DOI] [PubMed] [Google Scholar]
  18. Hartshorn MJ, Murray CW, Cleasby A, Frederickson M, Tickle IJ, Jhoti H. Fragment-based lead discovery using X-ray crystallography. Journal of Medicinal Chemistry. 2005;48:403–413. doi: 10.1021/jm0495778. [DOI] [PubMed] [Google Scholar]
  19. Henrich B, Bergamaschi A, Broennimann C, Dinapoli R, Eikenberry EF, Johnson I, Kobas M, Kraft P, Mozzanica A, Schmitt B. PILATUS: A single photon counting pixel detector for X-ray applications. Nuclear Instruments and Methods in Physics Research Section A. 2009;607:247–249. [Google Scholar]
  20. Hesterkamp T, Whittaker M. Fragment-based activity space: smaller is better. Current Opinion in Chemical Biology. 2008;12:260–268. doi: 10.1016/j.cbpa.2008.02.005. [DOI] [PubMed] [Google Scholar]
  21. Horton NC, Perona JJ. DNA cleavage by EcoRV endonuclease: two metal ions in three metal ion binding sites. Biochemistry. 2004;43:6841–6857. doi: 10.1021/bi0499056. [DOI] [PubMed] [Google Scholar]
  22. Kuroda DG, Bauman JD, Challa R, Patel D, Troxler T, Das K, Arnold E, Hochstrasser RM. Snapshot of the equilibrium dynamics of a drug bound to HIV-1 reverse transcriptase. Nature Chemistry. 2013;5:174–181. doi: 10.1038/nchem.1559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Lin TY, Timasheff SN. Why do some organisms use a urea-methylamine mixture as osmolyte? Thermodynamic compensation of urea and trimethylamine N-oxide interactions with protein. Biochemistry. 1994;33:12695–12701. doi: 10.1021/bi00208a021. [DOI] [PubMed] [Google Scholar]
  24. Mattos C, Bellamacina CR, Peisach E, Pereira A, Vitkup D, Petsko GA, Ringe D. Multiple solvent crystal structures: probing binding sites, plasticity, and hydration. Journal of Molecular Biology. 2006;57:1471–1482. doi: 10.1016/j.jmb.2006.01.039. [DOI] [PubMed] [Google Scholar]
  25. Mattos C, Ringe D. Locating and characterizing binding sites on proteins. Nature Biotechnology. 1996;14:595–599. doi: 10.1038/nbt0596-595. [DOI] [PubMed] [Google Scholar]
  26. Mueller-Dieckmann C, Kauffmann B, Weiss MS. Trimethylamine N-oxide as a versatile cryoprotective agent in macromolecular crystallography. Journal of Applied Crystallography. 2011;44:433–436. [Google Scholar]
  27. Nienaber VL, Richardson PL, Klighofer V, Bouska JJ, Giranda VL, Greer J. Discovering novel ligands for macromolecules using X-ray crystallographic screening. Nature Biotechnology. 2000;18:1105–1108. doi: 10.1038/80319. [DOI] [PubMed] [Google Scholar]
  28. Perryman AL, Zhang Q, Soutter HH, Rosenfeld R, McRee DE, Olson AJ, Elder JE, Stout CD. Fragment-based screen against HIV protease. Chemical Biology Drug Design. 2010;75:257–268. doi: 10.1111/j.1747-0285.2009.00943.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Saalau-Bethell S, Woodhead AJ, Chessari G, Carr MG, Coyle J, Graham B, Hiscock SD, Murray CW, Pathuri P, Rich SJ, Richardson CJ, Williams PA, Jhoti H. Discovery of an allosteric mechanism for the regulation of HCV NS3 protein function. Nature Chemical Biology. 2012;8:920–925. doi: 10.1038/nchembio.1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Sanders WJ, Nienaber VL, Lerner CG, McCall JO, Merrick SM, Swanson SJ, Harlan JE, Stoll VS, Stamper GF, Betz SF, Condroski KR, Meadows RP, Severin JM, Walter KA, Magdalinos P, Jakob CG, Wagner R, Beutel BA. Discovery of potent inhibitors of dihydroneopterin aldolase using CrystaLEAD high-throughput X-ray crystallographic screening and structure-directed lead optimization. Journal of Medicinal Chemistry. 2004;47:1709–1718. doi: 10.1021/jm030497y. [DOI] [PubMed] [Google Scholar]
  31. Shumlin IA, Cymborowski M, Chertihin O, Jha KN, Herr JC, Lesley SA, Joachimiak A, Minor W. Identification of unknown protein function using metabolite cocktail screening. Structure. 2012;20:1715–1725. doi: 10.1016/j.str.2012.07.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Spurlino JC. Fragment screening purely with protein crystallography. Methods in Enzymology. 2011;493:321–356. doi: 10.1016/B978-0-12-381274-2.00013-3. [DOI] [PubMed] [Google Scholar]
  33. Stout TJ, Schellenber U, Santi DV, Stroud RM. Crystal structures of a unique thermal-stable thymidylate synthase from Bacillus subtilis. Biochemistry. 1998;37:14736–14747. doi: 10.1021/bi981270l. [DOI] [PubMed] [Google Scholar]
  34. Tiefenbrunn T, Forli S, Happer M, Gonzalez A, Tsai Y, Soltis M, Elder JH, Olson AJ, Stout CD. Crystallographic fragment-based drug discovery: use of a brominated fragment library targeting HIV protease. Chemical Biology and Drug Design. 2013;83:141–148. doi: 10.1111/cbdd.12227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Verlinde CLMJ, Fan E, Shibata S, Zhang Z, Sun Z, Deng W, Ross J, Kim J, Xiao L, Arakaki T, Bosch J, Caruthers JM, Larson ET, LeTrong I, Napuli A, Kelley A, Mueller N, Zucker F, Van Voorhis WC, Buckner FS, Merritt EA, Hol WGJ. Fragment-based cocktail crystallography by the medical structural genomics of pathogenic protozoa consortium. Current Topics in Medicinal Chemistry. 2009;9:1678–1687. doi: 10.2174/156802609790102383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Verlinde C, Kim H, Bernstein BE, Mande SC, Hol WGJ. Antitrypanosomiasis drug development based on structures of glycolytic enzymes. In: Veerapandian P, editor. Structure-based Drug Design. Marcel Dekker, Inc.; 1997. pp. 365–394. [Google Scholar]
  37. Wang A, Bolen DW. A naturally occurring protective system in urea-rich cells: Mechanism of osmolyte protection of proteins against urea denaturation. Biochemistry. 1997;36:9101–9108. doi: 10.1021/bi970247h. [DOI] [PubMed] [Google Scholar]
  38. Wilcken R, Liu X, Zimmermann MO, Rutherford TJ, Fersht AR, Joerger AC, Boeckler FM. Halogenated-enriched fragment libraries as leads for drug rescue of mutant p53. Journal of American Chemical Society. 2012;134:6810–6818. doi: 10.1021/ja301056a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Winn MD, Ballard CC, Cowtan KD, Dodson EJ, Emsley P, Evans PR, et al. Overview of the CCP4 suite and current developments. Acta Crystallographica Section D Biological Crystallography. 2011;67:235–242. doi: 10.1107/S0907444910045749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Yancey PH, Clark ME, Hand SC, Bowlus RD, Somero GN. Living with water stress: evolution of osmolyte systems. Science. 1982;217:1214–1222. doi: 10.1126/science.7112124. [DOI] [PubMed] [Google Scholar]
  41. Yancey PH, Somero GN. Counteraction of urea destabilization of protein structure by methylamine osmoregulatory compounds of elasmobranch fishes. Biochemistry Journal. 1979;183:317–323. doi: 10.1042/bj1830317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Yuan P, Bartlam M, Lou Z, Chen S, Zhou J, He X, Lv Z, Ge R, Li X, Deng T, Fodor E, Rao Z, Liu Y. Crystal structure of an avian influenza polymerase PA(N) reveals an endonuclease active site. Nature. 2009;458:909–913. doi: 10.1038/nature07720. [DOI] [PubMed] [Google Scholar]
  43. Zou Q, Bennion BJ, Daggett V, Murphy KP. The molecular mechanism of stabilization of proteins by TMAO and its ability to counteract the effects of urea. Journal of the American Chemical Society. 2002;124:1192–1202. doi: 10.1021/ja004206b. [DOI] [PubMed] [Google Scholar]

RESOURCES