Abstract
19F NMR of labeled proteins is a sensitive method for characterizing structure, conformational dynamics, higher-order assembly, and ligand binding. Fluorination of aromatic side chains has been suggested as a labeling strategy for small molecule ligand discovery for protein-protein interaction interfaces. Using a model transcription factor binding domain of the CREB binding protein (CBP)/p300, KIX, we report the first full small molecule screen using protein-observed 19F NMR. Screening of 508 compounds and validation by 1H-15N HSQC NMR led to the identification of a minimal pharmacaphore for the MLL-KIX interaction site. Hit rate analysis for the CREB-KIX and MLL-KIX sites provided a metric to assess the ligandability or “druggability” of each interface informing future medicinal chemistry efforts. The structural information from the simplified spectra and data collection speed, affords a new screening tool for analysis of protein interfaces and discovery of small molecules.
Keywords: protein-protein Interactions, fragment screening, fluorine, biomolecular NMR, transcription
Protein-protein interactions (PPIs) are essential nodes in almost all biological processes within the cell. Modulation of these biomolecular interactions via small molecules is now a validated approach for therapeutic intervention[1] and chemical probe development to improve our understanding of biological systems.[2] Despite this validation, disruption of these interactions is difficult due to the large and plastic interfaces at the protein surface and lack of deep hydrophobic binding sites for accommodating traditional drug-like molecules.[3] This is particularly the case for the transient and dynamic interactions found for many transcription factor-protein binding events that occur with modest affinity (Kd = 0.1-100 µM).
We recently reported on a protein-observed fluorine NMR method (PrOF NMR) for characterizing ligand binding at PPI sites.[4] We employed singly-labeled fluorinated aromatic amino acids due to the enrichment of aromatic amino acids at PPI interfaces.[5] In many cases, the modest protein structural perturbations from a single aryl hydrogen to fluorine substitution, the fast data acquisition for small to medium sized proteins, and the orthogonality of fluorine with biological nuclei, lead to a simplified and readily obtainable 1D-NMR spectrum reporting directly on protein structure.[6] We used this method to characterize small molecule binding to three different transcription factor domains: bromodomains Brd4, BrdT, and BPTF.[4b] We also studied the protein interaction domain of CBP/p300, KIX, by PrOF NMR and identified small molecule ligands 1–10 and 1G7.[4a] Beyond model studies, PrOF NMR has yet to be tested in a full discovery format. Herein, we report the first full small molecule screen using PrOF NMR, evaluate its effectiveness for assessing protein druggability at two different binding sites within the same protein, KIX, and report structure activity relationships (SAR) from several newly discovered small molecules.
Molecules of low molecular weight and complexity, called fragments, have helped increase successful screening outcomes against difficult protein targets.[7] NMR has become a preferred method for screening for identifying low affinity ligands, in many cases using 1D-NMR ligand-based methods.[8] Analysis of fragment screen hit rates provides valuable information into the ligandability or “druggability” of the proteins,[9] and further development of fragments has led to molecules with more efficacious binding affinity and improved physicochemical properties than molecules obtained from traditional high-throughput screening.[10] To test the feasibility of using PrOF NMR in a fragment screening format, we chose the KIX domain.
KIX is a useful model protein for characterizing protein- small molecule interactions and studying the conformational plasticity of this dynamic protein domain. KIX binds to over a dozen transcription factors (e.g., MLL, E2A-PBX1, and CREB) through two binding sites.[11] These sites as well as new cryptic sites cannot readily be distiguished in a direct binding 1D-NMR ligand-based experiment. Prior screening efforts offer valuable ligand information for comparison with our PrOF NMR experiments. Two natural product depsides, sekikaic acid and lobaric acid, are reported as the most potent KIX inhibitors at the MLL-KIX PPI site (IC50 = 34 and 17 µM respectively). These compounds isolated from natural products were discovered after a screen of 50,000 commercial library compounds had resulted in no hits.[12] The only HSQC NMR screen resulted in two lead compounds, KG-122 (pamoic acid) and KG-501 from a 762 member peptidomimetic library (Fig. 1).[13] Isoxazolidine derivatives, discovered first as transcriptional activation domain mimics,[14] were also later shown to be KIX ligands.[15] Despite the paucity of potent ligands, both CREB-KIX, and E2A-KIX interactions have been proposed as important PPI targets for treating blood cancers due to their regulatory role in blood cell proliferation further motivating our research.[11b, 16]
Figure 1.
Solution structure of KIX and known ligands. A) NMR solution structure of KIX (PDB: 2LXT) with peptides MLL and pKID (CREB) in green. Tyrosine residues are red. B) KIX ligands and their corresponding methods of discovery.
PPIs involving the KIX domain occur through two distinct and allosterically-coupled binding sites on this 3-helical bundle protein. These sites are often referred to as the MLL and CREB/c-Myb sites.[12] To conduct the NMR screen, we replaced native tyrosine residues of KIX with 3-fluorotyrosine (3FY). The MLL site contains one tyrosine (Y631), which makes a critical contact with the native protein. A sequence analysis by Näär et al. of KIX domains shows this residue is conserved in humans, C. elegans and drosophila.[17] Y649 is evolutionarily conserved in all KIX domains. Fluorine resonances from Y649, Y650, and Y658 experience significant chemical shifts when CREB binds to KIX as well as Y631 from allostery.[4a] Consistent with natural ligands, our computed druggability analysis using SiteMap identified two druggable sites near the tyrosine residues at both the MLL (10.8 Å from Y631) and CREB/Myb sites (4–10 Å from Y649, Y650, and Y658).[18] Y640, which structurally stabilizes KIX via a cation-π interaction with R600,[19] is found in 97% of KIX domains, and may represent a new small molecule site for regulating protein conformation.[17] Importantly, singly fluorinated aromatic residues in KIX only modestly perturb secondary structure and ligand binding.[4a]
We expressed 3FY-labeled KIX (12 kDa) in good yield (70 mg/L) with high labeling efficiency (> 98%) for the screen.[18] Our fragment library was generated from the Maybridge rule of 3 commercial set, combining compounds into 85 mixtures of five or six compounds at a total stock concentration of 33.3 mM per compound in DMSO. As a positive control, we tested a known ligand, KG-501 (833 µM), and identified a binding interaction both in isolation, consistent with prior results,[4a] and in a mixture based on chemical shift perturbation of Y631 (Fig. 2B). For the NMR screen we used 40 µM KIX (~20 mg total). Chemical shift information was acquired in five minutes yielding approximately 510 minutes of experiment time for screening the 85 mixtures, including additional short reference experiments for each mixture. This experiment time is faster than 1H-15N SOFAST HMQC NMR experiments for similar sized proteins.[20] All mixtures were screened at 833 µM small molecule and 2.5% DMSO. Statistical cut-offs for chemical shift perturbation were set to two standard deviations from the average perturbations from the screen yielding 15 mixtures (Fig. 2C). Each mixture was subsequently deconvoluted by individual analysis of each compound leading to four verified ligands (1–40.8% hit rate). The reduction in hits was due to apparent additive effects of fragments which prevented the identification of a sole compound responsible for chemical shift perturbations in a given mixture.
Figure 2.
Fragment screening by PrOF NMR A) Flowchart for screening of fragment library for KIX ligands. 508 fragments were screened using PrOF NMR, yielding 15 mixtures hits and seven final ligand after deconvolution and SAR studies. B) KG-501 was used to test for KIX ligand identification in isolation or a mixture of six total compounds. C) Statistical cut-offs for chemical shift perturbations found in the screen. Samples with pertubations greater than two standard deviations above were identified as hits.
Ligand titration experiments via PrOF NMR were performed to determine the dissociation constants for the small molecule obtained by monitoring changes in chemical shift (Δδ). Three of the four ligands (23and 4) discovered from the screen were found to have low mM binding affinities for KIX. These compounds contained either an aryl or phenylacetic acid group (Table 1). Small molecules 3 and 4 exhibited a binding isotherm consistent with one-to-one binding while 2 potentially exhibited higher-order binding above 2 mM based on the binding isotherm generated. Consequently the Kd for 2 was estimated based on fitting the data up to 2 mM. We ruled out activity due to small molecule aggregation above 2 mM based on well resolved small molecule resonances in the presence and the absence of detergent.[18] Given that KG-122 is a diacid compound that binds to KIX in the MLL site, it is possible that multiple copies of the monoacid 2 could bind in the same region on the protein. Y631, which is featured in the MLL site, was the most sensitive reporter for binding and was therefore used to calculate the Kd for 23and 4. In contrast, Y649, Y650, and Y658, which are featured in the CREB binding site, were significantly less responsive to ligand binding with the exception of 1 which perturbed resonances for Y649, Y650, and Y631, potentially indicating binding at the CREB site.
Table 1.
Small molecule ligands discovered by PrOF NMR
| Compound |
1H-15N HSQC |
Kd (PrOF) (mM) |
Ligand Efficiency (kcal mol−1 NHA−1) |
|
|---|---|---|---|---|
| 1 | ![]() |
Yes | >10 | >0.13 |
| 2 | Yes | 1.4 ± 0.3* | 0.32 | |
| 3 | ![]() |
Yes | 5.3 ± 0.9 | 0.24 |
| 4 | ![]() |
Yes | 1.7 ± 0.6 | 0.29 |
| 5 | ![]() |
ND | 1.6 ± 0.3 | 0.27 |
| PAA5 | ND | 6.9 ± 0.7 | 0.18 | |
| Flurbiprofen | ![]() |
ND | 5.9 ± 0.6 | 0.17 |
Kd estimate based on data fit up to 2 mM, not accounting for higher order binding at greater concentrations, ND: Not Determined
As a follow-up to the screen, the SAR from the four hits were examined to identify structural motifs important for binding. Four trifluoromethylquinoline derivatives of 1 were purchased. These compounds were of interest due to small but significant chemical shift perturbations from 1which were similar to the chemical shift effects observed upon CREB binding to KIX suggesting a possible allosteric binding mode.[4a] One of the four compounds (KM07508) yielded comparable chemical shift perturbations as 1.[18] However, as 1 displayed poor binding affinity in titration experiments (Kd > 10 mM) due to lack of saturated binding and in 2D NMR experimental results explained below, SAR studies with this ligand class were not pursued further in lieu of the investigation into aryl and phenylacetic acids.
The other three discovered ligands contained an aryl or phenylacetic acid motif as a potential pharmacaphore. The presence of a carboxylic acid has been found in other KIX ligands that bind to the MLL site (sekikaic acid, lobaric acid, KG-122, and 1G7, Fig 1). To evaluate specificty over other acidic molecules and to probe for false negatives, several compounds from library mixtures were retested. One of the four compounds (9B11, 5), a biaryl carboxylic acid, was anticipated to have been a false negative due to its similar structure to KIX ligand 1G7. The other three carboxylic acids were selected to test if these aryl and phenylacetic acids represented a class of ligands that bound KIX while other carboxylic acids would not (i.e., a phenethyl, a thiazole, and an aliphatic carboxylic acid). Only 5 yielded significant changes in chemical shift, while the other three exhibited minimal to no apparent binding. Titration of 5 yielded a Kd of 1.6 ± 0.3 mM (Table 1). Low mM Kd values and ligand efficiencies from 0.19 to 0.32 kcal mol−1 non-hydrogen atom−1 were calculated for each fragment. As a comparison, the reported median ligand efficiency was 0.32 for a sample set of 480 ligand-target pairs.[21] Our ligand efficiencies are equal to or slightly lower than this value, supporting the difficulty of this target site.[12]
To expand on the investigation of aryl + acid SAR, nine arylacetic and benzoic acid derivatives were tested as analogs of molecule 2 to inform optimization studies, based on the affinity of 2 and the readily modifiable structure of phenylacetic acids. Of the nine compounds, only the biphenyl (PAA5) yielded a pronounced chemical shift perturbation close to one standard deviation above the average at 5 mM,[18] despite the structural similarity of these analogues to 2, e.g., 3-chloro-4-hydroxy phenylacetic acid (PAA2), phenylacetic acid (PAA3), and 4-hydroxyphenylacetic acid (PAA4) (Fig. 3). Each analog was also analyzed at 2 mM and 833 µM, due to the potential for higher order binding at high concentrations (> 2 mM). The trend remained consistent with all compounds aside from PAA5 yielding insignificant changes in chemical shift. The Kd for PAA5 binding to KIX was 6.9 ± 0.7 mM. These results suggest that hydrophobic groups at the para position on the phenylacetic acid play a key role in ligand binding.
Figure 3.
Representative SAR analysis of phenylacetic acids A) Chemical shift perturbation plot at three different concentrations of small molecule. B) Phenylacetic acid structures.
Phenylacetic acids are a common scaffold found in several nonsteroidal anti-inflammatory drugs. As an additional analysis of SAR, naproxen, ibuprofen, and flurbiprofen were used to further support the model that para-substituted phenylacetic acid derivatives would bind to KIX. All three compounds were analyzed by PrOF NMR. The chemical shift perturbations from enantiopure naproxen were under the statistical cutoff at low concentrations. Racemic ibuprofen yielded significant chemical shift perturbations (Δδ), while racemic flurbiprofen yielded the largest Δδ for Y631 observed thus far (0.38 ppm at 5 mM, Kd = 5.9 ± 0.6 mM, Fig. 3). Subsitution at the α-position affords an additional site of diversity and both stereochemistry and elaboration of side chains can be investigated as starting points for structure activity studies.
As a complementray method to evaluate the ligand binding site on KIX and rule out concerns of generating false positives from fluorine incorporation, we studied the binding of the original four ligands (1–4) by 1H-15N HSQC NMR using uniformly 15N-labeled KIX. Experiments were performed at 0, 1, and 5 mM ligand concentration for 1, 2, and 3 while 4 was characterized at 0, 0.5, and 1 mM due to low solubility above 2 mM. Residues whose chemical shifts were perturbed by one and two standard deviations above the average perturbation in a residue were mapped onto the protein structure to infer the corresponding binding site of each ligand. Consistent with PrOF NMR, perturbed residues in each case showed concentration dependent shifts and localized near the MLL binding site.[18] Additional residues were perturbed near an unstructured loop connecting helices α2 and α3 of KIX near the N-terminal portion of the protein. These perturbations may result from a conformational change induced by ligand in the MLL site rather than direct ligand binding near this loop region but will need further investigation. We conclude from this study that PrOF NMR was able to identify a related class of ligands all targetting the MLL site.
Despite few hits, the hit rates from the PrOF NMR screen could be used to assess the “druggability“ of the MLL and CREB sites on KIX. Consistent with the literature, the CREB interface is a challenging target while the MLL site is more amenable to ligand binding. For example, disulfide tethering screens reported fewer and less efficacious ligands for the CREB versus the MLL site.[22] These results support designing larger peptidomimetic inhibitors against the CREB site rather than using small molecules.[23] Both naturally occurring acidic transcription factors as well as synthetic ligands (Fig. 1) have a higher propensity for interacting at or near the MLL region, providing a starting point for MLL-site ligand development. The KIX-interacting peptide of MLL contains the sequence DIMDFVL where both aspartic acid residues and phenylalanine are important for binding.[11c] The aryl and phenylacetic acid derivatives reported here, may be minimal mimics of these motifs.
PrOF NMR has several advantages as a screening method due to its sensitivity to detect changes in chemical environment including dynamic structural information of the protein, low background signal, and ease of use. Here we demonstrated the first full small molecule screen using PrOF NMR, evaluated its efficacy for druggability determination, and reported on a class of small molecules (biaryl and phenylacetic acids) as KIX ligands. Additional SAR analysis guided our investigation to discover three additional KIX-binding small molecules. The biarylacid and phenylacetic acid structural motif is corroborated by several of the reported KIX ligands (Fig. 1) providing a starting point for small molecule derivitization and development. This study also demonstrates preferential targetability for the MLL site over the CREB site. While additional optimization is required to develop the discovered ligands into potent inhibitors of KIX interactions, this study has demonstrated the applicability of PrOF NMR as a useful tool for library screening, ligand discovery, and druggability assessment.
Supplementary Material
Acknowledgements
This project was supported in part by the Univeristy of Minnesota, the NSF-CAREER Award CHE-1352091 and the NIH training grant T32-GM08700 (C.T.G).
Footnotes
Supporting information for this article is given via a link at the end of the document.
References
- 1.a) Souers AJ, Leverson JD, Boghaert ER, Ackler SL, Catron ND, Chen J, Dayton BD, Ding H, Enschede SH, Fairbrother WJ, et al. Nat. Med. 2013;19:202–208. doi: 10.1038/nm.3048. [DOI] [PubMed] [Google Scholar]; b) Mirguet O, Gosmini R, Toum J, Clément CA, Barnathan M, Brusq J-M, Mordaunt JE, Grimes RM, Crowe M, Pineau O, et al. J. Med. Chem. 2013;56:7501–7515. doi: 10.1021/jm401088k. [DOI] [PubMed] [Google Scholar]
- 2.Arkin MR, Tang Y, Wells JA. Chem. Biol. 2014;21:1102–1114. doi: 10.1016/j.chembiol.2014.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Thompson AD, Dugan A, Gestwicki JE, Mapp AK. ACS Chem. Biol. 2012;7:1311–1320. doi: 10.1021/cb300255p. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.a) Pomerantz WC, Wang N, Lipinski AK, Wang R, Cierpicki T, Mapp AK. ACS Chem. Biol. 2012;7:1345–1350. doi: 10.1021/cb3002733. [DOI] [PMC free article] [PubMed] [Google Scholar]; b) Mishra NK, Urick AK, Ember SW, Schonbrunn E, Pomerantz WC. ACS Chem. Biol. 2014;9:2755–2760. doi: 10.1021/cb5007344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.a) Bogan AA, Thorn KS. J. Mol. Biol. 1998;280:1–9. doi: 10.1006/jmbi.1998.1843. [DOI] [PubMed] [Google Scholar]; b) Watkins AM, Arora PS. ACS Chem. Biol. 2014;9:1747–1754. doi: 10.1021/cb500241y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.a) Kitevski-LeBlanc JL, Prosser RS. Prog. Nucl. Magn. Reson. Spectrosc. 2012;62:1–33. doi: 10.1016/j.pnmrs.2011.06.003. [DOI] [PubMed] [Google Scholar]; b) Marsh EN, Suzuki Y. ACS Chem. Biol. 2014;9:1242–1250. doi: 10.1021/cb500111u. [DOI] [PubMed] [Google Scholar]
- 7.Hajduk PJ, Greer J. Nat. Rev. Drug. Discov. 2007;6:211–219. doi: 10.1038/nrd2220. [DOI] [PubMed] [Google Scholar]
- 8.a) Lepre CA. Methods Enzymol. 2011;493:219–239. doi: 10.1016/B978-0-12-381274-2.00009-1. [DOI] [PubMed] [Google Scholar]; b) Wu B, Zhang Z, Noberini R, Barile E, Giulianotti M, Pinilla C, Houghten RA, Pasquale EB, Pellecchia M. Chem Biol. 2013;20:19–33. doi: 10.1016/j.chembiol.2012.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]; c) Dalvit C, Fagerness PE, Hadden DT, Sarver RW, Stockman BJ. J. Am. Chem. Soc. 2003;125:7696–7703. doi: 10.1021/ja034646d. [DOI] [PubMed] [Google Scholar]
- 9.Hajduk PJ, Huth JR, Fesik SW. J. Med. Chem. 2005;48:2518–2525. doi: 10.1021/jm049131r. [DOI] [PubMed] [Google Scholar]
- 10.Siegal G, Ab E, Schultz J. Drug Discov. Today. 2007;12:1032–1039. doi: 10.1016/j.drudis.2007.08.005. [DOI] [PubMed] [Google Scholar]
- 11.a) Radhakrishnan I, Pérez-Alvarado GC, Parker D, Dyson HJ, Montminy MR, Wright PE. Cell. 1997;91:741–752. doi: 10.1016/s0092-8674(00)80463-8. [DOI] [PubMed] [Google Scholar]; b) Denis CM, Chitayat S, Plevin MJ, Wang F, Thompson P, Liu S, Spencer HL, Ikura M, LeBrun DP, Smith SP. Blood. 2012;120:3968–3977. doi: 10.1182/blood-2012-02-411397. [DOI] [PubMed] [Google Scholar]; c) Ernst P, Wang J, Huang M, Goodman RH, Korsmeyer SJ. Mol. Cell Biol. 2001;21:2249–2258. doi: 10.1128/MCB.21.7.2249-2258.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Majmudar CY, Højfeldt JW, Arevang CJ, Pomerantz WC, Gagnon JK, Schultz PJ, Cesa LC, Doss CH, Rowe SP, Vásquez V, Tamayo-Castillo G, Cierpicki T, Brooks CL, Sherman DH, Mapp AK. Angew. Chem. Int. Ed. 2012;51:11258–11262. doi: 10.1002/anie.201206815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Best JL, Amezcua CA, Mayr B, Flechner L, Murawsky CM, Emerson B, Zor T, Gardner KH, Montminy M. Proc. Natl. Acad. Sci. U.S.A. 2004;101:17622–17627. doi: 10.1073/pnas.0406374101. b) Further studies indicate that the active form of Naphthol AS-E phosphate is its dephosphrylated form. Li BX, Xiao X. Chembiochem. 2009;10:2721–2724. doi: 10.1002/cbic.200900552.
- 14.Rowe SP, Casey RJ, Brennan BB, Buhrlage SJ, Mapp AK. J. Am. Chem. Soc. 2007;129:10654–10655. doi: 10.1021/ja0736865. [DOI] [PubMed] [Google Scholar]
- 15.Buhrlage SJ, Bates CA, Rowe SP, Minter AR, Brennan BB, Majmudar CY, Wemmer DE, Al-Hashimi H, Mapp AK. ACS Chem. Biol. 2009;4:335–344. doi: 10.1021/cb900028j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Shankar DB, Cheng JC, Sakamoto KM. Cancer. 2005;104:1819–1824. doi: 10.1002/cncr.21401. [DOI] [PubMed] [Google Scholar]
- 17.Yang F, Vought BW, Satterlee JS, Walker AK, Jim Sun ZY, Watts JL, DeBeaumont R, Saito RM, Hyberts SG, Yang S, Macol C, Iyer L, Tjian R, van den Heuvel S, Hart AC, Wagner G, Naar AM. Nature. 2006;442:700–704. doi: 10.1038/nature04942. [DOI] [PubMed] [Google Scholar]
- 18.Please See Supporting Information
- 19.Wei Y, Horng J-C, Vendel AC, Raleigh DP, Lumb KJ. Biochemistry. 2003;42:7044–7049. doi: 10.1021/bi0343976. [DOI] [PubMed] [Google Scholar]
- 20.Harner MJ, Chauder BA, Phan J, Fesik SW. J. Med. Chem. 2014;57:9687–9692. doi: 10.1021/jm501035j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Hopkins AL, Keseru GM, Leeson PD, Rees DC, Reynolds CH. Nat. Rev. Drug Discov. 2014;13:105–121. doi: 10.1038/nrd4163. [DOI] [PubMed] [Google Scholar]
- 22.Lodge JM, Justin Rettenmaier T, Wells JA, Pomerantz WC, Mapp AK. Med. Chem. Comm. 2014;5:370–375. doi: 10.1039/C3MD00356F. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bullock BN, Jochim AL, Arora PS. J. Am. Chem. Soc. 2011;133:14220–14223. doi: 10.1021/ja206074j. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








