Abstract
The productive exploration of chemical space is an enduring challenge in chemical biology and medicinal chemistry. Natural products are biologically relevant, and their frameworks have facilitated chemical tool and drug discovery. A “top‐down” synthetic approach is described that enabled a range of complex bridged intermediates to be converted with high step efficiency into 26 diverse sp3‐rich scaffolds. The scaffolds have local natural product‐like features, but are only distantly related to specific natural product frameworks. To assess biological relevance, a set of 52 fragments was prepared, and screened by high‐throughput crystallography against three targets from two protein families (ATAD2, BRD1 and JMJD2D). In each case, 3D fragment hits were identified that would serve as distinctive starting points for ligand discovery. This demonstrates that frameworks that are distantly related to natural products can facilitate discovery of new biologically relevant regions within chemical space.
Keywords: chemical biology, fragments, molecular diversity, natural products, proteins
Introduction
Small molecules continue to dominate Man's ability to treat disease, and can transform our understanding of fundamental biology. Yet historically, the exploration of chemical space has been highly uneven,1, 2 in part because a narrow toolkit of reliable reactions has underpinned molecular discovery.3 Natural products can facilitate the identification of biologically relevant chemical space4 since they have arisen through the function‐driven evolution of biosynthetic pathways.5 Indeed, around a third of recent small molecule drugs have been inspired by natural products.6 In biology‐oriented synthesis,7 natural product frameworks4 inform the design of productive small molecule screening collections and fragment sets that can be exploited in the discovery of ligands for unrelated protein targets.8, 9 In addition, synthetic approaches have been developed to convert specific natural products into alternative complex frameworks.10 Natural product‐inspired compounds can provide highly distinctive starting points for discovery that contrast starkly with most synthetic screening compounds:11 in particular, the typically high fraction of sp3‐hybridised carbons (Fsp3) is attractive since it correlates with the successful translation of clinical candidates.12
We envisaged a “top‐down” synthetic approach in which alternative complex, yet readily accessible, intermediates would be converted into many natural product‐like scaffolds (Scheme 1). Initially, bridged scaffolds of general structure 2 would be prepared using intramolecular [5+2] cycloaddition reactions13, 14 (e.g. 1→2): ring cleavage (red; for example, 2→3), ring expansion (magenta; for example, 2→4), annulation (blue; for example, 2→5), or functional group modification (green; for example, 2→6) would then yield diverse scaffolds. The approach contrasts with diversity‐oriented strategies15, 16 in which building blocks are prepared (“built”) and linked (“coupled”) to yield intermediates which are then cyclised (“paired”) to yield alternative scaffolds. Although diversity‐oriented approaches to sp3‐rich fragments have been developed,17 their biological relevance has only rarely17b been demonstrated. To undertake a preliminary assessment of biological relevance, we have therefore prepared and screened a fragment set that is based on many of the scaffolds that are accessible using our synthetic approach.
Results and Discussion
Synthesis of natural product‐like scaffolds
The bridged scaffolds 2 a–f were prepared using intramolecular [5+2] cycloaddition chemistry (Scheme 2). The oxygen‐bridged cycloadducts were prepared by silyl transfer‐induced [5+2] cycloaddition of 3‐tert‐butyldimethylsilyloxy 4H‐pyran‐4‐ones (→2 a–c).14 In a similar vein, the nitrogen‐bridged cycloadducts were prepared by intramolecular cycloaddition of 3‐oxidopyridinium ylides (→2 d–f).18
The complex bridged intermediates 2 were transformed into diverse molecular scaffolds (Scheme 3 and Supporting Information). Cleavage of specific bonds enabled scaffold simplification. Specifically, we exploited 1,2‐diol cleavage (e.g.→the fused bicyclic scaffold 7); alkene ozonolysis (e.g.→the spiro‐fused scaffold 3; and reductive allylic ether cleavage (→the fused bicyclic scaffold 12). Interception of dialdehydes formed by 1,2‐diol cleavage enabled overall ring expansion. Thus, cleavage of the regioisomeric 1,2‐diols formed from 2 a and 2 c, followed by cyclative double reductive amination, yielded the tricyclic scaffolds 4 and 15, respectively.
Annulation enabled more complex scaffolds to be prepared. For example, fusion of alternative heterocycles was possible either by condensation–aromatisation of the masked diketone of 2 a, 2 b or 2 c (e.g. → the pyrazine 9, the quinoxalines 10 and 14 or the imidazole 11); or regioselective [3+2] annulation to the enone 2 d (→the pyrrole 17). Alternatively, treatment of the regioisomeric intermediates 2 a and 2 c with MeLi effected 1,2‐addition; reductive amination and oxazolidinone formation then gave the regioisomeric tetracycles 8 and 16. Furthermore, intramolecular reductive Heck reaction19 of 2 e gave the complex tetracycle 19. Finally, substituted analogues were prepared in which the framework of the initial cycloadduct had been retained (e.g. 13). Overall, the 26 scaffolds20 were prepared from commercially available starting materials in a total of just 64 steps (processes conducted in a single reaction vessel).
An hierarchical tree was constructed to capture the relationship between the scaffolds (Figure 1). Here, scaffolds were systematically simplified using an established protocol by removal of rings until a parent monocyclic ring system was ultimately obtained.21 Twenty two different graph‐node‐bond frameworks (capturing atom and bond type) were represented, which were then simplified to give nine parent monocycles. The scaffolds are based on a wide range of parent ring systems and there is thus significant diversity at each level of hierarchy of the scaffold tree. The exploitation of several different complex intermediates 2 was critical to the diversity that was possible, for example by enabling variation of regiochemistry (e.g. 4/15, 8/16 and 10/14) and heteroatom position and identity (e.g. 11/17). Such an approach would unlikely be possible by modification of natural products, since several related starting materials would be required in multi‐gram quantities.
To compare with other screening sets, we determined natural product likeness scores11 for the deprotected scaffolds, a natural product screening library (4,460 compounds) and a commercial screening collection (278,365 largely synthetic compounds) (Figure 2, Panel A). The distribution of the scores for the scaffolds was broadly similar to that of the natural product screening library but highly distinctive from that of the large screening collection. The local structural features of our scaffolds are thus reminiscent of those found in natural products. Despite the high natural product likeness, however, only one of the 22 graph‐node‐bond frameworks is actually a substructure in the roughly 281,000 compounds in the Dictionary of Natural Products22 (Figure 1 and Supporting Information). Indeed, significant simplification to mono‐ or bicyclic frameworks is required before sub‐structures of natural products are found. Our scaffolds thus have high natural product likeness, but their frameworks lie in branches that augment the scaffold trees of natural product frameworks.
Synthesis of a fragment set and high‐throughput crystallographic screens
To enable preliminary assessment of biological relevance, we prepared 52 racemic fragments based on 23 of the scaffolds. Here, a fragment‐based approach was exploited to enable efficient exploration of chemical space accessible using our synthetic approach. The set was designed to have high shape diversity, and to comprise fragments with controlled23 size (13 to 19 heavy atoms) and lipophilicity (−1.5<clogP<3) (Supporting Information). The fragment set was significantly more three‐dimensional (Supporting Information),24 and more natural product‐like (Figure 2, Panel B), than commercially available fragments with the same heavy atom range.
The fragment set was screened against three protein targets from two different mechanistic classes involved in epigenetic biology: the ATAD2 and BRD1 (also known as BRPF2) bromodomains, and the histone demethylase JMJD2D (also known as KDM4D). Here, the objective was to perform a preliminary assessment of biological relevance rather than to provide specific starting points for discovery. High expression levels of ATAD225 and JMJD2 family members26 correlate with poor outcomes in several cancers, whilst BRD1 is a member of the BRPF family of scaffolding proteins whose role in acute myeloid leukemia is now emerging.27 The two bromodomains are contrasting targets: ATAD2 has a shallow N‐acetyl lysine binding site and has been suggested to have particularly low druggability.28
The three target proteins were all amenable to high‐throughput protein crystallography. Protein crystals were soaked with the 52 individual racemic fragments,29 picked and then subjected to automated X‐ray diffraction. Fragment hits were identified through detection of additional electron density,30 and inspection for polar interactions with the protein. Fragment hits were successfully identified for each of the target proteins: two hits for JMJD2D, eight hits for the BRD1 bromodomain and seven hits for the ATAD2 bromodomain (Figure 3 and Supporting Information).
The fragment screen against JMJD2D revealed two hits, both of which targeted a peripheral binding site (Panel B1, Figure 3 and Supporting Information). The hits complement those found in previous fragment screens against JMJD2D:26, 31 specifically, X‐ray crystallography had revealed 23 fragments all target the enzyme active site. The functional importance of this peripheral binding site could now be investigated, for example to reveal opportunities for allosteric modulation of the enzyme.
For the BRD1 bromodomain, the eight fragment hits targeted the N‐acetyl lysine binding site (Panel B2, Figure 3 and Supporting Information; see Ref. [27] and references therein for known ligands). The fragment hits contained several different N‐acetyl mimetics that interacted with both N110 and, either directly or via a bridging water, Y67.
For the ATAD2 bromodomain, the fragment screen yielded seven hits (based on six distinct frameworks) that targeted the N‐acetyl lysine binding site (Panel B3, Figure 3 and Supporting Information). Six of the hits mirrored the binding mode of the N‐acetyl lysine side chain,28a interacting directly with N1064 and, via a bridging water, with Y1021. Four hits were in common with BRD1 although, in three cases, the interaction networks were more extensive: for example ent‐23 made water‐mediated interactions with V1008 and D1014 (Panel B3, top, Figure 3) and ent‐25 interacted directly with E1017 (Figure 4).
To compare directly with a more conventional fragment set, we also screened 700 commercially available fragments against ATAD2 by high‐throughput crystallography and identified nine hits that targeted N‐acetyl lysine binding site (Figure 4 and Supporting Information). As a group, the interactions of these fragments parallel those of other sp2‐rich fragments: they interact directly with N1064 and/or, via a bridging water, with Y1021 but make few additional polar contacts.25, 28, 32 For screens against many targets, flatter fragments have been observed to have higher hit rates.33 With ATAD2, however, a significantly higher hit rate was observed with our shape‐diverse natural product‐like fragments (7/52) than with more conventional34 flatter fragment sets (9/700). This outcome is consistent with the low hit rate observed in a previous fragment screen by NMR (65/13800, subsequently triaged to yield 12 hits with K d<1 mm).28b
Conclusion
We have developed a “top‐down” synthetic approach in which alternative complex, yet readily accessible, intermediates were converted into many diverse scaffolds. These scaffolds have local natural product‐like features, but are only distantly related to specific natural product frameworks. A set of 52 fragments based on 23 of the scaffolds was screened against three epigenetic targets from two distinct protein families. In each case, hits were obtained that may provide distinctive opportunities for subsequent fragment growth. We have therefore demonstrated that frameworks that are distantly related to natural products can facilitate identification of novel regions of biologically relevant chemical space. Synthetic approaches to such frameworks may thus help identify fertile chemical space for bioactive small molecule discovery that is inaccessible to existing compound collections and biosynthetic pathways.
Conflict of interest
The authors declare no conflict of interest.
Supporting information
Acknowledgements
We thank EPSRC for a Doctoral Prize Fellowship (to DJF) and grants (EP/J00894X; EP/N025652/1); GSK for funding; Aleksandra Szykowska and Michael Fairhead for protein production and purification; and support from the Innovative Medicines Initiative Joint Undertaking under grant number 115489, resources of which are composed of financial contribution from the European Union's Seventh Framework Programme (F97/2008–2013), EFPIA companies’ in kind contribution. The SGC is a registered charity (number 1097737) that receives funds from AbbVie, Bayer Pharma AG, Boehringer Ingelheim, Canada Foundation for Innovation, Eshelman Institute for Innovation, Genome Canada, Innovative Medicines Initiative (EU/EFPIA) [ULTRA‐DD grant no. 115766], Janssen, Merck & Co., Novartis Pharma AG, Ontario Ministry of Economic Development and Innovation, Pfizer, São Paulo Research Foundation‐FAPESP, Takeda, and Wellcome Trust [092809/Z/10/Z].
D. J. Foley, P. G. E. Craven, P. M. Collins, R. G. Doveston, A. Aimon, R. Talon, I. Churcher, F. von Delft, S. P. Marsden, A. Nelson, Chem. Eur. J. 2017, 23, 15227.
Contributor Information
Prof. Stephen P. Marsden, Email: s.p.marsden@leeds.ac.uk.
Prof. Adam Nelson, Email: a.s.nelson@leeds.ac.uk.
References
- 1. Lipkus A. H., Yuan Q., Lucas K. A., Funk S. A., W. F. Bartelt III , Schenck R. J., Trippe A. J., J. Org. Chem. 2008, 73, 4443–4451. [DOI] [PubMed] [Google Scholar]
- 2. Walters W. P., Green J., Weiss J. R., Murcko M. A., J. Med. Chem. 2011, 54, 6405–6416. [DOI] [PubMed] [Google Scholar]
- 3.
- 3a. Roughley S. D., Jordon A. M., J. Med. Chem. 2011, 54, 3451–3479; [DOI] [PubMed] [Google Scholar]
- 3b. Cooper T. W. J., Campbell I. B., Macdonald S. J. F., Angew. Chem. Int. Ed. 2010, 49, 8082–8091; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2010, 122, 8258–8267; [Google Scholar]
- 3c. Brown D. G., Boström J., J. Med. Chem. 2016, 59, 4443–4458. [DOI] [PubMed] [Google Scholar]
- 4.
- 4a. Rodrigues T., Reker D., Schneider P., Schneider G., Nat. Chem. 2016, 8, 531–541; [DOI] [PubMed] [Google Scholar]
- 4b. Koch M. A., Schuffenhauer A., Scheck M., Wetzel S., Casaulta M., Odermatt A., Ertl P., Waldmann H., Proc. Natl. Acad. Sci. USA 2005, 102, 17272–17277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.
- 5a. Maplestone R. A., Stone M. J., Williams D. H., Gene 1992, 115, 151–157; [DOI] [PubMed] [Google Scholar]
- 5b. Firn R. D., Jones C. G., Nat. Prod. Rep. 2003, 20, 382–391. [DOI] [PubMed] [Google Scholar]
- 6. Newman D. J., Cragg G. M., J. Nat. Prod. 2007, 70, 461–477. [DOI] [PubMed] [Google Scholar]
- 7.
- 7a. Wetzel S., Bon R. S., Kumar K., Waldmann H., Angew. Chem. Int. Ed. 2011, 50, 10800–10826; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2011, 123, 10990–11018; [Google Scholar]
- 7b. van Hattum H., Waldmann H., J. Am. Chem. Soc. 2014, 136, 11853–11859. [DOI] [PubMed] [Google Scholar]
- 8. Over B., Wetzel S., Grütter C., Nakai Y., Renner S., Rauh D., Waldmann H., Nat. Chem. 2013, 5, 21–28. [DOI] [PubMed] [Google Scholar]
- 9. Rodrigues T., Reker D., Kunze J., Schneider P., Schneider G., Angew. Chem. Int. Ed. 2015, 54, 10516–10520; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2015, 127, 10662–10666. [Google Scholar]
- 10.
- 10a. R. W. Huigens III , Morrison K. C., Hicklin R. W., T. A. Flood Jr. , Richter M. F., Hergenrother P. J., Nat. Chem. 2013, 5, 195–202; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10b. Rafferty R. J., Hicklin R. W., Maloof K. A., Hergenrother P. J., Angew. Chem. Int. Ed. 2014, 53, 220–224; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2014, 126, 224–228; [Google Scholar]
- 10c. Masarwa A., Weber M., Sarpong R., J. Am. Chem. Soc. 2015, 137, 6327–6334; [DOI] [PubMed] [Google Scholar]
- 10d. Weber M., Owens K., Masarwa A., Sarpong R., Org. Lett. 2015, 17, 5432–5435. [DOI] [PubMed] [Google Scholar]
- 11. Ertl P., Roggo S., Schuffenhauer A., J. Chem. Inf. Model. 2008, 48, 68–74. Implemented in RDKit v2015.09.2 (Greg Landrum; Open Source Cheminformatics; http://www.rdkit.org; last accessed 27-Apr-2016).18034468 [Google Scholar]
- 12. Lovering F., Bikker J., Humblet C., J. Med. Chem. 2009, 52, 6752–6756. [DOI] [PubMed] [Google Scholar]
- 13. Mascareñas J. L., in Advances in Cycloaddition, Volume 6, pp. 1–54 (Ed. Harmata, M.) JAI Press Inc., Stamford CT, 1999. [Google Scholar]
- 14. Mascareñas J. L., Rumbo A., Castedo L. A., J. Org. Chem. 1997, 62, 8620–8621. [Google Scholar]
- 15. Nielsen T., Schreiber S. L., Angew. Chem. Int. Ed. 2008, 47, 48–56; [DOI] [PMC free article] [PubMed] [Google Scholar]; Angew. Chem. 2008, 120, 52–61. [Google Scholar]
- 16.For key reviews, see:
- 16a. Diversity-Oriented Synthesis: Basics and Applications in Organic Synthesis Discovery and Chemical Biology (Ed. Trabocchi, A.) Wiley, Hoboken NJ, 2013; [Google Scholar]
- 16b. Galloway W. R. J. D., Isidro-Llobet A., Spring D. R., Nat. Commun. 2010, 1, 80; [DOI] [PubMed] [Google Scholar]
- 16c. Burke M. D., Schreiber S. L., Angew. Chem. Int. Ed. 2004, 43, 46–58; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2004, 116, 48–60. [Google Scholar]
- 17.
- 17a. Hung A. W., Ramek A., Wang Y., Kaya T., Wilson J. A., Clemons P. A., Young D. W., Proc. Natl. Acad. Sci. USA 2011, 108, 6799–6804; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17b. Wang Y., ACS Med. Chem. Lett. 2016, 7, 852–856; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17c. Twigg D. G., Kondo N., Mitchell S. L., Galloway W. R. J. D., Sore H. F., Madin A., Spring D. R., Angew. Chem. Int. Ed. 2016, 55, 12479–12483; [DOI] [PMC free article] [PubMed] [Google Scholar]; Angew. Chem. 2016, 128, 12667–12671. [Google Scholar]
- 18. Rumbo A., Mouriño A., Castedo L., Mascareñas J. L., J. Org. Chem. 1996, 61, 6114–6120. [DOI] [PubMed] [Google Scholar]
- 19. Amorese A., Arcadi A., Bernocchi E., Cacchi S., Cerrini S., Fedeli W., Ortar G., Tetrahedron 1989, 45, 813–828. [Google Scholar]
- 20.Pairs of diastereomers were prepared for two scaffolds.
- 21.
- 21a. Wetzel S., Klein K., Renner S., Rauh D., Oprea T. I., Mutzel P., Waldmann H., Nat. Chem. Biol. 2009, 5, 581–583; [DOI] [PubMed] [Google Scholar]
- 21b. Renner S., Nat. Chem. Biol. 2009, 5, 585–592. [DOI] [PubMed] [Google Scholar]
- 22.Dictionary of Natural Products Online, http://dnp.chemnetbase.com/intro/ (Accessed: 2nd November 2016).
- 23. Congreve M., Carr R., Murray C., Jhoti H. A., Drug Discovery Today 2003, 8, 876–877. [DOI] [PubMed] [Google Scholar]
- 24.I. Colomer, C. J. Empson, P. Craven, Z. Owen, R. G. Doveston, I. Churcher, S. P. Marsden, A. Nelson, Chem. Commun 2016, 52, 7209–7212. LLAMA is available at https://llama.leeds.ac.uk. [DOI] [PubMed]
- 25. Bamborough P., Angew. Chem. Int. Ed. 2016, 55, 11382–11386; [DOI] [PMC free article] [PubMed] [Google Scholar]; Angew. Chem. 2016, 128, 11554–11558. [Google Scholar]
- 26. Korczynska M., J. Med. Chem. 2016, 59, 1580–1598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Igoe N., J. Med. Chem. 2017, 60, 668–680. [DOI] [PubMed] [Google Scholar]
- 28.
- 28a. Chaikuad A., Petros A. M., Fedorov O., Xu J., Knapp S., MedChemComm 2014, 5, 1843–1848; [Google Scholar]
- 28b. Harner M. J., Chauder B. A., Phan J., Fesik S. W., J. Med. Chem. 2014, 57, 9687–9692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Collins P. M., Ng J. T., Talon R., Nekrosiute K., Krojer T., Douangamath A., Brandao-Neto J., Wright N., Pearce N. M., von Delft F., Acta Crystallogr. Sect. D 2017, 73, 246–255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.
- 30a. Pearce N. M., Bradley A. R., Krojer T., Marsden B. D., Deane C. M., von Delft F., Struct. Dyn. 2017, 4, 032104; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30b. Pearce N. M., Nat. Commun. 2017, 8, 15123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.
- 31a. Bavetsias V., J. Med. Chem. 2016, 59, 1388–1409; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31b. Westaway S. M., J. Med. Chem. 2016, 59, 1357–1369. [DOI] [PubMed] [Google Scholar]
- 32.
- 32a. Demont E. H., J. Med. Chem. 2015, 58, 5649–5673; [DOI] [PubMed] [Google Scholar]
- 32b. Poncet-Montange G., Biochem. J. 2015, 466, 337–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Hall R. J., Mortenson P. N., Murray C. W., Prog. Biophys. Mol. Biol. 2014, 116, 82–91. [DOI] [PubMed] [Google Scholar]
- 34. Morley A. D., Drug Discovery Today 2013, 18, 1221–1227. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.