Skip to main content
Nature Portfolio logoLink to Nature Portfolio
. 2024 Jun 17;43(29):2229–2243. doi: 10.1038/s41388-024-03077-2

Impact of structural biology and the protein data bank on us fda new drug approvals of low molecular weight antineoplastic agents 2019–2023

Stephen K Burley 1,2,3,4,, Amy Wu-Wu 1, Shuchismita Dutta 1,2, Shridar Ganesan 2, Steven X F Zheng 2
PMCID: PMC11245395  PMID: 38886570

Abstract

Open access to three-dimensional atomic-level biostructure information from the Protein Data Bank (PDB) facilitated discovery/development of 100% of the 34 new low molecular weight, protein-targeted, antineoplastic agents approved by the US FDA 2019–2023. Analyses of PDB holdings, the scientific literature, and related documents for each drug-target combination revealed that the impact of structural biologists and public-domain 3D biostructure data was broad and substantial, ranging from understanding target biology (100% of all drug targets), to identifying a given target as likely druggable (100% of all targets), to structure-guided drug discovery (>80% of all new small-molecule drugs, made up of 50% confirmed and >30% probable cases). In addition to aggregate impact assessments, illustrative case studies are presented for six first-in-class small-molecule anti-cancer drugs, including a selective inhibitor of nuclear export targeting Exportin 1 (selinexor, Xpovio), an ATP-competitive CSF-1R receptor tyrosine kinase inhibitor (pexidartinib,Turalia), a non-ATP-competitive inhibitor of the BCR-Abl fusion protein targeting the myristoyl binding pocket within the kinase catalytic domain of Abl (asciminib, Scemblix), a covalently-acting G12C KRAS inhibitor (sotorasib, Lumakras or Lumykras), an EZH2 methyltransferase inhibitor (tazemostat, Tazverik), and an agent targeting the basic-Helix-Loop-Helix transcription factor HIF-2α (belzutifan, Welireg).

Subject terms: Structure-based drug design, Drug development

Introduction

X-ray protein crystallography and structure-guided approaches have been mainstays for drug discovery for more than two decades [1, 2]. Atomic-level, three-dimensional (3D) structures of biological macromolecules inform our understanding of target biology (reviewed in [3]), and provide important insights into target druggability for both small-molecule and/or biologic agents (reviewed in [4]). Today, macromolecular crystallography (MX) and 3D electron microscopy (3DEM) are routinely used in most large and many small biopharmaceutical companies for structure-guided optimization of affinity of small-molecule screening hits and lead compounds [1]. 3D biostructure data can also aid in surmounting some of the myriad challenges (e.g., avoiding unwanted off-target binding) inherent in turning biochemically active compounds into potent, drug-like molecules suitable for safety and efficacy testing in animals and humans [5]. Finally, starting points for medicinal chemistry campaigns (i.e., selectively binding chemical scaffolds) can be identified via fragment screening using nuclear magnetic resonance spectroscopy or NMR [6], MX [7], and 3DEM [8].

Public-domain 3D biostructure information generated using MX, 3DEM, or NMR is distributed on an open-access basis by a singular global data resource, known as the Protein Data Bank (PDB [9]). When the PDB was established in 1971 as the first open-access digital data resource in biology, it housed only seven protein structures [9]. Today, the PDB is regarded as a global public good vital to basic and applied research and education/training across the biological and biomedical sciences. In the spring of 2024, the PDB housed >220,000 experimentally determined, atomic-level 3D structures of biological macromolecules (i.e., proteins, DNA, and RNA), many of which have been visualized in the act of binding one or more small-molecule ligands, including United States Food and Drug Administration (US FDA) approved drugs. Since 2003, the PDB has been managed jointly according to the FAIR Principles of Findability-Accessibility-Interoperability-Reusability [10] by the Worldwide Protein Data Bank (wwPDB) partnership [11, 12], including the US Research Collaboratory for Structural Bioinformatics Protein Data Bank or RCSB PDB [1315], Protein Data Bank in Europe [16], Protein Data Bank Japan [17], Protein Data Bank China [18], Biological Magnetic Resonance Data Bank [19], and the Electron Microscopy Data Bank [20].

The RCSB PDB (RCSB.org) headquartered at Rutgers, The State University of New Jersey (with additional performance sites at the University of California San Diego and the University of California San Francisco) serves as the US wwPDB data center and as the wwPDB-designated Archive Keeper for the PDB. On two previous occasions, we have reviewed the impact of structural biologists and PDB structures on US FDA drug approvals. Initially, we examined 210 new drugs approved by US FDA 2010–2016 across all therapeutic areas [21], and determined that open access to nearly 6000 atomic-level 3D structures of molecular targets stored in the PDB archive facilitated discovery and development of 185 (~88%) of these new medical entities (NMEs). Subsequently, we focused on 79 new antineoplastic agents with known molecular targets approved by US FDA 2010–2018 [22], and determined that open access to PDB data facilitated discovery and development of >90% of these life-changing drugs. More detailed analyses of the 54 low molecular weight (LMW) NMEs for cancer treatment approved by US FDA 2010–2018 revealed that >70% were the product of structure-guided drug discovery (SGDD) efforts at biopharmaceutical companies.

Since these analyses of the impact of PDB data on drug discovery and development were published by RCSB PDB, the world of structural biology underwent a seismic shift with emergence of new software tools that rely on artificial intelligence/machine learning (AI/ML) methods to predict protein structure from amino acid sequence alone at accuracies comparable to lower-resolution experimental structures. Development of AlphaFold2 [23], RosettaFold2 [24], etc. would not have been possible without open access to complete, rigorously validated, expertly biocurated 3D biostructure data archived in the PDB [25]. More recently, AI/ML methods, similarly dependent on open access to PDB data, have been developed to predict how proteins bind small molecules and other proteins (e.g., RoseTTAFold All-Atom [26], AlphaFold3 [27]). These new tools for predicting how small molecules bind to proteins go beyond a plethora of previously developed computational approaches to structure-based drug discovery based on PDB data (e.g., docking methods used in virtual screening and lead optimization, free energy perturbation methods for predicting ligand affinity, and statistical and data-driven tools for analyzing and designing protein ligand complexes) many of which are used across the biopharmaceutical industry.

Herein, we review the ways that structural biologists and open access to PDB data facilitated discovery and development of 34 new antineoplastic LMW-NMEs approved by US FDA 2019–2023. In addition to an aggregate review of PDB impact on new drug approvals, illustrative case studies are presented for six first-in-class small-molecule anti-cancer drugs, including a selective nuclear export Exportin 1 inhibitor (selinexor, Xpovio), an ATP-competitive CSF-1R receptor tyrosine kinase inhibitor (pexidartinib,Turalia), a non-ATP-competitive “STAMP” inhibitor of the BCR-Abl fusion protein (asciminib, Scemblix), a covalently-acting G12C mutant KRAS inhibitor (sotorasib, Lumakras or Lumykras), an EZH2 methyltransferase inhibitor (tazemostat, Tazverik), and a transcription factor (HIF-2α) targeting agent (belzutifan, Welireg).

Low-molecular-weight anti-neoplastic drugs approved by US FDA 2019–2023

In aggregate, 38 antineoplastic LMW-NMEs were approved by US FDA 2019–2023. Three of these newly approved drugs [umbralisib (Ukoniq), melphalan flufenamide (Pepaxto), and mobocertinib (Exkivity)] were omitted from this review, because each one was withdrawn from clinical use before the end of 2023. Lurbinectedin (Zepzelca), which alkylates guanine residues in the minor groove of DNA, was also omitted from our analyses. All 34 of the remaining antineoplastic LMW-NMEs approved by US FDA during this period target human proteins for which the PDB houses one or more atomic-level 3D structure. All PDB structures are freely available at no charge and with no limitations on data usage.

Impact of PDB structures on anti-neoplastic drug approvals

We searched the PDB archive using corresponding reference amino acid sequences from UniProt (www.uniprot.org) to identify 3D biostructures that include all or part of the known macromolecular target for each of the 34 protein-targeting antineoplastic LMW-NMEs (Table 1). As of February 2024, the archive contained one or more target protein structures for all of these drug targets (34/34, 100%). More than 80% of the target protein structures for the 34 LMW-NMEs were deposited to the PDB at least a decade before the drug was approved for clinical use by US FDA. The median time between the first PDB deposition of each LMW-NME target protein structure and US FDA approval of the LMW-NME exceeded 17 years (Table 2). The LMW-NMEs themselves are also well represented in the PDB. For ~74% (25/34) of the LMW-NMEs, one or more PDB structures reveal at the atomic level precisely how the drug binds to the corresponding target protein (Table 2).

Table 1.

Overview of PDB holdings for antineoplastic LMW-NMEs and their known molecular targets approved by US FDA 2019–2023.

LMW-NME protein target class NMEs in protein target class NMEs with protein target structures in PDB Total unique PDB IDs for NME protein target structures (>99% Identity) Number with protein target/NME co-complex structure(s) in PDB
Enzyme: Protein Kinases 24 24 (100%) 1388 20 (~83%)
Enzyme: IDH1 1 1 (100%) 48 1 (100%)
Enzyme: EZH2 1 1 (100%) 27 0 (0%)
Enzyme: γ-secretase 1 1 (100%) 19 1 (100%)
Nuclear Hormone Receptors: ERα, AR 2 2 (100%) 527 1 (50%)
GPCR: GnRHR 1 1 (100%) 1 0 (0%)
GPTase: G12C KRAS 2 1 (100%) 328 1 (50%)
Transcription Factor: HIF-2α 1 1 (100%) 32 1 (100%)
Karyopherin: XPO1 1 1 (100%) 8 1 (100%)
All 34 34 (100%) 2378 26 (~76%)

Table 2.

Evidentiary summary for structure-guided drug (SGDD) discovery of anti-cancer LMW-NMEs approved by US FDA 2019–2023. (N.B: PDB holdings reported in Column 5 were assessed in March 2024).

NME Generic Name
(TradeName)
Sponsor
Company
Target
Class
Target
UniProt ID
Oncology
Indication
Earliest >95% Amino Acid Sequence
Identical PDB ID for Target Protein/Year
(Literature Citation)
Number (PDB IDs containing human target protein)
US FDA Approval
Yeara
Delta (Years from 1st PDB ID public data release to US FDA Approval)
Target Protein/LMW-NME
Complex PDB ID
(Literature Citation)
Year of PDB Public
Data Release
Source of
Target-NME
Complex
PDB ID
SGDD
(Yes, Prob,
Poss,
Unl)

erdafitinib

(Balversa)

Janssen Protein Kinase

FGFRa

P11362

Bladder Cancer

1fgk/1996

[90]

Number 70

2019

Delta 23

5ew8

[33]

2016 (Before Approval)

Academia Yes

alpelisib

(Piqray)

Novartis Protein Kinase

PI3K

P42336

Breast Cancer

2rd0/2007

[91]

Number 103

2019

Delta 12

4jps

[34]

2013 (Before Approval)

Industry Yes

pexidartinib

(Turalio)

Plexxikon→

Daichi Sankyo

Protein Kinase

CSF-1R

P07333

TGCT

2i0v/2006

[66]

Number 24

2019

Delta 13

4r7h

[35]

2015 (Before Approval)

Academia Yes

entrectinib

(Rozlytrek)

Genentech Protein Kinase

ALK

Q9UM73

NSCLC

2xb7/2010

[92]

Number 77

2019

Delta 9

5fto

[36]

2016 (Before Approval)

Industry Yes

zanubrutinib

(Brukinsa)

BeiGene USA Protein Kinase

BTK

Q06187

Mantle Cell

Lymphoma

1k2p/2001

[93]

Number 129

2019

Delta 18

6j6m

[37]

2019 (At Approval)

Industry Yes

avapritinib

(Avyakit)

Blueprint Medicines Protein Kinase

PDGFRα

P16234

GIST

5k5x/2016

[94]

Number 13

2020

Delta 5

8pqh

[46]

2024 (After Approval)

Academia Poss

tucatinib

(Tukysa)

Oncothyreon→

Seattle Genetics

Protein Kinase

Her2

P04626

Breast Cancer

3pp0/2011

[95]

Number 49

2020

Delta 9

N/A N/A Poss

pemigatinib

(Pemazyre)

Incyte Protein Kinase

FGFR2

P21802

Cholangio

Carcinoma

1gjo/2003

10.2210/pdb1gjo/pdb

Number 52

2020

Delta 17

7wcl

[47]

2022 (After Approval)

Academia Prob

capmatinib

(Tabrecta)

Incyte→

Novartis

Protein Kinase

MET

P08581

NSCLC

1r0p/2003

[96]

Number 116

2020

Delta 17

N/A N/A Poss

selpercatinib

(Retevmo)

Lilly→

Loxo Oncology

Protein Kinase

RET

P07949

NSCLC

2ivu/2006

[97]

Number 34

2020

Delta 14

7ju6

[48]

2021 (After Approval)

Academia Prob

ripretinib

(Qinlock)

Deciphera Pharmaceuticals Protein Kinase

KIT

P10721

GIST

1pkg/2003

[98]

Number 42

2020

Delta 17

6mob

[38]

2019 (Before Approval)

Industry Yes

pralsetinib

(Gavreto)

Blueprint Medicines Protein Kinase

RET

P07949

NSCLC

2ivu/2006

[97]

Number 34

2020

Delta 14

7ju5

[48]

2020 (At Approval)

Academia Prob

tepotinib

(Tepmetko)

Merck Protein Kinase

MET

P08581

NSCLC

1r0p/2003

[96]

Number 116

2021

Delta 18

4r1v

[39]

2015 (Before Approval)

Industry Yes

trilaciclib

(Cosela)

G1 Therapeutics Protein Kinase (For Bone Marrow Protection)

CDK4-CKD6

P11802-

Q00534

Breast Cancer

2w96/2009-1bi7/1998

[99, 100]

Number 14–19

2021

Delta 12–23

N/A N/A Prob

tivozanib

(Fotivda)

Kyowa Kirin→

Aveo Pharmaceuticals

Protein Kinase

VEGFR

P35968

Renal Cell Carcinoma

1vr2/2000

[101]

Number 54

2021

Delta 21

4ase

[49]

2012 (Before Approval)

Industry Yes

infigratinib

(Truseltiq)

Novartis→

Bridge Biopharma

Protein Kinase

FGFR2

P21802

Cholangio carcinoma

1gjo/2003

10.2210/pdb1gjo/pdb

Number 52

2021

Delta 18

3tt0

[40]

2011 (Before Approval)

Industry Yes

asciminib

(Scemblix)

Novartis

Protein Kinase

Non-ATP-Competitive

BCR-Abl

P00519

Ph+ CML

1fpu/2000

[69]

Number 81

2021

Delta 21

5mo4

[41]

2017 (Before Approval)

Industry Yes

futibatinib

(Lytgobi)

Lilly Protein Kinase

FGFR2

P21802

Cholangio carcinoma

1gjo/2003

10.2210/pdb1gjo/pdb

Number 52

2022

Delta 19

6mzq

[42]

2019 (Before Approval)

Academia Yes

pirtobrutinib

(Jaypirca)

Lilly Protein Kinase

BTK

Q06187

Mantle Cell Lymphoma

1k2p/2001

[93]

Number 129

2023

Delta 22

8fll

[43]

2023 (At Approval)

Industry Yes

quizartinib

(Vanflyta)

Ambit→

Daichi Sankyo

Protein Kinase

FLT3

P36888

AML

1rjb/2004

[102]

Number 10

2023

Delta 19

4xuf

[50]

2015 (Before Approval)

Academia Prob

repotrectinib

(Augtyro)

Turning Point Therapeutics→

BristolMyers Squibb

Protein Kinase

ROS1

P08922

ROS1-positive NSLC

3zbf/2013

[103]

Number 4

2023

Delta 10

7vkn

[44]

2021 (Before Approval)

Industry Yes

momelotinib

(Ojjaara)

GSK Protein Kinase

Jak1

P23458

Myelofibrosis

3eyg/2009

[104]

Number 50

2023

Delta 14

7nns

(Unpublished, see doi.org/10.2210/pdb7nns/pdb)

2021 (Before Approval)

Academia Prob

fruquintinib

(Fruzaqla)

Takeda Protein Kinase

VEGFR

P35968

Colorectal Cancer

1vr2/1999

[101]

Number 54

2023

Delta 24

N/A N/A Prob

capivasertib

(Truqap)

AstraZeneca Protein Kinase

AKT2

P31751

Breast Cancer

1o6k/2002

[105]

Number 18

2023

Delta 21

4gv1

[45]

2013 (Before Approval)

Industry Yes

olutasidenib

(Rezlidhia)

Forma

Therapeutics

Isocitrate Dehydrogenase

IDH1

O75874

AML

1t09/2004

[29]

Number 48

2022

Delta 18

6u4j

[54]

2020 (Before Approval)

Industry Yes

tazemetostat

(Tazverik)

Epizyme Methyl transferase

EZH2

Q15910

Epithelioid sarcoma

4mi0/2013

[79]

Number 27

2020

Delta 7

N/A N/A Poss

nirogacestat

(Ogsiveo)

Pfizer→

Springworks

Therapeutics

Multi-subunit

Protease

Nicrastin subunit of γ-secretase

Q92542

Desmoid Tumors

5a63/2015

[106]

Number 19

2023

Delta 8

N/A N/A Unl

darolutamide

(Nubeqa)

Orion Pharma→

Bayer

Nuclear Hormone

Receptor

AR

P10275

Prostate Cancer

1e3g/2001

[107]

Number 92

2019

Delta 18

N/A N/A Prob
elacestrant (Orserdu)

Esai→

Takeda

Nuclear Hormone Receptor:

Selective Estrogen Receptor Degrader

ERα

P03372

Breast Cancer

1a52/1998

[108]

Number 436

2023

Delta 25

7te7

[56]

2022 (Before Approval)

Academia Prob

relugolix

(Orgovyx)

Takeda GPCR

GnRHR

P30968

Prostate Cancer

7br3/2020

[57]

Number 1

2020

Delta 0

N/A N/A Unl

sotorasib

(Lumakras)

Amgen GTPase

KRAS G12C

P01116

NSCLC

1d8d/2000

[109]

Number 329

2021

Delta 21

6oim

[77]

2019 (Before Approval)

Industry Yes

adagrasib

(Krazati)

Mirati Therapeutics GTPase

KRAS G12C

P01116

NSCLC

1d8d/2000

[109]

Number 329

2022

Delta 22

N/A N/A Yes

belzutifan

(Welireg)

Merck Transcription Factor

HIF-2α

Q99814

VHL Disease

3f1n/2009

[83]

Number 32

2021

Delta 12

7w80

[55]

2022 (After Approval)

Academia Prob

selinexor

(Xpovio)

Karyopharm Therapeutics Karyopherin

XPO1

O14980

Multiple Myeloma

3gb8/2009

[58]

Number 8

2019

Delta 20

7l5e

[52]

2021 (After Approval)

Academia Yes

aFGFR fibroblast growth factor receptor, PI3K phosphoinositide 3 kinase, CSF-1R colony stimulating factor-1 receptor, TGCT tenosynovial giant cell tumor, ALK anaplastic lymphoma kinase, BTK Bruton’s tyrosine kinase, PDGFRα platelet derived growth factor receptor α, HER2 human epidermal growth factor receptor 2, GIST gastrointestinal stromal tumor, MET hepatocyte growth factor receptor, NSCLC non-small-cell lung cancer, RET rearranged during transfection receptor tyrosine kinase, KIT cluster of differentiation 117, CDK-4 cyclin-dependent kinase-4, CDK-6 cyclin-dependent kinase-6, VEGFR vascular endothelial growth factor receptor, Ph+ CML Philadelphia chromosome positive chronic myeloid leukemia, AML acute myeloid leukemia, FLT3 feline McDonough sarcoma-like tyrosine kinase 3, ROS1 proto-oncogene tyrosine-protein kinase ROS encoded by the ROS1 gene, AKT2 AKT2 serine-threonine kinase, IDH1 isocitrate dehydrogenase 1, EZH2 enhancer of zeste homolog 2, AR androgen receptor, ERα estrogen receptor α, GPCR G-protein coupled receptor, GnRHR type 1 gonadotropin-releasing hormone receptor, GTPase nucleoside guanine triphosphate hydrolase, KRAS Kirsten rat sarcoma virus protein, G12C Glycine 12 to Cysteine, HIF-2α Hypoxia-inducible factor-2α, VHL Von Hippel Lindau, XPO1 exportin 1.

The 34 LMW-NMEs target 9 distinct classes of proteins (Table 1), including protein kinases, three other enzymes [isocitrate dehydrogenase (IDH1), a methyltransferase (EZH2), and the nicrastin subunit of γ-secretase], Exportin 1, two nuclear hormone receptors [estrogen receptor α (ER α) and androgen receptor (AR)], a G-protein coupled receptor (GPCR:GnRHR), a GTPase (G12C KRAS), and a transcription factor (HIF-2α). In total (as of February 2024), we identified 2,378 “Relevant Structures” housed within the PDB, which include unique PDB IDs containing the following: (a) a reference or a mutant/variant form of the target protein; (b) a LMW-NME bound to a reference or mutant/variant form of its target protein; (c) a LMW-NME bound to a potential alternative target protein; or (d) a LMW-NME bound to a possible off-target protein. The number of Relevant Structures identified for each target or target class ranges from 1 for the GPCR:GnRHR to 1388 for the protein kinases.

Review of PDB archival holdings and the scientific literature pertaining to each NME target/LMW-NME combination summarized in Table 2 revealed that public domain 3D structure data facilitated discovery and development of all 34 LMW-NMEs in the following ways:

  • (i)

    Target Biology: Atomic-level 3D structures provide functional insights that are not always apparent from amino acid sequence (reviewed in [3, 22]). Maximizing understanding target biology can help avoid failures in costly Phase 3 clinical trials, wherein the biological biochemical activity of the target protein is inhibited yet the desired clinical benefit is not realized.

    In every case, the PDB houses one or more experimentally-determined atomic-level 3D structure of each NME target.

  • (ii)

    Target Druggability: Atomic-level 3D structures enable visualization of surface features deemed likely to bind small organic compounds for inhibition of enzymatic action or other interdiction of biochemical/biological function (reviewed in [4, 28]).

    In every case (34/34), PDB structures revealed one or more potential small-molecule binding sites, either on the surface of a target protein or within a protein-protein interface (e.g., the homodimeric IDH1 enzyme PDB ID 1t09 [29]). Target druggability is also informed by atomic-level, 3D structures housed in the PDB that reveal how small-molecule ligands bind to target proteins. For many of the 34 LMW-NME targets, the PDB houses co-crystal structures of the target bound to non-proprietary tool compounds (data not shown).

  • (iii)

    Structure-Guided Lead Optimization: Co-crystal structures of target protein-ligand complexes are widely used across the biopharmaceutical industry to guide optimization of potency (reviewed in [1]). In the most favorable cases, knowledge of co-complex structures with potential off targets can also be employed to help ensure the desired selectivity profile and reduce the likelihood of off-target toxicity. (For example, incidental inhibition of glycogen synthase kinase-3β (GSK-3β) causes hyperglycemia. The PDB archive houses more than 100 atomic-level 3D structures of human GSK-3β, many of which include bound small-molecules that interfere with substrate binding.) In the absence of experimental co-crystal structures of the target protein with compounds from the medicinal chemistry lead series, in silico docking tools can be used to help guide optimization of potency and selectivity (reviewed in [30]). Since the advent of the advent of the “Resolution Revolution” [31] in cryo-electron microscopy, 3DEM structures are increasingly being used as a source of information for SGDD. For cases in which an experimentally-determined 3D structure of the target protein is not available, computed structure models [14] can be combined with these same in silico docking tools. Machine learning approaches are also being used with increasing frequency to drive medicinal chemistry campaigns (reviewed in [32]).

In 28/34 ( ~ 82%) of cases, there is either direct or indirect evidence from the PDB archive (e.g., co-crystal structures), the scientific literature (e.g., mention of use of 3D structures and computational docking methods in publications), and/or private communications with industry experts to the effect that structure-guided lead optimization with the target protein reliant on experimental and/or computational tools with public domain PDB structures were used by the sponsor biopharmaceutical company or its predecessor when prosecuting the NME target (Table 2).

While it impossible to ascertain with certainty that structural data previously present in the PDB were used by the drug discovery project team, we think it to be true for every one of the new small-molecule anti-cancer agents approved by US FDA 2019–2203. In private communications with industry structural biologists, one of us (S.K.B., Director of the RCSB Protein Data Bank) has been appreciatively told on many occasions that every new drug discovery project begins with a review of relevant structures housed in the PDB. Given the sophistication of biopharmaceutical company researchers, it seems highly unlikely that public domain information with direct bearing on the task at hand would be willfully ignored when speed is of the essence and success is vital to the future of the organization.

In 25/34 (~74%) of cases, the PDB archive contains a co-complex structure of the NME bound to its target protein (Table 2), with13 coming from structural biologists based in industry and 12 coming from those based in academia (Table 2).

Not surprisingly, most of the 28 LMW-NMEs identified as confirmed or probable products of SGDD correspond to the LMW-NMEs targeting one or more protein kinase (Table 2).

Fourteen of the protein kinase inhibitors were confirmed as products of SGDD (“Yes” in Table 2) on the basis of direct evidence from the scientific literature (or private communications with industry experts) that the sponsor company or its predecessor (for acquired programs) or a competitor company used experimental and/or computational methods to understand and/or optimize how each LMW-NME bound to its target protein [3345].

Seven protein kinase inhibitors were identified as probable products of SGDD (“Prob” in Table 2) on the basis of indirect evidence, including (a) PDB housed a structure of the target protein 10 or more years prior to drug approval; and/or (b) structural biologists based in either academia or industry deposited a co-complex structure of the LMW-NME bound to its target protein to the PDB; and/or (c) the target had been prosecuted successfully using SGDD previously by another company. We classified these less clear-cut cases as probable because we think it highly likely that the sponsor company was in possession of the same or similar data given the ubiquity of expert structural biology and computational chemistry teams across the biopharmaceutical industry today [33, 4650].

Three kinase inhibitors were identified as possible products of SGDD (“Poss” in Table 2), because of a paucity of information. Notwithstanding lack of evidence confirming use of experimental and/or computational structural biology tools to discover these three kinase inhibitors, we think it more likely than not that SGDD played at least supporting roles during medicinal chemistry optimization of each compound. As of February 2024, the PDB housed at least 5440 structures of protein kinases, including 4817 proteins of human origin. Moreover, many sponsor companies are highly experienced in using SGDD to accelerate discovery and development of potent, selective protein kinase inhibitors. For example, nilotinib (Tasigna), the second ATP-competitive BCR-Abl inhibitor to be approved by US FDA, was the product of a SGDD campaign at Novartis [51]. (N.B.: Nilotinib’s predecessor imatinib, the first kinase inhibitor to be approved by US FDA (in 2001), was not a product of SGDD.)

The remaining seven LMW-NMEs identified as confirmed (“Yes”) or probable (“Prob”) products of SGDD target other classes of proteins (Table 2).

Three LMW-NMEs, including two GTPase inhibitors (sotorasib, Lumakras or Lumykras; adagrasib, Krazati), a nuclear export protein inhibitor (selinexor, Xpovio), and an isocitrate dehydrogenase inhibitor (olutasidinib, Rezlidhia), were confirmed as products of structure-guided drug discovery (“Yes” in Table 2) on the basis of direct evidence from the scientific literature (or private communications with experts in academia or industry) that the sponsor company used structural biology tools to study how medicinal chemistry compounds and the LMW-NME bound to its target protein [5254].

Three LMW-NMEs, including an anti-androgen (darolutamide, Nubeqa), a transcription factor-targeting agent (belzutifan, Welireg), and an anti-estrogen (elacestrant, Orserdu), were identified as probable products of SGDD (“Prob” in Table 2) on the basis of indirect evidence [55, 56]. The evidentiary record for these drugs met identical criteria to those used above to identify the seven protein kinase inhibitors enumerated in Table 2 as probable products of SGDD.

Finally, two LMW-NMEs, nirogacestat (Ogsiveo) and relugolix (Orgovyx), were identified as unlikely to be products of SGDD (“Unl” in Table 2). For nirogacestat, a γ-secretase inhibitor, a single 3.4Å resolution 3DEM structure (PDB ID 5a63) was released only eight years prior to US FDA approval. (N.B.: There is no PDB structure containing is small-molecule.) Relugolix targets a GPCR (Gonadotropin-Releasing Hormone Receptor GNRH1R). The only human GNRH1R structure housed in the archive (PDB ID 7br3 [57]) was released in 2020 more or less coincident with NME approval. It revealed how elagolix, a compound that is structurally-similar to relugolix, binds to the same GPCR target.

The breadth and depth of PDB structures and publications coming from academia and industry revealed by our analyses reaffirms that 3D biostructure information impacts discovery of LMW-NMEs in real time. Conservative estimates suggest that MX structures of drug target proteins held as trade secrets inside company firewalls across the biopharmaceutical industry are comparable in total number to PDB archival holdings (i.e., ~220,000 structures in spring 2024). Willingness on the part of industry to share a subset of these data with academic researchers is essential for the long-term health of the experimental and computational chemistry eco-systems supporting SGDD. It is encouraging that >50% (13/25) of the PDB structures of the antineoplastic LMW-NMEs bound to their targets enumerated in Table 2 were deposited by industrial structural biology teams.

Given the highly competitive nature of biopharmaceutical industry, PDB deposition of 3D biostructures determined inside biopharmaceutical companies often lags the actual research and may not be permitted by internal policies until after drug discovery and development campaigns have succeeded. Findings summarized in Table 2 document that co-complex structures for 17 the LMW-NMEs bound to their target proteins were released into the PDB one or more years prior to US FDA approval, whereas three were released during the year of new drug approval and five were not released until at least one year following approval.

Case studies

Going beyond these aggregate analyses, we present in brief six case studies illustrating the impact of structural biology and PDB data on discovery and development of six first-in-class antineoplastic LMW-NMEs approved by US FDA 2019–2023.

Selinexor blockade of the exportin 1 cargo-binding groove

Exportin 1 (XPO1), also known as Chromosomal Maintenance 1 protein (CRM1), is a key nuclear export protein responsible for nucleocytoplasmic transport of numerous proteins and RNAs. XPO1 was identified as a promising therapeutic target in various cancers (e.g., multiple myeloma) on the strength of its role in nuclear export of tumor suppressor proteins. The first MX structure of near-full-length human XPO1 was determined at 2.9Å resolution in 2009 (PDB ID 3gb8 [58]). This structure also included the Nuclear Export Signal (NES) of snurportin, enabling significant advances in our understanding of XPO1 function. Further atomic-level insights into the function of XPO1 were provided by five related MX structures of a heterotrimeric complex of S. cerevisiae XPO1, human Ran, and S. cerevisiae RanBP1 bound to various cargo protein oligopeptides ranging in length from 20 to 22 amino acid residues (PDB IDs 5dhf, 5dif, 5di9, 5dha, 5dh9 [59]). Selinexor (Xpovio, Fig. 1A left), a first-in-class, orally-bioavailable covalent inhibitor of XPO1 was discovered and developed by Karyopharm Therapeutics Inc. (hereafter Karyopharm). It interferes with nuclear export of cargo proteins by occupying the cargo binding groove. Aberrant accumulation of proteins within the nucleus induces apoptosis of certain malignant cells. Karyopharm utilized structure-guided approaches to discover and develop selinexor. Karyopharm’s initial discovery efforts commenced shortly before publication of the first XPO1 structures in 2009 [58, 60]. Following public release of PDB ID 3gb8 [58], the sponsor company began using the MX structure for computational docking of small molecules. Binding of some of the compounds from the resulting lead series was visualized experimentally by Chook and co-workers, including KPT-185/PDB ID 4gmx [61], KPT-251/PDB ID 4gpt [62], and KPT-276/PDB ID 4wvf [63]. Importantly, these structures revealed a common mechanism of action (MOA) for members of the lead series distinct from that of the natural product Leptomycin B (PDB ID 4hat [64]), which failed in early stage clinical trials due toxicity concerns in the 1990s. In 2019, selinexor was granted Accelerated Approval from US FDA for use in combination with dexamethasone for treatment of adults with relapsed refractory multiple myeloma who have received at least four prior therapies, etc. The precise mechanism of action (MOA) of the drug was not revealed in 3D at the atomic level until the 2021 publication of a 1.9Å resolution co-crystal structure of selinexor (previously known as KPT-330) bound to a heterotrimeric complex of an engineered form of S. cerevisiae XPO1 (with a humanized cargo binding groove), human RAN, and S. cerevisiae RAN GPTase activating protein 1 (XPO1-RAN-RANBP1, PDB ID 7l5e [52], Fig. 1B and 1B Inset). At the time of writing, selinexor had been approved for use in two additional oncologic indications, and a related compound (eltanexor, Fig. 1A right, PDB ID 5jlj [65]) was under investigation by Karyopharm in clinical trial NCT02649790 (Study of the Safety, Tolerability and Efficacy of KPT-8602 in Patients With Relapsed/refractory Cancer Indications).

Fig. 1. Selinexor blockade of the cargo-binding groove of XPO1.

Fig. 1

A Chemical structure of selinexor (left) and eltanexor or KPT-8602 (right). B Ribbon representation of the co-crystal structure of selinexor (Atom type color coding: C-gray; N-blue; O-red; S-yellow; F-green; Cl-green) bound to the XPO1 (pink), Ran GTP (purple), and Ran BP1 (green) heterotrimer (PDB ID 7l5e [52]). Inset Zoomed in view of selinexor bound to the XPO1 cargo binding groove. All 3D figures were generated using the Mol* Viewer [110]. Dashed line ribbons denote parts of the polypeptide chain that could not be visualized using MX. All chemical structures were drawn using the RCSB PDB Chemical Sketch Tool available at www.rcsb.org/chemical-sketch [14]. All ribbon representation drawings were prepared using the Mol* Viewer [110].

Pexidartinib inhibition of CSF-1R

The Colony-Stimulating Factor-1 Receptor (CSF-1R) tyrosine kinase plays a crucial role in regulating macrophage and osteoclast production and has been implicated in tenosynovial giant cell tumor (TGCT) a rare, locally aggressive neoplasm of the joint or tendon sheath. The earliest atomic-level 3D structure of the CSF-1R kinase catalytic domain, determined at 2.8Å resolution and contributed to the PDB by industry structural biologists, was made publicly available in 2006 (PDB ID 2i0v [66]). This pioneer structure provided valuable insights into target druggability and tool compound binding (i.e., co-complex with a non-proprietary quinolone inhibitor). To target CSF-1R, Plexxikon, a pioneer in SGDD for protein kinases [67], used their approach to discover and develop pexidartinib (Fig. 2A). The drug is an ATP-competitive inhibitor of the protein tyrosine kinase activity of CSF-1R that disrupts proliferative signals contributing to uncontrolled growth of malignant cells in TGCT. A 2.8Å resolution crystal structure of the CSF-1R/pexidartinib co-complex (PDB ID 4r7h [35], Fig. 2B and 2B Inset), contributed by an academic research group, explained its MOA in 3D at the atomic level. Like many protein kinase inhibitors licensed for clinical use in oncology and other therapeutic areas, pexidartinib binds within the hinge region between the N- and C-terminal sub-domains of the kinase catalytic domain (Fig. 2B and 2B Inset), where it blocks entry of the ATP substrate (reviewed in [68]). US FDA approval of pexidartinib in 2019 marked a landmark in therapeutic management of TGCT, offering a non-surgical treatment option that could significantly improve patient outcomes.

Fig. 2. Pexidartinib inhibition of CSF-1R.

Fig. 2

A Chemical structure of pexidartinib. B Ribbon representation of the CSF-1R (pink)/pexidartinib co-crystal structure (PDB ID 4r7h [35]). Inset Zoomed in view of pexidartinib occupying the active site of the kinase catalytic domain. (Drug atom color coding and dashed line ribbons as in Fig. 1).

Asciminib Inhibition of BCR-Abl

The BCR-Abl fusion protein is the product of a balanced chromosomal translocation involving chromosomes 9 and 22, generating the so-called Philadelphia chromosome (Ph) plus the T(9;22) translocation. The resulting BCR-Abl fusion protein (BCR denoting breakpoint cluster region) is a constitutively active non-receptor protein tyrosine kinase that is the cause of Philadelphia chromosome–positive chronic myeloid leukemia (Ph+ CML). The first atomic-level MX structure of the Abl kinase catalytic domain (bound to a variant of imatinib, an ATP-competitive inhibitor) was determined in 2000 and made public by Kuriyan and co-workers (PDB ID 1fpu [69]). The same academic group determined the co-crystal structure of imatinib itself (Gleevec, an ATP-competitive inhibitor, Fig. 3A left) bound to Abl (PDB ID 1iep [70], Fig. 3B) shortly before Novartis obtained accelerated approval of the drug by US FDA. Novartis subsequently used Abl kinase domain co-crystal structures and computational chemistry tools to discover a second-generation BCR-ABL inhibitor nilotinib (or Tasigna, Fig. 3A center). Later, SGDD was also used by Novartis to discover and develop a mechanistically distinct BCR-Abl inhibitor that does not target the enzyme active site. Instead, their LMW-NME asciminib (Fig. 3A right, approved by US FDA for treatment of Ph+ CML in 2021) blocks the myristoyl-binding site within the Abl kinase domain, thereby reducing enzyme activity [71]. This approach yielded a new treatment option for patients who have developed resistance to ATP-competitive BCR-Abl inhibitors or were unable to tolerate the side effects of such agents (e.g., commonly reported imatinib side effects include edema, nausea, vomiting, muscle cramps, musculoskeletal pain, diarrhea, rash, fatigue, and abdominal pain). A 2.2Å resolution co-crystal structure of asciminib bound to the Abl kinase domain (Fig. 3C, PDB ID 5mo4 [41], contributed to PDB by structural biologists at the Genome Institute Novartis in 2016) revealed the MOA of the drug in 3 C Inset at the atomic level. This structure also explains the MOA of nilotinib, which is ATP-competitive. Asciminib has been described as a “STAMP inhibitor” (specifically targeting the Abl myristoyl pocket) that reduces enzyme activity by binding to an allosteric pocket within the kinase catalytic domain. Unlike Abl protein, which has an N-terminal myristoyl group, the BCR-Abl fusion protein encoded by the Philadelphia chromosome lacks this post-translational modification (PTM) and is not autoinhibited. A 3.4Å resolution structure of full-length Abl (PDB ID 1opl [72]), determined in 2003 revealed in atomic detail how the N-terminal myristoyl group binds to an allosteric pocket in the kinase catalytic domain. This effect stabilizes the tertiary structure of the enzyme, such that the SH2 domain (occurring near the N-terminus of the polypeptide chain) and the SH3 domain interact with the C-terminal portion kinase catalytic domain, thereby autoinhibiting enzyme activity. Asciminib received Accelerated Approval from US FDA in 2022 for treatment of Ph+ CML in chronic phase, previously treated with two or more tyrosine kinase inhibitors.

Fig. 3. Asciminib inhibition of BCR-Abl.

Fig. 3

A Chemical structures of imatinib (left), nilotinib (center), and asciminib (right). B Ribbon drawing of the co-crystal structure of imatinib bound to murine Abl (pink) (PDB ID 1iep [70]). C Ribbon drawing of the co-crystal structure of human Abl (pink) bound to both nilotinib and asciminib (PDB ID 5mo4 [41]). Inset Zoomed in view of asciminib occupying the myristoyl-binding site within the kinase catalytic domain. (Drug atom color coding and dashed line ribbons as in Fig. 1).

Sotorasib inhibition of G12C KRAS

KRAS is one of the most frequently mutated oncogenes in human cancers, playing a critical role in regulating cell growth, differentiation, and apoptosis. Missense mutations in KRAS lead to uncontrolled cellular proliferation and tumorigenesis. Such mutations are found in ~15% of all human cancers, highlighting the pivotal role of KRAS in oncogenesis [73]. Among the many distinct KRAS mutations that have detected during tumor DNA sequencing, the Glycine12→Cysteine (G12C) change is particularly noteworthy. This somatic mutation constitutively activates KRAS and promotes uncontrolled cell growth. G12C KRAS is locked in an active conformation that drives oncogenesis. It is found in about ~30% of non-small cell lung cancer (NSLC), >85% of pancreatic cancer, and ~40% of colorectal cancer [73, 74]. The first MX structure of a closely-related HRAS protein (PDB ID 5p21 [75], a rat HRAS structure highly similar to human KRAS dating from 1990) proved critical for understanding how the human RAS enzymes (HRAS, KRAS, and NRAS) function as molecular switches that cycle between an inactive GDP-bound state and an active GTP-bound state. Normally, this cycle of activation followed by inactivation is tightly regulated by guanine nucleotide exchange factors (GEFs) like Son of Sevenless (SOS, which facilitate exchange of GDP for GTP) and GTPase-activating proteins (GAPs, which promote hydrolysis of GTP to GDP), returning KRAS to its inactive state [74]. Somatic mutations disrupt enzyme regulation resulting in continuous signaling through the RAS/RAF/MAPK/ERK signaling pathway. Notwithstanding the attractiveness of KRAS as an anti-cancer drug target, it was thought by many to be “undruggable” after multiple biopharmaceutical companies tried and failed. In 2013, however, SGDD efforts led by Shokat revolutionized KRAS inhibitor discovery and development with design of compounds that bind irreversibly to a pocket below the switch II region by engaging with the acquired cysteine of G12C KRAS, proving that KRAS activity can be inhibited with small molecules [64]. Subsequently, SGDD by Amgen yielded sotorasib (Lumakras or Lumykras, Fig. 4A left), a compound that binds selectively and irreversibly to the Cys residue of G12C KRAS [76]. PDB ID 6o8m [77] revealed the MOA of sotorasib (previously known as AMG 510) in 3D at the atomic level (Fig. 4B and 4B Inset). This first-in-class LMW-NME received Accelerated Approval from US FDA in 2021 for treatment of adults with KRAS G12C-mutated locally advanced or metastatic non-small cell lung cancer, as determined by an FDA-approved test, who have received at least one prior systemic therapy. A second covalently-acting LMW-NME targeting G12C KRAS (adagrasib, Krazat; Fig. 4A right) was approved by US FDA in 2022.

Fig. 4. Sotorasib inhibition of G12C KRAS.

Fig. 4

A Chemical structure of sotorasib. B Chemical structure of adagrasib. C Ribbon representation of the co-crystal structure of sotorasib covalently bound to the G12C KRAS (pink)/GDP complex (PDB ID 6oim [77]). Inset Zoomed in view of the sotorasib binding site, showing the covalent bond (half green/half yellow) between the drug and Cysteine 12 (yellow atomic ball-and-stick figure). (Drug and GDP atom color coding and dashed line ribbons as in Fig. 1).

Tazemetostat Inhibition of EZH2

The Enhancer of Zeste Homolog 2 (EZH2) methyltransferase is part of the Polycomb repressive complex 2 (PRC2), which includes other essential proteins such as Embryonic Ectoderm Development EED. It plays a critical role in epigenetic regulation of gene expression through methylation of Lysine 27 within the N-terminal tail of the H3 nucleosomal histone (H3K27). This PTM serves as a key signal for epigenetic gene silencing. EZH2 plays a crucial role in maintaining the balance of gene expression patterns necessary for normal cellular function. Aberrant activity of EZH2, marked by dysregulated H3K27 methylation, has been implicated in development of various cancers (reviewed [78]). In 2020, Epizyme received US FDA approval for use of tazemetostat for treatment of epithelioid sarcoma, marking a significant improvement in therapeutic options available for individuals diagnosed with this rare malignancy. The first atomic-level 3D structure of the EZH2 methyltransferase catalytic domain was released into the PDB in 2013 (ID 4mi0 [79]). Epizyme almost certainly leveraged this information during discovery and development of tazemetostat (Fig. 5A left), a first-in-class EZH2 inhibitor. Tazemetostat blocks binding of S-Adenosylmethionine (SAM, the requisite methyl donor for H3K27 modification by EZH2), reducing proliferation of malignant cells dependent on dysregulated methyltransferase catalytic activity. At the time of writing (March 2024), no public-domain structures of EZH2 with tazemetostat occupying the SAM-binding site were available from the PDB. Given public availability of atomic-level 3D structures of EZH2 seven years prior to US FDA approval of tazemetostat, we think it possible that Epizyme used SGDD during their medicinal chemistry campaign. A co-crystal structure of GSK126 (Fig. 5A right) bound to the EZH2/EED heterodimer, made public in 2018, provides detailed insights into the likely MOA of tazemetostat at the atomic level (Fig. 5B and 5B Inset, PDB ID 5wg6 [80]).

Fig. 5. Tazemetostat inhibition of EZH2.

Fig. 5

A Chemical structure of tazemetostat (left) and GSK126 (right). B Ribbon representation of the co-crystal structure of the EZH2 (pink)/EED (purple) heterodimer bound to GSK126 (PDB ID 5wg6 [80]). Inset Zoomed in view of the SAM-binding pocket of EZH2 (PDB ID 5wg6 [80]), which is also the binding site for tazemetostat. (Drug atom color coding and dashed line ribbons as in Fig. 1).

Belzutifan inhibition of HIF-2α

Hypoxia-inducible factors (HIFs) are transcription factors that regulate cellular responses to reduced oxygen availability. Among them, HIF-2α plays a significant role in various physiological processes and pathogenesis of certain cancers. It is the protein target of Merck’s belzutifan (Fig. 6A), which received US FDA approval in 2021 for treatment of adult patients with Von Hippel-Lindau (VHL) disease [81, 82], who require therapy for associated renal cell carcinoma, central nervous system hemangioblastomas, or pancreatic neuroendocrine tumors, not requiring immediate surgery. The first public-domain atomic-level 3D structure of human HIF-2α was released into the PDB in 2009 (ID 3f1n [83]) more than a decade before regulatory approval. Since then, 32 MX structures of human HIF-2α and 8 structures of the closely related murine HIF-2α have been contributed to the PDB. The mechanism by which HIF-2α, in a heterodimeric partnership with HIF-1β (also known as ARNT), modulates gene expression is closely tied to cellular oxygen levels. Under conditions of normoxia (normal oxygen levels), HIF-2α is hydroxylated by prolyl hydroxylase domain (PHD) enzymes. This PTM marks HIF-2α for recognition and ubiquitination by the VHL protein, leading to proteasomal degradation. Under conditions of hypoxia (reduced oxygen levels), down regulated PHD enzymatic activity results in HIF-2α stabilization and nuclear translocation. Within the nucleus it assembles into an obligate heterodimer with HIF-1β. The resulting complex engages Hypoxia-Responsive Elements (HREs) within DNA promoter regions of certain genes, initiating their transcriptional activation in response to reduced oxygen availability. The products of these genes mediate critical adaptive responses to hypoxia, including angiogenesis (via Vascular Endothelial Growth Factor or VEGF), erythropoiesis (via Erythropoietin), and metabolic reprogramming [84]. Both HIF-2α and HIF-1β are multi-domain proteins, each consisting of an N-terminal basic Helix-Loop-Helix (bHLH) segment (responsible for DNA binding), followed by two Per-Arnt-Sim or PAS domains (PAS-A, PAS-B). HRE recognition by the HIF-2α/HIF-1β heterodimer was revealed in 3D at the atomic level in a 3.6Å resolution co-crystal structure of the murine proteins bound to duplex DNA bearing an HRE (PDB ID 4zpk [85]), made public in 2015. The basic regions of each bHLH segment engage the major groove on opposite faces of the double helix (Fig. 6B), utilizing intermolecular interactions between amino acid sidechains and nucleotide base edges for HRE recognition. Targeting of HIF-2α within the heterodimer by belzutifan abrogates DNA binding, thereby disrupting signaling pathways that would otherwise be exploited by malignant cells to support growth under hypoxic conditions. The MOA of belzutifan was revealed in 3D at the atomic level in a 2.75Å resolution co-crystal structure of the drug bound to murine HIF-2α/HIF-1β (PDB ID 7w80 [55], Fig. 6C and 6C Inset) contributed by an academic research group in 2021. The NME binds in a groove on the surface of the PAS-B domain of HIF-2α (Fig. 6C and 6C Inset). Disruption of interdomain interactions within the HIF-2α/HIF-1β heterodimer would appear to explain why the drug-bound form of the heterodimer is no longer able to recognize HREs within promoter DNA and turn on the hypoxia transcriptional program necessary for tumor cell survival. Merck’s approach provides a novel therapeutic strategy for VHL-disease-related cancers, wherein the HIF-2α/HIF-1β heterodimer contributes significantly to survival and proliferation of malignant cells under hypoxic conditions. While we do not have definitive evidence that Merck used SGDD to discover and develop belzutifan, we think it probable, given the longstanding productivity of Merck structural biologists, that the company was in possession of a co-crystal structure comparable to PDB ID 7w80 during their medicinal chemistry campaign. At a minimum, open access to PDB IDs 3f1n, 4zpk, 7w80, etc. will facilitate SGDD efforts focused on discovery and development of second-generation NMEs targeting HIF-2α.

Fig. 6. Belzutifan binding to HIF-2α.

Fig. 6

A Chemical structure of belzutifan. B Ribbon representation of the co-crystal structure of the HIF-2α (pink)/HIF- 1β (purple) heterodimer bound to duplex DNA (green and orange strands) containing an HRE (PDB ID 4zpk [85]) viewed along the helix axis (Fig. 6B left) and rotated left by ~90o degree (Fig. 6B right). C Ribbon representation of the co-crystal structure of the heterodimer of murine HIF-2α (pink)/HIF-1β (purple) bound to belzutifan (PDB ID 7w80 [55]). Inset Belzutifan bound to HIF-2α into the PAS-B domain pocket. (Drug atom color coding and dashed line ribbons as in Fig. 1).

Conclusion

This review documents that public-domain 3D biostructure data stored in the PDB contribute broadly to oncology drug discovery/development across the biopharmaceutical industry. For the 34 LMW-NMEs approved by US FDA 2019–2023, there is evidence from the PDB, industry experts, and/or the scientific literature that discovery and development of every one of these new drugs was facilitated by open access to experimentally-determined, atomic-level 3D structures of their protein targets housed in the PDB. In >80% of cases, the LMW-NMEs were the product of biopharmaceutical company SGDD efforts, involving co-crystal structure studies and/or computational docking using experimentally-determined crystal structures, etc.

With year-on-year growth in the number of structures in the PDB approaching 10%, the impact of the resource and structure-guided approaches to drug discovery/development is destined to remain significant. Moreover, the growing number of PDB structures coming from 3DEM since the advent of the cryo-electron microscopy “Resolution Revolution” [31], promises even broader 3D structural coverage of the human proteome. Every year at the RCSB Protein Data Bank, we witness deposition of exciting new 3D structures of integral membrane proteins and other macromolecular machines, many of which are sub-optimally targeted with relatively non-specific agents or have been held to be undruggable [86].

The long-standing requirement for PDB deposition of 3D atomic coordinates and experimental data and metadata upon journal publication ensures that this valuable information is made immediately available to basic and applied researchers around the world without limitations on usage. Moreover, expert biocuration and rigorous validation of experimental data and atomic coordinates across the PDB help to ensure that the archive as a whole can be mined for new knowledge using statistical tools [87, 88] or machine learning approaches [89].

As custodian of the PDB Archive, the wwPDB partnership is committed to the FAIR Principles [10], which help ensure the broadest possible use of public domain biomedical research data. The PDB has been recognized as a Core Certified Repository by CoreTrustSeal (coretrustseal.org). In 2022, the PDB was further recognized by the Global Biodata Coalition (https://globalbiodata.org) as a Global Core Biodata Resource, of “fundamental importance to the wider biological and life sciences community and the long-term preservation of biological data.” These two international, community-based, non-governmental, non-profit organizations promote investment in sustainable, trustworthy data infrastructure. The PDB is universally regarded as a gold-standard exemplar and a vanguard in the open access data movement in the biological and biomedical sciences.

Acknowledgements

We thank the more than 60,000 structural biologists who have deposited structures to the PDB Core Archive since 2000. We also thank Drs. Sebastian Bittrich, Jose Duarte, David S. Goodsell, Joan Segura, and Christine Zardecki for their insightful comments. We gratefully acknowledge contributions from all members of the Research Collaboratory for Structural Bioinformatics Protein Data Bank past and present and our Worldwide Protein Data Bank partners. RCSB PDB core operations are jointly funded by the National Science Foundation (DBI-1832184), the US Department of Energy (DE-SC0019749), and the National Cancer Institute, National Institute of Allergy and Infectious Diseases, and National Institute of General Medical Sciences of the National Institutes of Health under grant R01GM133198.

Author contributions

S.K.B. and A.W.W. assembled and analyzed the data. All authors contributed to drafting and editing of the manuscript.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Blundell TL. Protein crystallography and drug discovery: recollections of knowledge exchange between academia and industry. IUCrJ. 2017;4:308–21. doi: 10.1107/S2052252517009241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Klebe G Drug design: methodology, concepts, and mode-of-action: Berlin: Springer; 2013. xv, 901 pages p.
  • 3.Burley SK, Berman HM, Christie C, Duarte JM, Feng Z, Westbrook J, et al. RCSB Protein Data Bank: Sustaining a living digital data resource that enables breakthroughs in scientific research and biomedical education. Protein Sci. 2018;27:316–30. doi: 10.1002/pro.3331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Brown KK, Hann MM, Lakdawala AS, Santos R, Thomas PJ, Todd K. Approaches to target tractability assessment - a practical perspective. Medchemcomm. 2018;9:606–13. doi: 10.1039/C7MD00633K. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Stoll F, Goller AH, Hillisch A. Utility of protein structures in overcoming ADMET-related issues of drug-like compounds. Drug Discov Today. 2011;16:530–8. doi: 10.1016/j.drudis.2011.04.008. [DOI] [PubMed] [Google Scholar]
  • 6.Harner MJ, Frank AO, Fesik SW. Fragment-based drug discovery using NMR spectroscopy. J Biomol NMR. 2013;56:65–75. doi: 10.1007/s10858-013-9740-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hartshorn MJ, Murray CW, Cleasby A, Frederickson M, Tickle IJ, Jhoti H. Fragment-based lead discovery using X-ray crystallography. J Med Chem. 2005;48:403–13. doi: 10.1021/jm0495778. [DOI] [PubMed] [Google Scholar]
  • 8.Saur M, Hartshorn MJ, Dong J, Reeks J, Bunkoczi G, Jhoti H, et al. Fragment-based drug discovery using cryo-EM. Drug Discov Today. 2020;25:485–90. doi: 10.1016/j.drudis.2019.12.006. [DOI] [PubMed] [Google Scholar]
  • 9.Protein Data Bank. Crystallography: Protein Data Bank. Nature (London) N. Biol. 1971;233:223-. [Google Scholar]
  • 10.Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3:1–9. doi: 10.1038/sdata.2016.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Berman HM, Henrick K, Nakamura H. Announcing the worldwide Protein Data Bank. Nat Struct Biol. 2003;10:980. doi: 10.1038/nsb1203-980. [DOI] [PubMed] [Google Scholar]
  • 12.wwPDB consortium Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Res. 2019;47:D520–D8. doi: 10.1093/nar/gky949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, et al. The Protein Data Bank. Nucleic Acids Res. 2000;28:235–42. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Burley SK, Bhikadiya C, Bi C, Bittrich S, Chao H, Chen L, et al. RCSB Protein Data Bank (RCSB.org): Delivery of Experimentally-Determined PDB Structures Alongside One Million Computed Structure Models of Proteins from Artificial Intelligence/Machine Learning. Nucleic Acids Res. 2023;51:D488–D508. doi: 10.1093/nar/gkac1077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Burley SK, Bhikadiya C, Bi C, Bittrich S, Chao H, Chen L, et al. RCSB protein data bank: tools for visualizing and understanding biological macromolecules in 3D. Protein Sci. 2022;31:e4482. doi: 10.1002/pro.4482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Mir S, Alhroub Y, Anyango S, Armstrong DR, Berrisford JM, Clark AR, et al. PDBe: towards reusable data delivery infrastructure at protein data bank in Europe. Nucleic Acids Res. 2018;46:D486–D92. doi: 10.1093/nar/gkx1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kinjo AR, Bekker GJ, Suzuki H, Tsuchiya Y, Kawabata T, Ikegawa Y, et al. Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures. Nucleic Acids Res. 2017;45:D282–D8. doi: 10.1093/nar/gkw962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Xu W, Velankar S, Patwardhan A, Hoch JC, Burley SK, Kurisu G. Announcing the launch of Protein Data Bank China as an Associate Member of the Worldwide Protein Data Bank Partnership. Acta Crystallogr Sect D Struct Biol. 2023;79:792–5. doi: 10.1107/S2059798323006381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Hoch JC, Baskaran K, Burr H, Chin J, Eghbalnia HR, Fujiwara T, et al. Biological magnetic resonance data bank. Nucleic Acids Res. 2023;51:D368–D76. doi: 10.1093/nar/gkac1050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.wwPDB Consortium. EMDB-the electron microscopy data bank. Nucleic Acids Res. 2023;52:D456–D65. doi: 10.1093/nar/gkad1019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Westbrook JD, Burley SK. How structural biologists and the protein data bank contributed to recent FDA new drug approvals. Structure. 2019;27:211–7. doi: 10.1016/j.str.2018.11.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Westbrook JD, Soskind R, Hudson BP, Burley SK. Impact of protein data bank on antineoplastic approvals. Drug Discov Today. 2020;25:837–50. doi: 10.1016/j.drudis.2020.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–9. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Baek M, Anishchenko I, Humphreys I, Cong Q, Baker D, DiMaio F. Efficient and accurate prediction of protein structure using RoseTTAFold2. bioRxiv; 2023; 10.1101/2023.05.24.542179.
  • 25.Burley SK, Berman HM. Open-access data: A cornerstone for artificial intelligence approaches to protein structure prediction. Structure. 2021;29:515–20. doi: 10.1016/j.str.2021.04.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Krishna R, Wang J, Ahern W, Sturmfels P, Venkatesh P, Kalvet I, et al. Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science. 2024;384:eadl2528. doi: 10.1126/science.adl2528. [DOI] [PubMed] [Google Scholar]
  • 27.Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;630:493–500. [DOI] [PMC free article] [PubMed]
  • 28.Agoni C, Olotu FA, Ramharack P, Soliman ME. Druggability and drug-likeness concepts in drug design: are biomodelling and predictive tools having their say? J Mol Model. 2020;26:120. doi: 10.1007/s00894-020-04385-6. [DOI] [PubMed] [Google Scholar]
  • 29.Xu X, Zhao J, Xu Z, Peng B, Huang Q, Arnold E, et al. Structures of human cytosolic NADP-dependent isocitrate dehydrogenase reveal a novel self-regulatory mechanism of activity. J Biol Chem. 2004;279:33946–57. doi: 10.1074/jbc.M404298200. [DOI] [PubMed] [Google Scholar]
  • 30.Lin JH. Review structure- and dynamics-based computational design of anticancer drugs. Biopolymers. 2016;105:2–9. doi: 10.1002/bip.22744. [DOI] [PubMed] [Google Scholar]
  • 31.Kuhlbrandt W. Biochemistry. The resolution revolution. Science. 2014;343:1443–4. doi: 10.1126/science.1251652. [DOI] [PubMed] [Google Scholar]
  • 32.Lo YC, Rensi SE, Torng W, Altman RB. Machine learning in chemoinformatics and drug discovery. Drug Discov Today. 2018;23:1538–46. doi: 10.1016/j.drudis.2018.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Patani H, Bunney TD, Thiyagarajan N, Norman RA, Ogg D, Breed J, et al. Landscape of activating cancer mutations in FGFR kinases and their differential responses to inhibitors in clinical use. Oncotarget. 2016;7:24252–68. doi: 10.18632/oncotarget.8132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Furet P, Guagnano V, Fairhurst RA, Imbach-Weese P, Bruce I, Knapp M, et al. Discovery of NVP-BYL719 a potent and selective phosphatidylinositol-3 kinase alpha inhibitor selected for clinical evaluation. Bioorg Med Chem Lett. 2013;23:3741–8. doi: 10.1016/j.bmcl.2013.05.007. [DOI] [PubMed] [Google Scholar]
  • 35.Tap WD, Wainberg ZA, Anthony SP, Ibrahim PN, Zhang C, Healey JH, et al. Structure-guided blockade of CSF1R kinase in tenosynovial giant-cell tumor. N Engl J Med. 2015;373:428–37. doi: 10.1056/NEJMoa1411366. [DOI] [PubMed] [Google Scholar]
  • 36.Menichincheri M, Ardini E, Magnaghi P, Avanzi N, Banfi P, Bossi R, et al. Discovery of entrectinib: a new 3-aminoindazole as a potent anaplastic lymphoma kinase (ALK), c-ros Oncogene 1 Kinase (ROS1), and Pan-Tropomyosin Receptor Kinases (Pan-TRKs) inhibitor. J Med Chem. 2016;59:3392–408. doi: 10.1021/acs.jmedchem.6b00064. [DOI] [PubMed] [Google Scholar]
  • 37.Guo Y, Liu Y, Hu N, Yu D, Zhou C, Shi G, et al. Discovery of Zanubrutinib (BGB-3111), a Novel, Potent, and Selective Covalent Inhibitor of Bruton’s Tyrosine Kinase. J Med Chem. 2019;62:7923–40. doi: 10.1021/acs.jmedchem.9b00687. [DOI] [PubMed] [Google Scholar]
  • 38.Smith BD, Kaufman MD, Lu WP, Gupta A, Leary CB, Wise SC, et al. Ripretinib (DCC-2618) is a switch control kinase inhibitor of a broad spectrum of oncogenic and drug-resistant KIT and PDGFRA variants. Cancer Cell. 2019;35:738–51.e9. doi: 10.1016/j.ccell.2019.04.006. [DOI] [PubMed] [Google Scholar]
  • 39.Dorsch D, Schadt O, Stieber F, Meyring M, Gradler U, Bladt F, et al. Identification and optimization of pyridazinones as potent and selective c-Met kinase inhibitors. Bioorg Med Chem Lett. 2015;25:1597–602. doi: 10.1016/j.bmcl.2015.02.002. [DOI] [PubMed] [Google Scholar]
  • 40.Guagnano V, Furet P, Spanka C, Bordas V, Le Douget M, Stamm C. et al. Discovery of 3-(2,6-dichloro-3,5-dimethoxy-phenyl)-1-6-[4-(4-ethyl-piperazin-1-yl)-phenylamino]-pyrimidin-4-yl-1-methyl-urea(NVP-BGJ398), a potent and selective inhibitor of the fibroblast growth factor receptor family of receptor tyrosine kinase. J Med Chem. 2011;54:7066–83. [DOI] [PubMed]
  • 41.Wylie AA, Schoepfer J, Jahnke W, Cowan-Jacob SW, Loo A, Furet P, et al. The allosteric inhibitor ABL001 enables dual targeting of BCR-ABL1. Nature. 2017;543:733–7. doi: 10.1038/nature21702. [DOI] [PubMed] [Google Scholar]
  • 42.Kalyukina M, Yosaatmadja Y, Middleditch MJ, Patterson AV, Smaill JB, Squire CJ. TAS-120 cancer target binding: defining reactivity and revealing the first fibroblast growth factor receptor 1 (FGFR1) irreversible structure. ChemMedChem. 2019;14:494–500. doi: 10.1002/cmdc.201800719. [DOI] [PubMed] [Google Scholar]
  • 43.Gomez EB, Ebata K, Randeria HS, Rosendahl MS, Cedervall EP, Morales TH, et al. Preclinical characterization of pirtobrutinib, a highly selective, noncovalent (reversible) BTK inhibitor. Blood. 2023;142:62–72. doi: 10.1182/blood.2022018674. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Murray BW, Rogers E, Zhai D, Deng W, Chen X, Sprengeler PA, et al. Molecular characteristics of repotrectinib that enable potent inhibition of TRK fusion proteins and resistant mutations. Mol Cancer Ther. 2021;20:2446–56. doi: 10.1158/1535-7163.MCT-21-0632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Addie M, Ballard P, Buttar D, Crafter C, Currie G, Davies BR, et al. Discovery of 4-amino-N-[(1S)-1-(4-chlorophenyl)-3-hydroxypropyl]-1-(7H-pyrrolo[2,3-d]pyrimidin-4-yl)piperidine-4-carboxamide (AZD5363), an orally bioavailable, potent inhibitor of Akt kinases. J Med Chem. 2013;56:2059–73. doi: 10.1021/jm301762v. [DOI] [PubMed] [Google Scholar]
  • 46.Teuber A, Schulz T, Fletcher BS, Gontla R, Muhlenberg T, Zischinsky ML, et al. Avapritinib-based SAR studies unveil a binding pocket in KIT and PDGFRA. Nat Commun. 2024;15:63. doi: 10.1038/s41467-023-44376-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Lin Q, Chen X, Qu L, Guo M, Wei H, Dai S, et al. Characterization of the cholangiocarcinoma drug pemigatinib against FGFR gatekeeper mutants. Commun Chem. 2022;5:100. doi: 10.1038/s42004-022-00718-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Subbiah V, Shen T, Terzyan SS, Liu X, Hu X, Patel KP, et al. Structural basis of acquired resistance to selpercatinib and pralsetinib mediated by non-gatekeeper RET mutations. Ann Oncol. 2021;32:261–8. doi: 10.1016/j.annonc.2020.10.599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.McTigue M, Murray BW, Chen JH, Deng YL, Solowiej J, Kania RS. Molecular conformations, interactions, and properties associated with drug efficiency and clinical performance among VEGFR TK inhibitors. Proc Natl Acad Sci USA. 2012;109:18281–9. doi: 10.1073/pnas.1207759109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Zorn JA, Wang Q, Fujimura E, Barros T, Kuriyan J. Crystal structure of the FLT3 kinase domain bound to the inhibitor Quizartinib (AC220) PLoS One. 2015;10:e0121177. doi: 10.1371/journal.pone.0121177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Manley PW, Stiefl N, Cowan-Jacob SW, Kaufman S, Mestan J, Wartmann M, et al. Structural resemblances and comparisons of the relative pharmacological properties of imatinib and nilotinib. Bioorg Med Chem. 2010;18:6977–86. doi: 10.1016/j.bmc.2010.08.026. [DOI] [PubMed] [Google Scholar]
  • 52.Walker JS, Hing ZA, Harrington B, Baumhardt J, Ozer HG, Lehman A, et al. Recurrent XPO1 mutations alter pathogenesis of chronic lymphocytic leukemia. J Hematol Oncol. 2021;14:17. doi: 10.1186/s13045-021-01032-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Janes MR, Zhang J, Li LS, Hansen R, Peters U, Guo X, et al. Targeting KRAS mutant cancers with a covalent G12C-specific inhibitor. Cell. 2018;172:578–89.e17. doi: 10.1016/j.cell.2018.01.006. [DOI] [PubMed] [Google Scholar]
  • 54.Caravella JA, Lin J, Diebold RB, Campbell AM, Ericsson A, Gustafson G, et al. Structure-based design and identification of FT-2102 (olutasidenib), a potent mutant-selective IDH1 inhibitor. J Med Chem. 2020;63:1612–23. doi: 10.1021/acs.jmedchem.9b01423. [DOI] [PubMed] [Google Scholar]
  • 55.Ren X, Diao X, Zhuang J, Wu D Structural basis for the allosteric inhibition of hypoxia-inducible factor (HIF)-2 by belzutifan. Mol Pharmacol. 2022;102:240–7. [DOI] [PubMed]
  • 56.Hancock GR, Young KS, Hosfield DJ, Joiner C, Sullivan EA, Yildiz Y, et al. Unconventional isoquinoline-based SERMs elicit fulvestrant-like transcriptional programs in ER+ breast cancer cells. NPJ Breast Cancer. 2022;8:130. doi: 10.1038/s41523-022-00497-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Yan W, Cheng L, Wang W, Wu C, Yang X, Du X, et al. Structure of the human gonadotropin-releasing hormone receptor GnRH1R reveals an unusual ligand binding mode. Nat Commun. 2020;11:5287. doi: 10.1038/s41467-020-19109-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Dong X, Biswas A, Suel KE, Jackson LK, Martinez R, Gu H, et al. Structural basis for leucine-rich nuclear export signal recognition by CRM1. Nature. 2009;458:1136–41. doi: 10.1038/nature07975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Fung HY, Fu SC, Brautigam CA, Chook YM Structural determinants of nuclear export signal orientation in binding to exportin CRM1. eLife. 2015;4:e10034. [DOI] [PMC free article] [PubMed]
  • 60.Monecke T, Guttler T, Neumann P, Dickmanns A, Gorlich D, Ficner R. Crystal structure of the nuclear export receptor CRM1 in complex with Snurportin1 and RanGTP. Science. 2009;324:1087–91. doi: 10.1126/science.1173388. [DOI] [PubMed] [Google Scholar]
  • 61.Lapalombella R, Sun Q, Williams K, Tangeman L, Jha S, Zhong Y, et al. Selective inhibitors of nuclear export show that CRM1/XPO1 is a target in chronic lymphocytic leukemia. Blood. 2012;120:4621–34. doi: 10.1182/blood-2012-05-429506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Etchin J, Sun Q, Kentsis A, Farmer A, Zhang ZC, Sanda T, et al. Antileukemic activity of nuclear export inhibitors that spare normal hematopoietic cells. Leukemia. 2013;27:66–74. doi: 10.1038/leu.2012.219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Haines JD, Herbin O, de la Hera B, Vidaurre OG, Moy GA, Sun Q, et al. Nuclear export inhibitors avert progression in preclinical models of inflammatory demyelination. Nat Neurosci. 2015;18:511–20. doi: 10.1038/nn.3953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Sun Q, Carrasco YP, Hu Y, Guo X, Mirzaei H, Macmillan J, et al. Nuclear export inhibition through covalent conjugation and hydrolysis of Leptomycin B by CRM1. Proc Natl Acad Sci USA. 2013;110:1303–8. doi: 10.1073/pnas.1217203110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Hing ZA, Fung HY, Ranganathan P, Mitchell S, El-Gamal D, Woyach JA, et al. Next-generation XPO1 inhibitor shows improved efficacy and in vivo tolerability in hematological malignancies. Leukemia. 2016;30:2364–72. doi: 10.1038/leu.2016.136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Schubert C, Schalk-Hihi C, Struble GT, Ma HC, Petrounia IP, Brandt B, et al. Crystal structure of the tyrosine kinase domain of colony-stimulating factor-1 receptor (cFMS) in complex with two inhibitors. J Biol Chem. 2007;282:4094–101. doi: 10.1074/jbc.M608183200. [DOI] [PubMed] [Google Scholar]
  • 67.Bollag G, Tsai J, Zhang J, Zhang C, Ibrahim P, Nolop K, et al. Vemurafenib: the first drug approved for BRAF-mutant cancer. Nat Rev Drug Discov. 2012;11:873–86. doi: 10.1038/nrd3847. [DOI] [PubMed] [Google Scholar]
  • 68.Amoretti M, Amsler C, Bonomi G, Bouchta A, Bowe P, Carraro C, et al. Production and detection of cold antihydrogen atoms. Nature. 2002;419:456–9. doi: 10.1038/nature01096. [DOI] [PubMed] [Google Scholar]
  • 69.Schindler T, Bornmann W, Pellicena P, Miller WT, Clarkson B, Kuriyan J. Structural mechanism for STI-571 inhibition of abelson tyrosine kinase. Science. 2000;289:1938–42. doi: 10.1126/science.289.5486.1938. [DOI] [PubMed] [Google Scholar]
  • 70.Nagar B, Bornmann WG, Pellicena P, Schindler T, Veach DR, Miller WT, et al. Crystal structures of the kinase domain of c-Abl in complex with the small molecule inhibitors PD173955 and imatinib (STI-571) Cancer Res. 2002;62:4236–43. [PubMed] [Google Scholar]
  • 71.Qiang W, Antelope O, Zabriskie MS, Pomicter AD, Vellore NA, Szankasi P, et al. Mechanisms of resistance to the BCR-ABL1 allosteric inhibitor asciminib. Leukemia. 2017;31:2844–7. doi: 10.1038/leu.2017.264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Nagar B, Hantschel O, Young MA, Scheffzek K, Veach D, Bornmann W, et al. Structural basis for the autoinhibition of c-Abl tyrosine kinase. Cell. 2003;112:859–71. doi: 10.1016/S0092-8674(03)00194-6. [DOI] [PubMed] [Google Scholar]
  • 73.Parikh K, Banna G, Liu SV, Friedlaender A, Desai A, Subbiah V, et al. Drugging KRAS: current perspectives and state-of-art review. J Hematol Oncol. 2022;15:152. doi: 10.1186/s13045-022-01375-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Huang L, Guo Z, Wang F, Fu L. KRAS mutation: from undruggable to druggable in cancer. Signal Transduct Target Ther. 2021;6:386. doi: 10.1038/s41392-021-00780-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Pai EF, Krengel U, Petsko GA, Goody RS, Kabsch W, Wittinghofer A. Refined crystal structure of the triphosphate conformation of H-ras p21 at 1.35 A resolution: implications for the mechanism of GTP hydrolysis. EMBO J. 1990;9:2351–9. doi: 10.1002/j.1460-2075.1990.tb07409.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Lanman BA, Allen JR, Allen JG, Amegadzie AK, Ashton KS, Booker SK, et al. Discovery of a Covalent Inhibitor of KRAS(G12C) (AMG 510) for the Treatment of Solid Tumors. J Med Chem. 2020;63:52–65. doi: 10.1021/acs.jmedchem.9b01180. [DOI] [PubMed] [Google Scholar]
  • 77.Canon J, Rex K, Saiki AY, Mohr C, Cooke K, Bagal D, et al. The clinical KRAS(G12C) inhibitor AMG 510 drives anti-tumour immunity. Nature. 2019;575:217–23. doi: 10.1038/s41586-019-1694-1. [DOI] [PubMed] [Google Scholar]
  • 78.Kim KH, Roberts CW. Targeting EZH2 in cancer. Nat Med. 2016;22:128–34. doi: 10.1038/nm.4036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Wu H, Zeng H, Dong A, Li F, He H, Senisterra G, et al. Structure of the catalytic domain of EZH2 reveals conformational plasticity in cofactor and substrate binding sites and explains oncogenic mutations. PLoS One. 2013;8:e83737. doi: 10.1371/journal.pone.0083737. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Bratkowski M, Yang X, Liu X. An evolutionarily conserved structural platform for PRC2 inhibition by a class of Ezh2 inhibitors. Sci Rep. 2018;8:9092. doi: 10.1038/s41598-018-27175-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Steinberger KJ, Eubank TD. The underexplored landscape of hypoxia-inducible factor 2 alpha and potential roles in tumor macrophages: a review. Oxyg (Basel) 2023;3:45–76. doi: 10.3390/oxygen3010005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Lee JW, Ko J, Ju C, Eltzschig HK. Hypoxia signaling in human diseases and therapeutic targets. Exp Mol Med. 2019;51:1–13. doi: 10.1038/s12276-019-0299-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Scheuermann TH, Tomchick DR, Machius M, Guo Y, Bruick RK, Gardner KH. Artificial ligand binding within the HIF2alpha PAS-B domain of the HIF2 transcription factor. Proc Natl Acad Sci USA. 2009;106:450–5. doi: 10.1073/pnas.0808092106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Wu D, Su X, Lu J, Li S, Hood BL, Vasile S, et al. Bidirectional modulation of HIF-2 activity through chemical ligands. Nat Chem Biol. 2019;15:367–76. doi: 10.1038/s41589-019-0234-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Wu D, Potluri N, Lu J, Kim Y, Rastinejad F. Structural integration in hypoxia-inducible factors. Nature. 2015;524:303–8. doi: 10.1038/nature14883. [DOI] [PubMed] [Google Scholar]
  • 86.Goodsell DS, Zardecki C, Di Costanzo L, Duarte JM, Hudson BP, Persikova I, et al. RCSB Protein Data Bank: Enabling biomedical research and drug discovery. Protein Sci. 2020;29:52–65. doi: 10.1002/pro.3730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Shao C, Liu Z, Yang H, Wang S, Burley SK. Outlier analyses of the Protein Data Bank archive using a probability-density-ranking approach. Sci Data. 2018;5:180293. doi: 10.1038/sdata.2018.293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Shao C, Yang H, Westbrook JD, Young JY, Zardecki C, Burley SK. Multivariate analyses of quality metrics for crystal structures in the protein data bank archive. Structure. 2017;25:458–68. doi: 10.1016/j.str.2017.01.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Elbadawi M, Gaisford S, Basit AW. Advanced machine-learning techniques in drug discovery. Drug Discov Today. 2021;26:769–77. doi: 10.1016/j.drudis.2020.12.003. [DOI] [PubMed] [Google Scholar]
  • 90.Mohammadi M, Schlessinger J, Hubbard SR. Structure of the FGF receptor tyrosine kinase domain reveals a novel autoinhibitory mechanism. Cell. 1996;86:577–87. doi: 10.1016/S0092-8674(00)80131-2. [DOI] [PubMed] [Google Scholar]
  • 91.Huang CH, Mandelker D, Schmidt-Kittler O, Samuels Y, Velculescu VE, Kinzler KW, et al. The structure of a human p110alpha/p85alpha complex elucidates the effects of oncogenic PI3Kalpha mutations. Science. 2007;318:1744–8. doi: 10.1126/science.1150799. [DOI] [PubMed] [Google Scholar]
  • 92.Bossi RT, Saccardo MB, Ardini E, Menichincheri M, Rusconi L, Magnaghi P, et al. Crystal structures of anaplastic lymphoma kinase in complex with ATP competitive inhibitors. Biochemistry. 2010;49:6813–25. doi: 10.1021/bi1005514. [DOI] [PubMed] [Google Scholar]
  • 93.Mao C, Zhou M, Uckun FM. Crystal structure of Bruton’s tyrosine kinase domain suggests a novel pathway for activation and provides insights into the molecular basis of X-linked agammaglobulinemia. J Biol Chem. 2001;276:41435–43. doi: 10.1074/jbc.M104828200. [DOI] [PubMed] [Google Scholar]
  • 94.Forouhar F, Yang Y, Kumar D, Chen Y, Fridman E, Park SW, et al. Structural and biochemical studies identify tobacco SABP2 as a methyl salicylate esterase and implicate it in plant innate immunity. Proc Natl Acad Sci USA. 2005;102:1773–8. doi: 10.1073/pnas.0409227102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Aertgeerts K, Skene R, Yano J, Sang BC, Zou H, Snell G, et al. Structural analysis of the mechanism of inhibition and allosteric activation of the kinase domain of HER2 protein. J Biol Chem. 2011;286:18756–65. doi: 10.1074/jbc.M110.206193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Schiering N, Knapp S, Marconi M, Flocco MM, Cui J, Perego R, et al. Crystal structure of the tyrosine kinase domain of the hepatocyte growth factor receptor c-Met and its complex with the microbial alkaloid K-252a. Proc Natl Acad Sci USA. 2003;100:12654–9. doi: 10.1073/pnas.1734128100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Knowles PP, Murray-Rust J, Kjaer S, Scott RP, Hanrahan S, Santoro M, et al. Structure and chemical inhibition of the RET tyrosine kinase domain. J Biol Chem. 2006;281:33577–87. doi: 10.1074/jbc.M605604200. [DOI] [PubMed] [Google Scholar]
  • 98.Mol CD, Lim KB, Sridhar V, Zou H, Chien EY, Sang BC, et al. Structure of a c-kit product complex reveals the basis for kinase transactivation. J Biol Chem. 2003;278:31461–4. doi: 10.1074/jbc.C300186200. [DOI] [PubMed] [Google Scholar]
  • 99.Day PJ, Cleasby A, Tickle IJ, O’Reilly M, Coyle JE, Holding FP, et al. Crystal structure of human CDK4 in complex with a D-type cyclin. Proc Natl Acad Sci USA. 2009;106:4166–70. doi: 10.1073/pnas.0809645106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Russo AA, Tong L, Lee JO, Jeffrey PD, Pavletich NP. Structural basis for inhibition of the cyclin-dependent kinase Cdk6 by the tumour suppressor p16INK4a. Nature. 1998;395:237–43. doi: 10.1038/26155. [DOI] [PubMed] [Google Scholar]
  • 101.McTigue MA, Wickersham JA, Pinko C, Showalter RE, Parast CV, Tempczyk-Russell A, et al. Crystal structure of the kinase domain of human vascular endothelial growth factor receptor 2: a key enzyme in angiogenesis. Structure. 1999;7:319–30. doi: 10.1016/S0969-2126(99)80042-2. [DOI] [PubMed] [Google Scholar]
  • 102.Griffith J, Black J, Faerman C, Swenson L, Wynn M, Lu F, et al. The structural basis for autoinhibition of FLT3 by the juxtamembrane domain. Mol Cell. 2004;13:169–78. doi: 10.1016/S1097-2765(03)00505-7. [DOI] [PubMed] [Google Scholar]
  • 103.Awad MM, Katayama R, McTigue M, Liu W, Deng YL, Brooun A, et al. Acquired resistance to crizotinib from a mutation in CD74-ROS1. N. Engl J Med. 2013;368:2395–401. doi: 10.1056/NEJMoa1215530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Williams NK, Bamert RS, Patel O, Wang C, Walden PM, Wilks AF, et al. Dissecting specificity in the Janus kinases: the structures of JAK-specific inhibitors complexed to the JAK1 and JAK2 protein tyrosine kinase domains. J Mol Biol. 2009;387:219–32. doi: 10.1016/j.jmb.2009.01.041. [DOI] [PubMed] [Google Scholar]
  • 105.Yang J, Cron P, Good VM, Thompson V, Hemmings BA, Barford D. Crystal structure of an activated Akt/protein kinase B ternary complex with GSK3-peptide and AMP-PNP. Nat Struct Biol. 2002;9:940–4. doi: 10.1038/nsb870. [DOI] [PubMed] [Google Scholar]
  • 106.Bai XC, Yan C, Yang G, Lu P, Ma D, Sun L, et al. An atomic structure of human gamma-secretase. Nature. 2015;525:212–7. doi: 10.1038/nature14892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Matias PM, Donner P, Coelho R, Thomaz M, Peixoto C, Macedo S, et al. Structural evidence for ligand specificity in the binding domain of the human androgen receptor. Implications for pathogenic gene mutations. J Biol Chem. 2000;275:26164–71. doi: 10.1074/jbc.M004571200. [DOI] [PubMed] [Google Scholar]
  • 108.Tanenbaum DM, Wang Y, Williams SP, Sigler PB. Crystallographic comparison of the estrogen and progesterone receptor’s ligand binding domains. Proc Natl Acad Sci USA. 1998;95:5998–6003. doi: 10.1073/pnas.95.11.5998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Long SB, Casey PJ, Beese LS. The basis for K-Ras4B binding specificity to protein farnesyltransferase revealed by 2 A resolution ternary complex structures. Structure. 2000;8:209–22. doi: 10.1016/S0969-2126(00)00096-4. [DOI] [PubMed] [Google Scholar]
  • 110.Sehnal D, Bittrich S, Deshpande M, Svobodova R, Berka K, Bazgier V, et al. Mol* Viewer: modern web app for 3D visualization and analysis of large biomolecular structures. Nucleic Acids Res. 2021;49:W431–W7. doi: 10.1093/nar/gkab314. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Oncogene are provided here courtesy of Nature Publishing Group

RESOURCES