Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2021 May 18;16(16):2524–2534. doi: 10.1002/cmdc.202100246

Ligand Accessibility Insights to the Dengue Virus NS3‐NS2B Protease Assessed by Long‐Timescale Molecular Dynamics Simulations

Thales Kronenberger 1,2,+,, Mateus Sá Magalhães Serafim 3,+, Arun Kumar Tonduru 2, Vinícius Gonçalves Maltarollo 4, Antti Poso 1,2
PMCID: PMC8453957  PMID: 33899341

Abstract

Dengue is a tropical disease caused by the dengue virus (DENV), with an estimate of 300 million new cases every year. Due to the limited vaccine efficiency and absence of effective antiviral treatment, new drug candidates are urgently needed. DENV NS3‐NS2B protease complex is essential for viral post‐translational processing and maturation, and this enzyme has been extensively studied as a relevant drug target. Crystal structures often underestimate NS3‐NS2B flexibility, whereas they can adopt different conformational states depending on the bound substrate. We conducted molecular dynamics simulations (∼30 μs) with a non‐ and covalently bound inhibitor to understand the conformational changes in the DENV‐3 NS3‐NS2B complex. Our results show that the open‐closing movement of the protease exposes multiple druggable subpockets that can be investigated in later drug discovery efforts.

Keywords: Dengue virus, Molecular dynamics simulation, NS3-NS2B protease


The open/closing movement of the NS3‐NS2B protease exposes multiple druggable subpockets that can be investigated in later drug discovery efforts. Considering the conformational shift is key for further drug discovery, since induced‐fit docking would be only relevant for fine tuning.

graphic file with name CMDC-16-2524-g003.jpg

Introduction

Dengue virus (DENV), a member of the Flavivirus genus and Flaviviridae family, is an arthropod‐borne pathogen responsible for a large disease burden worldwide. Estimates indicate that over 3 billion people are at risk of DENV infections worldwide with approximately 284–528 million new infections annually, 500,000 severe cases, and 20,000 deaths per year.[1, 2, 3] These DENV infections primarily occur in tropical and subtropical climates, where the transmitting vector, the Aedes aegypti mosquito, is prevalent. DENV infections may develop into a haemorrhagic fever in severe cases of the disease. As there are no efficient vaccines or antiviral treatments available for this disease, new treatment strategies and identification of potential drug candidates is urgently needed.[2, 4]

The DENV genome is composed of a positive‐sense single‐stranded RNA that is directly translated into a single polyprotein, in which its complex topology becomes embedded into endoplasmic reticulum (ER) membranes.[5, 6] The polyprotein cleavage occurs co‐ and post‐translationally resulting in three structural proteins (capsid protein C, prM and protein E) and seven non‐structural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5).[7]

Non‐structural proteins are mostly multifunctional, and for example, NS3 contains both RNA helicase and RNA annealing activities. Structurally, NS3 is composed of a protease (NS3PRO), a helicase (NS3HEL) as well as an RNA triphosphatase domains, with the protease being responsible for cleavage of the polyprotein, together with cellular peptidases. The protease and helicase domains interact with the viral genome, potentially bridging the replication and capsid assembly steps for nucleocapsid formation.[8, 9, 10] It is also well established that uncoupling the sequential order of these two proteolytic domains reduces the nucleocapsid incorporation into the virus particle. This indicates that polyprotein processing does regulate the viral assembly.[11, 12, 13]

The NS3PRO domain, herein referred to only as NS3 for short, belongs to the S7 family of serine proteases, with a conserved catalytic triad; His51, Asp75 and Ser135.[14] In terms of mechanism, the reactive Ser135 triggers the proteolytic activity by generating a tetrahedral intermediate with His51, which is stabilized by the interaction with Asp75. This intermediate conformation then attacks the electrophilic atoms of the substrate, followed by the cleavage and release of substrate fragments.[15]

It is worth mentioning that DENV's NS3, similarly to other flaviviruses proteases, requires a peptide cofactor (NS2B) to be fully active. Structural studies have revealed that NS2B participates in the catalytically active conformation of the DENV protease, as a β‐hairpin region wraps around the NS3PRO core[14] (Figure 1A illustrates this complex organization with the NS3 in grey and NS2B in cyan).

Figure 1.

Figure 1

DENV‐3 crystal structure illustrating the closed active conformation (PDB 3U1I) (A) and the open conformation (PDB 3U1J) (B), where the C‐terminal of the NS2B cofactor becomes flexible and decouples from the active site. NS3 is represented as a grey cartoon, NS2B is represented as a cyan cartoon, the active site residues are highlighted as sticks and the covalently bound NDL inhibitor is depicted in red/blue. C) Inset of the active site bound to the covalent inhibitor NDL and its main interactions (PDB 3U1I). D) Example of peptide inhibitor NDL. From left to right, the following residues are represented Bz‐Nle‐Lys‐Arg‐Arg‐H and numbered P1–P5, accordingly. In this representation residues displaying hydrogen bond interactions in the original NS3‐NS2B‐NDL co‐crystallized structure (PDB 3U1I) are depicted as spheres and the hydrogen bonds with the main‐chain amino acids are displayed as cyan lines, while interactions with the side chains are represented as dashed cyan lines. Ser135’s covalent bond in the P1 moiety is represented as a dark line.

The NS3‐NS2B complex prefers substrates with basic residues (Arg or Lys) at the P1 and P2 core sites, followed by amino acids with short side chains (Gly, Ser, or Ala) at the P1′ site[16] (Figure 1). This preference can only be explained by the substrate interaction with residues from both NS3 and NS2B proteins, which underlies the relevance of their interaction.

Illustrating this fact, the inhibitor peptide NDL (Bz‐Nle‐Lys‐Arg‐Arg‐H) has been employed to study the binding mode of covalently bound inhibitors in the NS3‐NS2B protease from DENV‐3 and WNV (Figure 1A,C). NDL covalently binds to Ser135 and it is further stabilized by a series of interactions between its positively charged moieties and residues in the NS3 (Figure 1C,D).

Interestingly, the NS3‐NS2B complex exhibits either an open or closed configuration, which is believed to change upon ligand binding. For example, NDL in the crystal structure (3U1I) displays a closed conformation where the NS2B beta‐hairpin covers the active site (Figure 1A). However, the trypsin substrate‐mimetic inhibitor aprotinin binds an open conformation of the NS3PRO (3UIJ),[14] in which the NS2B hairpin does not cover the active site. This indicates the so‐called “induced‐fit” mechanism of catalysis, where substrates and substrate‐mimicking inhibitors induce conformational changes in the protein leading to the active conformation.[17] Differences between both conformations are not limited to changes in the NS2B conformation and previous comparative work discusses that associating “open” and “closed” to activity might be an oversimplification.[18]

The crucial role of processing viral polyproteins into their final functional entities highlights this protease potential as a drug target.[19] However, most of the DENV NS3‐NS2B inhibitors described in the literature display high molecular weight peptide‐mimetic structures with poor bioavailability,[20, 21] which prevents their clinical use.

To identify new NS3‐NS2B inhibitors, it is essential to understand the motion of the protease domain as well as the drug‐protein interactions. Here, by simulating the apostructure of NS3‐N2SB like also both a covalently and non‐covalently bound inhibitors (∼10 μs each), we aim to compare the conformational landscape of NS3 system. Analysis of the trajectories allowed us to clarify subtle differences in ligand binding profiles, highlighting the relevance of NS2B residues for its stabilization. Furthermore, we were able to identify putative druggable pockets for further drug discovery work.

Results

Structure selection and simulation setup

During the DENV maturation process, the N‐terminal portion of NS3 is anchored in the vicinity of the endoplasmic reticulum and non‐covalently interacting with the NS2B.[22] In order to mimic this behaviour, several studies have utilized an artificial construct linking NS3 and NS2B via a glycine‐rich loop to experimentally evaluate their activity.[23] Although previously co‐crystallized flavivirus’ NS3PRO structures are available, most of them are either incomplete or do present this artificial glycine‐rich loop (indicated as GlyL, Table 1).

Table 1.

Crystallographic structures analysed for conformational changes, co‐crystallized with peptides or peptide inhibitors. NS3PRO refers to the protease domain of the NS3 protein, ignoring the helicase part of the protein, while NS2BCF refers only to the hydrophilic loop of NS2B, which acts as a cofactor for the NS3PRO activation.

Virus

PDB ID

Structural Information

Resolution (Å)

DENV‐1

3L6P[a]

NS2BCF‐GlyL‐NS3PROΔ10

2.20

DENV‐2

2FOM

NS2BCF‐GlyL‐NS3PRO

1.50

DENV‐3

3U1I[b]

NS2BCF‐NS3PRO+NDL

2.30

DENV‐3

3U1J

NS2BCF‐NS3PRO+Aprotinin

1.80

DENV‐4

2WHX[c]

NS2B18‐GlyL‐NS3full

2.20

DENV‐4

2VBC[c]

NS2B18‐GlyL‐NS3full

3.15

WNV

2FP7

NS2BCF‐GlyL‐NS3PRO+NDL

1.68

WNV

2IJO

NS2BCF‐GlyL‐NS3PRO+BPTI

2.30

WNV

3E90

NS2BCF‐GlyL‐NS3PRO+NKK

2.45

[a] The construct has a 10 residues deletion in the N‐terminus of NS3PRO, to facilitate crystallization. [c] Missing NS3 Pro10‐Ala16. [b] Instead of using the full hydrophilic loop from NS2B, a short sequence of 18 residues was used; this was done only to provide solubility to the expressed complex. NDL: Bz‐Nle‐Lys‐Arg‐Arg‐H, BPTI: bovine pancreatic trypsin inhibitor, NKK: Naph‐Lys‐Lys‐Arg‐H, DENV: dengue virus, WNV: West Nile virus Bz‐benzyl, Nle‐norleucine, Naph‐naphthalene

As a solution this issue decided to use homology modelling. We observed that structures from other viruses could be used as a template for the homology model. However, these structures either do not harbour an inhibitor (e. g. PDB: 2FOM, 2VBC and 2WHX) or harbour one not specifically bound to its catalytic site (e. g. 6MO0, 6MO1 and 6MO2), present mutations (e. g. 2WZQ, 3LKW, 4M9F, 4M9I and 4M9T) or were produced in different pH (e. g. 4M9K and 4M9M). In particular, DENV‐3 protease bound to aprotinin (3U1J) does not form a closed conformation (β‐hairpin region of NS2B wrapped around the NS3 core)[14] and, therefore it was not selected.

The best structure for our purposes was 3U1I, as it was co‐crystallized with a covalently bound inhibitor (NDL, Figure 1C,D) connected to Ser135. This structure has a closed conformation[14] (Figure 1A), where the NS2B beta‐sheet folds over the active site and is further stabilized by interactions at the P4′ to P3 sites of the substrate. The NS3‐N2SB structure 3U1I[14] was simulated in three different modes: i) with the NDL inhibitor covalently bound to Ser135, representing the inhibited state; ii) with the NDL in position for inhibition but non covalently bound; and iii) the apostructure with no inhibitor.

Inhibitor binding mode assessed by MD depends on the interaction with NS3 and NS2B

As an initial approach, we investigated the stability of the interactions between NS3‐NS2B and the ligand along the entire MD trajectory. Slightly different interaction profiles (residues, bond types and frequencies) are observed when comparing the simulations with the non‐covalently and covalently bound ligands (Figure 2A). In both systems, NDL is mainly stabilized by a pair of ionic interactions and hydrogen bonds between its basic side chains (P1–P2, Figure 2), and between the catalytic Asp75 (NS3) and Asp79 (NS2B) (Figure 2B). Simulations with the non‐covalently bound ligand also features a prominent interaction of the P2’s arginine moiety with Asp81 from the NS2B (Figure 2B).

Figure 2.

Figure 2

Interactions of DENV‐3 protease with consensus peptide, both covalently and non‐covalently bound. A) Interaction pattern comparing simulations with the covalently bound peptide against the non‐covalently bound peptide, bars represent the frequency of interaction during the simulation. Residues belonging to the P1 site are labelled in blue. Other P‐sites were not significantly enriched in the simulations. The non‐covalently bound ligand mainly manifests a single stable conformation (B), while the covalently bound system explores two different binding modes (C,D). Dashed yellow lines designate polar interactions. The protease's residues are coloured according to the atom types (protein carbons, light grey for the NS3 and cyan for the NS2B; nitrogen, blue; oxygen, red).

Further, simulations with covalently bound ligand reveal that the ligand is stabilized by an interaction between the hydrogen of P2 main‐chain and the side chains of Asn152 and His51, as well as π‐π interactions and hydrophobic contacts between the P4–P5 moieties and the Tyr161 residue (Figure 2A,C,D). These interactions are slightly different from the ones originally proposed in the crystal structure, which highlights the possibility of multiple binding modes for the ligand. Our data also suggests that Asn152 stabilizes the hydroxyl group of non‐covalently bound peptide by forming a hydrogen bond (Figure 2A,B), and the hydrogen bond becomes more prevalent in the covalently bound systems (Figures 2C,D). It is worth mentioning that Asn152Ala NS3 mutants remain active in terms of proteolysis, but exhibit a significant decrease in their binding affinity for some ligands,[24] which supports the relevance of this residue for stabilization even if not directly involved in the catalytic reaction.

This interaction profile is in agreement with the enzyme's preference for basic moieties in the P1/P2, as a result of its interactions with the Asp75 (NS3, interacting with the P1) and Asp129 (NS2B, with P2).[24] These observations are consistent with the results of Kouretova et al., (2017). It has been postulated that effective inhibitors in biochemical assays would therefore be positively charged, which in turn would lead to poor in vivo bioavailability.

To a further investigation, whether these differences in the ligand interaction were a result from the conformation changes in the protein, we decided to analyse the MD trajectories using principal component analyses.

Conformational changes in the NS3‐NS2B dynamics upon ligand binding

It is known from the available crystal structures (Supporting Information, Figure S2) that, in the absence of a substrate or active site inhibitor, the C‐terminal portion of NS2B can adopt an “open” conformation and become disordered.[25, 26, 27] This conformational change in NS2B after ligand binding has been described by short MD simulations and by several NMR studies using a linked construct.[25, 26, 27] After substrate binding to the NS3 active site, the C‐terminal portion of NS2B interacts with the NS3 core, closing the NS3 active site with the so‐called active “closed” conformation. However, in our simulations, we observed that the protease can adopt four major states (based on the K‐means clusterization, Supporting Information: Figure S3): State I – a tightly closed state which is exclusive of the system simulated with the covalently bound ligand (Figure 3A); State II – a semi‐closed state where the NS3’s N‐terminal is unfolded (Figure 3B); State III – a second semi‐closed state where NS3’s N‐terminal loses the beta‐sheet fold but remains near the core protein (Figure 3C); and State IV – a fully open state, where the NS2B's C‐terminal uncovers the active site (Figure 3D). With exception of state I, which seems almost exclusive for systems with the covalently bound ligand, the occurrence of the states in the simulations happens independently from the presence of ligand binding, though with different frequencies (Figure 3A–D and Supporting Information Figure S3). These conformations are distinct, where the closed‐ones are stabilized by the interaction between the negatively charged amino acids of NS2B's C‐terminal loop, and the NS3’s lysine residues (Lys73 and Lys74, Figure 3E).

Figure 3.

Figure 3

DENV‐3 protease conformational states. A) State I, a tightly closed state, is only seen with the ligand covalently bound system. B) State II, a semi‐closed state, where the NS3’s N‐terminal is unfolded. C) State III, the semi‐closed state where NS3’s N‐terminal is losing the beta‐sheet fold, but remains near the core protein D) State IV, fully open with the NS2B's C‐terminal uncovers the active site. The frequency of each conformation for covalent bound ligan (Cov), non‐covalent bound ligand (Non‐Cov) and apostructure (Apo) is shown below the structural representation. The grey cartoon represents the NS3 protease domain and light blue represents the NS2B, yellow spheres represent the position of the active site and relevant residues are depicted as sticks. E) box plot representation of the distance variation along the analysed trajectory and plotted by each cluster for relevant amino‐acid pairs. Lys73‐Asp80 was chosen to represent the charge‐clamp holding the NS2B in the closed conformation. Arg85‐Asp81 distance variation represents the NS2B's hairpin bending and conformational changes and Tyr150‐Tyr161 describes hydrophobic pair relevant for the NS3 folding.

In a non‐covalently bound system, the active site covering the beta‐sheet lid of the C‐terminal region of NS2B (highlighted in purple in Figure 4E), has a lower root mean square fluctuation (RMSF) in comparison to the simulations with the apostructure and covalently bound ligands (Figure 4A–E). According to the PCA analysis, the high flexibility in this region is in agreement with the opening and exposure of the active site (Figure 4F).

Figure 4.

Figure 4

DENV protease core (NS3) mobility influences the interaction partner (NS2B) and vice‐versa. Comparison between the protein b‐factor from the crystal structure DENV‐3 protease (3U1I, in grey on the right axis) and the average residue fluctuations obtained from the RMSF of DENV's backbone atoms. RMSF values were compared between the average of the replicates of the system with (blue) and without peptide (purple, in the left axis). Dashed lines represent the standard deviating limits for the NS2B (A) and NS3 (B), while orange lines represent results from simulations with non‐covalently bound ligand (C, NS2B) and (D, NS3). E) The region that deviates in the presence of the peptide are highlighted in purple in the NS2B co‐activator beta‐sheet between (residues 75 and 85). F) The extreme movements of PC1 common for systems in the presence of ligands. In E) and F) NS3 is coloured as grey/black and NS2B is coloured as cyan, while golden arrows indicate the extreme movements related to each principal component. Yellow spheres represent the position of the catalytic triad.

Assessment of binding pockets and sites occupancy: druggability potential

Our microsecond‐scale MD simulations make it possible to study transient binding site properties. We decided to utilize SiteMap analyses along the simulated trajectories to identify changes in the average site volume (Figure 5A) between the states. The SiteMap can also predict the potential druggability of the pockets, ranking them by the SiteScore values. These values range from 0–1.5, with values higher than 0.8 suggesting druggable pockets. Interestingly, the differences in volume seem to inversely affect the predicted site's druggability, where more compact sites had higher SiteScore values (Figure 5B). Accordingly, State II and IV display a more accessible and exposed surface, resulting in lower SiteScore values for these states.

Figure 5.

Figure 5

Large variation in the catalytic site volume influences its druggability and most prominent druggable region. DENV‐3’s NS2B anionic pocket. Box plot representation of the volume (A) and druggability (B) along the analysed trajectory (plotted by each state cluster). Druggability variation for the active site pocket was assessed by the SiteScore calculated with the SiteMap. Solid lines represent the average and the boxes amplitude the quartiles, and outliers are highlighted, red asterisks represent significance in comparison to the respective apostructure distribution, as calculated by Mann‐Whitney test, p‐value<0.01 is represented by *. The identified pockets from the PrankWeb server and SiteMap analyses, represented as spheres (C–F) for the different protein states discussed on the PCA. Amino acid residues found in common between the different predictions are depicted as sticks.

We then proceeded to verify pockets using representative conformations for each state/ligand combination using the website PrankWeb. This server searches for potential ligand‐binding points on a protein surface, predicting the chemical propensity of these points. A feature vector is calculated by physicochemical and geometric properties of surrounding atoms and residues, which are then clustered, resulting in an individual ranking score.[28] The PrankWeb server predictions varied among the clusters and conformations, but at least one pocket was predicted to all states evaluated (Table 2). The results with the P2Rank method from PrankWeb also showed higher scores for covalent bound states. Herein they were visually depicted by one representative conformation of each state (I, II, III and IV, represented as spheres in Figure 5C–F). Pockets with higher scores were shown for the covalent (1.57 to 7.98) and non‐covalently bound (2.02 to 5.22) in comparison to the apostructure states (2.37 to 4.54).

Table 2.

Potential pocket's scores and binding sites predicted by the P2Rank method in PrankWeb to the three different modes assessed (apostructure, covalent bond to Ser135 and non‐covalently bound). Ligands were manually removed, and each conformation state was individually evaluated.

Conformation state

Best predicted pocket score

Total of potential binding sites (residues)

Apostructure (State IV)

4.54

n=37

Apostructure (State II)

2.37

n=16

Apostructure (State II)

2.59

n=12

Covalent (State IV)

1.57

n=16

Covalent (State III)

3.71

n=11

Covalent (State II)

2.49

n=14

Covalent (State I)

7.98

n=24

Non‐covalently bound (State IV)

5.22

n=24

Non‐covalently bound (State III)

5.08

n=30

Non‐covalently bound (State II)

3.66

n=20

Non‐covalently bound (State I)

2.02

n=11

In addition to the importance in the states’ differences, pocket analyses identified similar concomitant residues predicted as potential binding sites, including Asn152 for all states evaluated; His51, Pro132, Gly133, Ser135, Gly151 and Asn152 for all apostructure states; Asn152 for all non‐covalently bound states; and Ile123, Asn152, Ile165 and Ala166 for all covalent states (Supporting Information, Table 1).

Taken together, these results suggest that the druggable active site presents a unique residue (Asn152) shared among all conformational states. Also, this binding site is not found directly in the covalent or apostructure protein states. Residue Asn152 is a potential target in the protease active site, as it was conserved between the open and closed states, and corresponding P2Rank scores were similar in between all binding modes and protease conformations. Even so, it is important to mention that the structural accessibility of an inhibitor to the active site can also play a key role in the complex.[17] To the best of our knowledge, this is the first time that this approach has been applied to comparatively evaluate the pocket sites that have importance to the different DENV‐3 protease conformations.

Discussion

The NS3 proteases are highly flexible proteins, whose flexibility is often underestimated when only a single crystal structure is analysed. Especially the DENV's NS3 can adopt multiple substrate/inhibitor specific conformational states. Here we have used extensive MD simulations to study the DENV‐3 NS3‐NS2B complex, as a model for DENV proteases, to identify conformational changes occurring in the substrate/inhibitor binding. We sampled the conformational landscape by conducting simulations with a ligand (NDL) covalently and non‐covalently bound to the catalytic Ser135 and compared these results with apostructure simulations. While there are clear differences between each system, the most striking feature is the identification of distinct protein conformations.

From the start, the detected interplay between the NDL ligand interaction with residues from both NS3‐NS2B and protein changes upon the ligand binding led us to systematically investigate conformational changes in the complex. We used a combination of principal component analyses and K‐means clustering to classify the conformations explored during the simulation into four main States (I–IV). These states structurally vary in the NS2B conformation (open or closed), due to changes in the interaction between the NS2B's C‐terminal beta‐hairpin (Asp80 and Asp81), rich in negatively charged amino acids, and the NS3’s lysine residues (Lys73 and Lys74). Open conformations where the NS2B hairpin is misplaced or unfolded have been previously observed in other crystal structures (Supporting Information, Figure S2).

Previously reported simulations conducted with the NS3‐NS2B from WNV,[29, 30] starting from an initial open conformation, reported the movement of NS2B hairpin's closing as an irreversible process. Interestingly, the WNV open conformation displayed an improperly formed oxyanion hole (Thr132‐Gly133), which flipped towards the Ser135, concomitantly with the closing of the NS2B hairpin. This resulted in the formation of the catalytically competent oxyanion hole in a closed conformation, which then remained stable until the end of the simulations (of around 80 ns).[30] This observation is consistent with the mechanism of action of the catalytic triad in the flaviviruses’ proteases, where the reactive serine oxyanion generates a tetrahedral intermediate necessary for the substrate cleavage.[15] In agreement with that observation, the work of De Almeida et al., (2013) also detected the involvement of a hydrogen bond with residue Gly133 between its main chain atoms and P1 and P2 in all conformations evaluated, and with the same NS3 structure assessed here (3U1I).[31]

We then proceeded on studying the potential druggability of these new conformations using both SiteMap and PrankWeb server. Interestingly the closed states (I and III) have, on average, smaller pocket volumes and higher druggability values. Studying the amino acid composition of the overlapping pockets suggests that the involvement of NS2B residues increases its druggability. The site prediction analyses also detected conserved ligand‐binding sites on the protein's surface, extending it beyond the classical active site (e. g. Asn152). In PrankWeb, data showed only one residue (Asn152) concomitant to all conformation states, as well as specific residues concomitant to all covalent states (e. g. Ile123), and for the apostructure states (e. g. Gly133 and Gly151) (Supporting Information, Table 1). These results also corroborate previous findings which highlight Lys73, Lys74, Ile123 and Asn152,[32] as well as Gly151 and Gly153,[17] Gly133,[31] and Asp75 as important binding sites on the protein's surface and are also consistent with the WNV protease interaction previously described.[33, 34] The work from Behnam et al. (2020), comparing different NS3‐NS2B protease structures, describes the so‐called subpocket B that extends beyond His51/Asp75 connecting it with the P2 pocket.[18]

Similarly, the work from De Almeida et al., (2013) aimed to account for the possible influence of N‐ and C‐terminus domains in the NS3 protease's active site in terms of conformational changes and/or substrate accessibility. They studied short simulations (30 ns) of a full‐length homology model, whose conformations were then clustered into four families.[31] However, due to the short simulation time, one can speculate whether those conformational differences were due to the initial system equilibration or whether the authors had underestimated the possibility of an even more flexible system.

Brecher et al., (2016,[24]) postulated the presence of an allosteric pocket in DENV‐2 protease, composed of NS2B's 67–95 hairpin loop and the NS3PRO 117–120 loop. This pocket would exist only in the closed‐active conformation when the NS2B would be less flexible. In the open‐inactive conformation, the active site would behave as a cleft, composed of NS3 residues such as Val146, Gly148, Leu149, Tyr150, Gly151 and Asn152, without support from the NS2B residues, which requires to be induced to a closed conformation to bind peptide inhibitors.

Based on our results, we suggest that DENV‐3 protease would be a difficult target for classically designed drug‐like small molecules, which would bind to a single protein conformation. One of the issues is the low druggability of open states, such as the State IV. Although the open conformation in State IV makes the active site more accessible, drug binding is hindered due to the strong desolvation effect. On the other hand, the more constrained States II and III are not very much better, as binding sites are too small for drug‐like compounds. We believe that the best alternative is to have a covalent‐binding, resembling the State I conformation. This is probably only possible if the initial binding occurs at State IV and, after complex formation, initiates conformational change to a more compact State.

The search of relevant conformations, among the heterogeneous set of structures available in the literature, have substantial implications on the understanding of this enzyme.[18] We suggest that understanding this conformational shift is key for further drug discovery, since even induced‐fit and flexible docking would be only relevant for fine tuning.

It would be also interesting to have a crystal structure of DENV protease bound to a lead compound with interactions in both P sites, since this would help in the further optimization in the design of inhibitors, especially considering P2 and P3 moieties, such as those available in this same structure (3U1I).[35] For instance, Yao et al., (2019)[36] proposed pan‐flavivirus inhibitors binding to an allosteric pocket of NS3. Their lead compound held the protease in a catalytically inactive open conformation, as confirmed by crystallization. This novel type of inhibitor could be expanded to allow more interactions with the NS2B cofactor, which may lead to conformational changes in the NS3‐NS2B complex. Finally, we hope that our current study would enable the expansion of the DENV protease inhibitor discovery by providing novel interesting conformations to target as well as the dynamic description of the open‐closing transition.

Conclusion

The DENV's NS3 protease crystal structures often underestimate its high flexibility, as this protein can adopt multiple conformational states depending on the ligand bound. Here we expand on the conformational changes occurring in the DENV‐3 NS3‐NS2B complex obtained from a long‐timescale molecular dynamics simulation. We sampled the ligand‐binding site conformational landscape by conducting MD simulations with the non‐ and covalently bound ligand states, as well as the apostructure.

We assessed which residues would compose potential druggable binding sites (Tyr161‐NS3, Asp75 and Asp129, in the NS2B), in addition to evaluating which residues were involved in the establishment, stability and activity of the protease complex (e. g. Asn152). We observed that the interaction with Asp81 (NS2B) can be relevant for the recognition of the inhibitor and the transition towards the catalytically active closed conformation. This emphasizes the importance of the co‐factor NS2B in the conformational switch that leads to activity.

Furthermore, we propose that inhibitors targeting the open NS3‐NS2B conformations could benefit from the higher accessibility, despite the poorer druggability of these sites. Complementarily, inhibitors inducing closing conformational changes would be relevant for the protease inhibition.

Experimental Section

The N2SB‐NS3 protease bound to a covalent inhibitor (the NDL inhibitor found in the PDB ID: 3U1I[14]) was simulated in three different modes: i) with the NDL inhibitor covalently bound to Ser135, representing the inhibited state; ii) with the NDL in position for inhibition but non covalently bound; and iii) the apostructure with no inhibitor. This study aimed to identify different protease conformations in each system and evaluate their druggability potential for the development of new inhibitors.

Structure selection and system preparation

DENV‐3 protease (wild‐type PDB ID: 3U1I[14]) was selected, among other available structures, for the molecular dynamics simulations based on the resolution (2.3 Å) and the presence of a peptide inhibitor (NDL), Bz‐Nle‐Lys‐Arg‐Arg‐H, where Bz represents a benzyl group and Nle stands for norleucine.

The selected structure was prepared using Epik,[37] by adjusting the ionization states of amino acid residues, especially taking into account the catalytic triad (Ser135, His51 and Asp75), in addition to fixing missing side chain atoms, and then further subjecting it to restrained energy minimization (PrepWiz, Maestro v2019.1). The missing segments Ser1‐Lys15 from NS3PRO were modelled using Prime[38] with standard options. Incomplete sequence N‐ and C‐termini were capped and sulphate ions from the crystallization buffer were removed. Water molecules with more than two hydrogen bonds were retained. For the covalently bound system simulations, the bond with Ser135 was maintained. Non‐covalently bound systems were generated by breaking the covalent bond with Ser135. Final structure was minimized using the PrepWizard using the OPLS3e force‐field, with movements restrained to the maximum of 0.3 Å variation.

Molecular dynamics simulation

MD simulations were carried out using the Desmond engine[39] with the OPLS3e force‐field.[40] The simulated system encompassed the protein‐ligand complex, a predefined water model (TIP3P[41]) as explicit solvent and counter‐ions (Na+ or Cl adjusted to neutralize the overall system charge, around 4 or 5 Na+ atoms depending on the system). The entire system was treated in a cubic box with periodic boundary conditions, specifying the shape and the size of the box as 13 Å distance from the box edges to any atom of the protein (totalizing around 40,000 atoms between protein, ligand, solvent and ions). We used a time step of 1 fs and the short‐range coulombic interactions were treated using a cut‐off value of 9.0 Å, while the smooth Particle Mesh Ewald method (PME) handled the long‐range coulombic interactions.[42]

Initially, the system was minimized using Steepest Descent, followed by equilibration. All simulations were run in NPT ensemble (T=310 K, Nosé‐Hoover method; p=1.01325 bar, Martyna‐Tobias‐Klein method) with default Desmond settings. Reversible reference system propagator algorithms (RESPA) integrator with 2 fs, 2 fs and 6 fs timesteps were used for bonded, near and far, respectively. A constant temperature of 310 K was maintained throughout the simulation using the Nose‐Hoover thermostat algorithm, and the Martyna‐Tobias‐Klein Barostat algorithm was used to maintain 1 atm of pressure. After minimization and relaxation of the system, we proceeded with an independent production step of at least 1 μs for each system, with some particular systems being extended after evaluation. All MD simulations were performed at least in five independent runs with randomly generated seeds. In total, we studied 12 μs for DENV protease covalently bound to its ligand, 9 μs for the system with unbound ligand and 10 μs for the apostructure simulations. The representative structural frames were selected based on a visual inspection of the conserved interactions and principal component analyses (PCA) followed by cluster analysis.

Simulation trajectory analyses and principal component analyses (PCA)

In the PCA, the backbone of each frame was extracted and aligned using trj_selection_dl.py and trj_align.py scripts from Schrodinger (2019v4). Individual simulations from all the runs were merged using trj_merge.py into a final trajectory and CMS files, which were further used for the generation of the principal components. PCA was carried out employing protein C‐alpha atoms as variables using trj_essential_dynamics.py script. The first principal component was found to explain 20.4 %, the second 15.3 % and the third 12.6 %, of the total information (48.3 %).

By using the first three principal components results from the previous analysis, we carried out K‐means clustering analysis employing the KNIME analytics platform.[43] Initially, aiming to define the optimal number of clusters, the Elbow method (also known as scree‐plot) was carried out.[44] In this step, ten clustering analysis were generated varying the number of clusters from 1 to 10. Then, the sum of squared distances between samples (frames from MD simulations) and the centroid of the defined cluster was calculated for each run. Then, the optimal number of clusters was defined by the point with a drastic decrease in the distance variance (therefore, “elbow‐like” curve, as displayed in the scree‐plot). Finally, a K‐means analysis was carried out using the obtained optimal number of clusters.

Interactions and distances were determined using the Simulation Event Analysis pipeline implemented in Maestro (Maestro 2019v4) with standard bond definitions. Molecular dynamics trajectories were visualized, and figures were produced using PyMOL (Schrödinger LCC, version 2.4). Trajectories and interaction data are available in the Zenodo repository (under the code: 10.5281/zenodo.3732941).

Identification of potential druggable Pockets

We used SiteMap[45] to analyse the structures representing different conformational states of the DENV protease, aiming to identify and evaluate potential ligand‐binding sites. SiteMap works by creating a grid of points separated by 1 Å. Each of those points is evaluated to check for one of the three cases: i) if they overlap with any protein atoms, ii) if they are insufficiently enclosed by the protein or iii) if they have a too small van der Waals interaction energy with the protein. If so, in any of those cases, this point is not considered for further analyses. The remaining points are then grouped into sites, and the enclosure, size, and hydrophilicity of the sites are calculated. These values are combined into a single scoring function called the SiteScore, that ranges from 0 to 1.5 (most likely to be druggable). SiteScore values higher than 0.8 were previously associated with druggable pockets.[45]

Finally, a P2Rank analysis on the webserver PrankWeb was performed to compare different conformations states and map binding residues.[28, 46] Briefly, the method consists in a machine learning (ML) technique to predict binding sites. Employing a random forest (RF), the method predicts a chemical propensity bind of points (potential locations of binding ligand contact atoms) on a solvent‐accessible protein surface. A feature vector describes these locations, which consists of physicochemical and geometric properties of surrounding atoms and residues (e. g. aromaticity and hydrophobicity). High propensity points are then clustered (i. e. pockets), and ranked based on the cumulative score of the cluster. Here, covalent and non‐covalently bound conformations’ ligands were manually removed before P2Rank analyses.

Conflict of interest

The authors declare no conflict of interest.

Supporting information

As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.

Supporting Information

Acknowledgements

This work was supported by the UFMG intramural funds, CAPES (88887.595578/2020‐00), CNPq (132732/2018‐1) and FAPEMIG (APQ‐03058‐18) for the fellowship of M.S.M.S. The authors thank CSC – Finland for the very generous computational resources provided. The authors also thank Ewen MacDonald for critical reading and language corrections. Open access funding enabled and organized by Projekt DEAL.

T. Kronenberger, M. Sá Magalhães Serafim, A. Kumar Tonduru, V. Gonçalves Maltarollo, A. Poso, ChemMedChem 2021, 16, 2524.

References

  • 1.“Dengue control. WHO | Epidemiology,” can be found under http://www.who.int/denguecontrol/epidemiology/en/ 2016, accessed on February 2021.
  • 2.Bhatt S., Gething P. W., Brady O. J., Messina J. P., Farlow A. W., Moyes C. L., Drake J. M., Brownstein J. S., Hoen A. G., Sankoh O., Myers M. F., George D. B., Jaenisch T., Wint G. R. W., Simmons C. P., Scott T. W., Farrar J. J., Hay S. I., Nature 2013, 496, 504–507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Stanaway J. D., Shepard D. S., Undurraga E. A., Halasa Y. A., Coffeng L. E., Brady O. J., Hay S. I., Bedi N., Bensenor I. M., Castañeda-Orjuela C. A., Chuang T.-W., Gibney K. B., Memish Z. A., Rafay A., Ukwaja K. N., Yonemoto N., Murray C. J. L., Lancet Infect. Dis. 2016, 16, 712–723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wilder-Smith A., Ooi E.-E., Horstick O., Wills B., The Lancet. 2019, 393, 350–363. [DOI] [PubMed] [Google Scholar]
  • 5.Xie X., Gayen S., Kang C., Yuan Z., Shi P.-Y., J. Virol. 2013, 87, 4609–4622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Miller S., Kastner S., Krijnse-Locker J., Bühler S., Bartenschlager R., J. Biol. Chem. 2007, 282, 8873–8882. [DOI] [PubMed] [Google Scholar]
  • 7.Lindenbach B. D., Thiel H.-J., Rice C. M., Fields Virology 2006, 52, 1101–1152. [Google Scholar]
  • 8.Pijlman G. P., Kondratieva N., Khromykh A. A., J. Virol. 2006, 80, 11255–11264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Patkar C. G., Kuhn R. J., J. Virol. 2008, 82, 3342–3352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gebhard L. G., Iglesias N. G., Byk L. A., Filomatori C. V., De Maio F. A., Gamarnik A. V., J. Virol. 2016, 90, 5451–5461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Amberg S. M., Rice C. M., J. Virol. 1999, 73, 8083–8094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Lee E., Stocks C. E., Amberg S. M., Rice C. M., Lobigs M., J. Virol. 2000, 74, 24–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lobigs M., Lee E., J. Virol. 2004, 78, 178–186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Noble C. G., Seh C. C., Chao A. T., Shi P. Y., J. Virol. 2012, 86, 438–446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Timiri A. K., Sinha B. N., Jayaprakash V., Eur. J. Med. Chem. 2016, 117, 125–143. [DOI] [PubMed] [Google Scholar]
  • 16.Gouvea I. E., Izidoro M. A., Judice W. a. S., Cezari M. H. S., Caliendo G., Santagada V., dos Santos C. N. D., Queiroz M. H., Juliano M. A., Young P. R., Fairlie D. P., Juliano L., Arch. Biochem. Biophys. 2007, 457, 187–196. [DOI] [PubMed] [Google Scholar]
  • 17.Aleshin A. E., Shiryaev S. A., Strongin A. Y., Liddington R. C., Protein Sci. 2007, 16, 795–806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Behnam M. A. M., Klein C. D. P., Biochimie 2020, 174, 117–125. [DOI] [PubMed] [Google Scholar]
  • 19.Steuber H., Hilgenfeld R., Curr. Top. Med. Chem. 2010, 10, 323–345. [DOI] [PubMed] [Google Scholar]
  • 20.Tian Y.-S., Zhou Y., Takagi T., Kameoka M., Kawashita N., Chem. Pharm. Bull. 2018, 66, 191–206. [DOI] [PubMed] [Google Scholar]
  • 21.Wang Q.-Y., Patel S. J., Vangrevelinghe E., Xu H. Y., Rao R., Jaber D., Schul W., Gu F., Heudi O., Ma N. L., Poh M. K., Phong W. Y., Keller T. H., Jacoby E., Vasudevan S. G., Antimicrob. Agents Chemother. 2009, 53, 1823–1831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Luo D., Vasudevan S. G., Lescar J., Antiviral Res. 2015, 118, 148–158. [DOI] [PubMed] [Google Scholar]
  • 23.Phoo W. W., El Sahili A., Zhang Z., Chen M. W., Liew C. W., Lescar J., Vasudevan S. G., Luo D., Antiviral Res. 2020, 182, 104900. [DOI] [PubMed] [Google Scholar]
  • 24.Brecher M., Li Z., Liu B., Zhang J., Koetzner C. A., Alifarag A., Jones S. A., Lin Q., Kramer L. D., Li H., PLoS Pathog. 2017, 13, e1006411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Su X.-C., Ozawa K., Qi R., Vasudevan S. G., Lim S. P., Otting G., PLoS Neglected Trop. Dis. 2009, 3, e561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Su X.-C., Ozawa K., Yagi H., Lim S. P., Wen D., Ekonomiuk D., Huang D., Keller T. H., Sonntag S., Caflisch A., Vasudevan S. G., Otting G., FEBS J. 2009, 276, 4244–4255. [DOI] [PubMed] [Google Scholar]
  • 27.Zhu L., Yang J., Li H., Sun H., Liu J., Wang J., Biochem. Biophys. Res. Commun. 2015, 461, 677–680. [DOI] [PubMed] [Google Scholar]
  • 28.Jendele L., Krivak R., Skoda P., Novotny M., Hoksza D., Nucleic Acids Res. 2019, 47, W345–W349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kang C., Gayen S., Wang W., Severin R., Chen A. S., Lim H. A., Chia C. S. B., Schüller A., Doan D. N. P., Poulsen A., Hill J., Vasudevan S. G., Keller T. H., Antiviral Res. 2013, 97, 137–144. [DOI] [PubMed] [Google Scholar]
  • 30.Ekonomiuk D., Caflisch A., Protein Sci. 2009, 18, 1003–1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.de Almeida H., Bastos I. M. D., Ribeiro B. M., Maigret B., Santana J. M., PLoS One 2013, 8, e72402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Wu H., Bock S., Snitko M., Berger T., Weidner T., Holloway S., Kanitz M., Diederich W. E., Steuber H., Walter C., Hofmann D., Weißbrich B., Spannaus R., Acosta E. G., Bartenschlager R., Engels B., Schirmeister T., Bodem J., Antimicrob. Agents Chemother. 2015, 59, 1100–1109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Erbel P., Schiering N., D'Arcy A., Renatus M., Kroemer M., Lim S. P., Yin Z., Keller T. H., Vasudevan S. G., Hommel U., Nat. Struct. Mol. Biol. 2006, 13, 372–373. [DOI] [PubMed] [Google Scholar]
  • 34.Leonel C. A., Lima W. G., dos Santos M., Ferraz A. C., Taranto A. G., de Magalhães J. C., dos Santos L. L., Ferreira J. M. S., Arch. Virol. 2018, 163, 575–586. [DOI] [PubMed] [Google Scholar]
  • 35.Lin K.-H., Ali A., Rusere L., Soumana D. I., Yilmaz N. K., Schiffer C. A., J. Virol. 2017, 91, DOI 10.1128/JVI.00045–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Yao Y., Huo T., Lin Y.-L., Nie S., Wu F., Hua Y., Wu J., Kneubehl A. R., Vogt M. B., Rico-Hesse R., Song Y., J. Am. Chem. Soc. 2019, DOI 10.1021/jacs.9b02505. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Shelley J. C., Cholleti A., Frye L. L., Greenwood J. R., Timlin M. R., Uchimaya M., J. Comput.-Aided Mol. Des. 2007, 21, 681–691. [DOI] [PubMed] [Google Scholar]
  • 38.Jacobson M. P., Pincus D. L., Rapp C. S., Day T. J. F., Honig B., Shaw D. E., Friesner R. A., Proteins. 2004, 55, 351–367. [DOI] [PubMed] [Google Scholar]
  • 39.K. J. Bowers, E. Chow, H. Xu, R. O. Dror, M. P. Eastwood, B. A. Gregersen, J. L. Klepeis, I. Kolossvary, M. A. Moraes, F. D. Sacerdoti, J. K. Salmon, Y. Shan, D. E. Shaw, in Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, ACM, New York, NY, USA, 2006.
  • 40.Harder E., Damm W., Maple J., Wu C., Reboul M., Xiang J. Y., Wang L., Lupyan D., Dahlgren M. K., Knight J. L., Kaus J. W., Cerutti D. S., Krilov G., Jorgensen W. L., Abel R., Friesner R. A., J. Chem. Theory Comput. 2016, 12, 281–296. [DOI] [PubMed] [Google Scholar]
  • 41.Jorgensen W. L., Chandrasekhar J., Madura J. D., Impey R. W., Klein M. L., J. Chem. Phys. 1983, 79, 926–935. [Google Scholar]
  • 42.Darden T., York D., Pedersen L., J. Chem. Phys. 1993, 98, 10089–10092. [Google Scholar]
  • 43.Berthold M. R., Cebron N., Dill F., Gabriel T. R., Kötter T., Meinl T., Ohl P., Thiel K., Wiswedel B., ACM SIGKDD Explorations Newsletter 2009, 11, 26. [Google Scholar]
  • 44.“Topic Extraction – kilian.thiel,” can be found under https://hub.knime.com/knime/spaces/Examples/latest/08 Other Analytics Types/01 Text Processing/17 TopicExtraction with the ElbowMethod, n.d.
  • 45.Halgren T. A., J. Chem. Inf. Model. 2009, 49, 377–389. [DOI] [PubMed] [Google Scholar]
  • 46.Krivák R., Hoksza D., J. Cheminf. 2018, 10, 39. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.

Supporting Information


Articles from Chemmedchem are provided here courtesy of Wiley

RESOURCES