Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2022 Aug 13;12:13792. doi: 10.1038/s41598-022-17920-7

Fly casting with ligand sliding and orientational selection supporting complex formation of a GPCR and a middle sized flexible molecule

Junichi Higo 1,2,, Kota Kasahara 3, Gert-Jan Bekker 4, Benson Ma 4, Shun Sakuraba 5, Shinji Iida 6,8, Narutoshi Kamiya 1, Ikuo Fukuda 1, Hidetoshi Kono 5, Yoshifumi Fukunishi 7, Haruki Nakamura 4
PMCID: PMC9376114  PMID: 35963875

Abstract

A GA-guided multidimensional virtual-system coupled molecular dynamics (GA-mD-VcMD) simulation was conducted to elucidate binding mechanisms of a middle-sized flexible molecule, bosentan, to a GPCR protein, human endothelin receptor type B (hETB). GA-mD-VcMD is a generalized ensemble method that produces a free-energy landscape of the ligand-receptor binding by searching large-scale motions accompanied with stable maintenance of the fragile cell-membrane structure. All molecular components (bosentan, hETB, membrane, and solvent) were represented with an all-atom model. Then sampling was conducted from conformations where bosentan was distant from the binding site in the hETB binding pocket. The deepest basin in the resultant free-energy landscape was assigned to native-like complex conformation. The following binding mechanism was inferred. First, bosentan fluctuating randomly in solution is captured using a tip region of the flexible N-terminal tail of hETB via nonspecific attractive interactions (fly casting). Bosentan then slides occasionally from the tip to the root of the N-terminal tail (ligand–sliding). During this sliding, bosentan passes the gate of the binding pocket from outside to inside of the pocket with an accompanying rapid reduction of the molecular orientational variety of bosentan (orientational selection). Last, in the pocket, ligand–receptor attractive native contacts are formed. Eventually, the native-like complex is completed. The bosentan-captured conformations by the tip-region and root-region of the N-terminal tail correspond to two basins in the free-energy landscape. The ligand-sliding corresponds to overcoming of a free-energy barrier between the basins.

Subject terms: Biophysics, Computational biology and bioinformatics

Introduction

So-called G protein-coupled receptors (GPCRs) are membrane proteins constructing a large evolutionarily related protein family with various molecular functions1. Generally speaking, GPCRs have seven transmembrane helices (TM1–TM7) embedded in the membrane. GPCRs activate cellular responses via detection of molecules outside the cell. Because GPCRs are related to many diseases, they have been important targets for drugs2. Endothelin-1 (ET1) is a peptide (21 residues long) with a strong vasoconstrictor action discovered in humans3. To exert its activity, ET1 transmits signals by interacting with two GPCRs: endothelin type A (ETA)4 and endothelin type B (ETB)5 receptors. The complex structure of human endothelin type-B receptor (hETB) and ET1 was resolved using X-ray crystallography6. In this complex, ET1 is bound to a binding pocket of hETB on the cytoplasmic side.

Bosentan competes with ET1 when binding to human ETA (hETA) and human hETB (hETB). It inhibits ET1’s vasoconstriction effect as an antagonist79. The tertiary structure of the bosentan-hETB complex was solved using X-ray crystallography10. In this structure, bosentan binds to the binding pocket of hETB more deeply than ET1. However, interaction patterns between the C-terminal segment of ET1 and amino-acid residues in the binding pocket of hETB are similar to those in the bosentan–hETB complex.

Bosentan is a medium-sized (551.6 Da) drug molecule that is actually large compared to other commercial drug molecules. This molecule is flexible because it has long flexible sidechains (Fig. 1a), whereas the central ring, to which the sidechains are connected, is stiff. Consequently, bosentan will adopt temporal complex conformations (encounter complexes) before reaching the deep position of the hETB pocket (i.e., the native-complex position). A special molecular dynamics (MD) simulation that can sample various conformations in the complex-formation process is suitable for investigating those temporal conformations.

Figure 1.

Figure 1

(a) Chemical structure of bosentan. Two atom groups GβB and GγB are surrounded by dotted-line rectangles, which are used to set reaction coordinates (RCs) λβ and λγ. (b) The first RC, λα, defined by distance between centroids of atom groups GαA (red segment in the N-terminal tail of hETB surrounded by dotted line; residues 85–89) and GαB (magenta-colored segment in β-hairpin of hETB surrounded by another dotted line; residues 243–254). Binding pocket of hETB is indicated by blue broken-line circle. Bound bosentan is shown at the bottom of the pocket. Green and cyan regions are defined in panel (c). (c) The second and third RCs are, respectively, λβ and λγ. Also, λβ is defined by the distance between centroids of GβA (a cyan-colored part of the fifth transmembrane helix TM5; residues 273–281 (5 × 39–5 × 461), in which the first and last residues are indicated by “273(5 × 39)” and “281(5 × 461)”, respectively, in the panel) and GβB (a part of bosentan surrounded by dotted line). λγ is defined by GγA (a green part of the seventh transmembrane helix TM7; residues 372–379 (7 × 37–7 × 46), in which the first and last residues are indicated by “372(7 × 37)” and “379(7 × 46)”, respectively, in the panel) and GγB (other part of bosentan surrounded by the other dotted line). Bound and unbound bosentans are displayed. Yellow residues are cysteines that form disulfide bonds at the roots of the N-terminal tail, GαA, and the β-hairpin, GαB. Transmembrane helices (TM1–7) are labeled in panel b. Transmembrane helices (TM1–7) are labeled for panel (b and c).

Recently, we introduced a generalized ensemble method, a multi-dimensional virtual-system coupled molecular dynamics (mD-VcMD) simulation11, which enhances conformational sampling with repetition of simulations (iteration) in a multi-dimensional (mD) reaction-coordinate (RC) space. By repeating the iterations, the sampled volume in the mD-RC space is expanded. The sampled conformations (snapshots) are then collected from the iterations. Importantly, mD-VcMD assigns a statistical weight (thermodynamic weight equilibrated at a simulation temperature) to each snapshot. Therefore, the ensemble of snapshots is equilibrated. Any equilibrated physicochemical quantity is computable from the ensemble if the quantity is expressed by the system’s coordinates. We designate this simulation method as the “original mD-VcMD method” or simply the “original method” in this paper. The original method was applied to a large and complicated system consisting of ET1 and hETB, where hETB was embedded in an explicit membrane and the membrane was immersed in an explicit solvent12. A benefit of mD-VcMD against other generalized ensemble methods is that RCs can be set arbitrarily to distances among atom groups. Consequently, this method can treat the molecular binding/unbinding process straightforwardly when RCs are set to distances between the two molecules (or between portions of the molecules). That is to say, enhancement of the motions of those RCs is related directly to molecular binding and unbinding.

Although the original mD-VcMD was applicable to the large and complicated system, we encountered an important difficulty: Once a poorly sampled region emerges in the mD-RC space in an iterative simulation, the conformation might be trapped in this region in subsequent iterations. Consequently, a more robust method is necessary to proceed with the sampling. To overcome this difficulty, we introduced a genetic algorithm-guided multi-dimensional virtual-system coupled molecular dynamics method (GA-mD-VcMD)13, which is an extension of the original mD-VcMD. We applied this method to some biological systems that consist of a protein and ligand in an explicit solvent1416.

From the original mD-VcMD applied to binding of ET1 to hETB, two binding mechanisms were identified: Fly casting1719 and conformational selection20. The receptor hETB has an intrinsically disordered N-terminal tail. The ET1 floating in solution was captured by the fluctuating N-terminal tail (fly casting mechanism) because the interactions between ET1 and the hETB’s N-terminal were attractive12. In fact, the root of the N-terminal tail is located at the gate of the binding pocket of hETB, which indicates that once ET1 is captured by the N-terminal tail, the ligand is restricted in a narrow volume around the gate of the binding pocket, resulting in an increased possibility that ET1 contacts the gate of the binding pocket. Next, only a fraction of various orientations of ET1 were allowed to enter the hETB pocket (conformational selection). This fractional entrance suggests that the pocket size is not sufficiently large for ET1 to rearrange the molecular orientation in the pocket, and that only ET1 with molecular orientations that are advantageous for the native complex formation is accepted.

For this study, we used GA-mD-VcMD simulation to sample interactions between the medium-sized molecule bosentan and hETB. Aside from the difference of ligand and simulation method, two differences exist between earlier studies and the present study. (1) We prolonged the intrinsically disordered N-terminal tail of hETB of the X-ray structure6,10, which were five and ten residues long for the earlier and the current studies, respectively12. This tail extension varies the features of the fly casting mechanism, as shown later. Because GPCRs have a long N-terminal tail in general21, this extension of the tail might support experimentation studies elucidating the role of the N-terminal tail of GPCRs22,23. Second, the initial simulation conformations for the present study were those in which bosentan was distant from hETB, whereas those for the previous study were the native-complex conformations of ET1 and hETB. In general, the difference of initial conformations influences the resultant conformational ensemble considerably. The reliability of findings obtained from the current study depends on the capacity of our method to sample native conformations. The native complex should be discovered as the lowest free-energy conformation out of many possible conformations. By contrast, in earlier studies, the native complex is always involved in the resultant ensemble, which means that the native complex might not necessarily correspond to the lowest free-energy conformation because a lower free-energy conformation might be overlooked in the sampling.

Our GA-mD-VcMD simulation produced a free-energy landscape of the bosentan–hETB system. Probability distribution functions of some quantities were calculated from the resultant conformational ensemble. Because the free-energy landscape covered both unbound conformations and the native-complex structures, and because the lowest free-energy conformation corresponded to the native-complex structure, we were able to analyze the complex formation comprehensively. We discuss the generality of the binding mechanism found in bosentan-hETB to ligand–GPCR binding. Because bosentan is a drug molecule, the present study helps drug development research not only by proposing the lowest free-energy conformations but also by investigating the binding process.

Methods

The aim of this work is investigation of the complex-formation process of bosentan and hETB using GA-mD-VcMD method. First, we briefly explain this method, which provides a conformational ensemble of the system and a thermodynamic weight at the simulation temperature T (300K in this study), which is assigned to conformations stored in the ensemble. Details of GA-mD-VcMD have been explained elsewhere1316. Next, we explain the molecular system which consists of hETB, bosentan, membrane (cholesterol and POPC lipid molecules), and solvent (water molecules and ions). Then, we describe multiple reaction coordinates that are introduced, along with the initial conformations used for simulation. Finally, we account for the method used to calculate the spatial density of bosentan around hETB.

We frequently use the term “native-like complex”. In general, an MD simulation is influenced by some force-field errors and sampling errors. Therefore, when a simulation is initiated from a completely unbound conformation, the simulation does not perfectly reproduce the experimentally observed conformation. For this reason, we designate the computed lowest free-energy conformation as “native-like complex conformation”, even when the conformation closely resembles the experimentally observed one.

However, we use the word “native complex conformation” for a computed conformation when a short simulation is done starting from the experimentally observed conformation because this simulation can be regarded as refinement of the experimentally observed conformation.

GA-mD-VcMD and a conformational ensemble

To sample both the unbound and bound conformations of the system extensively, GA-mD-VcMD controls the system motions in a multi-dimensional (mD) space constructed by multiple reaction coordinates (RCs). The definition of an RC is presented in Section 1 of Supplementary Information (SI) and in Figure S1. We introduce three RCs denoted as λ(h) (h=α,β,orγ), whereas the dimensionality of the RC space is arbitrary in theory13. Therefore, the word “multi-dimensional (mD)” designates three-dimensional (3D) for the present study. The actual definition for the three RCs is given later.

Conformational sampling of the system by GA-mD-VcMD at 300K produces a distribution (density) of the system in the mD-RC space (not in the real space). Because the conformational space of a biological molecular system is wide, the GA-mD-VcMD simulation is executed iteratively, by which the sampled volume of 3D-RC space is expanded. Here, the density at a 3D-RC position [λα,λβ,λγ] is denoted as Qcanoλ(α),λ(β),λ(γ), with the density computed from the M-th iteration expressed as Qcano(M)λ(α),λ(β),λ(γ) when necessary.

The simulation is terminated when a convergence criterion is satisfied for Qcanoλ(α),λ(β),λ(γ). The convergence is discussed in the “Results and Discussion” section. Details of the convergence are discussed in Section 5 of SI. Snapshots of the system are saved from all iterations. After convergence, a thermodynamic weight at simulation temperature T is assigned to each saved snapshot based on Qcanoλ(α),λ(β),λ(γ). Therefore, the snapshots construct a thermally equilibrated conformational ensemble (canonical ensemble) at T (300K).

Molecular system studied

We generated the molecular system consisting of bosentan, hETB, cholesterols surrounding hETB, the POPC bilayer where hETB is embedded, and an explicit solvent (water molecules, Na+ and Cl- ions). All of them were packed in a periodic boundary box. Details of system preparation are presented in Section 2 of SI. Figure S2 is a diagram outlining generation of the molecular system.

Two reference crystal structures are available for generating the molecular system for the simulation: The bosentan–hETB (PDB ID 5xpr; 3.6 Å resolution) and K8794–hETB (PDB ID 5x93; 2.2 Å) complex structures. Importantly, the two crystal structures are mutually similar to a considerable degree (Figure S3a) because K8794 is an analog of bosentan (Figure S3b). Because K8794–hETB complex has better resolution than bosentan–hETB complex, we used the K8794–hETB complex to prepare the molecular system.

First, we replaced K8794 with bosentan. This complex conformation is designated as “Complex structure of bosentan–hETB” in Figure S2. For crystallography, some mutations were applied to the amino-acid sequence of hETB to increase the diffraction data quality10. Then, we back-mutated these amino acids to those in the wild-type sequence of hETB. Finally, the computed hETB consists of residues 80–403. The N-terminal and C-termini of hETB were capped respectively by the acetyl and N-methyl groups denoted as Ace and Nme. Section 2 of SI presents details of system preparation.

Then the complex was embedded in the POPC bilayer. After four cholesterols were introduced into the hETB–membrane interface (Section 2 of SI), the system was immersed in a periodic boundary box filled with water molecules. The x-, y-, and z-coordinate axes for specifying the atomic positions were set parallel to the sides of the periodic box (the x–y plane was parallel to the membrane surface). The box dimensions were 71.33 Å (x-axis), 71.33 Å (y-axis), and 132.17 Å (z-axis). Finally, sodium and chlorine ions were introduced by replacing water molecules randomly with ions to neutralize the system at a physiological ion concentration of 130 mM. This conformation is shown as the “Complex structure in periodic box” in Figure S2. The total number of atoms of the system was 69,062, with the numbers of protein and bosentan atoms as 5289 and 68, respectively, and with four cholesterols (296 atoms), 127 POPC molecules (17,018 atoms), 15,440 water molecules (46,320 atoms), and 28 sodium and 43 chlorine ions.

After energy minimization of the above system, with subsequent NVT and NPT simulations, we obtained the complex structure shown in Figure S4, for which the resultant periodic-box size was 69.37 Å × 69.37 Å × 140.29 Å. This structure is represented as “Complex structure in Figure S4 (native-complex structure)” in Figure S2. Because this complex structure is a relaxed conformation of the X-ray complex structure, bosentan was located at a position close to that in the X-ray structures (PDB IDs: 5xpr and 5x93). Therefore, we designate this conformation as the “native complex structure” in the present study. It is noteworthy that this native complex structure is not the initial conformation of the current GA-mD-VcMD simulations. As explained later, we generated conformations in which bosentan was separated completely from the binding pocket of hETB. They were the conformations that we used as the initial simulation conformations of GA-mD-VcMD.

The residue ordinal numbers used for this paper are based on two sources: the original PDB numbering (PDB ID 5xpr) and the GPCRdb generic numbering24,25. The GPCRdb generic residue numbers are presented in parentheses. When a residue is not numbered in the GPCRdb generic method, only the original PDB residue number is shown. For instance, residues of the N-terminal tail of hETB are not numbered in the GPCRdb generic method.

Set of three RCs introduced to control system motions

Before describing generation of the initial conformations of GA-mD-VcMD, the set of three RCs (λh; h{α,β,γ}) introduced for the current study must be explained concretely. The setup of RCs is fundamentally important to increase the sampling efficiency, although RCs can be set arbitrarily in theory. We imposed three conditions on the RCs: the RCs can control (1) the gate opening and closing of the hETB binding pocket, (2) the ligand approaching to and departing from the receptor, and (3) the ligand orientation relative to the receptor.

Although various RCs can satisfy the three conditions presented above, we set them in a straightforward manner as shown in Fig. 1b,c. The exact specifications for the atom groups are given in the figure caption. We briefly explain the roles of the RCs. For condition (1), two atom groups GαA and GαB (Fig. 1b) are introduced to define the first RC λ(α), where the increase and decrease of λ(α) correspond to the opening and closing of the gate of the hETB’s binding pocket, respectively. We note that disulfide bonds are formed at the roots of the N-terminal tail, GαA, and the β-hairpin, GαB, as shown in Fig. 1b,c. This means that the roots of the N-terminal tail and β-hairpin are locked by the disulfide bonds and that other regions of hETB are only weakly influenced by the gate motions. Figure 1c shows atom groups GβA and GβB to define λ(β), as well as GγA and GγB to define λ(γ). Groups GβA and GβB are mutually close in the native complex; GγA and GγB are as well. Likewise, those atom groups are distant from each other in an unbound conformation. The increase and decrease of λ(β) and λ(γ) result in approaching and departing of bosentan to the hETB’s binding pocket: Condition (2) is established. Finally, we note that bosentan rotates with respect to hEETB when λ(β) decreases concomitantly with increasing λ(γ): Condition (3) is established. Figure 1a portrays a closeup image of bosentan to clarify GβB and GγB.

These atom groups are set as independent of the chemical properties of the ligand and receptor. The atom groups are set conveniently to establish the three conditions (1), (2), and (3) above. As described later, this set of RCs produced various gate–widths of the hETB’s binding pocket as well as various bosentan conformations relative to hETB.

One might infer that enhancement of the conformational motions in the 3D RC space affect the resultant conformational distribution and snapshots because a bias is introduced for the enhancement13. This possible effect of bias is true for conventional MD simulation. In GA-mD-VcMD, however, the bias effect is removed entirely from the resultant ensemble using a reweighting technique13.

Initial conformations for GA-guided mD-VcMD

We explain generation of the initial conformations of GA-mD-VcMD. Our aim is to explore the conformational space from unbound to bound states. Consequently, the initial conformations should not be the native complex structure but should be a completely unbound conformation. Because we had only the native complex structure (Figure S4) at this stage, it was necessary to generate the unbound conformations from the complex structure. For this purpose, we performed ten simulations (called “separation simulations”) starting from the native complex structure with imposition of three requirements to RCs to the system’s conformation: 24.0 Å λα 25.0 Å, 60.0 Å λβ 65.0 Å, and 60.0 Å λγ 65.0 Å. Ten separation simulations were run at 300 K for 5×106 steps, with a time step of 2 fs. The three RCs for the native complex structure are λ(α)=8.22 Å, λ(β)=8.35 Å, and λ(γ)=6.21 Å. During the separation simulations, bosentan moved from the binding pocket to unbound conformations. The resultant conformations are presented in Figure S5, where bosentans are separated completely from hETB, which are presented as “Unbound conformations in figure S5” in Figure S2.

Next, for additional relaxation of those unbound conformations, we performed 2000 runs: 200 (1×106 steps with a time step of 2 fs at 300 K) runs starting from each of those ten unbound conformations with different random seeds to the initial atomic velocities. For the 2000 runs, we applied the following three restrictions to RCs: λα25.0 Å, λβ65.0 Å, and λγ65.0 Å. The upper limits were introduced to prevent bosentan from flying further away from hETB in the periodic box. We refer to those 2000 runs as “diffusion simulations”. The last snapshots from the 2000 runs were then used as the initial conformations of the GA-mD-VcMD simulations. Figure S6 shows the positions of the last snapshots of the simulations in the 3D-RC space and the native complex structure in the 3D-RC space. Those conformations are represented as “Distributed conformations in figure S6” in figure S2.

It is noteworthy that Figure S6 demonstrates that the conformations sampled in the diffusion simulations did not reach the native complex structure. We emphasize that none of the current results was influenced by the structural features of the native complex because the 2000 initial conformations are those diffused freely from the ten completely unbound conformations (Figure S5).

GA-mD-VcMD simulations

As described in the “Introduction” section, GA-mD-VcMD enhances conformational sampling in the 3D-RC space. The thermodynamic weights of sampled conformations at the simulation temperature (300 K) are computed using the converged distribution Qcanoλ(α),λ(β),λ(γ).

In the original mD-VcMD12 and GA-mD-VcMD13, each RC axis λh (h{α,β,γ}) is divided into small 1D zones (originally designated as “virtual zones”13). Accordingly, the 3D-RC space is divided into small 3D zones. Table S2 lists the actual division of each 1D axis into zones. During a short time-interval Δτtrns of simulation, the system’s conformation freely moves only in a single 3D zone with a restoring force outside the zone. Then, an inter-zone transition from the zone to one of neighboring zones is attempted at every time interval. For the present study, we set Δτtrns=20ps, which corresponds to 1×104 simulation steps, with a time step of 2fs. The inter-zone transition probabilities are determined using Qcano12,13. In this way, the conformation moves in a wide 3D-RC space by inter-zone transitions.

We performed Nrun (= 2000) runs at 300K in parallel starting from different conformations in each iteration to increase the sampling efficiency further. A simple integration of the Nrun trajectories can be regarded as a single, long trajectory by connecting the trajectories in arbitrary order26,27. The Nrun initial conformations for the first iteration of GA-mD-vcMD are the last conformations from the diffusion simulations (Figure S6). We obtained snapshots from the first to M-th iterations when the M-th iteration had finished. The Nrun initial conformations for the (M+1)-th iteration were selected from those snapshots so that they distribute as evenly as possible in the 3D-RC space. However, when a certain RC region was poorly sampled, we selected more snapshots from or near the poorly sampled RC region than from well-sampled RC regions13.

We set the simulation length of each run to 1×106 steps (1×106×2fs=2.0ns). Consequently, the total length for an iteration was 4.0μs (= 2.0ns×2000). We performed 55 iterations for the present study, which corresponds to 220.0μs (4.0μs×55) in all. A snapshot was stored every 1×105 steps (0.2ns) in each run. Therefore, 1.1×106 (220.0μs/0.2ns) snapshots were stored in all. We calculated the distribution functions of various quantities at the simulation temperature (300K) from the resultant ensemble of snapshots.

The GA-mD-VcMD algorithm13, which was initially available in a MD simulation program omegagene/myPresto28, has been implemented for the current study in GROMACS29 using the PLUMED plug-in30. The simulation conditions were the following: LINCS31 to constrain the covalent-bond lengths related to hydrogen atoms, the Nosé–Hoover thermostat32,33 to control the simulation temperature, the zero-dipole summation method3436 to compute the long-range electrostatic interactions accurately and quickly, a time-step of 2 fs (Δt=2 fs), and simulation temperature of 300K. All simulations were performed on the TSUBAME3.0 supercomputer at the Tokyo Institute of Technology using GP-GPU.

The force fields used are AMBER99SB-ILDN37 for hETB and AMBER LIPID1438 for the cholesterol and POPC lipid molecules, with TIP3P39 for water molecules and Joung–Cheatham model40 for ions (chloride and sodium), respectively. The bosentan force field was prepared as follows: First, the bosentan atomic partial charges were calculated quantum-chemically using Gaussian0941 at the HF/6-31G* level, followed by RESP fitting42. Then, the obtained partial charges were incorporated into a general amber force field (GAFF) force-field file43, which was designed to be compatible with conventional AMBER force-fields. The chemical structure of bosentan is shown in Fig. 1A.

Spatial density of bosentan around hETB

GA-mD-VcMD simulation produces a canonical ensemble of conformations (snapshots) and assigns a thermodynamic weight (300K) to each snapshot. Therefore, using the ensemble, one can calculate a spatial density of the bosentan centroid around hETB at 300 K using the ensemble. Details of methods used to calculate the spatial density are presented in Subsection 3.1 of SI.

We briefly explain the notation used to express the spatial density. The 3D real space (not the 3D-RC space) is divided into cubes (3 Å × 3 Å × 3 Å), with centers denoted as rcube. The spatial density ρCMb(rcube) is the probability of detecting the bosentan centroid rCMb in the cube centered at rcube.

Results and discussion

We repeated 55 iterations of GA-mD-VcMD while saving 1.1×106 snapshots from the first to 55th iterations in all, as described in Methods. First, we show that the GA-mD-VcMD produced a distribution of the system that covered both the unbound and native-like complex conformations. We next show that the highest-density spot (the lowest free-energy spot) corresponds to the native-like complex structure. We then analyze the binding process from the unbound to intermediates, and from the intermediates to the native-like complex. For this purpose, the binding process is divided into three stages: the early stage of binding, where bosentan is outside the binding pocket; the second stage, where bosentan is passing the gate of the binding pocket; and the last stage, where bosentan is moving in the binding pocket.

Conformational distribution in 3D-RC space

GA-mD-VcMD produced the distribution Qcanoλ(α),λ(β),λ(γ) equilibrated at 300K in the 3D-RC space (Figure S8). Importantly, the distribution covered both the unbound and bound conformations. The native-complex structure was located at the periphery of the lowest free-energy basin. Section 4 of SI explains the features of this distribution.

The convergence check of the distribution is argued in Section 5 of SI with a function, Elocal(λα,λβ,λγ). This function was introduced originally in an earlier report of a study13. The value of this function is calculated at zones (λα,λβ,λγ) in each iteration. Simply, when an inequality Elocal(λα,λβ,λγ)<0.25 is satisfied at a zone, it is judged that Qcano is determined accurately at the zone. The validity of this threshold value has been checked repeatedly1416.

Figure S9 shows that iterations 1–10 (Figures S9a–c) sampled a large volume of the 3D-RC space involving the unbound and bound conformations. Therefore, the simulation could be halted at the tenth iteration. However, to obtain snapshots sufficient for analysis, we continued the simulation up to the 55th iteration. A snapshot of every 0.2ns (100,000 steps) of the simulation was stored.

Next, we compared the distribution Qcanoλ(α),λ(β),λ(γ) with that obtained from an earlier study of the ET1–hETB system12. Section 6 of SI presents details of the comparative method. Figure S10 shows that the free-energy slope along the λ(α) axis (the motion along the gate opening and closing of the hETB’s binding pocket) is gentler than those along the λ(β) or λ(γ) axis (bosentan’s approaching and departing motions to hETB). This result agrees with that obtained from the earlier study. However, the slope in the current bosentan–hETB landscape was about one-tenth of that in the previous ET1-hETB landscape.

From the earlier study12, we inferred that the steep slope is probably caused by a shortcoming of the update method of the probability distribution Qcano in the 3D RC space (i.e., Qcano(M)Qcano(M+1)) and that the steep slope would be weakened if two pieces of QcanoM(λ(α),λ(β),λ(γ)) and QcanoM(λα,λβ,λγ) were connected adequately, where (λα,λβ,λγ) and (λα,λβ,λγ) are mutually neighboring zones in the RC space. GA-mD-VcMD provides such a method to connect Qcano(M) smoothly13. In fact, the free-energy slope decreased in the present study significantly.

Three-dimensional (3D) spatial distribution of bosentan around hETB

Whereas GA-mD-VcMD samples the mD-RC space, the conformational distribution in the mD-RC space does not always provide a concrete or intuitive image for the molecular conformation of the system. Therefore, we converted the distribution Qcanoλ(α),λ(β),λ(γ) (Figure S8) to the conformational distribution ρCMb(rcube) defined in the 3D real space using a thermodynamic weight assigned to all the sampled snapshots using Eq. 31 of that earlier study13. Subsection 3.1 of SI presents the method to calculate ρCMb(rcube). Figure 2 portrays the spatial patterns of ρCMb(rcube) of the bosentan centroid rCMb around hETB. Importantly, the highest-density spot (the red contours; ρCMb(rcube)0.5) well agreed with the position of bosentan centroid in the native complex structure. This result is important because the reliability of the simulation data decreases unless the highest-density spot is assigned to the native bosentan position.

Figure 2.

Figure 2

The spatial density ρCMbrcube of the bosentan centroid around hETB. The iso-density map is presented at three contour levels with different colors, as shown in the inset. The molecular structure is the native complex (Figure S4) omitting solvent. Bosentan is shown as a cyan-colored stick model, with its centroid shown as a small black sphere. N-terminal tail and β-hairpin of hETB are indicated with labels. Red-colored contours are named “highest-density spots”. Blue-colored ones (ρCMbrcube0.1) well specify the binding pocket of hETB. Magenta-colored regions above the pocket are designated as “hat”. Cyan-colored broken-lines represent the boundary, the hat–pocket boundary, which is called “gate” of the binding pocket. Figure S11, presents ρCMbrcube with addition of two low-density contours of ρCMbrcube0.001 and 0.0001. Transmembrane helices (TM1–7) are labeled.

The binding pocket corresponds to the region of ρCMb(rcube)0.1 (blue contours), which involved the highest-density spot. Outside of the binding pocket was characterized by a lower density of ρCMb(rcube)0.01 (magenta-colored ones), which was named a “density hat” or simply “hat”. We denote the hat–pocket boundary (cyan-colored broken line in Fig. 2) as the “gate” of the binding pocket. We emphasize that the gate is not defined by residues of hETB but rather assigned to the boundary between the low- and high-density regions of ρCMb(rcube). In other words, the gate is a quantity that is dependent on the density field.

Figure S11 depicts the spatial density for the whole sampled region. Bosentan was distributed even in regions that are distant from hETB, as evidenced by the low density: ρCMb(rcube)0.0001. The bosentan density increased rapidly with approach to the binding pocket from the far region. This figure demonstrates that GA-mD-VcMD is powerful for sampling the wide conformational space efficiently.

RMSD of bosentan between the snapshot and the native complex

As presented above, we showed visual agreement of the computed highest-density spot and the bosentan native-complex position. Here, to analyze the agreement quantitatively, we calculated the root-mean-square-deviations RMSD of bosentan of two types between a snapshot and the native complex: RMSD=RMSDwhole or RMSD=RMSDcore. RMSDwhole is calculated using all the heavy atoms in bosentan. Also, RMSDcore is calculated using the heavy atoms in the bosentan core region. The definition of the core region is presented in Fig. 3a.

Figure 3.

Figure 3

(a) Bosentan’s “core region”, which is surrounded by a red broken line. The core region located at the center of bosentan can be regarded approximately as a rigid body. Therefore, the core region is a representative part to express the bosentan position and orientation. (b) Distance distribution function PDDFRMSD and (c) radial distribution function PRDFRMSD are presented for both RMSDwhole and RMSDcore. Definitions of RMSD, PDDF, and PRDF are given in Subsection 3.2 of SI.

The ten heavy atoms comprising the core region are located on a plane and are connected firmly by covalent bonds. Consequently, this region can be regarded as a rigid body approximately. Furthermore, this region is located at the bosentan center surrounded by four sidechains. Therefore, the core region is a representative part for expressing the bosentan position and orientation.

We calculated two probability distribution functions to analyze RMSD: the distance distribution function PDDF(RMSD) and the radial distribution function PRDF(RMSD). Subsection 3.2 of SI presents details of PDDF and PRDF. The resultant distributions are presented in Fig. 3. For the whole bosentan, the highest-peak positions of PRDF and PDDF are, respectively, at RMSDwhole3.25 Å and RMSDwhole5.75 Å. Similarly, for the core region, those are respectively at RMSDcore2.25 Å and RMSDcore2.75 Å. The peak positions for the core region were smaller than those for the whole bosentan because of the disorder of the longest sidechain of bosentan, as explained later.

Bosentan–tail contact ratio

The structure of the N-terminal tail is not determined in the X-ray structures (PDB IDs: 5x93 and 5xpr) because the tail is intrinsically disordered. We presume that the tail might affect the binding process because the root of the tail is located at the entrance of the hETB binding pocket. Results of some experimentation studies have indicated that an extracellular loop of GPCRs plays a role as a lid to block the gate of the binding pocket44,45.

Figure S7b highlights the gate of the hETB binding pocket. As this figure shows, half of the gate, which is covered by the white solid-line rectangle, is closed by the contact of the N-terminal tail and the β-hairpin. The other half of the gate, which is covered by the white broken-line rectangle, is open. The definition of the N-terminal tail and the β-hairpin is given in the caption of Figure S7b. One can readily imagine from this figure that the gate situation can vary during tail fluctuations. Below, we investigate the tail fluctuations and the bosentan–tail contact.

First, to visualize fluctuations of the N-terminal tail, we calculated a spatial density function ρCMNt(rcube) for the N-terminal tail tip. We also calculated the spatial density function ρCMβh(rcube) for the tip of the β-hairpin because the β-hairpin is the contact partner of the N-terminal tail to open and close the gate of the binding pocket. Figure S12a exhibits that the N-terminal tail fluctuates widely with varying its orientation, which is a natural property of the intrinsically disordered segment. In contrast, the β-hairpin fluctuation is highly limited in a narrow volume, as shown in Figure S12b. Therefore, the gate opening and closing is induced by the large motions of the N-terminal tail (not by the fluctuation of β-hairpin).

Next, to investigate the bosentan–tail contact, we introduced a “cube-based bosentan–tail contact ratio” denoted as c¯b-N(rcube). It is noteworthy the bosentan centroid rCMb in a snapshot was assigned to a cube at rcube, which involved rCMb. Then c¯b-N(rcube) is the ratio by which bosentan in rcube contacts to the N-terminal tail. The maximum and minimum of c¯b-N(rcube) are, respectively, 1.0 and 0.0. Detailed procedures used to calculate c¯b-N(rcube) are presented in Subsection 3.3 of SI.

Figure 4a presents the spatial patterns of c¯b-Nrcube at three contour levels: c¯b-Nrcube0.1, 0.7, and 0.9. The low-contact contours, c¯b-N0.1, extended to regions distant from the gate of the binding pocket, mean that the N-terminal tail contacted bosentan even when bosentan was floating in solution. With bosentan approaching the pocket, the contact ratio increased quickly and reached c¯b-N0.7. Because the region of c¯b-N0.7 involved the density-hat region (ρCMb0.01; magenta-colored contour regions of Fig. 2), the contact might act as an attractive interaction. The high-contact region of c¯b-N0.9 (magenta-colored contours in Fig. 4a) was located immediately above the pocket, not in the hETB binding pocket. It is particularly interesting that c¯b-Nrcube was considerably small in the pocket. The high contact between the tail and bosentan right outside the pocket, followed by loss of contact once bosentan is in the pocket, indicates that the role of the N-terminal tail is the recruiting of bosentan into the pocket.

Figure 4.

Figure 4

(a) Spatial patterns of cube-based bosentan–tail contact ratio c¯b-Nrcube at three contour levels presented in the inset. The “hat” region, “pocket”, and “bosentan in native complex” are also shown. (b) Spatial patterns viewed from above the pocket. The structure which is presented is the native complex. Transmembrane helices (TM1–7) are labeled.

It is worth recalling that the gate-closing occurs because of the contact of the N-terminal tail and the β-hairpin (Figure S7b). In other words, gate-opening is induced by the N-terminal tail fluctuating in solution away from the gate. Figure 4 shows that the regions of c¯b-N0.7 are widely extended far from the binding pocket. The gate-opening and closing, the large fluctuations of the N-terminal tail, and the bosentan–tail contact formation are mutually related.

Bosentan slides along the N-terminal tail upon binding

As shown in Fig. 4, the cube-based bosentan–tail contact ratio depends on the relative position of bosentan to hETB in the 3D real space. Here, we quantify the bosentan position relative to hETB by quantity rbb, which is the distance between the bosentan centroid, rCMb,i, in snapshot i and that in the native complex rCMbref: rbb=|rCMb,i-rCMbref|. Figure S7c presents rbb in a snapshot. Smaller rbb is associated with closer bosentan to its natively binding position. Then, we divided the rbb axis into 13 slices with thickness of 5 Å, which are designated respectively as Δk (k=1,,13): The lower and upper boundaries of Δk are given respectively as 5(k-1) and 5k (unit is Å). Table S3 presents the slice range exactly. As explained in Section 7 of SI and in Figure S13, the boundary between slices Δ4 and Δ5 coincides well with the gate of the binding pocket, which was introduced as the boundary between the blue contour region (ρCMb(rcube)0.1) and the magenta-colored contour region (ρCMb(rcube)0.01) in Fig. 2. By this arrangement, four slices Δ1,…, Δ4 are in the binding pocket, and slices Δ5,…, Δ13 are outside the pocket.

Figures S14S17 present conformations selected from some slices. Section 8 of SI explains some features of those selected conformations. These figures visually imply that the contact site on the N-terminal tail to bosentan tends to move with rbb motion. To verify this tendency quantitatively, we introduced a “residue-based bosentan–tail contact ratio” ρcntΔkj, which is the contact ratio of residue j of the N-terminal tail to bosentan in a slice Δk. Then, we converted ρcntΔkj to a potential of mean force (PMF) FΔkj (equation S14 of Section 9 of SI). One can imagine a 2D plane constructed by the residue ordinal number j and the slice Δk. The lower the FΔkj at a site [j,Δk] in the 2D plane, the stabler the contact becomes.

Figure 5 shows FΔkj in the 2D plane as the free-energy landscape. This figure indicates the existence of two free-energy basins, which are denoted as H1 and H2 in the figure. We discuss the binding process assuming that bosentan is approaching hETB from a far solvent region to the binding pocket. In Δ13 (60  Å rbb<65 Å), the N-terminal tail does not touch bosentan: ρcntΔ13j=0 (j). In Δ12 (55 Å rbb<60 Å), bosentan first touches Ace (N-terminal capping group) of the N-terminal tail. Because the system moves generally toward a low free-energy region, possible transition paths are indicated by black arrows in Fig. 5, which lead the conformation to basin H1. Those motions decrease rbb and the bosentan–tail contact firm, thereby decreasing the free energy.

Figure 5.

Figure 5

Residue-based bosentan–tail contact ratio, ρcntΔkj, converted to PMF: FΔkj=-RTlnρcntΔkj (see Section 9 of SI), where j is the residue ordinal number (x-axis) of hETB’s N-terminal tail, and where Δk is the k-th rbb slice (y-axis), where the respective lower and upper boundaries are 5k-1 and 5k. Unit is Å. PMF value is presented by color (see color bar). White is assigned to unsampled regions or regions for which the PMF is larger than 5 kcal/mol. The sequence of the N-terminal tail is Ace, Ser80, Pro81, Pro82, Arg83, Thr84, Ile85, Ser86, Pro87, Pro88, Pro89, where Ace is an acetyl group introduced to cap the N-terminal tail and where the residue number of 79 is assigned to Ace in the X-axis of the figure. Also, Δk is the range of the rbb slice given in Table S3. Region H1 ranges from Ser80 to Arg83 in x-axis and from Δ7 to Δ5 in y-axis. H2 does from Ile85 to Pro89 and from Δ4 to Δ3. The meanings of arrows are described in the text. The gate of the binding pocket is defined by boundary (thin line) between Δ4 and Δ5 and the binding pocket is the region below the gate (Δ4, Δ3, Δ2, and Δ1).

With further decrease of Δk, the stable region moved from the basin H1 to H2, i.e., the system moves from the tip region to the root region of the N-terminal tail. This move is natural because basin H2 is lower than basin H1. One can recall that the boundary between Δ5 and Δ4 corresponds to the gate of binding pocket. Conformational changes from H1 to H2 can occur via multiple pathways indicated by the red arrows in Fig. 5. Because the sites to which the red arrows are assigned had a larger free energy than either H1 or H2, the H1-H2 basin switch requires that the free-energy barrier be overcome.

In Δ2 and Δ1, which are located at the bottom of the binding pocket, the bosentan–tail contact is disrupted. This disruption does not mean that the complex itself is dissociated but that the complex is stabilized without the bosentan–tail contact because the native-like complex conformations are stabler than the other state, as shown in Figs. 2 and 3. We also observed a specific contact at Arg83 in Δ2 and Δ1. Nevertheless, this specific contact represents a weak interaction because the depth of free energy at Arg83 is shallow (Fig. 5).

The importance of the long N-terminal tail is demonstrated by comparing the current simulation with the simulations from GPCRmd46, which is a database of various MD simulations of GPCRs (https://submission.gpcrmd.org/home/). In the unbound-form MD simulation of hETB in GPCRmd (https://submission.gpcrmd.org/view/886), the N-terminal tail of hETB fluctuates greatly in solvent without forming a specific conformation. This result agrees with those obtained from our study. By contrast, in the bound-form simulation of hETB from GPCRmd (https://submission.gpcrmd.org/view/887), the N-terminal tail contacted frequently to a ligand, which bound in the deep binding pocket of hETB. Figure 5 shows that the bosentan-tail contact disappeared when bosentan moved to the bottom of the binding pocket in our simulation. This discrepancy of the tail conformation in the bound form between our simulation and the simulation from GPCRmd is caused simply by a difference in the length of the tail used for the two simulations. The N-terminal tail in the current simulation is six residues longer than that of the GPCRmd simulation. The longer N-terminal tail cannot be packed into the binding pocket of GPCR. We presume that our simulation is more realistic than the bound-form simulation of GPCRmd because GPCRs have a long N-terminal tail in general. Therefore, this comparison of the two simulations demonstrates the importance of the length of the N-terminal tail.

Bosentan–tail non-specific contacts

Figure 5 does not show whether the bosentan–tail contact is specific or non-specific. To investigate the bosentan atom specificity, we calculated an “atom–residue contact ratio” θcntΔki,j which is the contact ratio of a heavy atom i of bosentan contacting residue j of the N-terminal tail in each slice Δk. The exact definition of θcntΔki,j is presented in Section 9 of SI. Figure 6 shows θcntΔki,j in each slice. Normalization is done for each panel of Fig. 6 to highlight the atom-specificity (color graduation) in each slice. Consequently, the existence of basins H1 or H2 is not recognized in Fig. 6.

Figure 6.

Figure 6

Atom–residue contact ratio θcntΔki,j, which is the contact ratio of heavy atom i of bosentan to residue j of N-terminal tail in each slice Δk. Its exact definition is given in equation S15 of SI. Panels (al) respectively show θcntΔ12i,j,,θcntΔ1i,j. (m) Atom ordinal numbers assigned to bosentan's heavy atoms.

Figure 6a shows that a few atoms of bosentan rarely contact Ace of the N-terminal tail in Δ12. Figures 6b–d depict that all the heavy atoms of bosentan contact non-specifically to the tip of the N-terminal tail in Δ11-Δ9. Figures 6e–h again show that bosentan contact is non-specific in Δ8-Δ5, where all residues of the N-terminal tail contact to bosentan. Up to Δ12-Δ5, bosentan is outside of the binding pocket. The non-specific contact is still found in Δ4 (Fig. 6i) and Δ3 (Fig. 6j), although Δ4 and Δ3 are in the binding pocket.

In Δ2 and Δ1 (Fig. 6k,l, respectively), by contrast, Arg83 of the N-terminal tail contacts specifically to atoms 1–13 of bosentan, as shown in Fig. 6m. This specificity is consistent with the depiction in Fig. 5. Atoms 1–10 and 11–13 are associated respectively with the hydrophobic and hydrophilic groups. Figure S18 presents some conformations that involve the specific bosentan–tail contacts. Generally, the hydrophobic sidechain stem of Arg83 of the tail and the hydrophobic group (atoms 1–10) of bosentan share some contact, as do the nitrogen atoms of the sidechain tip of Arg83 and oxygen atoms of sulfonamide of bosentan.

As shown in Figs. 4 and 5, the bosentan–tail contact tends to be dissociated when bosentan proceeds to the bottom of the binding pocket (Δ1 and Δ2). Consequently, in many snapshots depicting Δ1 and Δ2, the N-terminal tail does not contact bosentan. If the bosentan–tail contact has a role to capture bosentan in solution, then we consider that the contact in the binding pocket has no role for molecular binding.

Fly casting with ligand sliding

Here we check whether the bosentan–tail contact acts as an attractive interaction. First, we calculated a probability distribution function PΔk(rb-N) regarding the minimum heavy atomic distance rb-N from bosentan to the N-terminal tail for snapshots, which are involved in a slice Δk. Then, PMF regarding rb-N is defined as PMFΔkrb-N=-RTln[PΔk(rb-N)]. In fact, the effective interaction between two molecules is contributed by many elemental interactions. To estimate this interaction comprehensively, PMF is the best quantity. This quantity involves not only direct intermolecular interactions but also indirect interactions (ligand–solvent and solvent–receptor interactions). This quantity also involves effects of molecular motions (i.e., conformational variety, or entropy).

Figure 7 presents PMF for each slice. Apparently, the bottom of the lowest free-energy basin is assigned to a noncontact state in slices Δ12Δ10 (rbb45 Å) (Fig. 7a–c). Actually, PMF for Δ13 is not shown because a bosentan–tail contact was not detected in Δ13. In contrast, the lowest free-energy basin is assigned to a contact state (rb-N3.5 Å) in Δ9Δ2 (Fig. 7d–k), which indicates that the bosentan–tail contact acts effectively as an attractive interaction in the range of 45 Å >rbb 5 Å.

Figure 7.

Figure 7

Potential of mean force PMFΔkrb-N for the bosentan–tail distance rb-N in slice Δk (k=1,12). PMFΔkrb-N is defined in the main text. Panels (al) respectively show PMFΔ12rb-N,,PMFΔ1rb-N. Also, PMFΔ13rb-N is omitted because PMFΔ13rb-N is similar with PMFΔ12rb-N.

The analysis described above showed that a bosentan–tail attractive interaction exerted from Δ9 (45 Å =rbb) between the N-terminal tail and bosentan. Together with the residue sliding from the tip side to the root side with decreasing rbb, we designate this binding mechanism “Fly Casting with ligand sliding”.

Bosentan–membrane contact

Bosentan occasionally contacts the membrane (Figures S14e, S15d and S15e). To investigate the bosentan–membrane contact, we introduced a quantity, a “bosentan–membrane contact ratio” c¯b-mrcube, that is calculated with a similar procedure with c¯b-Nrcube. Subsection 3.3 of SI provides details of the calculation procedure. The bosentan–membrane contact was formed commonly on the membrane surface (Figure S19). Some GPCR ligands are known to diffuse from the lipid bilayer to reach the binding pocket of GPCRs rather than approaching the pocket from the solvent side47,48. From the present simulation, however, we were unable to find such a ligand-diffusive motion of bosentan in the membrane, whereas we observed that bosentan sank in the membrane (Figure S14e). One might consider that the absence of bosentan diffusive motions in membrane might be due to insufficient setting of the upper boundaries of λ(β) and λ(γ) (68.47 Å and 67.47 Å, respectively; Table S2). However, Figure S11 of SI shows that the upper boundaries were sufficiently large for bosentan to move in the membrane if no interaction is applied between bosentan and the membrane. Therefore, we conclude that bosentan mainly approaches the pocket from solvent. Because the membrane prevents bosentan from moving to the cytoplasmic side, the membrane has a role in restricting the phase volume for bosentan to move. This volume restriction might facilitate binding of bosentan to the receptor’s binding pocket49.

Orientational selection when bosentan enters the binding pocket

We investigated the molecular orientation of bosentan around hETB. First, we defined the bosentan molecular orientation by two vectors (orientation vectors) v and v, which are fixed in the core region of bosentan (Figure S20a), which are approximately perpendicular to one another. Figure 3a gives the exact definition of the core region. Then, we normalized the orientation vectors as |vα|=1 (α=or). Because the bosentan core region is structurally stiff, the two orientation vectors are convenient quantities for defining the bosentan molecular orientation. In fact, in general, two nonparallel vectors can specify the orientation of a plane (the central ring of the core region) in 3D real space. Figure S20b presents an example of the vectors in the native complex and in a snapshot. Next, we assigned the orientation vectors to a cube rcube, in which the bosentan centroid was detected. Then, Ω in equation S7 of Subsection 3.4 of SI is replaced by vα. The resultant quantity Ω¯rcube is designated as v¯αrcube in the equation. We designated v¯αrcube as an “averaged orientation vector”. Because vα in a snapshot was normalized, the maximum of v¯α is unity when all the vectors detected in the cube rcube are perfectly ordered. The minimum is 0.0 when the vectors are randomized completely.

Figures 8a,b portray spatial patterns of v¯αrcube, in which orientation vectors with large norms (v¯α>0.3) are shown. Importantly, the large-norm vectors were found mainly in the binding pocket. Therefore, the bosentan orientation is ordered in the pocket, although it is disordered outside the pocket. This finding suggests that configurational entropy of bosentan decreases quickly when bosentan passes the gate of the binding pocket. It is worth recalling that the free-energy basin of FΔkj also switched quickly from H1 to H2 at the gate (Fig. 5). In fact, the decrease of entropy results in increased free energy. This entropic decrease should be compensated by another thermodynamic factor. Otherwise, bosentan in the binding pocket is destabilized, and the density ρCMb(rcube) in the binding pocket decreases. We show in the next section that intermolecular native contacts, which act as attractive interactions between bosentan and hETB, are formed in the binding pocket. Consequently, those intermolecular attractive interactions (enthalpy decrease) compensate the entropy loss.

Figure 8.

Figure 8

Spatial patterns of averaged orientation vectors (a) v¯rcube and (b) v¯rcube. A vector is colored differently by its norm v¯αrcube (α= or α=) (inset). Vectors with v¯αrcube<0.3 are omitted. (c) Spatial patterns of scalar product SPrcube (= ev¯rcube·vref) and (d) those of SPrcube (= ev¯rcube·vref), where ev¯αrcube and vαref are defined in main text. Regions of SPαrcube0.5 and SPαrcube-0.5 are colored differently by their values (inset). For all panels, the hETB structure is from the native complex. Small black spheres are bosentan centroid in the native complex. The binding pocket is shown as a broken-line circle in all panels. Transmembrane helices (TM1–7) are labeled for panel c. Positions of the transmembrane helices for the other panels are presumable from panel c because hETB is shown from similar directions for all panels.

However, Fig. 8a,b do not necessarily mean that the bosentan orientations resemble those in the native-complex position. Then, we check the orientation similarity as follows: First, we calculated v and v for the native complex, which are designated respectively as v(ref) or v(ref). Next, we calculated a scalar product as SPαrcube=ev¯αrcube·vα(ref), where ev¯αrcube is a unit vector that is parallel to v¯αrcube. If ev¯αrcube is exactly parallel to vα(ref), then SPαrcube=1; if v¯αrcube is exactly anti-parallel to vα(ref), then SPαrcube=-1. If ev¯αrcube and vα(ref) are mutually perpendicular, then SPαrcube=0. Therefore, the spatial pattern of SPαrcube is a useful measure to assess similarity between v¯αrcube and vα(ref).

Figure 8c demonstrates that the major volume of the binding pocket was occupied by SPrcube0.5, which indicates similar orientations of v¯ and v(ref). For v¯, a large volume of the pocket was also characterized by SPrcube0.5 , whereas a minor part (cyan-colored contours) of the pocket was characterized by SPrcube-0.5 (Fig. 8d). This minor part does not cover the bosentan native position at all. Therefore, we can infer that bosentan enters the pocket with limited orientations that are beneficial for reaching the native-complex position because the pocket is too narrow for bulky bosentan to flip. We designate this possible mechanism as “orientational selection”.

Bosentan–hETB contacts and ring-C packing at the bottom of binding pocket

In the above section, we suggested that the entropy loss (decrease in the orientiational variety of bosentan) should be compensated by enthalpy decrease for the stabilization of bosentan in the binding pocket. To investigate this stabilization effect, we analyze attractive interactions (atomic contacts) of bosentan with amino-acid residues of hETB in the binding pocket.

First, we observe common intermolecular interactions in the three X-ray structures: the bosentan–hETB10, ET1–hETB6, and the K8794–hETB10 complexes. In fact, the same binding scheme can be observed in the three complex structures (Figure S21). By viewing the three complexes closely, we identified four intermolecular native atomic contacts (interatomic distance less than 5 Å in the X-ray structures), which exist commonly in the three complex structures as attractive interactions, as shown in Section 10 of SI, Figures S22a, S22b, and Table S4: contact (1) between oxygen atoms in the sulfonyl group (SOO) of the bosentan’s sulfonamide and the nitrogen (Nζ) of Lys 182 (3 × 33); (2) between the oxygens in bosentan’s SOO and the nitrogen of Lys 273 (5 × 39); (3) between the oxygens in bosentan’s SOO and nitrogen atoms of Arg 343 (6 × 55); and (4) between the nitrogen atom (N14) of the bosentan’s sulfonamide and the nitrogen (Nζ) of Lys 182 (3 × 33). The actual interatomic distances for the four contacts are listed in Table S4 for the three X-ray structures: Most of contacts are around 3 Å, with the sole exception of 4.56 Å for (2) in the bosentan–hETB complex. Figure S22c enumerates all intermolecular interactions in the bosentan–hETB complex.

Contacts (1)–(3) are associated with salt bridges. Contact (4) can be associated either with a salt bridge or with a hydrogen bond because we cannot infer whether a hydrogen atom is bound to N14 of the bosentan’s sulfonamide in the X-ray crystallography, or not. Nevertheless, it is noteworthy that both the salt bridge and the hydrogen bond are attractive interactions. For the present study, we used the neutral form for bosentan, in which a hydrogen is attached to N14 (Fig. 1a). This ambiguity of protonation is discussed at the end of the “Results and Discussion” section.

The native atomic contacts introduced above are appropriate quantities to analyze the stabilization effect (enthalpic effect) that compensates the entropic disadvantage (decrease of molecular orientation). As more native contacts form in the complex, the attractive interactions become stronger while bosentan’s structural entropy decreases.

To analyze the intermolecular distances of snapshots, we introduce the notation of rα;β, which is a distance between atoms α and β belonging respectively to the ligand (bosentan, K8794, or ET1) and hETB. We use the original PDB residue numbering to specify atom β. The four distances related to the four native contacts are designated as rN14;Lys182Nζ, rSOO;Lys182Nζ, rSOO;Lys273Nζ, and rSOO;Arg343Nη. When multiple distances were possible in rα;β, the minimum of the distances was selected for rα;β. For instance, four distances are possible for rSOO;Arg343Nη because two oxygen atoms and two nitrogen atoms are involved respectively in the sulfonyl group and Arg 343 (6 × 55). Also, Table S4 presents distances rα;β in the X-ray structures, in which three distances are approximately 3 Å, except for rSOO;Lys182Nζ (= 4.56 Å) from the bosentan-hETB complex. We presume that rSOO;Lys182Nζ, rSOO;Lys273Nζ, and rSOO;Arg343Nη are related to salt-bridges, and that rN14;Lys182Nζ might be done to either a salt-bridge or a hydrogen bond.

We judged that the heavy atoms α and β are contacting if rα;β<5.0 Å is satisfied in a snapshot. The value of 5.0 Å is about 0.5 Å larger than the largest 4.56 Å) of the experimentally obtained values (Table S4). Considering thermal fluctuations of the complex structure and the likelihood of error in experiment measurements (3.6 Å resolution for the bosentan–hETB complex), we added a tolerance figure of 0.5 Å to the inequality. Furthermore, not only does a heavy atom have a radius of approximately 2.0 Å, but the remaining distance between α and β is 1.0 Å (= 5.0  Å − 2 × 2.0 Å), which does not allow a water molecule to penetrate because the water molecule diameter is about 3.0 Å. Therefore, the 5.0 Å threshold is rational to check the atomic contacts.

Based on the criteria above, we calculated the number of contacts Ncnt (0Ncnt4) regarding the four distances for all snapshots. Then, picking snapshots falling in an rbb-range of Rrbb=rbb-Δrbb;rbb+Δrbb, we calculated the average of Ncnt for the selected snapshots and denoted it as Ncntrbb. Section 11 of SI presents the exact method to calculate Ncntrbb and its standard deviation SDcntrbb. Ideally, Δrbb should be sufficiently small (Δrbb0) to calculate Ncntrbb exactly at rbb. However, the number of snapshots in Rrbb decreases concomitantly with decreasing Δrbb because the simulation length is finite. To maintain statistical significance in Ncntrbb, we set Δrbb=0.25 Å for the present simulation. The value of Ncntrbb is four if all snapshots in Rrbb always have four contacts and zero if no snapshot has any contact.

Figure 9 depicts the rbb dependence of Ncntrbb, where rbb starts from 0.25 Å and where Ncnt (0.25 Å) is calculated for the range of R(0.25 Å) = [0.0 Å; 0.5 Å]. Note thatR(rbb < 0.25 Å) is impossible to be calculated. We set the maximum of rbb at 15.25 Å, where the used range is R(15.25 Å) = [15.0 Å; 15.5 Å]. Also, R(15.25 Å) is involved in the slice Δ4, immediately below the gate of the hETB’s binding pocket (Figure S13). Therefore, Fig. 9 shows that almost no atom contact is formed at the gate: Ncnt (15.25 Å) = 0.005. Also, Ncntrbb increased concomitantly with decreasing rbb (with bosentan approaching the native-complex position): The number of contacts exceeded the value of 1 at rbb=3.25 Å: Ncnt (3.25 Å) =1.03±0.85. Subsequently, the growth rate of Ncntrbb rose, and Ncntrbb reached the maximum at rbb = 0.25 Å: Ncnt (0.25 Å) =2.74±0.79. We further investigated formation of each contact, as described in Section 12 of SI. Figure S23 shows that each native contact is formed with decreasing rbb.

Figure 9.

Figure 9

Dependence of Ncntrbb on rbb and its standard deviation (error bars). Lower limits of some error bars are smaller than 0 on the x-axis because the standard deviation was calculated using the usual form: Ncntrbb2-Ncntrbb21/2.

Because the four intermolecular native contacts act as attractive interactions (Section 10 of SI), Fig. 9 shows that the complex structure is stabilized enthalpically by the native contacts when bosentan goes to the bottom of the binding pocket. We presume that the instability induced by the decrease of the entropy when bosentan passing the gate of the binding pocket is compensated by formation of the native contacts.

We checked the reproducibility of the packing of ring C to the bottom of the binding pocket (see Figure S21c for ring C of bosentan). For this purpose, we calculated the density map ρCMCrcube of the centroid of ring C using equation S3 of Subsection 3.1 of SI with replacement of the centroid of portion X by the centroid of ring C. Figure S24 presents the high-density spot (ρCMCrcube0.8) for ring C. Importantly, the high-density spot is localized in a region that overlaps with the ring-C position in the native complex. However, the contours of ρCMCrcube0.4 show deviation from the native-complex position. We discuss this discrepancy between the computation and the native complex later.

Bosentan conformations in the highest-density spot

As shown in Fig. 2, the bosentan centroid in the native complex structure was located on the fringe of the computed highest-density spot (red contours in Fig. 2). This section presents an explanation of bosentan conformations in the highest-density spot.

As stated earlier, the highest-peak position of PRDFRMSDwholeheavy or PDDFRMSDwholeheavy of Fig. 3 is larger than that of PRDFRMSDcoreheavy or PDDFRMSDcoreheavy, which suggests that the sidechains of bosentan were somewhat disordered. To examine this disorder in greater detail, we collected snapshots from 3.5 Å RMSDwholeheavy 4.0 Å, which involve the highest peak of PRDFRMSDwholeheavy (Fig. 3b).

Figure 10a displays four randomly selected conformations from the snapshots collected above. This figure demonstrates that the conformations from the highest density spot have some similarity with the native-complex structure (the X-ray structure). Therefore, we referred to the complex conformations in highest density spot as “native-like complex”. However, Fig. 10a shows that the longest sidechain of bosentan, ring A (Figure S21c), is deviated from the native-complex position with a conformational diversity, whereas the core region was converged to a position close to the native-complex position. For this reason, the highest-peak position of PRDFRMSDwholeheavy or PDDFRMSDwholeheavy was larger than that of PRDFRMSDcoreheavy or PDDFRMSDcoreheavy in Fig. 3, respectively. We discuss this imperfection in the next section.

Figure 10.

Figure 10

Snapshots sampled in the highest density spot (ρCMbrcube=0.5 in Fig. 3). In both panels, bosentan conformations from simulation are shown as magenta, and that in the native complex is shown as green. (a) Four conformations selected randomly from the range of 3.5 Å RMSDwholeheavy<4.0 Å. The bosentan core region and ring A are shown in the figure. (b) The conformation with RMSDwholeheavy=1.11 Å. hETB in the native complex is also shown. Although superimposition was not applied to bosentan, it was applied to trans-membrane helices of hETB (see Figure S7a). Transmembrane helices (TM1–7) are labeled for panel (b).

Figure 10b depicts a snapshot with a small RMSDwholeheavy (1.11 Å). Such small RMSDwholeheavy conformations were the minority in the ensemble of snapshots because there was no peak at RMSDwholeheavy1 Å in Fig. 3. This small RMSDcoreheavy was a result obtained because the ring A of this conformation overlapped well on that of the native-complex structure.

Comparison of bosentan at the highest-density spot with that in the crystal complex structure

We note several discrepancies between bosentan in the crystal complex structure and that at the computed highest-density spot. Actually, GA-guided mD-VcMD is sufficiently powerful to explore the ligand–receptor conformational space starting from completely unbound conformations (Figure S5). However, we remark that reproduction of the native-like complex was not perfect. As discussed, the bosentan’s longest sidechain (ring A; Figure S21c) exhibited structural diversity in native-like complex conformations, although the core region agreed well with the native-complex position.

We infer two possibilities for the imperfectness: Accuracy of classical-mechanical force-fields and/or treatment of the bosentan’s sulfonamide. Despite multiple attempts to improve force fields, current force fields are well known to be imperfect. In future studies, novel force-field parameters might resolve details of conformational inconsistency with experimentation.

Regarding the second possibility, we used the bosentan’s sulfonamide in a normal (neutral) form in that a hydrogen atom is attached to the nitrogen atom (N14) of the sulfonamide. In the neutral form, the electronic state of N14 is the sp2 hybridization with planar three-chemical bonds (S-NH-C angle of approximately 130 degrees). In the deprotonated form, the electronic state is also the sp2 hybridization with planar two-chemical bonds and one electron-lone pair (S=N¨-C angle of approximately 131°). Therefore, we cannot distinguish the electronic state of sulfonamide merely by viewing the X-ray structure of bosentan.

We might treat the N14 atom deprotonated because the pKa value of bosentan is 5.150 or 5.7951 in solution: 0.5% and 99.5% of bosentan molecules are in the neutral and ionic form, respectively, in human blood (pH=7.4). However, the form of sulfonamide in the hETB binding pocket is unknown. Reportedly52, the pKa of sulfonamide in various compounds, which are immersed in a mixture of organic solvent and water, increases greatly with increasing fraction of the organic solvent. In addition, as reported53, the electronic state of amino acids in the protein interior might be different from that in solution. Furthermore, the cell membrane is negatively charged. The bosentan molecules in the negatively charged form (the major fraction in blood) should only slightly access the cell surface by the repulsive electrostatic force between the cell membrane and the negatively charged bosentan. In contrast, the neutral form (the minor fraction in blood) can reach the cell surface and the receptor. Finally, we decided to use the neutral form of bosentan with protonated N14.

As described earlier, an attractive interaction can act for either the neutral or charged form of bosentan to the nitrogen (Nζ) of Lys 182 (3 × 33). In fact, we demonstrated that the distribution function ρbs(rN14;Lys182Nζ;rbb=0.25 Å) exhibited a remarkable peak at rN14;Lys182Nζ=3 Å (Figure S23f), which reproduces the experimentally obtained observations (Table S4). This result constitutes evidence that the currently used form of sulfonamide is plausible. To analyze the structure diversity of the ring A further, hugely numerous computations must be made in future studies, with varying of the force-fields and the charge state of sulfonamide.

Binding mechanism from the current and other simulations

Recent advances in MD simulations have enabled investigation of ligand–GPCR interactions54. Various inter-molecular interactions and semi-stable complexes (poses) were investigated when the ligands reach the GPCR’s binding pocket55. Energetically favored binding pathways and stable states (poses) in a conformational space were investigated, and information transfer from the ligand binding pocket to the intracellular region of the receptor was analyzed from the obtained conformational ensemble5658. A previous MD simulation study of ligands and the β2-adrenergic receptor (β2 AR) showed an energy barrier at the extracellular entrance of binding pocket59. The study suggested that the barrier is induced by dehydration of the ligand to enter the binding pocket. It is likely that this extracellular entrance may correspond to the “gate” of the binding pocket in our study because our simulation also showed the existence of a free-energy barrier, though this one is related to the ligand sliding on the hETB’s N-terminal tail and to the decrease of the ligand orientational variety (i.e., entropy loss) at the binding pocket gate. This difference of the origin for the free-energy barrier between these two studies implies the existence of a variety of GPCR binding pathways.

Most studies have focused on the ligand-binding pathways after the ligands have passed the gate of the GPCR’s binding pocket5659. On the other hand, there has been little investigation where ligands being far from the binding pocket gate are concerned. One main outcome from the current study is that the disordered N-terminal tail acts as an attracter to capture the ligand. Therefore, these studies and ours complement each other other with respect to understanding different aspects of the ligand–GPCR binding process.

The mechanism through which N-terminal tail of a GPCR assists with ligand binding can also vary depending on the tail’s structure. For example, the N-terminal segment of a GPCR, sphingosine-1-phosphate receptor 1 (S1P1R), is not disordered but adapts an α-helix in the crystal structure (PDB ID: 3v2y). This segment covers the gate of the binding pocket of the GPCR. A previous MD study60 reported that the ligand ML056 diffuses within the membrane to reach the entrance of the S1P1R's binding pocket. As the ligand passes the entrance, the α-helix unfolds, increasing the width of the entrance, which allows the ligand to diffuse into the pocket more readily. This simulation study and ours demonstrate that there is variety in the ligand–GPCR binding process depending on the structure of the system being studied.

Conclusions

Prediction of the ligand–receptor complex structure is crucially important for structure-based drug design. This strategy is reasonable to develop a new drug (especially antagonist) because the molecular system finally reaches the most thermodynamically stable structure irrespective of the binding pathways. However, the binding pathways are also important to elucidate the complex. As mentioned above, MD simulations showed ligand binding pathways of various GPCRs and semi-stable complex structures (poses) when the ligands enter into the binding pocket and reach the bottom of the binding pocket5,7,5456,5860. The current simulation study showed that an attractive interaction acts effectively between bosentan and the N-terminal tail of hETB via the fly casting mechanism1719, even when bosentan is located distant from the binding pocket of hETB. Some experiments reported in the related literature have indicated that the N-terminal tails of GPCRs have important physiological functions61, whereas the N-terminal tails of GPCRs have been studied insufficiently in contrast to the binding pocket. Because the tail is conformationally disordered, a simulation like the one performed for this study is helpful for elucidating the functions or roles of the N-terminal tail in the binding process.

In general, GPCRs have a long and disordered N-terminal tail (40–200 amino-acid residues long)21. Results of an MD study revealed that an amino-acid substitution to the N-terminal tail of a GPCR (human β2-adrenergic receptor) varies the tail dynamics, and that accessibility of the ligand to the binding pocket also varies by substitution62. This study, in addition to our present work, suggests that the ligand–receptor interaction should be studied even when the ligand is outside the binding pocket of GPCR.

The bosentan–hETB complex formation obtained from the present study is summarized as follows: First the tip-side of the N-terminal tail of hETB captures bosentan when bosentan is fluctuating in solution via nonspecific intermolecular contacts. The bosentan–capturing conformation is identified as the free-energy basin H1 in the free-energy landscape (Fig. 5). Then ligand–sliding occurs occasionally from the tip-side to the root-side of the N-terminal tail when bosentan passes the gate of the binding pocket, which corresponds to basin switching from H1 to the deeper basin H2 with overcoming of the inter-basin free-energy barrier. We designated this mechanism as “fly-casting with ligand–sliding”. Importantly, the molecular orientational variety of bosentan decreases quickly when bosentan passes the gate (Fig. 8). This decrement of the orientational variety has a role of screening out bosentan conformations for which the molecular orientations are improper to reach the binding site (orientational selection). In fact, this orientational selection is a variant of conformational selection20. Because the decrement of the bosentan orientational variety leads to a loss of configurational entropy of bosentan, another thermodynamic quantity should compensate the free-energetic shortcoming by the decrement of the orientational variety. Otherwise, the density of the bosentan centroid, ρCMbrcube, should decrease in the binding pocket. Our results indicate that the intermolecular native contacts (electronically attractive interactions) between bosentan and hETB are formed in the binding pocket (Fig. 9). It is likely that this native contact formation is the compensation factor, and that, eventually, the native-complex conformation is completed.

Results obtained from the present study suggest that a longer N-terminal tail adds a new property to the binding mechanism. Moreover, it presents the possibility that the binding speed might be enhanced by designing the length and sequence of the N-terminal tail. In fact, earlier reports describe that long N-terminal tails exert various physiological functions of GPCRs such as ligand recognition, receptor activation, and signaling22,23. Therefore, a simulation with a longer N-terminal tail is the next challenge for ligand–GPCR simulation.

We uploaded important snapshots obtained from the current simulation to the Biological Structure Model Archive (BSM-Arc)63. The entry for this study is https://bsma.pdbj.org/entry/35.

Supplementary Information

Acknowledgements

This work was supported by JSPS KAKENHI Grants No. 21K06052 (J.H.), 20H03229 (N.K.), and JP18H05534 (H.K.), and was performed in part under the Cooperative Research Program of the Institute for Protein Research, Osaka University, CR-21-05. J.H., K.K. and K.N. acknowledge support by the HPCI System Research Project (Project IDs: hp200025, hp200063, hp200090, hp210002, hp210005, and hp210008). J.H., S.I., N.H., I.F., N.K., and Y.F. are supported by the Development of Innovative Drug Discovery Technologies for Middle-Sized Molecules project from the Japan Agency for Medical Research and Development (AMED). J.H., N.K., I.F., and Y.F. are supported by Project Focused on Developing Key Technology for Discovering and Manufacturing Drugs for Next-Generation Treatment and Diagnosis (2018–2021 and 2021–) from AMED. J.H., K.K., S.S. and H.K. were partially supported by the Platform Project for Supporting Drug Discovery and Life Science Research (Basis for Supporting Innovative Drug Discovery and Life Science Research (BINDS)) from AMED under Grant Number JP21am0101106. The authors thank Dr. Isao Yasumatsu from Daiichi Sankyo RD Novare Co., Ltd. (Japan) for helpful discussion. GA-mD-VcMD was performed on TSUBAME3.0 supercomputers at the Tokyo Institute of Technology. We used UCSF Chimera ver. 1.15 for drawing molecular structures and gnuplot for graphs. Other analyses were done using homemade programs.

Author contributions

H.N. proposed the research plan; J.H., N.K., I.F., Y.F., and H.N. designed the actual research plan; K.K. and J.H. developed the GA-mD-VcMD algorithm; K.K., S.S. and H.K. implemented the GA-mD-VcMD algorithm to gromacs using the PLUMED plug-in; G.J.B., N.K. and B.M. generated the molecular system for simulation; B.M., G.J.B. and N.K. generated the gromacs input files and performed preparatory simulations; J.H. performed the main part of the simulation and analyses; S.I., I.F. and Y.F. performed some important parts of analyses; J.H. wrote the paper with acceptance of critical comments raised by coauthors.

Data availability

The datasets used or analyzed during the current study are available from the corresponding author on reasonable request. Several important snapshots during the current simulation were available in the BSM-Arc (https://bsma.pdbj.org/entry/35).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-022-17920-7.

References

  • 1.Syrovatkina V, Alegre KO, Dey R, Huang XY. Regulation, signaling, and physiological functions of G proteins. J. Mol. Biol. 2016;428:3850–3868. doi: 10.1016/j.jmb.2016.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Filmore D. It’s a GPCR world. Mod. Drug Discov. 2004;7:24–27. [Google Scholar]
  • 3.Yanagisawa M, Kurihara H, Kimura S, Tomobe Y, Kobayashi M, Mitsui Y, Yazaki Y, Goto K, Masaki T. A novel potent vasoconstrictor peptide produced by vascular endothelial cells. Nature. 1988;332:411–415. doi: 10.1038/332411a0. [DOI] [PubMed] [Google Scholar]
  • 4.Arai H, Hori S, Aramori I, Ohkubo H, Nakanishi S. Cloning and expression of a cDNA encoding an endothelin receptor. Nature. 1990;348:730–732. doi: 10.1038/348730a0. [DOI] [PubMed] [Google Scholar]
  • 5.Sakurai T, Yanagisawa M, Takuwat Y, Miyazaki H, Kimura S, Goto K, Masaki T. Cloning of a cDNA encoding a non-isopeptide-selective subtype of the endothelin receptor. Nature. 1990;348:732–735. doi: 10.1038/348732a0. [DOI] [PubMed] [Google Scholar]
  • 6.Shihoya W, Nishizawa T, Okuta A, Tani K, Dohmae N, Fujiyoshi Y, Nureki O, Doi T. Activation mechanism of endothelin ETB receptor by endothelin-1. Nature. 2016;537:363–368. doi: 10.1038/nature19319. [DOI] [PubMed] [Google Scholar]
  • 7.Clozel M, Breu V, Burri K, Cassal J-M, Fischli W, Gray GA, Hirth G, Löffier B-M, Müller M, Neidhart W, Ramuz H. Pathophysiological role of endothelin revealed by the first orally active endothelin receptor antagonist. Nature. 1993;365:759–761. doi: 10.1038/365759a0. [DOI] [PubMed] [Google Scholar]
  • 8.Clozel M, Breu V, Gray GA, Kalina B, Löffler BM, Burri K, Cassal JM, Hirth G, Müller M, Neidhart W. Pharmacological characterization of bosentan, a new potent orally active nonpeptide endothelin receptor antagonist. J. Pharmacol. Exp. Ther. 1994;270:228–235. [PubMed] [Google Scholar]
  • 9.Neidhart W, Breu V, Bur D, Burri K, Clozel M, Hirth G, MillIer M, Wessel HP, Ramuz H. The discovery of nonpeptide endothelin receptor antagonists Progression towards bosentan. Chimia. 1996;50:519–524. doi: 10.2533/chimia.1996.519. [DOI] [Google Scholar]
  • 10.Shihoya W, Nishizawa T, Yamashita K, Inoue A, Hirata K, Kadji FMN, Okuta A, Tani K, Aoki J, Fujiyoshi Y, Doi T, Nureki O. X-ray structures of endothelin ETB receptor bound to clinical antagonist bosentan and its analog. Nat. Struct. Mol. Biol. 2017;24:758–764. doi: 10.1038/nsmb.3450. [DOI] [PubMed] [Google Scholar]
  • 11.Hayami T, Higo J, Nakamura H, Kasahara K. Multidimensional virtual-system coupled canonical molecular dynamics to compute free-energy landscapes of peptide multimer assembly. J. Comput. Chem. 2019;40:2453–2463. doi: 10.1002/jcc.26020. [DOI] [PubMed] [Google Scholar]
  • 12.Higo J, Kasahara K, Wada M, Dasgupta B, Kamiya N, Hayami T, Fukuda I, Fukunishi Y, Nakamura H. Free-energy landscape of molecular interactions between endothelin 1 and human endothelin type B receptor: Fly casting mechanism. Protein Eng. Des. Sel. 2019;32:297–308. doi: 10.1093/protein/gzz029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Higo J, Kusaka A, Kasahara K, Kamiya N, Hayato I, Qilin X, Takahashi T, Fukuda I, Mori K, Hata Y, Fukunishi Y. GA-guided mD-VcMD: A genetic-algorithm-guided method for multi-dimensional virtual-system coupled molecular dynamics. Biophys. Physicobiol. 2020;17:161–176. doi: 10.2142/biophysico.BSJ-2020008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Higo J, Kawabata T, Kusaka A, Kasahara K, Kamiya N, Fukuda I, Mori K, Hata Y, Fukunishi Y, Nakamura H. Molecular interaction mechanism of a 14-3-3 protein with a phosphorylated peptide elucidated by enhanced conformational sampling. J. Chem. Inf. Model. 2020;60:4867–4880. doi: 10.1021/acs.jcim.0c00551. [DOI] [PubMed] [Google Scholar]
  • 15.Higo J, Takashima H, Fukunishi Y, Yoshimori A. Generalized-ensemble method study: A helix-mimetic compound inhibits protein–protein interaction by long-range and short-range intermolecular interactions. J. Comput. Chem. 2021;42:956–969. doi: 10.1002/jcc.26516. [DOI] [PubMed] [Google Scholar]
  • 16.Hayami T, Kamiya N, Kasahara K, Kawabata T, Kurita J-I, Fukunishi Y, Nishimura Y, Nakamura H, Higo J. Difference of binding modes among three ligands to a receptor mSin3B corresponding to their inhibitory activities. Sci. Rep. 2021;11:6178. doi: 10.1038/s41598-021-85612-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Shoemaker BA, Portman JJ, Wolynes PG. Speeding molecular recognition by using the folding funnel: The fly-casting mechanism. Proc. Natl. Acad. Sci. USA. 2000;97:8868–8873. doi: 10.1073/pnas.160259697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Sugase K, Dyson HJ, Wright PE. Mechanism of coupled folding and binding of an intrinsically disordered protein. Nature. 2007;447:1021–1025. doi: 10.1038/nature05858. [DOI] [PubMed] [Google Scholar]
  • 19.Arai M. Unified understanding of folding and binding mechanisms of globular and intrinsically disordered proteins. Biophys. Rev. 2018;10:163–181. doi: 10.1007/s12551-017-0346-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Spolar RS, Record MT., Jr Coupling of local folding to site-specific binding of proteins to DNA. Science. 1994;263:777–784. doi: 10.1126/science.8303294. [DOI] [PubMed] [Google Scholar]
  • 21.Wallin E, von Heijne G. Properties of N-terminal tails in G-protein coupled receptors: A statistical study. Prot. Eng. Des. Sel. 1995;8:693–698. doi: 10.1093/protein/8.7.693. [DOI] [PubMed] [Google Scholar]
  • 22.Ersoy BA, Pardo L, Zhang S, Thompson DA, Millhauser G, Govaerts C, Vaisse C. Mechanism of N-terminal modulation of activity at the melanocortin-4 receptor GPCR. Nat. Chem. Biol. 2012;8:725–730. doi: 10.1038/nchembio.1008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Uddin MS, Hauser M, Naider F, Becker JM. The N-terminus of the yeast G protein-coupled receptor Ste2p plays critical roles in surface expression, signaling, and negative regulation. Biochem. Biophys. Acta. 2016;1858:715–724. doi: 10.1016/j.bbamem.2015.12.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Isberg V, de Graaf C, Bortolato A, Cherezov V, Katritch V, Marshall FH, Mordalski S, Pin JP, Stevens RC, Vriend G, Gloriam DE. Generic GPCR residue numbers—Aligning topology maps while minding the gaps. Trends Pharmacol. Sci. 2015;36:22–31. doi: 10.1016/j.tips.2014.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Pandy-Szekeres G, Esguerra M, Hauser AS, Caroli J, Munk C, Pilger S, Keseru GM, Kooistra AJ, Gloriam DE. The G protein database, GproteinDb. Nucleic Acids Res. 2022;50:D518–D525. doi: 10.1093/nar/gkab852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Higo J, Kamiya N, Sugihara T, Yonezawa Y, Nakamura H. Verifying trivial parallelization of multicanonical molecular dynamics for conformational sampling of a polypeptide in explicit water. Chem. Phys. Lett. 2009;473:326–329. doi: 10.1016/j.cplett.2009.03.077. [DOI] [Google Scholar]
  • 27.Ikebe J, Umezawa K, Kamiya N, Sugihara T, Yonezawa Y, Takano Y, Nakamura H, Higo J. Theory for trivial trajectory parallelization of multicanonical molecular dynamics and application to a polypeptide in water. J. Comput. Chem. 2011;32:1286–1297. doi: 10.1002/jcc.21710. [DOI] [PubMed] [Google Scholar]
  • 28.Kasahara K, Ma B, Goto K, Dasgupta B, Higo J, Fukuda I, Mashimo T, Akiyama Y, Nakamura H. myPresto/Omegagene: A GPU-accelerated molecular dynamics simulator tailored for enhanced conformational sampling methods with a non-ewald electrostatic scheme. Biophys. Physicobiol. 2016;13:209–216. doi: 10.2142/biophysico.13.0_209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kutzner C, Páll S, Fechner M, Esztermann A, de Groot BL, Grubmüller H. More bang for your buck: Improved use of GPU nodes for GROMACS 2018. J. Comput. Chem. 2019;40:2418–2431. doi: 10.1002/jcc.26011. [DOI] [PubMed] [Google Scholar]
  • 30.Bonomi M, Branduardi D, Bussi G, Camilloni C, Provasi D, Raiteri P, Donadio D, Marinelli F, Pietrucci F, Broglia R, Parrinello M. PLUMED: A portable plugin for free-energy calculations with molecular dynamics. Comput. Phys. Commun. 2009;180:1961–1972. doi: 10.1016/j.cpc.2009.05.011. [DOI] [Google Scholar]
  • 31.Hess B, Bekker H, Berendsen HJC, Fraaije JGEM. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997;18:1463–1472. doi: 10.1002/(SICI)1096-987X(199709)18:12&#x0003c;1463::AID-JCC4&#x0003e;3.0.CO;2-H. [DOI] [Google Scholar]
  • 32.Nosé S. A unified formulation of the constant temperature molecular dynamics methods. J. Chem. Phys. 1984;81:511–519. doi: 10.1063/1.447334. [DOI] [Google Scholar]
  • 33.Hoover WG. Canonical dynamics: Equilibrium phase-space distributions. Phys. Rev. A. 1985;31:1695–1697. doi: 10.1103/PhysRevA.31.1695. [DOI] [PubMed] [Google Scholar]
  • 34.Kamiya N, Fukuda I, Nakamura H. Application of zero-dipole summation method to molecular dynamics simulations of a membrane protein system. Chem. Phys. Lett. 2013;568–569:26–32. doi: 10.1016/j.cplett.2013.03.014. [DOI] [Google Scholar]
  • 35.Fukuda I, Kamiya N, Yonezawa Y, Nakamura H. Simple and accurate scheme to compute electrostatic interaction: Zero-dipole summation technique for molecular system and application to bulk water. J. Chem. Phys. 2012;137:054314. doi: 10.1063/1.4739789. [DOI] [PubMed] [Google Scholar]
  • 36.Fukuda I, Yonezawa Y, Nakamura H. Molecular dynamics scheme for precise estimation of electrostatic interaction via zero-dipole summation principle. J. Chem. Phys. 2011;134:164107. doi: 10.1063/1.3582791. [DOI] [PubMed] [Google Scholar]
  • 37.Lindorff-Larsen K, Piana S, Palmo K, Maragakis P, Klepeis JL, Dror RO, Shaw DE. Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins Struct Funct. Genet. 2010;78:1950–1958. doi: 10.1002/prot.22711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Dickson CJ, Madej BD, Skjevik ÅA, Betz RM, Teigen K, Gould IR, Walker RC. Lipid14: The Amber lipid force field. J. Chem. Theory Comput. 2014;10:865–879. doi: 10.1021/ct4010307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–936. doi: 10.1063/1.445869. [DOI] [Google Scholar]
  • 40.Joung IS, Cheatham TE., III Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. J. Phys. Chem. B. 2008;112:9020–9041. doi: 10.1021/jp8001614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Frisch M. J. et al. Gaussian 09, Revision D.01.
  • 42.Bayly CI, Cieplak P, Cornell W, Kollman PA. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: The RESP model. J. Phys. Chem. 1993;97:10269–10280. doi: 10.1021/j100142a004. [DOI] [Google Scholar]
  • 43.Wang JM, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general amber force field. J. Comput. Chem. 2004;25:1157–1174. doi: 10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
  • 44.Wacker D, Wang S, McCorvy JM, Betz RM, Venkatakrishnan AJ, Levit A, Lansu K, Schools ZL, Che T, Nichols DE, Shoichet BK, Dror RO, Roth BL. Crystal structure of an LSD-bound human serotonin receptor. Cell. 2017;168:377–389. doi: 10.1016/j.cell.2016.12.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Che T, English J, Krumm BE, Kim K, Pardon E, Olsen RHJ, Wang S, Zhang S, Diberto JF, Sciaky N, Carroll FI, Steyaert J, Wacker D, Roth BL. Nanobody-enabled monitoring of kappa opioid receptor states. Nat. Commun. 2020;11:1145–1145. doi: 10.1038/s41467-020-14889-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Rodríguez-Espigares I, Torrens-Fontanals M, Tiemann JKS, et al. GPCRmd uncovers the dynamics of the 3D-GPCRome. Nat. Methods. 2020;17:777–787. doi: 10.1038/s41592-020-0884-y. [DOI] [PubMed] [Google Scholar]
  • 47.Szlenk CT, Gc JB, Natesan S. Does the lipid bilayer orchestrate access and binding of ligands to transmembrane orthosteric/allosteric sites of G protein-coupled receptors? Mol. Pharmacol. 2019;96:527–541. doi: 10.1124/mol.118.115113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Toyoda Y, Morimoto K, Suno R, Horita S, Yamashita K, Hirata K, Sekiguchi Y, Yasuda S, Shiroishi M, Shimizu T, Urushibata Y, Kajiwara Y, Inazumi T, Hotta Y, Asada H, Nakane T, Shiimura Y, Nakagita T, Tsuge K, Yoshida S, Kuribara T, Hosoya T, Sugimoto Y, Nomura N, Sato M, Hirokawa T, Kinoshita M, Murata T, Takayama K, Yamamoto M, Narumiya S, Iwata S, Kobayashi T. Ligand binding to human prostaglandin E receptor EP4 at the lipid-bilayer interface. Nat. Chem. Biol. 2019;15:18–26. doi: 10.1038/s41589-018-0131-3. [DOI] [PubMed] [Google Scholar]
  • 49.McCloskey MA, Poo MM. Rates of membrane-associated reactions: Reduction of dimensionality revisited. J. Cell Biol. 1986;102:88–96. doi: 10.1083/jcb.102.1.88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Gatfield J, Grandjean CM, Bur D, Bolli MH, Nayler O. Distinct ETA receptor binding mode of macitentan as determined by site directed mutagenesis. PLoS ONE. 2014;9:e107809. doi: 10.1371/journal.pone.0107809. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Krupa A, Majda D, Mozgawa W, Szlęk J, Jachowicz R. Physicochemical properties of bosentan and selected PDE-5 inhibitors in the design of drugs for rare diseases. AAPS PharmSciTech. 2017;18:1318–1331. doi: 10.1208/s12249-016-0599-7. [DOI] [PubMed] [Google Scholar]
  • 52.Şanli N, Şanli S, Özkan G, Denizlic A. Determination of pKa values of some sulfonamides by LC and LC-PDA methods in acetonitrile-water binary mixtures. J. Braz. Chem. Soc. 2010;21:1952–1960. [Google Scholar]
  • 53.Isom DG, Castañeda CA, Cannon BR, García-Moreno EB. Large shifts in pKa values of lysine residues buried inside a protein. Proc. Natl. Acad. Sci. USA. 2011;108:5260–5265. doi: 10.1073/pnas.1010750108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Marino KA, Filizola M. Investigating small-molecule ligand binding to G protein-coupled receptors with biased or unbiased molecular dynamics simulations. Methods Mol. Biol. 2018;1705:351–364. doi: 10.1007/978-1-4939-7465-8_17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Dror RO, Green HF, Valant C, Borhani DW, Valcourt JR, Pan AC, Arlow DH, Canals M, Lane JR, Rahmani R, Baell JB, Sexton PM, Christopoulos A, Shaw DE. Structural basis for modulation of a G-protein-coupled receptor by allosteric drugs. Nature. 2013;503:295–299. doi: 10.1038/nature12595. [DOI] [PubMed] [Google Scholar]
  • 56.Schneider S, Provasi D, Filizola M. How oliceridine (TRV-130) binds and stabilizes a μ-opioid receptor conformational state that selectively triggers G protein signaling pathways. Biochemistry. 2016;55:6456–6466. doi: 10.1021/acs.biochem.6b00948. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Shang Y, Yeatman HR, Provasi D, Alt A, Christopoulos A, Canals M, Filizola M. Proposed mode of binding and action of positive allosteric modulators at opioid receptors. ACS Chem. Biol. 2016;11:1220–1229. doi: 10.1021/acschembio.5b00712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Kruse AC, Hu J, Pan AC, Arlow DH, Rosenbaum DM, Rosemond E, Green HF, Liu T, Chae PS, Dror RO, Shaw DE, Weis WI, Wess J, Kobilka BK. Structure and dynamics of the M3 muscarinic acetylcholine receptor. Nature. 2012;482:552–556. doi: 10.1038/nature10867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Dror RO, Pan AC, Arlow DH, Borhani DW, Maragakis P, Shan Y, Xu H, Shaw DE. Pathway and mechanism of drug binding to G-protein-coupled receptors. Proc. Natl. Acad. Sci. USA. 2011;108:13118–13123. doi: 10.1073/pnas.1104614108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Stanley N, Pardo L, Fabritiis GD. The pathway of ligand entry from the membrane bilayer to a lipid G protein-coupled receptor. Sci. Rep. 2016;6:22639. doi: 10.1038/srep22639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Coleman JLJ, Ngo T, Smith NJ. The G protein-coupled receptor N-terminus and receptor signalling: N-tering a new era. Cell Signal. 2017;33:1–9. doi: 10.1016/j.cellsig.2017.02.004. [DOI] [PubMed] [Google Scholar]
  • 62.Shahane G, Parsania C, Sengupta D, Joshi M. Molecular insights into the dynamics of pharmacogenetically important N-terminal variants of the human β2-adrenergic receptor. PLoS Comput. Biol. 2014;10:e1004006. doi: 10.1371/journal.pcbi.1004006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Bekker G, Kawabata T, Kurisu G. The biological structure model archive (BSM-Arc): An archive for in silico models and simulations. Biophys. Rev. 2020;12:371–375. doi: 10.1007/s12551-020-00632-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The datasets used or analyzed during the current study are available from the corresponding author on reasonable request. Several important snapshots during the current simulation were available in the BSM-Arc (https://bsma.pdbj.org/entry/35).


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES