Abstract
Background:
Chemokine GPCRs play key roles in biology and medicine. Particularly, CXCR4 promotes cancer metastasis and facilitate HIV entry into host cells. Plerixafor (PLX) is a CXCR4 drug, but the pathway and binding site of PLX in CXCR4 remain unknown.
Results & methodology:
We have performed molecular docking and all-atom simulations using Gaussian accelerated molecular dynamics (GaMD), which are consistent with previous mutation experiments, suggesting that PLX binds to the orthosteric site of CXCR4 as an antagonist. The GaMD simulations further revealed an intermediate allosteric binding site at the extracellular mouth of CXCR4.
Conclusion:
The newly identified allosteric site can be targeted for novel drug design targeting CXCR4 and other chemokine receptors.
Keywords: : chemokine receptors, drug binding, GPCRs, Gaussian accelerated molecular dynamics, HIV, plerixafor
Chemokine receptors are key GPCRs, which control cell migration during immune system responses and development of cardiovascular and central nervous systems and in numerous diseases (including inflammation and cancer) [1,2]. Importantly, the CCR5 and CXCR4 chemokine receptors also function as co-receptors that facilitate HIV entry into host immune cells [3–5]. Maraviroc and vicriviroc, two clinic drugs of HIV entry inhibitors, are antagonists of the CCR5 receptor [6]. These drugs could block HIV replication, but ultimately resistance develops, due to emergence of viruses that can utilize the CXCR4 co-receptor.
Unfortunately, the development of the CXCR4 antagonists as effective drugs of HIV infection has been greatly hindered. This is due to the facts that CXCR4 is widely expressed in many different human tissues and the primary endogenous chemokine-binding (orthosteric) site is conserved across different subtypes of the chemokine receptors. Off-target binding often occurs for the prototypical CXCR4 antagonists [7]. Alternatively, it is appealing to develop allosteric modulators, which selectively bind to a topographically distant (allosteric) site with divergent sequences. They are promising to regulate the responsiveness of CXCR4 to endogenous chemokine with reduced side effects [8–10].
X-ray crystal structures have been determined for the CXCR4 receptor, in complex with an antagonist compound IT1t [4], a cyclic peptide CVX15 [4] and vMIP-II that is a viral chemokine encoded by Kaposi’s Sarcoma-associated herpesvirus [3]. These ligands are antagonists that stabilize the CXCR4 in the inactive state. Homodimers are formed for the CXCR4 in the x-ray structures with interface involving the transmembrane (TM) helices 5 and 6. While the CVX15 cyclic peptide binds to a major subpocket, the IT1t compound and vMIP-II viral chemokine occupies a minor subpocket at the receptor orthosteric site. In addition, the vMIP-II viral chemokine forms extensive interactions with the N-terminus of the CXCR4 receptor. These structures provide important insights into antagonist binding of the CXCR4. They also serve as an excellent starting point for computational modeling and structure-based drug design.
Plerixafor (PLX) or AMD3100 is an organic compound with two positively charged cyclam rings that are connected by a 1,4-phenylenebis(methylene) linker [11]. It has been identified as the first selective small-molecule inhibitor of the CXCR4 receptor. Previous studies suggested multiple binding modes of PLX in the CXCR4, raising the question of whether it is an antagonist or a negative allosteric modulator [11–13]. The target binding site of known drugs in the CXCR4 such as PLX [11–13] remains elusive. Furthermore, the pathways and mechanism of drug binding to the CXCR4 receptor are still poorly understood, which has greatly hindered effective drug design of this important target.
Gaussian accelerated molecular dynamics (GaMD) is a computational technique that provides simultaneous unconstrained enhanced sampling and free energy calculations of large biomolecules [14]. GaMD works by adding a harmonic boost potential to smooth the potential energy surface of biomolecules to reduce the system energy barriers [14]. A Gaussian distribution is followed by the added harmonic boost potential. Proper energetic reweighting of GaMD simulations is achieved through cumulant expansion to the second order (‘Gaussian approximation’). While accurate free energy profiles have been obtained from GaMD simulations for small biomolecules such as alanine dipeptide compared with long-timescale conventional MD simulations [14,15], it is exceedingly difficult to calculate convergent free energy profiles for large systems like GPCRs. Nevertheless, relatively low-energy conformational states of biomolecules can be identified from the GaMD reweighted free energy profiles [16,17]. Without the requirement of carefully predefined collective variables, GaMD is advantageous to study complex biological processes such as drug binding to proteins. GaMD simulations have been successfully demonstrated on ligand binding [14–18], protein folding [14,18], GPCR activation [19] and the protein-membrane [20], protein–protein [21,22] and protein–nucleic acid [23,24] interactions.
In this study, we have combined molecular docking and all-atom GaMD simulations to determine the binding mode of the PLX drug in the CXCR4 receptor. The docking and GaMD simulations are consistent with previous mutation experimental data [13,25], suggesting that PLX binds to the orthosteric site of CXCR4 as an antagonist. The GaMD simulations have also identified important intermediate states of the drug and an intermediate allosteric site of the CXCR4 receptor. This opens a new platform to design novel allosteric modulators as selective drugs of CXCR4 for the treatment against HIV and cancers associated with this receptor.
Materials & methods
Gaussian accelerated molecular dynamics
GaMD is an unconstrained enhanced sampling approach that works by adding a harmonic boost potential to smooth the potential energy surface of biomolecules to reduce energy barriers [14]. Brief description of the method is provided here.
Consider a system with N atoms at positions . When potential energy of the system is less than a threshold energy E, a boost potential is added to the system as follows:
| (Eq. 1) |
| (Eq. 2) |
where k is the harmonic force constant. The two adjustable parameters E and k can be determined by application of three enhanced sampling principles. First, for any two arbitrary potential values and found on the original energy surface, if , should be a monotonic function that does not change the relative order of the biased potential values, in other words, . Second, if , the potential difference observed on the smoothed energy surface should be smaller than that of the original, in other words, . By combining the first two criteria and plugging in the formula of and , we obtain:
| (Eq. 3) |
where and are the system minimum and maximum potential energies. To ensure that (Equation 3) is valid, k has to satisfy: . Let us define , then . Third, the standard deviation (SD) of ΔV needs to be small enough (i.e., narrow distribution) to ensure accurate reweighting using cumulant expansion to the second order: , where and are the average and SD of ΔV with as a user-specified upper limit (e.g., ) for accurate reweighting. When E is set to the lower bound according to equation 3, can be calculated as:
| (Eq. 4) |
Alternatively, when the threshold energy E is set to its upper bound , is set to:
| (Eq. 5) |
if is calculated between 0 and 1. Otherwise, is calculated using (Equation 4).
The original GaMD method provides schemes to add only the total potential boost , only dihedral potential boost or the dual potential boost (both and ) [14]. The dual-boost GaMD (GaMD_Dual) simulation generally provides higher acceleration than the other two types of simulations [26]. The simulation parameters comprise the threshold energy E for applying boost potential and the effective harmonic force constants, and for the total and dihedral potential boost, respectively. Since ligand binding mainly involves nonbonded interactions, another scheme of nonbonded dual-boost GaMD (GaMD_Dual_NB) is introduced in this study of drug binding to the CXCR4 chemokine receptor, for which boost potential is applied to the system dihedral energy and nonbonded potential energy terms.
Energetic reweighting of GaMD simulations
For energetic reweighting of GaMD simulations to calculate potential mean force (PMF), the probability distribution along a reaction coordinate is written as . Given the boost potential of each frame, can be reweighted to recover the canonical ensemble distribution , as:
| (Eq. 6) |
where M is the number of bins, and is the ensemble-averaged Boltzmann factor of for simulation frames found in the jth bin. The ensemble-averaged reweighting factor can be approximated using cumulant expansion:
| (Eq. 7) |
where first two cumulants are given by
| (Eq. 8) |
The boost potential obtained from GaMD simulations usually follows near-Gaussian distribution [27]. Cumulant expansion to the second order thus provides a good approximation for computing the reweighting factor [14,28]. The reweighted free energy is calculated as
| (Eq. 9) |
where is the modified free energy obtained from GaMD simulation and is a constant.
System setup
The highest-resolution (2.5 Å) x-ray structure of CXCR4 co-crystalized with antagonist IT1t (Protein Data Bank [PDB]: 3ODU) [4] was used for setting up the simulation system. The structure included 293 residues (27–319) of the 352 residues of the CXCR4. The T4-lysozyme residues that were added to facilitate crystallization and the IT1t antagonist were removed. Structure of the PLX drug that was refined with molecular dynamics was downloaded from the Automated Topology Builder and Repository (http://compbio.biosci.uq.edu.au/atb).
PLX (AMD3100; Mozobil®, Genzyme, MA USA) is an organic small molecule having a 1,4-phenylenebis(methylene) linker that connects two cyclam rings [11]. With its symmetric structure, it has been suggested to exhibit a unique binding mode to CXCR4 unlike other ligands [11]. At physiological pH, each cyclam ring (1,4,8,11-tetrazacyclotetradecane) has two positive charges, giving an overall charge of +4 to the entire bicyclam PLX molecule [11]. Maestro in Schrodinger [29] was used to prepare the ligand at physiological pH 7.4, which generated only one configuration of the ligand with four positive charges on atoms N1, N3, N4 and N7 of the bicyclam rings. This configuration was used to carry out GaMD simulations.
Simulation systems were prepared for the unbound and the bound state of the ligand. The unbound state was prepared by placing a total of ten PLX molecules at a distance >15 Å from the receptor (Figure 1A). The bound state was prepared by rigid-body docking of PLX to the receptor using AutoDock [30]. The lowest energy conformation (-7.48 kcal/mol) was selected from docking for performing GaMD simulations. The simulation systems were prepared with the CHARMM-GUI [31–33] web server for using the membrane protein input generator. All chain termini were capped with neutral patches (acetyl and methylamide). The systems were solvated in 0.15 M NaCl solution at temperature 310 K. The CHARMM36m [34] parameter set was used for the receptor and lipids and the CGenFF 2.2.0 parameters [35,36] for the ligand. The output files from CHARMM-GUI were used to perform GaMD simulations with AMBER as summarized in Table 1.
Figure 1. . Gaussian accelerated molecular dynamics simulation (Sim2 in Table 1) successfully captured spontaneous drug binding to the CXCR4 chemokine receptor.

(A) Computational model used for simulation of the CXCR4 receptor (blue ribbons) with ten PLX drug molecules (red spheres) placed away in the solvent. The receptor was inserted in a POPC lipid bilayer (cyan sticks) and solvated in an aqueous solution (cyan) of 0.15 M NaCl, (B) structure of PLX with numbered nitrogen atoms and the symmetry-corrected RMSD of PLX relative to its bound conformation plotted as a function of simulation time, (C) the distance between the Cγ atom of receptor residue D2626.58 of the binding pocket and the N7 atom of PLX, (D) the distance between the Cγ atom of receptor residue D972.63 of the binding pocket and the N3 atom of PLX, (E) the distance between the Cδ atom of receptor residue E2887.39 of the binding pocket and the N4 atom of PLX and (F) GaMD predicted binding pose of PLX (red sticks) at the orthosteric site of CXCR4 (blue ribbons) with their interacting residues labeled and highlighted in green sticks. Antagonist IT1t (orange) and docking conformation of PLX (yellow) are shown for reference. The seven TM helices I–VII and three extracellular loops (ECL) 1–3 are labeled in the CXCR4 receptor.
GaMD: Gaussian accelerated molecular dynamics; PLX: Plerixafor; RMSD: Root-mean-square deviation; TM: Transmembrane.
Table 1. . Summary of Gaussian accelerated molecular dynamics simulations performed on the CXCR4 chemokine receptor in the presence of the plerixafor drug.
| System | Natoms† | Method‡ | ID | Length (ns) | § (kcal/mol) | ¶ (kcal/mol) |
|---|---|---|---|---|---|---|
| CXCR4 + PLX (unbound) | 120275 | GaMD_Dual_NB | Sim1 | 800 | 19.81 | 4.49 |
| Sim2 | 1000 | 19.85 | 4.49 | |||
| Sim3 | 800 | 19.76 | 4.48 | |||
| Sim4 | 800 | 19.83 | 4.49 | |||
| Sim5 | 800 | 19.77 | 4.48 | |||
| CXCR4 + PLX (unbound) | 120275 | GaMD_Dual | Sim6 | 800 | 14.93 | 4.32 |
| Sim7 | 800 | 14.91 | 4.31 | |||
| Sim8 | 800 | 14.92 | 4.32 | |||
| Sim9 | 800 | 14.96 | 4.32 | |||
| Sim10 | 800 | 15.01 | 4.33 | |||
| CXCR4 + PLX (bound) | 76625 | GaMD_Dual | Sim11 | 300 | 15.22 | 4.37 |
| Sim12 | 300 | 15.46 | 4.40 | |||
| Sim13 | 300 | 15.19 | 4.37 |
Natoms: number of atoms in the system.
GaMD_Dual is dual boost GaMD and GaMD_Fual_NB is nonbonded dual-boost GaMD.
: average of the GaMD boost potential.
: SD of the GaMD boost potential.
GaMD: Gaussian accelerated molecular dynamic; PLX: Plerixafor; SD: Standard deviation.
Simulation protocol
The output files of CHARMM-GUI web server were used for initial energy minimization, equilibration and conventional molecular dynamics (cMD) to prepare the systems for GaMD simulations. The systems were energy minimized and equilibrated at 310 K using default parameters given by CHARMM-GUI. Energy minimization of the systems was done for 5000 steps using steepest-descent algorithm at constant number, volume and temperature at 310 K. The systems were further equilibrated for 375 ps at 310 K at constant number, pressure and temperature. The systems were simulated using cMD for 10 ns at 1 atm pressure and 310 K temperature. GaMD implemented in GPU version of AMBER 18 [14,37] was applied to simulate the CXCR4 systems. The simulations involved an initial short cMD of 2.5 ns to calculate GaMD acceleration parameters and GaMD equilibration of added boost potential for 40 ns.
Five independent 800–1000 ns production GaMD simulations with boost potential applied to the system nonbonded potential and dihedral energy terms (GaMD_Dual_NB) were performed on the CXCR4 receptor with PLX unbound. In this case, the GaMD_Dual_NB algorithm was developed for more efficient sampling of the nonbonded ligand binding process. For comparison, the same system was also simulated using the previous dual-boost GaMD (GaMD_Dual) in five independent 800 ns production runs, for which boost potential was applied to the dihedral energy and the system total potential energy. In the case of the drug bound state, simulations were performed to refine the docked conformation of PLX inside the receptor, for which GaMD_Dual was generally considered to be better for biomolecular conformational sampling and thus applied for the structural refinement. Therefore, three independent 300 ns GaMD_Dual simulations were performed on the CXCR4 receptor with PLX docked obtained with AutoDock rigid-body docking. The GaMD simulations are summarized in Table 1.
Simulation analysis
Simulation analysis was carried out using VMD [38] and CPPTRAJ [39]. The hierarchical agglomerative clustering algorithm was used to cluster snapshots of the diffusing PLX using its center ring with all production GaMD simulations combined for each system. The root-mean-square deviation (RMSD) cutoff was set to 2.0 Å. The combined GaMD simulations of PLX docked to CXCR4 were clustered to obtain five clusters for which the simulation trajectories showed stable binding of PLX at the orthosteric site of CXCR4. The top ranked cluster was selected as a reference, termed the ‘bound conformation’, to calculate the RMSD of PLX during simulations of its binding to the receptor from the solvent.
Since PLX is symmetric with two cyclam rings, its RMSD relative to the bound conformation was calculated twice, once with the original configuration and another with the two cyclam rings flipped. The minimum of the two RMSD values of each frame (i.e., symmetry-corrected RMSD) was used for further analysis.
The RMSD of PLX relative to the bound conformation and distances of important interactions between the drug and receptor residues were identified to calculate PMF free energy profiles using the PyReweighting toolkit [28]. A bin size of 2 Å was used for distances and RMSD. Free energy values were also reweighted for each of the ligand structural clusters. The cutoff was set to 500 frames in a bin or cluster for reweighting.
Results
Three independent 300 ns dual-boost GaMD (GaMD_Dual) simulations on the PLX docked CXCR4 obtained from AutoDock rigid-body docking showed that the drug maintained stable binding at the orthosteric site. Five structural clusters of PLX were generated from clustering snapshots of the drug with all three GaMD simulations combined (Supplementary Figure 1A). The top ranked cluster, which exhibited a similar conformation compared with the docking pose, was defined the ‘bound conformation’ of PLX (Supplementary Figure 1B). In one of the nonbonded dual-boost GaMD simulations (GaMD_Dual_NB) of CXCR4, spontaneous binding of PLX from free diffusion in the solvent to the orthosteric site of the CXCR4 receptor was captured (Figure 1 & ‘Sim2’ in Table 1). The minimum RMSD of PLX relative to the bound conformation was 2.76 Å. The other four GaMD simulations revealed important interactions of the drug with the receptor surface, although no complete binding of PLX to the CXCR4 orthosteric site was observed (Supplementary Figure 2). The five GaMD simulations showed similar average and SD of the added boost potential as 19.8 kcal/mol and 4.48 kcal/mol, respectively.
The GaMD_Dual simulations of CXCR4 with unbound PLX showed lower boost potential of 14.9 ± 4.32 kcal/mol. No complete binding of PLX to the CXCR4 was captured in any of the five simulations (Supplementary Figure 3). Moreover, GaMD_Dual simulations of CXCR4 with PLX bound showed stable binding of PLX at the CXCR4 orthosteric site with similar GaMD boost potential of 15.2 ± 4.38 kcal/mol.
GaMD simulations revealed PLX binding to the CXCR4 receptor as an antagonist
In Sim2 of the GaMD_Dual_NB simulation trajectory, out of the ten PLX drug molecules freely diffusing in the solvent, PLX1 approached the extracellular mouth of the CXCR4 receptor and bound to the orthosteric site (Figure 1). Stable binding was observed after approximately 480 ns with an RMSD of approximately 2.8 Å relative to the GaMD refined docking conformation of PLX (Figure 1B).
Within approximately 180 ns of simulation time, a salt bridge was formed between the receptor residue D2626.58 and N7 of PLX (Figure 1C). The protein residue D972.63 formed a salt bridge with atom N3 of PLX at approximately 240 ns despite fluctuations (Figure 1D). Residue E2887.39 that was located deep inside the receptor-binding pocket formed another salt bridge with atom N4 of PLX much later (∼480ns) during the GaMD simulation (Figure 1E). In the final bound conformation, the positively charged PLX formed stable salt bridges with residues D972.63, D2626.58 and E2887.39 as shown in Figure 1F. While the IT1t antagonist bound to only the minor subpocket of CXCR4 (PDB: 3ODU) [4], our GaMD simulations revealed binding of PLX to both the minor and major subpockets of the receptor (Figure 1F).
Free energy profiles of PLX binding to the CXCR4 receptor
We combined all five GaMD_Dual_NB simulations to calculate reweighted free energy profiles of the drug binding process to characterize the binding of PLX to the CXCR4 receptor. The RMSD of PLX relative to the bound conformation and the salt bridge distances formed between the protein and ligand were selected as reaction coordinates. From the free energy profiles as shown in Figure 2, we identified four low-energy conformational states: unbound, intermediate 1 (I1), intermediate 2 (I2) and bound. In the unbound state, the distances between the charge center of D2626.58 (the CG atom) and PLX atom N7 (Figure 2A), D972.63 (the CG atom) and PLX atom N3 (Figure 2B); E2887.39 (the CD atom) and PLX atom N4 (Figure 2C) exhibited a broad energy well centered around approximately 60 Å, suggesting that the drug diffused in bulk solution. In the Intermediate I1 state, the distances between the charge center CG atom of D2626.58 and PLX atom N7 (Figure 2A) and D972.63 (the CG atom) and PLX atom N3 (Figure 2B) were 12.65 and 15.06 Å, respectively. The RMSD of PLX relative to the bound conformation was 11.42 Å. In the intermediate I2 state, the distances between the charge center CG atom of D2626.58 and PLX atom N7 (Figure 2A) and E2887.39 (the CD atom) and PLX atom N4 (Figure 2C) were 7.38 and 21.49 Å, respectively. The RMSD of PLX relative to the bound conformation was 11.14 Å. In the bound state, the distances between the charge center CG atom of D2626.58 and PLX atom N7 (Figure 2A), D972.63 (the CG atom) and PLX atom N3 (Figure 2B) and E2887.39 (the CD atom) and PLX atom N4 (Figure 2C) were approximately 3.0 Å and the RMSD of PLX relative to the bound conformation was approximately 2.8 Å. Further PMF calculations showed that the distances between the Cα atoms of receptor residues R1343.50-T2416.37 and S1313.47-I2456.41 maintained the IT1t antagonist-bound CXCR4 x-ray structural values in the free energy minima despite PLX movements in the GaMD simulations (Supplementary Figure 4). Therefore, the CXCR4 receptor remained in the inactive state during the PLX drug binding to the orthosteric site.
Figure 2. . 2D potential of mean force free energy profiles of the CXCR4–plerixafor interactions.

(A) The distance between the Cγ atom of CXCR4 residue D2626.58 of the binding pocket and the N7 atom of PLX as plotted in Figure 1D, (B) the distance between the Cγ atom of CXCR4 residue D972.63 of the binding pocket and the N3 atom of PLX as plotted in Figure 1E, (C) the distance between the Cδ atom of CXCR4 residue E2887.39 of the binding pocket and the N4 atom of PLX as plotted in Figure 1F. (D) A 2D PMF of the distance between receptor residue D2626.58 and PLX atom N7 versus the distance between receptor residue E2887.39 and PLX atom N4. The I1 and I2 states at energy minima were identified at (D262:CG-PLX:N7, E288:CD-PLX:N4) distances of (13.01 Å, 8.60 Å) and (7.38 Å, 21.49 Å), respectively. (E) A 2D PMF of the distance between receptor residue D2626.58 and PLX atom N7 versus the distance between receptor residue D972.63 and PLX atom N3. (F) Zoomed view of the 2D PMF as plotted in Figure 2E. The I1 and I2 states at energy minima (∼0.47 kcal/mol and 0.0 kcal/mol) were identified at (D262:CG-PLX:N7 and D97:CG-PLX:N3) distances of (13.01 Å and 16.40 Å) and (7.38 Å and 19.54 Å), respectively. Low-energy unbound, intermediate I1 and I2 and bound conformational states are labeled in the PMF profiles.
GaMD: Gaussian accelerated molecular dynamic; PLX: Plerixafor; PMF: Potential of mean force; RMSD: Root-mean-square deviation.
In addition, we calculated 2D PMF profiles of the D2626.58:CG-PLX:N7 distance versus the D972.63:CG-PLX:N3 distance (Figure 2E & F) and the E2887.39:CD-PLX:N4 distance (Figure 2D) to further characterize the intermediate states I1 and I2 conformational states. In the intermediate I1 state, the D2626.58:CG-PLX:N7, D972.63:CG-PLX:N3 and E2887.39:CD-PLX:N4 distances were 13.01, 16.40 and 8.60 Å, respectively (Figure 2D–F). In the intermediate I2 state, the D2626.58:CG-PLX:N7, D972.63:CG-PLX:N3 and E2887.39:CD-PLX:N4 distances were 7.38, 19.54 and 21.49 Å, respectively (Figure 2D–F).
A novel intermediate drug binding site was identified in the CXCR4
The intermediate conformational states I1 and I2 identified from the free energy profiles of GaMD simulations were shown in Figures 3A & B, respectively. Remarkably, polar and charged residues in the ECL2-TM6-TM6 region of the CXCR4 (including residues D187ECL2, D1935.32 and D2626.58) formed favorable interactions with the positively charged nitrogen atoms of PLX. These interactions played a significant role in the recognition and binding of PLX to the CXCR4 receptor.
Figure 3. . Novel intermediate drug binding site in the CXCR4 receptor. Intermediate I1 and I2 conformational states of PLX (red sticks) bound to the CXCR4 receptor (blue ribbons).

The charged nitrogen atoms of PLX are numbered as 1, 3, 4 and 7. (A) Intermediate I1 state with interacting residues highlighted in green sticks, including D187ECL2 and D2626.58 that formed ionic interactions with PLX atoms N4 and N3, respectively. (B) Intermediate I2 state with interacting residues highlighted in green sticks, including D187ECL2, D1935.32 and D2626.58 that formed salt bridges with positively charged N7, N4 and N1 of PLX, respectively.
GaMD: Gaussian accelerated molecular dynamic; PLX: Plerixafor.
In the intermediate I1 conformation, residues D187ECL2 and D2626.58 formed ionic interactions with PLX atoms N4 and N3, respectively (Figure 3A). In the intermediate I2 conformation, residues D187ECL2, D1935.32 and D2626.58 formed salt bridges with positively charged N7, N4 and N1 atoms of PLX, respectively (Figure 3B). Therefore, the two intermediate states of drug–receptor complex share the common recognizing determinants that involve key residues D187ECL2, D1935.32 and D2626.58 of the receptor. Two distinct low-energy states were obtained in the free energy profiles as the symmetric ligand flips its conformation at the same site of the receptor in the ECL2–TM5–TM6, which therefore was considered as a single intermediate drug binding site of the CXCR4.
Pathway of drug binding to the CXCR4 receptor
Next, structural clustering was performed on GaMD simulation frames of the PLX drug to identify the representative binding pathway of PLX to the CXCR4 receptor (see ‘Materials & methods’ section). The structural clusters were also reweighted to obtain their original free energy values which ranged from 0 to approximately 5.5 kcal/mol. The top 35 reweighted clusters were selected to represent the binding pathway of PLX to the CXCR4 receptor as shown in Figure 4. Starting from diffusion in the solvent, PLX binds to an intermediate site between ECL2–TM5–TM6 and then the final orthosteric site in the TM domain. In the top ranked structural cluster, PLX bound to the target orthosteric site. The second lowest-energy cluster was located at the opening of the receptor in the region between ECL2 and the TM5 and TM6 helices, whereas few ligand clusters were observed on the receptor surface near ECL2.
Figure 4. . Binding pathway of the plerixafor drug to the CXCR4 chemokine receptor revealed from Gaussian accelerated molecular dynamic simulations.

Starting from diffusion in the solvent, PLX bound to the target site of the CXCR4 receptor via an intermediate site located between ECL2 and TM V–VI helices. The CXCR4 is shown in blue ribbons. The PLX structural clusters (sticks) are colored by the reweighted PMF free energy values in a green (0 kcal/mol)-white-red (2.5 kcal/mol) color scale.
PLX: Plerixafor; PMF: Potential of mean force; TM: Transmembrane.
Discussion
In this study, molecular docking and all-atom GaMD simulations were applied to determine the target site and mechanism of the PLX drug binding in the CXCR4 receptor. The GaMD simulations successfully captured spontaneous binding of the PLX drug to the receptor orthosteric site. However, it is important to note that the free energy profiles presented here were not converged since the binding of PLX to the receptor orthosteric site was observed only once in one of the five GaMD simulations. It is exceedingly difficult to calculate accurate free energy profiles for such large system of this study even with enhanced sampling. Nevertheless, we observed multiple events of the PLX drug binding to the CXCR4 in the extracellular mouth region (Figure 1B; Supplementary Figure 2 & 3), which has been identified as an allosteric site of many class A GPCRs [40–43]. It has proven challenging to simulate complete binding of GPCR antagonists, especially with a relatively large size like the PLX drug molecule. Previous tens-of-microsecond molecular dynamics (MD) simulations using the Anton supercomputer [44] and accelerated MD simulations [45] were able to capture binding of a similar sized antagonist tiotropium to only the extracellular mouth of the M3 muscarinic receptor, but not to the final ‘orthosteric’ target site deeply buried in the receptor TM domain. Therefore, the GaMD simulations in this study presented an advance for us to capture complete binding of PLX to the orthosteric site of CXCR4. Furthermore, energetic reweighting of the GaMD simulations allowed us to obtain ‘semi-quantitative’ PMF profiles to identify relatively low-energy conformational states of the PLX drug binding to CXCR4. Consistent results were obtained from molecular docking and GaMD simulations, showing that the PLX drug bound to the same IT1t antagonist-binding orthosteric site of CXCR4 (being an antagonist). The docking and simulation findings were further supported by previous mutation experiments, which suggested PLX interacts particularly with residues D2626.58 and E2887.39 in the orthosteric site of CXCR4 [13,25]. Taken together, the computational modeling and experimental data strongly supported our finding that the PLX drug molecule bound to the deeply buried orthosteric site of CXCR4 via an intermediate allosteric site in the receptor extracellular mouth.
The orthosteric binding pocket of the CXCR4 receptor largely consists of negatively charged residues [46] that form salt bridges with the positively charged PLX (Figure 1F). The negatively charged binding pocket is known to have two distinguished major and minor subpockets [46]. The crystal structures of CXCR4 with small molecule antagonist IT1t (PDB: 3ODU) and viral chemokine antagonist vMIP-II (PDB: 4RWS) revealed similar binding sites in the minor subpocket [3,4]. The IT1t and the N-terminus of vMIP-II make polar contacts with residues D972.63 and E2887.39 in the minor subpocket [3]. In contrast, a moderate spatial overlap observed between the N-terminus of vMIP-II and small cyclic peptide antagonist CVX15 (PDB: 3OE0), however, showed polar contacts with the same residues such as D187 from ECL2 and D2626.58 [4]. Our GaMD simulations revealed simultaneous occupancy of PLX in both the major and minor subpockets of the receptor with its two cyclam rings.
PLX is clinically used for hematopoietoc stem cells mobilization to treat cancer patients, however, being a selective inhibitor of CXCR4, has not yet been approved to treat HIV infection due to its side effects [47,48]. In this study, we have explored atomistic drug interactions with the CXCR4 through GaMD simulations. Detailed analysis of the intermediate states I1 and I2 revealed a significant role of the receptor ECL2 and TM5–TM6 region in the recognition and binding of PLX to the CXCR4. Residues D187ECL2, D1935.32 and D2626.58 that formed salt bridge interactions with the PLX were suggested to be the recognizing determinants in the intermediate states of PLX binding.
The bound conformation of PLX captured in this study involved interactions with important negatively charged residues D972.63, D2626.58 and E2887.39 in the orthosteric pocket of the CXCR4 as also seen in the receptor crystal structures bound by the small molecule (IT1t) [4], viral chemokine (vMIP-II) [3] and small cyclic peptide (CVX15) [4] antagonists. Therefore, through the GaMD simulations, we have elucidated the role of PLX to be an antagonist of the CXCR4 receptor.
Conclusion
We have captured spontaneous binding of the PLX drug to the CXCR4 receptor through nonbonded dual-boost GaMD simulations. The GaMD simulations have revealed the mechanism and pathway of drug binding to the CXCR4. The newly identified intermediate binding site involving residues D187ECL2, D1935.32 and D2626.58 in the ECL2–TM5–TM6 region will serve as a novel target site for designing allosteric modulators of the CXCR4. The simulation findings have provided a molecular basis for rational computer-aided drug design against HIV and other diseases associated with the CXCR4 chemokine receptor.
Future perspective
Chemokine receptors are key GPCRs involved in controlling cell migration during immune response, cardiovascular and central nervous system development and are associated with inflammation and cancer. The CXCR4 chemokine receptors facilitate HIV entry into host immune cells and serve as an important therapeutic target against HIV infection. Development of antagonists against the CXCR4 has often caused off-target side effects. Discovery of novel allosteric sites of the receptor has become promising for structure-based design of selective drugs. Together with previous mutation experiments, the molecular docking and GaMD enhanced simulations in this study strongly suggest that the PLX drug molecule is bound to the deeply buried orthosteric site of CXCR4 via an intermediate allosteric site at the receptor extracellular mouth.
While GaMD simulations have successfully captured complete binding of PLX to the deeply buried orthosteric site of CXCR4, the calculated free energy profile is not converged. Further studies are still needed to simulate multiple events of drug binding and even slower unbinding in the future, which is expected to enable accurate free energy calculations. In addition, the newly identified allosteric site will be targeted for computer-aided structure-based design of potent allosteric modulators of the CXCR4. Virtual screening of large compound libraries to identify novel allosteric modulators, followed by in vitro and in vivo experimental validation, will serve as a promising approach for developing effective therapeutics against the CXCR4.
Summary points.
Molecular docking and all-atom simulations using Gaussian accelerated molecular dynamics were combined to determine the mechanism and target site of the plerixafor drug binding in the CXCR4 receptor.
The docking and simulation findings were consistent with previous mutation experiments, suggesting that plerixafor binds to the orthosteric site of CXCR4 as an antagonist.
The Gaussian accelerated molecular dynamics; simulations further revealed an intermediate allosteric binding site at the extracellular mouth involving residues from the ECL2–TM5–TM6 of CXCR4.
The newly identified allosteric site can be targeted for novel drug design targeting CXCR4 and other chemokine receptors.
Supplementary Material
Acknowledgments
We appreciate the preliminary simulations of chemokine receptors by A Podgorny and R Feehan and thank T Handel, I Kufareva and Å Skjevik for valuable discussions.
Footnotes
Supplementary data
To view the supplementary data that accompany this paper please visit the journal website at: www.future-science.com/doi/suppl/10.4155/fmc-2020-0044
Financial & competing interests disclosure
This work used supercomputing resources with allocation award TG-MCB180049 through the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation (grant no. ACI-1548562), and project M2874 through the National Energy Research Scientific Computing Center (NERSC), which is the USA Department of Energy Office of Science User Facility operated under Contract number DE-AC02-05CH11231 and the Research Computing Cluster at the University of Kansas. This work was supported by the National Institutes of Health (R01GM132572) and the startup funding in the College of Liberal Arts and Sciences at the University of Kansas. The authors have no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.
No writing assistance was utilized in the production of this manuscript.
References
- 1.Balkwill F. The significance of cancer cell expression of the chemokine receptor CXCR4. Semin. Cancer Biol. 14(3), 171–179 (2004). [DOI] [PubMed] [Google Scholar]
- 2.Koelink PJ, Overbeek SA, Braber S. et al. Targeting chemokine receptors in chronic inflammatory diseases: an extensive review. Pharmacol. Ther. 133(1), 1–18 (2012). [DOI] [PubMed] [Google Scholar]
- 3.Qin L, Kufareva I, Holden LG. et al. Crystal structure of the chemokine receptor CXCR4 in complex with a viral chemokine. Science 347(6226), 1117–1122 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wu B, Chien EY, Mol CD. et al. Structures of the CXCR4 chemokine GPCR with small-molecule and cyclic peptide antagonists. Science 330(6007), 1066–1071 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zheng Y, Han GW, Abagyan R. et al. Structure of CC chemokine receptor 5 with a potent chemokine antagonist reveals mechanisms of chemokine recognition and molecular mimicry by HIV. Immunity 46(6), 1005–1017.e5 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tan Q, Zhu Y, Li J. et al. Structure of the CCR5 chemokine receptor–HIV entry inhibitor maraviroc complex. Science 341(6152), 1387–1390 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Isberg V, Mordalski S, Munk C. et al. GPCRdb: an information system for G protein-coupled receptors. Nucleic Acids Res. 44(D1), D356–D364 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lolis E, Sachpatzidis A, Dohlman H, Manfredi J. Identification of allosteric peptide agonists of CXCR4. J. Biol. Chem. 278(2), 896–907 (2005). [DOI] [PubMed] [Google Scholar]
- 9.Ehrlich A, Ray P, Luker KE, Lolis EJ, Luker GD. Allosteric peptide regulators of chemokine receptors CXCR4 and CXCR7. Biochem. Pharmacol. 86(9), 1263–1271 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Christopoulos A. Allosteric binding sites on cell-surface receptors: novel targets for drug discovery. Nat. Rev. Drug Discov. 1(3), 198–210 (2002). [DOI] [PubMed] [Google Scholar]
- 11.Fricker SP. Physiology and pharmacology of plerixafor. Transfus. Med. Hemother. 40(4), 237–245 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Allegretti M, Cesta MC, Locati M. Allosteric modulation of chemoattractant receptors. Front. Immunol. 7, 170 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Rosenkilde MM, Gerlach L-O, Jakobsen JS, Skerlj RT, Bridger GJ, Schwartz TW. Molecular mechanism of AMD3100 antagonism in the CXCR4 receptor transfer of binding site to the CXCR3 receptor. J. Biol. Chem. 279(4), 3033–3041 (2004). [DOI] [PubMed] [Google Scholar]
- 14.Miao Y, Feher VA, McCammon JA. Gaussian accelerated molecular dynamics: unconstrained enhanced sampling and free energy calculation. J. Chem. Theory Comput. 11(8), 3584–3595 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Pang YT, Miao Y, Wang Y, McCammon JA. Gaussian accelerated molecular dynamics in NAMD. J. Chem. Theory Comput. 13(1), 9–19 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hamelberg D, Mongan J, McCammon JA. Accelerated molecular dynamics: a promising and efficient simulation method for biomolecules. J. Chem. Phys. 120(24), 11919–11929 (2004). [DOI] [PubMed] [Google Scholar]
- 17.Shen T, Hamelberg D. A statistical analysis of the precision of reweighting-based simulations. J. Chem. Phys. 129(3), 034103 (2008). [DOI] [PubMed] [Google Scholar]
- 18.Pang YT, Miao Y, Wang Y, McCammon JA. Gaussian accelerated molecular dynamics in NAMD. J. Chem. Theory Comput. 13(1), 9–19 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Miao Y, McCammon JA. Graded activation and free energy landscapes of a muscarinic G-protein–coupled receptor. PNAS 113(43), 12162–12167 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bhattarai A, Wang J, Miao Y. G‐protein‐coupled receptor–membrane interactions depend on the receptor activation state. J. Comput. Chem. 41(5), 460–471 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang J, Miao Y. Mechanistic insights into specific G protein interactions with adenosine receptors. J. Phys. Chem. 123(30), 6462–6473 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Miao Y, McCammon JA. Mechanism of the G-protein mimetic nanobody binding to a muscarinic G-protein-coupled receptor. PNAS 115(12), 3036–3041 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.East KW, Newton JC, Morzan UN. et al. Allosteric motions of the CRISPR-Cas9 HNH nuclease probed by NMR and molecular dynamics. JACS 142(3), 1348–1358 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ricci CG, Chen JS, Miao Y. et al. Deciphering off-target effects in CRISPR-Cas9 through accelerated molecular dynamics. ACS Cent. Sci. 5(4), 651–662 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Gerlach LO, Skerlj RT, Bridger GJ, Schwartz TW. Molecular interactions of cyclam and bicyclam non-peptide antagonists with the CXCR4 chemokine receptor. J. Biol. Chem. 276(17), 14153–14160 (2001). [DOI] [PubMed] [Google Scholar]
- 26.Miao Y. Acceleration of biomolecular kinetics in Gaussian accelerated molecular dynamics. J. Chem. Phys. 149(7), 072308 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Miao Y, McCammon JA. Gaussian accelerated molecular dynamics: theory, implementation, and applications. : Annual Reports in Computational Chemistry. David AD Robert RC. (). Elsevier, KS, USA, 231–278 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Miao Y, Sinko W, Pierce L, Bucher D, Walker RC, McCammon JA. Improved reweighting of accelerated molecular dynamics simulations for free energy calculation. J. Chem. Theory Comput. 10(7), 2677–2689 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Release 2020–1. Maestro. Schrödinger, LLC. NY, USA: (2020). [Google Scholar]
- 30.Morris GM, Huey R, Lindstrom W. et al. AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Comput. Chem 30(16), 2785–2791 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lee J, Cheng X, Swails JM. et al. CHARMM-GUI input generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM simulations using the CHARMM36 additive force field. J. Chem. Theory Comput. 12(1), 405–413 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Jo S, Kim T, Iyer VG, Im W. CHARMM-GUI: a web-based graphical user interface for CHARMM. J. Comput. Chem 29(11), 1859–1865 (2008). [DOI] [PubMed] [Google Scholar]
- 33.Wu EL, Cheng X, Jo S. et al. CHARMM-GUI membrane builder toward realistic biological membrane simulations. J. Comput. Chem 35(27), 1997–2004 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Huang J, Rauscher S, Nawrocki G. et al. CHARMM36m: an improved force field for folded and intrinsically disordered proteins. Nat. Methods 14(1), 71–73 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Vanommeslaeghe K, Raman EP, MacKerell Jr AD. Automation of the CHARMM general force field (CGenFF) II: assignment of bonded parameters and partial atomic charges. J. Chem. Inf. Model. 52(12), 3155–3168 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Vanommeslaeghe K, MacKerell Jr AD. Automation of the CHARMM general force field (CGenFF) I: bond perception and atom typing. J. Chem. Inf. Model. 52(12), 3144–3154 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Case D, Ben-Shalom I, Brozell S. et al. AMBER 18. (2018). https://ambermd.org/CiteAmber.php
- 38.Humphrey W, Dalke A, Schulten K. VMD: visual molecular dynamics. J. Mol. Graph. 14(1), 33–38 (1996). [DOI] [PubMed] [Google Scholar]
- 39.Roe DR, Cheatham III TE. PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 9(7), 3084–3095 (2013). [DOI] [PubMed] [Google Scholar]
- 40.Dror RO, Green HF, Valant C. et al. Structural basis for modulation of a G-protein-coupled receptor by allosteric drugs. Nature 503(7475), 295–299 (2013). [DOI] [PubMed] [Google Scholar]
- 41.Kruse AC, Ring AM, Manglik A. et al. Activation and allosteric modulation of a muscarinic acetylcholine receptor. Nature 504(7478), 101–106 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Miao Y, Bhattarai A, Nguyen ATN, Christopoulos A, May LT. Structural basis for binding of allosteric drug leads in the adenosine A1 receptor. Sci. Rep. 8(1), 16836 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Miao Y, McCammon JA. G-protein coupled receptors: advances in simulation and drug discovery. Curr. Opin. Struc. Biol. 41, 83–89 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kruse AC, Hu J, Pan AC. et al. Structure and dynamics of the M3 muscarinic acetylcholine receptor. Nature 482(7386), 552 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Kappel K, Miao Y, McCammon JA. Accelerated molecular dynamics simulations of ligand binding to a muscarinic G-protein coupled receptor. Q. Rev. Biophys. 48(04), 479–487 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Roumen L, Scholten D, de Kruijf P, de Esch I, Leurs R, de Graaf C. C (X) CR in silico: computer-aided prediction of chemokine receptor–ligand interactions. Drug Discovery Today: Technol. 9(4), e281–e291 (2012). [DOI] [PubMed] [Google Scholar]
- 47.Woollard SM, Kanmogne GD. Maraviroc: a review of its use in HIV infection and beyond. Drug Des. Dev. Ther. 9, 5447 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Grande F, Giancotti G, Ioele G, Occhiuzzi MA, Garofalo A. An update on small molecules targeting CXCR4 as starting points for the development of anti-cancer therapeutics. Eur. J. Med. Chem. 139, 519–530 (2017). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
