Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Jun 1.
Published in final edited form as: J Comput Aided Mol Des. 2019 May 23;33(6):573–595. doi: 10.1007/s10822-019-00196-x

A Benchmark Study of Loop Modeling Methods Applied to G Protein-Coupled Receptors

Lee H Wink 1, Daniel L Baker 1, Judith A Cole 2, Abby L Parrill-Baker 1,*
PMCID: PMC6628340  NIHMSID: NIHMS1039997  PMID: 31123958

Abstract

G protein-coupled receptors (GPCR) are important drug discovery targets. Despite progress, many GPCR structures have not yet been solved. For these targets, comparative modeling is used in virtual ligand screening to prioritize experimental efforts. However, the structure of extracellular loop 2 (ECL2) is often poorly predicted. This is significant due to involvement of ECL2 in ligand binding for many Class A GPCR. Here we examine the performance of loop modeling protocols available in the Rosetta (Cyclic Coordinate Descent [CCD], KIC with Fragments [KICF] and Next Generation KIC [NGK]) and Molecular Operating Environment (MOE) software suites (de novo search). ECL2 from GPCR crystal structures served as the structure prediction targets and were divided into four sets depending on loop length. Results suggest that KICF and NGK sampled and scored more loop models with sub-angstrom and near-atomic accuracy than CCD or de novo search for loops of 24 or fewer residues. None of the methods were able to sample loop conformations with near-atomic accuracy for the longest targets ranging from 25 – 32 residues based on 1000 models generated. For these long loop targets, increased conformational sampling is necessary. The strongly conserved disulfide bond between Cys3.25 and Cys45.50 in ECL2 proved an effective filter. Setting an upper limit of 5.1 Å on the S-S distance improved the lowest RMSD model included in the top 10 scored structures in Groups 1–4 on average between 0.33 and 1.27 Å. Disulfide bond formation and geometry optimization of ECL2 provided an additional incremental benefit in structure quality.

Keywords: Comparative modeling, homology modeling, loop modeling, GPCR

Introduction

G protein-coupled receptors (GPCR) comprise one of the largest families of integral membrane proteins in eukaryotes. GPCR serve central roles in amplifying and regulating a wide range of intracellular responses to extracellular stimuli. In response to ligand activation, GPCR undergo conformational changes that influence coupling with intracellular partners, including heterotrimeric G proteins (Gα, Gβ, Gγ subunits), β-arrestins, G protein-coupled receptor kinases (GRK), and other effectors.13 It is estimated that between 27–50% of FDA-approved drugs interact with GPCR targets including classes of drugs such as beta-blockers, antihistamines, and antipsychotics.4,5

All GPCR share a common topology featuring a core bundle of seven transmembrane (TM) α-helices with the N-termini and C-termini located on the extracellular and intracellular sides of the cell membrane, respectively. The extracellular and intracellular loops (ECL and ICL) are the protein segments that connect adjacent TM domains (ECL1–3, ICL1–3). The ECL and ICL of GPCR have lower sequence and structural conservation than the TM domains. With respect to the available known GPCR structures (mostly Class A GPCR), ECL2 is generally the longest and most diverse in terms of amino acid identity and three-dimensional structure (Fig 1). Despite low sequence conservation, an overwhelming majority of GPCR contain a disulfide bond between highly conserved cysteine residues in ECL2 and the extracellular end of TM3. Based on the analysis of 367 GPCR sequences representing members from Class A, B1, B2, C, and F that were downloaded using the GPCRdb alignment tools,6,7 89% (327 out of 367) of sequences contain the conserved cysteine residues in ECL2/TM3. Also, the disulfide bond between the cysteine sidechains is observable in 94% (47 out of 50) of representative crystal structures of unique GPCR (as of May 2018). Three lipid receptors (LPAR1, S1PR1, CB1) that have known crystal structures lack the conserved disulfide bond between ECL2/TM3. Instead, they contain an intra-loop disulfide bond that constrains the loop conformation

Fig. 1. Extracellular loop comparison of seventeen superposed GPCR crystal structures.

Fig. 1

The PDB IDs of the superposed GPCR crystal structures above are as follows: Bovine RHO (1GZM, green), B2AR (2RH1, orange), B1AR(2VT4, magenta), Squid RHO (2Z73, gold), AA2AR (3EML, cyan), DRD3 (3PBL, pink), H1R (3RZE, maroon), ACM2 (3UON, light blue), CXCR4 (3ODU, brown), S1PR1 (3V2Y, dark green), PAR1 (3VW7, yellow), NTR1 (4GRV, blue), ACM3 (4DAJ, light brown), OPRK (4DJH, purple), OPRM (4DKL, dark blue), NOP (4EA3, salmon), OPRD (4EJ4, dark brown).

For many GPCR, ECL2 plays important roles in GPCR activation, orthosteric ligand binding, and allosteric ligand interactions.8,9 Mutagenesis experiments on ECL2 of the complement C5a receptor and thrombin receptor resulted in constitutively active GPCR.10,11 These findings suggest that in some GPCR, ECL2 functions as a negative regulator that dampens signaling by restricting the transition to active receptor states in the absence of endogenous ligand. However, ECL2 mutagenesis doesn’t uniformly confer constitutive GPCR activity. For example, the A204E mutation in ECL2 of the ghrelin receptor resulted in diminished constitutive activity.12 Thus, it is generally understood that ECL2 plays a role in GPCR function, but many of the details are receptor-dependent.

Given that the binding pocket features of closely-related GPCR are relatively similar, there must be other structural aspects that give rise to observed differences in receptor-ligand specificity. Indeed, the diversity among ECL2 amino acid sequences and structural features even between closely-related GPCR contributes to receptor-specific ligand interactions.13 Table 1 shows examples of GPCR crystal structures that display a range of receptor-ligand interactions with a variable number of ECL2 residues.

Table 1. GPCR structures with ECL2 contacts to the crystallized ligand.

Cys 45.50 indicates the conserved ECL2 residue that forms a disulfide bond with Cys 3.25. ECL2 residue interactions are indicated by the following abbreviations: Hydrophobic (HYD), Backbone H-bond (BBH), Sidechain H-bond (SCH) Aromatic (ARO), Charge-assisted H-bond (CAH+/−). Each set of ECL2 interactions were compiled from the pre-calculated GPCR-ligand interaction tool on GPCRdb.14

GPCR PDB ID Cys 45.50 ECL2 Residue Interactions with Crystallized Ligand
DRD3 3PBL15 C181 1183, HYD
CXCR4 3ODU16 C186 C186, BBH
LPAR1 4Z3517 N/A M198, HYD
S1PR1 3V2Y18 N/A V194, BBH
CB1 5U0919 N/A F268, HYD
P2Y12R 4PXZ20 C175 C175, BBH; K179, CAH+
SMO 4JKV21 C390 V386, HYD; Y394, ARO
RHO 2Z7322 C186 F188, HYD
B2AR 2RH123 C191 F193, ARO
PAR2 5NDD24 C226 C226, HYD; H227, ARO; D228, CAH−
PAR1 3VW725 C254 L258, BBH; L262, HYD
APJR 5VBL26 C181 L173, HYD; N175, N177, BBH; T176, SCH
FFAR1 5TZR27 C170 L158, HYD; W174, HYD
AA1R 5UEN28 C169 F171, ARO
AA2AR 4EIY29 C166 F168, ARO; E169, CAH−

Template-based modeling methods rely on an underlying structural similarity which is often lacking when comparing ECL2 between members of the GPCR family. GPCR loop segments display low sequence conservation relative to the TM domains and tend to exhibit variable lengths which inevitably introduces gaps in GPCR sequence alignments. Furthermore, there are instances of loops with low structural similarities despite having relatively high sequence identities by homology modeling standards. For reliable homology-based prediction of ligand-receptor interactions, the pairwise percentage identity of target-template sequence identity is suggested to be around 35–40% based on the GPCR Dock assessments from 2008 and 2010.30,31 For example, the dopamine receptor D2 (DRD2) homology modeling target shares 66.9% and 36.8% sequence identity overall with the GPCR templates, DRD3 (PDB: 3PBL) and 5HT2C (PDB: 6BQH), respectively. While these templates certainly meet or exceed the suggested 35–40% sequence identity threshold for reliable homology modeling, the DRD3 and 5HT2C crystal structures would be poor templates for the ECL2 segment of the DRD2 target (Fig. 2).

Fig. 2. ECL2 comparison of GPCR that share high percentage sequence identity.

Fig. 2

The DRD3 (3PBL, orange), DRD2 (6CM4, cyan), and 5-HT2C (6BQH, magenta) crystal structures are shown to highlight the structurally variable ECL2 segments.

Although not a comprehensive list, Table 2 shows several GPCR with more than two crystal structures available at the beginning of this study. The dynamic nature of these loops is apparent in these sets of superposed crystal structures. The ECL2 Cα atoms in different structures of a single protein have root mean square deviation (RMSD) values ranging from 0.3 – 1.5 Å. Traditionally the gold standard of structure prediction is achieving top-ranking models with sub-angstrom accuracy (Cα RMSD under 1.0 Å) to the reference “native” structure. However, ECL2 experimental variability in different crystal structures of the same GPCR can exceed this value. For the ECL2 targets in this benchmark, it is more reasonable to consider methods that produce top-scoring models with near-atomic accuracy (Cα RMSD within 2.5 Å) as the threshold for success.115 Often, models with near-atomic accuracy are sufficient for applications downstream of modeling.116

Table 2: GPCR with more than 2 crystal structures.

The ECL2 residues of each set of GPCR PDBs were superposed and the Cα RMSD values were calculated.

GPCR PDB IDs ECL2
Residues
ECL2
RMSD
GLR 4L6R, 5EE7, 5XEZ, 5XF13234 F289 - N300 1.29 Å
GLP1R 5NX2, 5VEW, 5VEX, 6B3J3537 Y291 - N302 1.52 Å
ACM2 3UON, 4MQS, 4MQT38,39 V168 - S182 0.76 Å
ACM3 4U14, 4U15, 4U16, 4DAJ40,41 K212 - S226 0.46 Å
CXCR4 3ODU, 3OE0, 3OE6, 3OE8, 3OE9, 4RWS16,42 N176 - N192 0.81 Å
LPAR1 4Z34, 4Z35, 4Z3617 W186 - S203 0.28 Å
CB1 5XRA, 5XR8, 5TGZ, 5U0919,43,44 W255 - D272 0.64 Å
P2Y12R 4NTJ, 4PXZ, 4PY020,45 T163 - S180 0.93 Å
SMO 4JKV, 4O9R, 4QIN, 4QIM, 4N4W, 5L7I, 5L7D21,4648 A379 - N396 0.40 Å
OPRD 4N6H, 4RWD, 4RWA49,50 A187 - P205 0.43 Å
NOP 4EA3, 5DHG, 5DHH51,52 T207 - P224 0.75 Å
AT2R 5UNG, 5UNF, 5UNH53 R182 - E202 0.59 Å
NTR1 3ZEV, 4GRV, 4XEE, 4XES, 4BWB, 4BV0, 4BUO, 5T045457 M208 - D230 0.75 Å
B1AR 2Y00, 2Y01, 2Y02, 2Y03, 2Y04, 2YCW, 2YCX, 2YCY, 2YCZ, 2VT4, 3ZPR, 3ZPQ, 4AMI, 4AMJ, 4BVN, 4GPO, 5F8U, 5A8E5865 W181 - T203 0.43 Å
B2AR 2RH1, 2R4R, 2R4S, 3D4S, 3NY8, 3NYA, 3NY9, 3KJ6, 3P0G, 3PDS, 3SN6, 4LDE, 4LDL, 4LDO, 4GBR, 4QKX, 5JQH, 5D5B, 5D5A, 5D6L23,6678 W173 - N196 0.40 Å
ETBR 5GLI, 5GLH, 5X93, 5XPR79,80 F240 - T263 1.14 Å
RHO 1U19, 1GZM, 1L9H, 1F88, 1HZX, 2G87, 2PED, 2J4Y, 2HPY, 2I35, 2I36, 2I37, 2X72, 3OAX, 3CAP, 3DQB, 3PQR, 3PXO, 3C9M, 3C9L, 4X1H, 4J4Q, 4BEZ, 4BEY, 4PXF, 4A4M, 5TE3, 5TE5, 5DYS, 5EN081103 0.65 Å
AA2AR 2YDV, 2YDO, 3REY, 3PWH, 3RFM, 3EML, 3QAK, 3UZC, 3UZA, 3VGA, 3VG9, 4EIY, 4UHR, 4UG2, 5IUB, 5IU8, 5IUA, 5IU4, 5IU7, 5G53, 5UIG, 5K2A, 5K2B, 5K2C, 5K2D104114 G142 - P173 0.95 Å

Ab initio loop modeling can be described as a mini-protein folding problem with success largely depending on two general components: sampling and scoring. An extensive search to sample loop conformational space is implemented with the target sequence. Also, a method to evaluate or score the loop model conformations in order to select near-native structures (close to reference structure) is necessary. The prediction involves generating model loop structures and ranking them based on some criteria (i.e. energy functions as in Rosetta and MOE) that desirably correlate with experimentally determined structures. Our primary research question at the start of this benchmark study was, which of the available modeling software methods can accurately sample and score near-native loop models?

Methods

The reference GPCR crystal structures used in this benchmark for ECL2 modeling are listed in Table 3. Crystal structures with poorly resolved or completely missing residues in the ECL2 segment were excluded from the benchmark. The highest resolution structures were chosen as references for individual GPCR that had multiple crystal structures available. The preferred chain of each benchmark target PDB file was extracted using the GPCRdb structure browser application for further preparation.

Table 3. GPCR reference structures used in loop modeling benchmark.

The reference structures were divided into four categories, Groups 1–4, of increasing expected difficulty based on loop lengths.

Group GPCR PDB ID /
Resolution (Å)
ECL2
Residues
ECL2
Length
ECL2 2° Structural
Features
1 GLR 5XF134 / 3.2 F289 - N300 12 none
GLP1R 5NX235 / 3.7 Y291 - N302 12 3–10 helix
DRD3 3PBL15 / 2.9 F172 - N185 14 none
CRFR1 4Z9G117 / 3.2 D254 - T268 15 3–10 helix
ACM4 5DSG118 / 2.6 K177 - N192 16 3–10 helix
US28 4XT1119 / 2.9 V166 - E181 16 β-hairpin
CXCR4 3ODU16 / 2.5 N176 - N192 17 β-hairpin
2 LPAR1 4Z3517 /2.9 W186 - S203 18 3–10 helix
S1PR1 3V2Y18 / 2.8 W182 - H199 18 3–10 helix
CB1 5U0919 / 2.6 W255 - D272 18 α-helix
P2Y12R 4PXZ20 / 2.5 T163 - S180 18 3–10 helix
SMO 4JKV21 / 2.5 A379 - N396 18 β-hairpin
OPRD 4N6H49 / 1.8 A187 - P205 19 β-hairpin
CCR5 5UIW120 / 2.2 T167 - S185 19 β-hairpin
3 P2Y1R 4XNV121 / 2.2 S190 - E209 20 β-hairpin
AT2R 5UNG53 / 2.8 R182 - E202 21 β-hairpin
RHO 2Z7322 / 2.5 G173 - D194 22 β-hairpin
GRM1 4OR2122 / 2.8 E728 - S749 22 β-hairpin
NTR1 4XES56 / 2.6 M208 - D230 23 β-sheet, 3–10 helix
B2AR 2RH123 / 2.4 W173 - N196 24 α-helix
ETBR 5GLI79 / 2.5 F240 - T263 24 β-hairpin
4 PAR2 5NDD24 / 2.8 V211 - L235 25 β-hairpin
PAR1 3VW725 / 2.2 L239 - E264 26 β-hairpin
APJR 5VBL26 / 2.6 R168 - S193 26 β-hairpin
OX1R 4ZJ8123 / 2.8 M183 - D209 27 β-hairpin
FFAR1 5TZR27 / 2.2 A146 - D175 30 β-sheet
AA1R 5UEN28 / 3.2 G145 - S176 32 (2) α-helices
AA2AR 4EIY29 / 1.8 G142 - P173 32 (2) α-helices

The Molecular Operating Environment (MOE version 2016.08) software package was used for GPCR structure preparation and visualization. The Rosetta software suite (Rosetta version 3.8) and MOE were used for modeling the ECL2 segment of the benchmark reference structures. For each of the GPCR in the benchmark, the Robetta fragment server was used to generate the 9mer and 3mer fragment library files that are necessary for Rosetta CCD and KICF loop modeling methods.124 Fragment generation for each reference GPCR excluded any PDB data from the GPCR crystal structure itself, as verified by manual inspection of PDB ID codes in each fragment file.

Structure preparation for GPCR targets using MOE

The benchmark GPCR structures listed in Table 3 were downloaded from the PDB125 and prepared for loop modeling with MOE. Ligands and water molecules were deleted from the structure files. For each GPCR structure, the “QuickPrep” process was used to streamline the structure preparation process. QuickPrep corrects any structural issues (i.e. residues with alternate locations, missing atoms, chain breaks, etc.) that often accompany structural data, adds explicit hydrogens and partial charges with “Protonate 3D,” and performs a tethered-receptor energy minimization using the Amber12EHT126128 forcefield (RMS gradient of 0.1 kcal/mol/Å). The final energy minimization step was performed to improve any inaccurate geometries derived from the crystallographic data. During the minimization process, the receptor atoms were tethered to ensure that changes to the initial positions are modest.

Description of loop modeling software

Rosetta loop modeling protocols can be used for loop refinement or loop reconstruction. This study only implements modeling in the context of loop reconstruction – ab initio/de novo prediction of the “native” loop conformation based on amino acid sequence, but the initial backbone and sidechain conformations were discarded prior to modeling. Loop refinement, on the other hand, is utilized in finding lower energy conformations starting from a given loop conformation that is potentially close to the “native” structure.

The sampling process was implemented in two stages with iterations of Monte Carlo simulated annealing: An initial low-resolution/coarse-grained stage where the sidechain atoms were represented as “centroids” and a high-resolution/full-atom stage where the sidechain atoms were explicitly represented. This process was coupled with one of several scoring functions. Fig. 1 shows a schematic overview of the general Rosetta loop modeling process. The available loop modeling algorithms in Rosetta differ in conformational search (sampling) strategy and solutions to the loop closure problem.

The Cyclic Coordinate Descent (CCD) algorithm proceeds by optimizing the dihedral angles through consecutive loop residues from the N- to C-terminus where the goal is to minimize the distance between the free C-terminus end of the loop and the fixed anchor position.129,130 The CCD algorithm in Rosetta uses experimentally-derived fragment libraries to guide the conformational search during loop modeling. The fragment libraries contain the coupled phi/psi dihedrals of peptide segments with 9 and 3 residues (9mers, 3mers) from the PDB.

The Kinematic Closure (KIC) method selects 3 pivot atoms (remaining loop backbone atoms are designated non-pivot) and divides the loop into two segments for conformational sampling of the non-pivot phi, psi dihedral angles. Subsequently, the pivot dihedral angles (6 phi, psi angles for 3 pivots) were analytically solved to position each rigid segment for loop closure.131 The standard KIC protocol for loop modeling has subsequently been replaced by the Next Generation KIC (NGK) and KIC with Fragments (KICF) methods.

The Next Generation KIC (NGK) algorithm employs intensification strategies during non-pivot conformational sampling in both low- and high-resolution stages of the loop modeling process.132 In the high-resolution stage, NGK implements additional annealing strategies that modulate the energy function to overcome large energy barriers. The intensification strategies involve 1) using neighbor-dependent Ramachandran distributions (Rama2b term) to select phi/psi dihedral combinations during sampling and 2) independently sampling ω angles based on observations in high-resolution crystal structures. Traditionally, the planar character of the peptide bond restricts the ω dihedral angle to either 180° for the common trans-configuration or 0° for the less common cis-configuration. However, analyses of high-resolution protein structures concluded that trans peptide ω values can vary by more than 25° from planarity in some cases, and that the non-planar character of peptide bonds are more common than previously known.133 During the NGK method, ω sampling is performed independently of the phi/psi dihedrals from a Gaussian around the observed mean of 179.1° ±6.3.132 The annealing strategies implemented in the NGK method involve ramping the weights of 1) the repulsive component of the Lennard-Jones potential and 2) the Rama score (distinct from Rama2b term used in intensification strategy), which is the likelihood of a phi/psi combination occurring given an amino acid type. While the intensification strategies (Rama2b and ω sampling) are applied in both low- and high-resolution stages of loop modeling, the annealing strategies are only implemented in the high-resolution stage. Overall, these intensification and annealing strategies were found to greatly improve loop modeling accuracy compared to the standard KIC method in a previous benchmark.132

The KIC with Fragments (KICF) method combines the fragment library sampling strategy from the CCD method with the KIC loop closure method. The main difference between this method and the NGK method is the way in which loop backbone conformations are sampled. The fragment-based sampling of phi/psi/omega dihedral angles consists of four major steps: (1) one of the given fragment libraries is selected at random and searched for alignment frames where fragments overlap with subsegments of the loop; (2) one of the alignment frames and fragments within that frame is selected at random; (3) the phi/psi/omega dihedral angles of that fragment are applied to the loop subsegment; and finally (4) kinematic closure (KIC) calculations are performed to achieve loop closure.

The Rosetta all-atom energy function to evaluate/score biomolecular structures and models has evolved over many versions (Score12, Talaris2013, Talaris2014), with the Rosetta Energy Function 2015 (REF2015) becoming the default scoring function as of July 2017.134137 However, the most recently available scoring function (REF2015) and loop modeling algorithm, (KICF), had not been tested on the 12-residue, or 14–17 residue loop modeling benchmark sets used by Rosetta developers (as of May 2018).

The MOE Loop Modeler application has a de novo search method and a PDB search method for sampling potential loop backbone conformations. For this study, only the de novo search method was used to model the ECL2 of the benchmark GPCR. MOE loop modeling protocols also consisted of distinct low-/high-resolution stages for loop modeling. The initial de novo search stage only deals with the loop backbone atoms, generating potential loop conformations that were ranked by an initial coarse scoring function before advancing to the full-atom stage. Fig. 4 shows a schematic overview of the general MOE loop modeling process.

Fig. 4. Overview schematic of MOE loop modeling process.

Fig. 4

MOE loop modeling uses an extension of the CCD algorithm, Full CCD (FCCD).138 This method differs from CCD by solely operating on the Cα backbone atoms with pseudo bond angles and dihedral angles, optimizing both terms to achieve loop closure. Probability densities calculated from high-resolution PDB structures resulted in specific profiles for the Cα pseudo bond and dihedral angles. These profiles were used for random sampling of loop Cα conformational space during the de novo search stage, followed by FCCD loop closure.

After loop closure by FCCD, it is necessary to optimize the Cα backbone atoms. This is accomplished by using a component of the PULCHRA (Protein Chain Reconstruction Algorithm) method which employs a steepest descent gradient minimization and a simple harmonic potential to optimize the Cα positions before full backbone reconstruction.139 To reconstruct backbone atom positions from the Cα loop traces generated, MOE uses the BBQ (Backbone Building from Quadrilaterals) method which is based on proximal distance geometries for sets of four sequential Cα atoms in the loop. Additionally, MOE backbone packer performs a minimization to relieve any strained backbone geometries and atom clashes. This step is followed by a final geometry and duplicate check to ensure that non-redundant backbone conformations with reasonable bond and dihedral angles are being evaluated by the coarse scoring function. The top-ranking loop backbone conformations are advanced to the full-atom stage.

In the full-atom stage, sidechain atoms were added to the loop and optimized with respect to the sidechain orientations. The entire loop segment was energy minimized through multiple steps before the final scoring step. The full-atom loop conformations generated were scored using GBVI (Generalized-Born Volume Integral). The potential energy of the system using GBVI has been shown to recover loop conformations close to the native from the Jacobson Loop Decoy Dataset.140

Rosetta and MOE loop modeling methods

A total of 1000 ECL2 models were generated for the GPCR benchmark targets after structure preparation using each loop modeling method. All 28 reference structures in the benchmark were reconstructed with the Rosetta methods discussed previously, but only 7 of the 28 reference structures were modeled with MOE. This was due to non-competitive performance on the shorter ECL2 targets which will be discussed in the results section.

The following descriptions are the main Rosetta command line options used to perform NGK, KICF, and CCD loop modeling methods in low-resolution and full-atom stages. The remodel stage was the term for the initial coarse-grained modeling step. The remodel stage samples loop backbone conformations using a reduced representation of amino acid sidechains and a Rosetta low-resolution scoring function. This was initiated by the following options associated with the ‘-loops:remodel’ command: ‘perturb_kic’, ‘perturb_kic_with_fragments’, ‘perturb_ccd’. A loop definition file was separately generated for each reference GPCR to define the residues of the loop (ECL2) to be remodeled. By enabling the “extend loop” field in the loop files, the target loop segment’s bond lengths, angles, and omega torsions were idealized, and all phi/psi values were replaced randomly from Ramachandran space to give an initial closed conformation at the start of remodel stage. This was to ensure that loop reconstruction wasn’t influenced by the initial loop conformation. Subsequently, the loop phi/psi dihedrals were sampled using the fragment data described previously, followed by KIC or CCD calculations to achieve loop closure. Finally, the loop underwent minimization using the BFGS (Broyden-Fletcher-Goldfarb-Shanno) algorithm and a Metropolis criterion acceptance test using the Rosetta low-resolution scoring function, score4L. The number of Monte Carlo steps in both stages of loop modeling was determined by the number of outer and inner cycles, (outer_cycles * inner_cycles). The default number of outer (5) and inner cycles (evaluated by: min(1000, number_of_loop_residues * 20)) were used for all Rosetta loop modeling jobs in this benchmark study. Loop poses were set to the lowest energy conformation evaluated at the end of each outer cycle. The temperature decreased exponentially from 1.5 KT to 0.5 KT from the first step to the last. The refine stage was the term for the all-atom loop modeling step and was activated by the following options associated with the ‘-loops:refine’ command: ‘refine_kic’, ‘refine_kic_with_fragments’, ‘refine_ccd’. This stage implements a similar scheme to the perturb stage, with major differences in the all-atom treatment of the loop during conformational sampling and the scoring function used to guide and evaluate loop conformations during model production. Out of the three Rosetta loop modeling algorithms used in this benchmark, NGK was the quickest among the more accurate methods at generating a given set of loop models. Therefore, if subsequent sets of ECL2 models of benchmark targets were produced, loop modeling was performed initially with the NGK algorithm.

The following MOE loop modeling options were used during the de novo search stage, full-atom model generation, and final model scoring. The loop sequence for each GPCR ECL2 was selected in the sequence editor window and the following options were provided in the main MOE Loop Modeler window: only the de novo search method was enabled, the default RMSD limit of 0.50 was decreased to 0.25 Å, the max iterations and energy window were set to 1000 and 10, respectively. The number of de novo search runs and final models built were set to 1000 total. Subsequently, SVL batch files were created to run the loop modeler jobs on a high-performance computing cluster. Due to the molecular database (.mdb files) storage limitations, 10 sets of 100 final models or 20 sets of 50 final models were generated.

Filtering and Optimization

The majority of GPCR share a conserved disulfide bond between Cys45.50 in ECL2 and Cys3.25 at the top of the third transmembrane domain. In the 28-receptor benchmark set used in this study, 25 of the receptors share this structural constraint. We assessed the utility of using the S-S distance as a filter for improving ECL2 models. Distance-based filtering used a cut-off of 5.1 Å. This distance is approximately double the van der Waals contact distance for sulfer and was selected to avoid the possibility of selecting loop models in which atoms occur between the two sulfur atoms expected to form a disulfide bond. Disulfide bonds were constructed in the top 10 scored models meeting this distance cutoff followed by geometry optimization of the loop using the MOE software.

Results and Discussion

Loop modeling performance throughout this benchmark study was assessed by comparing de novo models of ECL2 for proteins from Table 3 to the crystallographic reference structures indicated in the same table. Superpositions were performed for residues not modeled de novo before calculation of (Cα) RMSD values. The metrics reported throughout this section include: lowest RMSD model (LRM), top scored model (T1), lowest RMSD model in the top 10 scored (T10), and lowest RMSD model in the top 25 scored (T25).

Rosetta Scoring Function Comparison

To decide which scoring function to use with Rosetta loop modeling, the NGK loop modeling algorithm was used to sample ECL2 conformations for the first 4 targets in Group 1 (Table 3) using both the Rosetta Energy Function 2015 (REF2015) and its predecessor, Talaris 2014. The LRM results show that models meeting the near-atomic accuracy metric of 2.5 Å were sampled in each run (Fig. 5). However, when modeling loops of unknown structure, the T1 is more relevant than the LRM. In 3 out of 4 cases, REF2015 improved reference structure prediction accuracy (lower RMSD) for the top scored models. These data suggest that the REF2015 scoring function is more suitable for identifying models closer to the reference ECL2 structures in the benchmark. Furthermore, the most recent energy function has been parametrized to estimate energies in units of kilocalories per mole, whereas all previous Rosetta energy functions used arbitrary units.137 Therefore, REF2015 was used in the Rosetta loop modeling protocols for all benchmark targets and method comparisons.

Fig. 5. Energy function comparison with Rosetta NGK loop modeling.

Fig. 5

Comparisons of the energy function influence on loop modeling performance is shown for the shortest GPCR ECL2 targets in the benchmark. The lowest RMSD models (LRM), top scored models (T1), lowest RMSD models in the top 10 scored (T10), and lowest RMSD models in the top 25 scored (T25) are shown from 1000 models generated for each ECL2 target using NGK loop modeling. The number in parentheses represents the ECL2 length.

Group 1 Loop Modeling Results

Loop modeling was performed using three algorithms in Rosetta and one algorithm in MOE for the Group 1 ECL2 targets from Table 3. The results from Group 1 in Fig. 6a show that the NGK and KICF methods were able to sample models with better accuracy than either CCD or MOE based on the RMSD of the LRM to the reference ECL2 structures. Notably, ECL2 of the muscarinic acetylcholine receptor M4 (ACM4, Fig. 7) and dopamine receptor D3 (DRD3) targets were modeled with sub-angstrom accuracy using both the NGK (LRM = 0.34, 0.71 Å, respectively) and KICF (LRM = 0.35, 0.76 Å, respectively) algorithms. For the ECL2 of CXCR4 (Fig. 8), loop modeling using KICF also displayed sub-angstrom accuracy for the LRM (0.50 Å). However, none of the targets were modeled with sub-angstrom accuracy using the CCD algorithm or MOE. Six out of the seven ECL2 targets in Group 1 were modeled with near-atomic accuracy (RMSD ≤ 2.5 Å) using the NGK and CCD algorithms. Additionally, all seven of the targets were modeled with near-atomic accuracy using the KICF algorithm. From the Group 1 targets, there were only two cases where the LRM from MOE had RMSD values below 2.5 Å to the reference structure. The average LRM values for the NGK, KICF, and CCD algorithms were 1.66, 1.09, and 2.07 Å, respectively. In comparison, the average LRM using the MOE loop modeling algorithm was much higher, 3.37 Å.

Fig. 6. Group 1 Loop Modeling Benchmark set.

Fig. 6

(a) The RMSD values of the LRM and T1 are shown out of 1000 models generated for Group 1 ECL2 targets using NGK, KICF, CCD, and MOE loop modeling algorithms. In total, the LRM had sub-atomic accuracy in two and three cases when using the NGK and KICF algorithms, respectively. Additionally, the LRM had near-atomic accuracy in seven cases when using the KICF algorithm and six cases when using the NGK or CCD algorithms. (b) The RMSD values of the T1 is compared to the T10 and T25 using NGK, KICF, CCD, and MOE.

Fig. 7. ECL2 models of ACM4 superposed with reference structure.

Fig. 7

(a) The LRM (orange) and T1 (magenta) out of 1000 total models generated using NGK loop modeling had RMSD values of 0.34 Å and 0.40 Å to the reference structure (green). (b) The LRM and T1 out of 1000 total models generated using KICF loop modeling had RMSD values of 0.35 Å and 0.62 Å to the reference structure.

Fig. 8. ECL2 models of CXCR4 superposed with reference structure.

Fig. 8

(a) The LRM (orange) and T1 (magenta) out of 1000 total models generated using NGK loop modeling had RMSD values of 3.08 Å and 7.02 Å to the reference structure (green). (b) The LRM and T1 out of 1000 total models generated using KICF loop modeling had RMSD values of 0.50 Å and 4.19 Å to the reference structure.

While generating ECL2 models with sub-angstrom or near-atomic accuracy overall is desirable, sampling loop conformations is only one aspect of structure prediction when the target structure is unknown. Loop models with low RMSD values to the target must also be scored or ranked favorably so they can be distinguished from the rest of the generated models. To evaluate the scoring component of the loop modeling protocols, the lowest RMSD within the top 1, 10, and 25 scored models (T1, T10, T25) compared to the reference ECL2 structure is illustrated for Group 1 in Fig. 6b. Overall, the MOE loop modeling algorithm had a substantially higher average RMSD value for the T1 (7.96 Å) compared to the algorithms used within Rosetta. The NGK, KICF, and CCD algorithms had average T1 values of 3.61, 3.40, and 4.56 Å, respectively. The T1 produced by CCD or MOE loop modeling algorithms had RMSD values outside of the near-atomic accuracy threshold for all seven Group 1 ECL2 targets. The NGK and KICF algorithms were both able to produce T1 with RMSD values below the near-atomic accuracy threshold for two of the targets. For a majority of the Group 1 targets, selecting higher-quality loop models than the T1 was necessary to reliably study receptor-ligand interactions through docking experiments. Therefore, we sought to establish selection guidelines for the number of final loop models to retain from the top scored models. Specifically, we wanted to assess the accuracy of final models within subsets of the top 10 or 25 scored models produced. For every ECL2 target in Group 1, the RMSD values for T10 and T25 are lower than the RMSD value for the T1, regardless of sampling methodology. However, expanding the number of retained ECL2 models from the top 10 to the top 25 scored did not consistently result in substantially lower RMSD models. In Group 1, there were only three of twenty-eight total cases in which the T10 had an RMSD value above the 2.5 Å threshold, but the corresponding T25 had an RMSD below that threshold. Models with near-atomic accuracy were generated and scored within the top 25 for the targets CRFR1 (using KICF), ACM4 (using CCD), and US28 (using CCD) that were not scored within the top 10. However, for most of the sampling algorithm and target combinations, there was not a significant advantage in retaining the top 25 scored models rather than the top 10 out of 1000 models total.

Group 2 Loop Modeling Results

All Group 2 ECL2 targets from Table 3 were modeled using the three loop modeling algorithms in Rosetta. Loop modeling results for Group 2 targets (Fig. 9) show that the NGK and KICF methods were able to sample loop conformations of the cannabinoid receptor type 1 (CB1) ECL2 with sub-angstrom accuracy overall (NGK/KICF LRM = 0.85/0.82 Å, Fig. 10). Loop modeling with the KICF method also achieved sub-angstrom accuracy with the longest loop in Group 2, CCR5 (LRM = 0.75 Å). In terms of models with near-atomic accuracy (including models with sub-angstrom accuracy), there were three and four cases where the NGK and KICF algorithms sampled loop conformations with RMSD values ≤ 2.5 Å, respectively. There were three cases where the CCD algorithm sampled loop conformations with near-atomic accuracy, but no models with sub-angstrom accuracy were generated.

Fig. 9. Group 2 Loop Modeling Benchmark Set.

Fig. 9

(a) The RMSD values of the LRM and T1 are shown out of 1000 models generated for Group 2 ECL2 targets using NGK, KICF, and CCD algorithms. In total, the LRM had sub-atomic accuracy in one and two cases when using the NGK and KICF algorithms, respectively. Additionally, the LRM had near-atomic accuracy in four cases when using the KICF algorithm and in three cases when using the NGK or CCD algorithms. (b) The RMSD values of the T1 is compared to the T10 and T25 using NGK, KICF, and CCD. The upper and lower dotted lines represent the near-atomic and sub-angstrom accuracy thresholds, respectively.

Fig. 10. ECL2 models of CB1 superposed with reference structure.

Fig. 10

(a) The LRM (orange) and T1 (magenta) out of 1000 models total using NGK loop modeling had RMSD values of 0.85 Å and 4.32 Å to the reference structure (green). (b) The LRM and T1 when KICF loop modeling was used had RMSD values of 0.82 Å and 1.93 Å to the reference structure.

The T1 for a majority of Group 2 targets had a substantially larger RMSD value than the LRM which is consistent with the results from Group 1. However, for the CB1 ECL2 target the T1 using KICF displayed near-atomic accuracy to the reference structure with an RMSD of 1.93 Å and the LRM was scored within the top 10 models (Figures 7 and 8b). On the other hand, the T1 found using NGK had an RMSD of 4.32 Å to the reference structure and the LRM was not scored within the top 10 or 25 scored models. The lowest RMSD model found in the top 10 scored models using the NGK method displayed near-atomic accuracy to the reference structure with an RMSD of 1.39 Å. For the CCR5 ECL2 target, both methods that use fragment assembly (KICF and CCD) outperformed the NGK method in all 4 metrics. Notably, the LRM and T10 values from the KICF method with CCR5 were 0.75 and 0.81 Å.

In Group 2, the smoothened receptor (SMO) ECL2 was the most troublesome target for all three Rosetta loop modeling algorithms. The KICF algorithm yielded the most accurate LRM with an RMSD of 1.58 Å to the reference structure. The LRM found using the NGK and CCD algorithms had RMSD values of 6.22 and 5.84, respectively. However, the top scored model from this method had an RMSD of ~20 Å. While the ECL2 is situated just above the center of the TM bundle in the reference structure, SMO (Class F GPCR) differs from the other benchmark structures in many ways. Particularly, SMO has a longer ECL1 than other known GPCR structures (mostly Class A) and an extracellular domain (ECD) linker region that essentially form a lid over ECL2 and the TM bundle center (Fig. 11). The long ECL1 and ECD linker regions might sterically hinder a search for ECL2 loop conformations that are close to the reference structure. In other words, loop models that position ECL2 away from the TM bundle center, ECL1, and the ECD linker regions may be scored better. Since the SMO reference structure contained a co-crystallized ligand with contacts to the ECLs (ECL2 contacts shown in Table 1), it is also plausible that the conformation of the reference ECL2 is not as energetically favorable when the ligand is absent. A second set of ECL2 models (n = 1000) was generated for SMO using the NGK algorithm, but the ECD linker domain on the N-terminus was deleted prior to loop modeling. Out of the second set, the T1 and LRM were the same and had an RMSD value of 3.30 Å to the reference ECL2. This is a significant improvement over the T1 and LRM RMSD values from the initial set of models produced by NGK which were 17.0 Å and 6.22 Å, respectively. This demonstrates that the steric hindrance provided by the ECD linker domain is one impediment to sampling loop conformations similar to the reference SMO structure.

Fig. 11. ECL2 models of SMO superposed with reference structure.

Fig. 11

Fig. 11

(a) Side view of SMO crystal structure (PDB:4JKV) highlighting the native ECL2 (green) buried underneath ECL1 (gray) and the ECD linker (salmon). (b) Top view of the extracellular side of the SMO crystal structure (c) The LRM (orange), T1 (magenta), and T10 (cyan) using the NGK algorithm had RMSD values of 6.22 Å, 17.0 Å, and 12.7 Å to the reference structure, respectively. (d) The LRM, T1, and T10 using the KICF algorithm had RMSD values of 1.58 Å, 19.9 Å, and 6.89 Å to the reference structure, respectively. The ECD linker and ECL1 regions were hidden in panels C and D to visualize ECL2 and models clearly.

Group 3 Loop Modeling Results

Group 3 targets from Table 3 were modeled using the Rosetta loop modeling algorithms. The results from modeling the ECL2 of Group 3 targets (Fig. 12) show that KICF was the only algorithm capable of sampling ECL2 models with sub-angstrom accuracy relative to the reference structures (2 of 7 targets). The LRM generated by KICF for the P2Y1R (Fig. 13b) and AT2R ECL2 targets had sub-angstrom accuracy with RMSD values of 0.63 and 0.54 Å, respectively. The LRM of P2Y1R had near-atomic accuracy (RMSD ≤ 2.5 Å) to the reference structure for all three loop modeling algorithms, suggesting it is a relatively ‘easy’ case.

Fig. 12. Group 3 Loop Modeling Benchmark Set.

Fig. 12

(a) The RMSD values of the LRM and T1 are shown out of 1000 models generated for Group 3 ECL2 targets using the NGK, KICF, and CCD algorithms. In total, the LRM had sub-atomic accuracy in two cases when using the KICF algorithm. Additionally, the LRM had near-atomic accuracy in three cases when using the KICF algorithm and in single cases when using the NGK or CCD algorithms. (b) The RMSD values of the T1 is compared to the T10 and T25 using NGK, KICF, and CCD. The upper and lower dotted lines represent the near-atomic and sub-angstrom accuracy thresholds, respectively.

Fig. 13. ECL2 models of P2YR1 superposed with reference structure.

Fig. 13

(a) The LRM (orange), T1 (magenta), and T10 (cyan) out of 1000 models generated using NGK loop modeling had RMSD values of 2.44 Å, 6.53 Å, and 2.55 Å to the reference structure (green), respectively. (b) In this case, the LRM was also the T1 (magenta) when KICF loop modeling was used (LRM = T1 = 0.63 Å RMSD to reference structure).

In the case of the P2Y1R ECL2 target, the LRM produced by the KICF algorithm was also evaluated as the T1. This was not the case for the AT2R ECL2 target, but the top scored model produced by the KICF algorithm still had sub-angstrom accuracy (RMSD = 0.81 Å) when compared to the reference structure. On the other hand, the KICF and NGK algorithms produced top scored models with high RMSD values (~18 Å for both methods) compared to the ECL2 reference structure of β2AR (Fig. 14). For the top 25 scored β2AR ECL2 models, none of the methods were able to generate loop models with RMSD values below 4 Å. Since this target had one of the longer loops in the benchmark, it is possible that increased sampling was necessary to produce models closer to the reference structure. To determine if increased sampling would substantially improve models, a second set of 4000 ECL2 models was generated for β2AR using the NGK loop modeling algorithm. From the larger set of models, the LRM had an RMSD of 2.93 Å to the reference target. While this is only a slight improvement from the LRM from the initial set of 1000 models, the T1 from the set of 4000 models had a RMSD value of 5.46 Å which is substantially lower than the T1 from the initial set (18.1 Å). Out of the 4000-model set, the T10 and T25 both had an RMSD of 3.67 Å to the reference structure which was also lower relative to the T10 and T25 from the initial set (6.81 and 5.83 Å).

Fig. 14. ECL2 models of B2AR superposed with reference structure.

Fig. 14

(a) The LRM (orange), T1 (magenta), and T10 (cyan) out of 1000 models generated using NGK loop modeling had RMSD values of 3.84 Å, 18.1 Å, and 6.81 Å to the reference structure (green), respectively. (b) The LRM, T1, and T10 generated using KICF loop modeling had RMSD values of 2.60 Å, 18.6 Å, and 5.04 Å to the reference structure, respectively.

Group 4 Loop Modeling Results

Group 4 targets from Table 3 were also modeled using the Rosetta loop modeling algorithms (Fig. 15). The average LRM values resulting from modeling with the NGK, KICF, and CCD algorithms were 4.55, 4.98, and 5.38 Å, respectively. None of the loop modeling methods used was able to generate models with sub-angstrom or near-atomic accuracy to the reference structures. Based on these results, an increase in conformational sampling is likely necessary. Potentially due to inadequate sampling of conformational space, deficiencies in the scoring function to distinguish accurate models were evident for the longer loops in this benchmark set. The average ECL2 RMSD values for the T1 using the NGK, KICF and CCD algorithms were 13.18, 11.01, and 11.70 Å, respectively. A decrease in the average ECL2 RMSD values was observed for the T10 using the same three algorithms (7.65, 7.37, and 7.72 Å), but were still substantially higher than the 2.5 Å threshold for useful models.

Fig. 15. Group 4 Loop Modeling Benchmark Set.

Fig. 15

(a) The RMSD values of the LRM and T1 are shown out of 1000 models generated for Group 4 ECL2 targets using the NGK, KICF, and CCD algorithms. The T10 and T25 are shown out of 1000 models generated for Group 4 ECL2 targets using NGK, KICF, and CCD. (b) The RMSD values of the T1 is compared to the T10 and T25 using NGK, KICF, and CCD. The upper and lower dotted lines represent the near-atomic and sub-angstrom accuracy thresholds, respectively.

Filtering and Optimization Results

Many GPCR structures characterized to date share a conserved disulfide bond between Cys3.25 at the top of TM3 and Cys45.50 in ECL2. The models generated using the KICF algorithm for the 25 of 28 receptors studied here were therefore filtered based on the distance between the sulfur atoms in these residues to see if such filtering selected an improved set of 10 models. Additionally, formation of the disulfide bond followed by geometry optimization of ECL2 was examined as a potential loop optimization strategy. Table 4 compares T10 results in the absence of filtering/optimization (T10), after filtering (T10SSdist), and after filtering/disulfide formation/optimization (T10SSbond) across all four groups of loop modeling targets.

Table 4. Cys3.25-Cys45.50 distance and disulfide bond formation as filtering/optimization strategies within the KICF results for all four target groups.

The lowest RMSD within the top 10 scored structures (T10), within the top 10 scored structures exhibiting S-S distances ≤ 5.1 Å without (T10SSdist) or within the top 10 scored structures exhibiting S-S distances ≤ 5.1 Å after disulfide bond formation and loop geometry optimization (T10SSbond) are compared.

Receptor (loop length) T10 (Å) T10SSdist (Å) T10SSbond (Å)
Group 1 Targets Group average=1.90 Group average=1.57 Group average=1.54
 ACM4 (16) 0.64 0.64 0.57
 CRFR1 (15) 2.94 1.84 1.91
 CXCR4 (17) 0.52 0.52 0.40
 DRD3 (14) 1.04 0.86 0.81
 GLP1R (12) 2.78 2.85 2.70
 GLR (12) 1.29 1.24 0.87
 US28 (16) 4.11 3.07 3.50
Group 2 Targets Group average=3.44 Group average=2.46 Group average=2.23
 P2Y12 (18) 3.73 3.97 2.91
 SMO (18) 7.15 1.71 1.57
 OPRD (19) 5.68 4.04 3.99
 CCR5 (19) 1.39 1.39 1.05
 LPAR1 (18) 2.48 2.48* 2.48*
 S1PR1 (18) 2.79 2.79* 2.79*
 CB1 (18) 0.84 0.84* 0.84*
Group 3 Targets Group average=4.10 Group average=3.55 Group average=3.25
 AT2R (21) 0.68 0.68 0.79
 GRM1 (22) 5.02 5.06 4.31
 P2Y1R (20) 0.64 0.64 0.63
 RHO (22) 4.00 4.05 4.25
 B2AR (24) 5.15 4.67 3.86
 ETBR (24) 5.21 3.94 3.35
 NTR1 (23) 8.01 5.81 5.53
Group 4 Targets Group average=7.50 Group average=6.23 Group average=6.09
 AA1R (32) 8.29 6.72 6.98
 AA2AR (32) 7.05 9.88 9.73
 APJR (26) 3.52 3.52 3.61
 FFAR1 (30) 6.99 8.39 7.28
 OX1R (27) 6.99 6.23 6.49
 PAR1 (26) 4.19 4.19 3.48
 PAR2 (25) 15.49 4.68 5.06
*

Receptor lacks conserved Cys3.25 and/or Cys45.50, no filtering or disulfide bond formation performed.

The results in Table 4 demonstrate that the use of the Cys3.25-Cys45.50 S-S distance as filter produced modest improvements for most receptors in the lowest RMSD found within the top ten scoring models. The average improvement in the group 1, 2, 3, and 4 targets was 0.33, 1.71, 0.55 and 1.27 Å, respectively. Formation of the disulfide bond followed by ECL2 geometry optimization provided slightly larger improvements of 0.37, 2.11, 0.86, and 1.41 Å across the same groups. A few targets showed more substantial changes, including PAR2 for which T10 dropped from 15.49 Å to 4.68 Å upon filtering and AA2AR for which T10 changed in the opposite direction from 7.05 Å to 9.88 Å upon filtering. Figure 16 compares the PAR2 structures selected solely based on score (panel a) and those selected based on score after filtering based on S-S distance (panel c) with the reference crystal structure (panel b). This figure illustrates that the ten top scored loop models consistently show interactions between ECL2 and the transmembrane domain, with ECL2 intruding into the lipid bilayer, which was absent during loop modeling. Use of the S-S distance filter eliminated many of the models in which ECL2 occupies space that should be reserved for the lipid bilayer, producing the > 10 Å improvement in the lowest RMSD found within the top ten scored structures (Table 4). It is likely that the models in Figure 16a scored well due to burial of hydrophobic sidechains, an important driver for folding of soluble proteins that have been used in parameterizing typical energy functions. Alternative filters (such as setting a minimum distance between atoms in ECL2 and those in the membrane-embedded sidechains of the TM segments) or hybrid scoring functions that appropriately treat the membrane-embedded region of transmembrane proteins could also effectively filter out unwanted structures like those in Figure 16a. In contrast to the substantial benefit filtering provided in selecting better models of PAR2 relative to scores alone, filtering lowered the quality of the best model in the top ten scored structures for AA2AR. As shown in Figure 17, the majority of the top ten AA2AR ECL2 models did not exhibit an ECL2 position overlapping with the position of the lipid bilayer. In this case and for others in the Group 3 target set, ECL2 models with the desired accuracy (< 2.5 Å RMSD were not present in the 1000 models initially sampled, and more sampling would be required in order to obtain improved models.

Figure 16. PAR2 ECL2 Models (KICF) and Reference Structure.

Figure 16.

(a) Top ten scored PAR2 ECL2 models produced within 1000 structures generated using KICF in Rosetta. (b) PAR2 crystallographic reference structure. (c) Top ten scored PAR2 ECL2 models after filtering 1000 structures generated using KICF in Rosetta for Cys3.25-Cys45.50 S-S distances 5.1 ≤ Å.

Figure 17. AA2AR ECL2 Models (KICF) and Reference Structure.

Figure 17.

(a) Top ten scored AA2AR ECL2 models produced within 1000 structures generated using KICF in Rosetta. (b) AA2AR crystallographic reference structure. (c) Top ten scored AA2AR ECL2 models after filtering 1000 structures generated using KICF in Rosetta for Cys3.25-Cys45.50 S-S distances 5.1 ≤ Å.

Comparison to Other GPCR Structure Prediction Benchmarks

Advances in GPCR structure prediction have been assessed in the community-wide GPCR DOCK experiments in 2008, 2010, 2013.30,31,141 In these experiments, researchers were tasked with modeling a target GPCR with a bound ligand prior to the publication of the target crystal structures. In GPCR DOCK 2010, two of the targets utilized were the crystal structures of the DRD3/eticlopride (PDB: 3PBL) and CXCR4/IT1t (PDB: 3ODU) receptor-ligand complexes.31 Overall, there were no models submitted for either target where the ECL2 had a backbone RMSD within 2.5 Å in comparison to the crystal structure. While the best DRD3 models had ECL2 RMSD values of 2.69 Å, CXCR4 was a more difficult modelling target where the two best models had ECL2 RMSD values of 4.32 and 6.61 Å. Based on all submitted models of DRD3 and CXCR4, the median RMSD values for ECL2 were 4.11 and 9.19 Å, respectively. While the RMSD values presented in this benchmark study show significant improvement in modeling the ECL2 of DRD3 and CXCR4 in terms of sampling, it is remarkable that the LRM obtained from KICF loop modeling with CXCR4 and DRD3 were ranked within the top 10 scored models after Cys3.25-Cys45.50 S-S distance filtering and had RMSD values of 0.52 and 0.86 Å, respectively. However, it should be noted that there is a significant advantage in modeling loops starting with a GPCR crystal structure versus a homology model. In general, a template-based GPCR model (without additional refinement) has an equivalent backbone structure to the aligned segments of the template. However, GPCR structures tend to diverge at the extracellular ends, and thus homology models matched to a template structure may not reflect real structural differences between the target and template structure. Arora et al. showed that variations in loop anchor positions can have significant influence on modeling accuracy for GPCR loops.142 Therefore, the results obtained from this benchmark study represent a best-case scenario for modeling GPCR loops where the starting structure contained no errors in the anchor positions or in the structures of the geometrically proximal ECL1 or ECL3.

Conclusion

Overall, the results from modeling Group 1 and 2 ECL2 targets showed that KICF sampled the most loop conformations with sub-angstrom and near-atomic accuracy to the reference structure. The NGK algorithm followed just behind KICF in terms of building loop models with sub-angstrom accuracy, but both algorithms had the same number of cases of modeling loops with near-atomic accuracy. Although the CCD algorithm was not able to build any loop models with sub-angstrom accuracy for the targets in Group 1 and 2, CCD produced models with near-atomic accuracy for nine of the fourteen targets. The results from modeling Group 3 ECL2 targets showed that only the KICF algorithm was able to sample loop models with sub-angstrom accuracy. Overall, these data suggest that the KIC with Fragments and Next Generation KIC methods within Rosetta perform better than the Cyclic Coordinate Descent method or the de novo search method within MOE Loop Modeler for loops with up to 21 residues (Groups 1 & 2). For the longer loops in Group 3 (20–24 residues), KIC with Fragments outperforms all the other methods. Out of all 28 GPCR loops modeled, KICF generated the most models under 2.5 Å out of 1000 produced total. For loop targets analogous to those in Group 4 (25–32 residues) where no models were within 2.5 Å RMSD regardless of loop modeling method, it is recommended that a greater number of models be produced (i.e. > 4000) to increase sampling of ECL2 conformational space. Application of loop modeling to generate unknown structures requires that models from the sampled set be selected. Regardless of loop length, the RMSD of the T1 was much higher than the LRM in most cases. This was also observed in another benchmark study targeting 13 GPCR ECL2 using the CABS modeling software.143 Scores alone were less effective than the combination of scores and use of the conserved disulfide bond between Cys3.25 in TM3 and Cys45.50 in ECL2 as a filter. Notably, selection of ten structures provided substantial improvement in model quality within the set over selection of only one structure, but inclusion of additional structures up to twenty-five did not provide significant additional gains. Overall, eleven of the twenty-one targets in Groups 1–3 included at least one model with an RMSD ≤ 2.5 Å within the top ten scored models meeting the S-S distance filter. Four targets in Groups 1–3 sampled a model with RMSD was ≤ 2.5 Å that did not score in the top 10 models after S-S distance filtering. In these cases, adjustments to the scoring function, refinement methods, or loop structure environment would be needed improve loop structure prediction.

Fig. 3. Overview schematic of Rosetta loop modeling process.

Fig. 3

References

  • 1.Ribas C; Penela P; Murga C; Salcedo A; Garcia-Hoz C; Jurado-Pueyo M; Aymerich I; Mayor F The G protein-coupled receptor kinase (GRK) interactome: Role of GRKs in GPCR regulation and signaling. Biochimica et Biophysica Acta (BBA) - Biomembranes 2007, 1768, 913–922. [DOI] [PubMed] [Google Scholar]
  • 2.Lagerstrӧm MC; Schiӧth HB Structural diversity of G protein-coupled receptors and significance for drug discovery. Nat Rev Drug Discov 2008, 7, 339–357. [DOI] [PubMed] [Google Scholar]
  • 3.Rosenbaum DM; Rasmussen SGF; Kobilka BK The structure and function of G-protein-coupled receptors. Nature 2009, 459, 356–363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gudermann T; Nurnberg B; Schultz G Receptors and G proteins as primary components of transmembrane signal transduction. Part 1. G-protein-coupled receptors: structure and function. J. Mol. Med 1995, 73, 51–63. [DOI] [PubMed] [Google Scholar]
  • 5.Overington JP; Al-Lazikani B; Hopkins AL How many drug targets are there? Nat Rev DrugDiscov 2006, 5, 993–996. [DOI] [PubMed] [Google Scholar]
  • 6.van der Kant R; Vriend G Alpha-Bulges in G Protein-Coupled Receptors. Int J M ol Sci. 2014, 15, 7841–7864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Isberg V; de Graaf C; Bortolato A; Cherezov V; Katritch V; Marshall FH; Mordalski S; Pin J; Stevens RC; Vriend G; Gloriam DE Generic GPCR Residue Numbers - Aligning Topology Maps While Minding The Gaps. Trends Pharmacol Sci 2015, 36(1), 22–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Avlani VA; Gregory KJ; Morton CJ; Parker MW; Sexton PM; Christopoulos A Critical Role for the Second Extracellular Loop in the Binding of Both Orthosteric and Allosteric G Protein-coupled Receptor Ligands. J. Biol. Chem 2007, 282, 25677–25686. [DOI] [PubMed] [Google Scholar]
  • 9.Peeters MC; van Westen GJP; Li Q; IJzerman AP Importance of the extracellular loops in G protein-coupled receptors for ligand recognition and receptor activation. Trends in Pharmacological Sciences 2011, 32, 35–42. [DOI] [PubMed] [Google Scholar]
  • 10.Klco JM; Wiegand CB; Narzinski K; Baranski TJ Essential role for the second extracellular loop in C5a receptor activation. Nature Structural & Molecular Biology 2005, 12, 320. [DOI] [PubMed] [Google Scholar]
  • 11.Nanevicz T; Wang L; Chen M; Ishii M; Coughlin SR Thrombin Receptor Activating Mutations: Alteration of an Extracellular Agonist Recognition Domain Causes Constitutive Signaling. J. Biol. Chem 1996, 271, 702–706. [DOI] [PubMed] [Google Scholar]
  • 12.Pantel J; Legendre M; Cabrol S; Hilal L; Hajaji Y; Morisset S; Nivot S; Vie-Luton M; Grouselle D; Kerdanet M. d.; Kadiri A; Epelbaum J; Bouc YL; Amselem S Loss of constitutive activity of the growth hormone secretagogue receptor in familial short stature. J Clin Invest 2006, 116, 760–768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wheatley M; Wootten D; Conner M; Simms J; Kendrick R; Logan R; Poyner D; Barwell J Lifting the lid on GPCRs: the role of extracellular loops. British Journal o f Pharmacology 2012, 165, 1688–1703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Pándy-Szekeres G; Munk C; Tsonkov TM; Mordalski S; Harpsøe K; Hauser AS; Bojarski AJ; Gloriam DE GPCRdb in 2018: adding GPCR structure models and ligands. Nucleic Acids Res 2018, 46, D446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Chien EYT; Liu W; Zhao Q; Katritch V; Won Han G; Hanson MA; Shi L; Newman AH; Javitch JA; Cherezov V; Stevens RC Structure of the Human Dopamine D3 Receptor in Complex with a D2/D3 Selective Antagonist. Science 2010, 330, 1091–1095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wu B; Chien EYT; Mol CD; Fenalti G; Liu W; Katritch V; Abagyan R; Brooun A; Wells P; Bi FC; Hamel DJ; Kuhn P; Handel TM; Cherezov V; Stevens RC Structures of the CXCR4 Chemokine GPCR with Small-Molecule and Cyclic Peptide Antagonists. Science 2010, 330, 1066–1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chrencik JE; Roth CB; Terakado M; Kurata H; Omi R; Kihara Y; Warshaviak D; Nakade S; Asmar-Rovira G; Mileni M; Mizuno H; Griffith MT; Rodgers C; Han GW; Velasquez J; Chun J; Stevens RC; Hanson MA Crystal Structure of Antagonist Bound Human Lysophosphatidic Acid Receptor 1. Cell 2015, 161, 1633–1643. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Hanson MA; Roth CB; Jo E; Griffith MT; Scott FL; Reinhart G; Desale H; Clemons B; Cahalan SM; Schuerer SC; Sanna MG; Han GW; Kuhn P; Rosen H; Stevens RC Crystal Structure of a Lipid G Protein-Coupled Receptor. Science 2012, 335, 851–855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Shao Z; Yin J; Chapman K; Grzemska M; Clark L; Wang J; Rosenbaum DM High-resolution crystal structure of the human CB1 cannabinoid receptor. Nature 2016, 540, 602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhang J; Zhang K; Gao Z; Paoletta S; Zhang D; Han GW; Li T; Ma L; Zhang W; Müller CE; Yang H; Jiang H; Cherezov V; Katritch V; Jacobson KA; Stevens RC; Wu B; Zhao Q Agonist-bound structure of the human P2Y12 receptor. Nature 2014, 509, 119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wang C; Wu H; Katritch V; Han GW; Huang X; Liu W; Siu FY; Roth BL; Cherezov V; Stevens RC Structure of the human smoothened receptor bound to an antitumour agent. Nature 2013, 497, 338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Murakami M; Kouyama T Crystal structure of squid rhodopsin. Nature 2008, 453, 363. [DOI] [PubMed] [Google Scholar]
  • 23.Cherezov V; Rosenbaum DM; Hanson MA; Rasmussen SGF; Thian FS; Kobilka TS; Choi H; Kuhn P; Weis WI; Kobilka BK; Stevens RC High-Resolution Crystal Structure of an Engineered Human Beta2- Adrenergic G Protein-Coupled Receptor. Science 2007, 318, 1258–1265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Cheng RKY; Fiez-Vandal C; Schlenker O; Edman K; Aggeler B; Brown DG; Brown GA; Cooke RM; Dumelin CE; Doré AS; Geschwindner S; Grebner C; Hermansson N; Jazayeri A; Johansson P; Leong L; Prihandoko R; Rappas M; Soutter H; Snijder A; Sundstrӧm L; Tehan B; Thornton P; Troast D; Wiggin G; Zhukov A; Marshall FH; Dekker N Structural insight into allosteric modulation of protease-activated receptor 2. Nature 2017, 545, 112. [DOI] [PubMed] [Google Scholar]
  • 25.Zhang C; Srinivasan Y; Arlow DH; Fung JJ; Palmer D; Zheng Y; Green HF; Pandey A; Dror RO; Shaw DE; Weis WI; Coughlin SR; Kobilka BK High-resolution crystal structure of human protease-activated receptor 1. Nature 2012, 492, 387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Ma Y; Yue Y; Ma Y; Zhang Q; Zhou Q; Song Y; Shen Y; Li X; Ma X; Li C; Hanson MA; Han GW; Sickmier EA; Swaminath G; Zhao S; Stevens RC; Hu LA; Zhong W; Zhang M; Xu F Structural Basis for Apelin Control of the Human Apelin Receptor. Structure 2017, 25, 866.e4. [DOI] [PubMed] [Google Scholar]
  • 27.Lu J; Byrne N; Wang J; Bricogne G; Brown FK; Chobanian HR; Colletti SL; Salvo JD; Thomas-Fowlkes B; Guo Y; Hall DL; Hadix J; Hastings NB; Hermes JD; Ho T; Howard AD; Josien H; Kornienko M; Lumb KJ; Miller MW; Patel SB; Pio B; Plummer CW; Sherborne BS; Sheth P; Souza S; Tummala S; Vonrhein C; Webb M; Allen SJ; Johnston JM; Weinglass AB; Sharma S; Soisson SM Structural basis for the cooperative allosteric activation of the free fatty acid receptor GPR40. Nature Structural & Molecular Biology 2017, 24, 570. [DOI] [PubMed] [Google Scholar]
  • 28.Glukhova A; Thal DM; Nguyen AT; Vecchio EA; Jorg M; Scammells PJ; May LT; Sexton PM; Christopoulos A Structure of the Adenosine A1 Receptor Reveals the Basis for Subtype Selectivity. Cell 2017, 168, 877.e13. [DOI] [PubMed] [Google Scholar]
  • 29.Liu W; Chun E; Thompson AA; Chubukov P; Xu F; Katritch V; Han GW; Roth CB; Heitman LH; IJzerman AP; Cherezov V; Stevens RC Structural Basis for Allosteric Regulation of GPCRs by Sodium Ions. Science 2012, 337, 232–236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Michino M; Abola E; Brooks CL; Dixon JS; Moult J; Stevens RC Community-wide assessment of GPCR structure modeling and docking understanding. Nat Rev Drug Discov 2009, 8, 455–463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Kufareva I; Rueda M; Katritch V; Stevens RC; Abagyan R Status of GPCR modeling and docking as reflected by community-wide GPCR Dock 2010 assessment. Structure 2011, 19, 1108–1126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Siu FY; He M; Graaf C. d.; Han GW; Yang D; Zhang Z; Zhou C; Xu Q; Wacker D; Joseph JS; Liu W; Lau J; Cherezov V; Katritch V; Wang M; Stevens RC Structure of the human glucagon class B G-protein-coupled receptor. Nature 2013, 499, 444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Jazayeri A; Doré AS; Lamb D; Krishnamurthy H; Southall SM; Baig AH; Bortolato A; Koglin M; Robertson NJ; Errey JC; Andrews SP; Teobald I; Brown AJH; Cooke RM; Weir M; Marshall FH Extra-helical binding site of a glucagon receptor antagonist. Nature 2016, 533, 274–277. [DOI] [PubMed] [Google Scholar]
  • 34.Zhang H; Qiao A; Yang D; Yang L; Dai A; Graaf C. d.; Reedtz-Runge S; Dharmarajan V; Zhang H; Han GW; Grant TD; Sierra RG; Weierstall U; Nelson G; Liu W; Wu Y; Ma L; Cai X; Lin G; Wu X; Geng Z; Dong Y; Song G; Griffin PR; Lau J; Cherezov V; Yang H; Hanson MA; Stevens RC; Zhao Q; Jiang H; Wang M; Wu B Structure of the full-length glucagon class B G-protein-coupled receptor. Nature 2017, 546, 259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Jazayeri A; Rappas M; Brown AJH; Kean J; Errey JC; Robertson NJ; Fiez-Vandal C; Andrews SP; Congreve M; Bortolato A; Mason JS; Baig AH; Teobald I; Dore AS; Weir M; Cooke RM; Marshall FH Crystal structure of the GLP-1 receptor bound to a peptide agonist. Nature 2017, 546, 254. [DOI] [PubMed] [Google Scholar]
  • 36.Song G; Yang D; Wang Y; de Graaf C; Zhou Q; Jiang S; Liu K; Cai X; Dai A; Lin G; Liu D; Wu F; Wu Y; Zhao S; Ye L; Han GW; Lau J; Wu B; Hanson MA; Liu Z; Wang M; Stevens RC Human GLP-1 receptor transmembrane domain structure in complex with allosteric modulators. Nature 2017, 546, 312–315. [DOI] [PubMed] [Google Scholar]
  • 37.Liang Y; Khoshouei M; Glukhova A; Furness SGB; Zhao P; Clydesdale L; Koole C; Truong TT; Thal DM; Lei S; Radjainia M; Danev R; Baumeister W; Wang M; Miller LJ; Christopoulos A; Sexton PM; Wootten D Phase-plate cryo-EM structure of a biased agonist-bound human GLP-1 receptor-Gs complex. Nature 2018, 555, 121–125. [DOI] [PubMed] [Google Scholar]
  • 38.Haga K; Kruse AC; Asada H; Yurugi-Kobayashi T; Shiroishi M; Zhang C; Weis WI; Okada T; Kobilka BK; Haga T; Kobayashi T Structure of the human M2 muscarinic acetylcholine receptor bound to an antagonist. Nature 2012, 482, 547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kruse AC; Ring AM; Manglik A; Hu J; Hu K; Eitel K; Hubner H; Pardon E; Valant C; Sexton PM; Christopoulos A; Felder CC; Gmeiner P; Steyaert J; Weis WI; Garcia KC; Wess J; Kobilka BK Activation and allosteric modulation of a muscarinic acetylcholine receptor. Nature 2013, 504, 101–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Thorsen TS; Matt R; Weis WI; Kobilka BK Modified T4 Lysozyme Fusion Proteins Facilitate G Protein-Coupled Receptor Crystallogenesis. Structure 2014, 22, 1657–1664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Kruse AC; Hu J; Pan AC; Arlow DH; Rosenbaum DM; Rosemond E; Green HF; Liu T; Chae PS; Dror RO; Shaw DE; Weis WI; Wess J; Kobilka BK Structure and dynamics of the M3 muscarinic acetylcholine receptor. Nature 2012, 482, 552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Qin L; Kufareva I; Holden LG; Wang C; Zheng Y; Zhao C; Fenalti G; Wu H; Han GW; Cherezov V; Abagyan R; Stevens RC; Handel TM Structural biology. Crystal structure of the chemokine receptor CXCR4 in complex with a viral chemokine. Science 2015, 347, 1117–1122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Hua T; Vemuri K; Nikas SP; Laprairie RB; Wu Y; Qu L; Pu M; Korde A; Jiang S; Ho J; Han GW; Ding K; Li X; Liu H; Hanson MA; Zhao S; Bohn LM; Makriyannis A; Stevens RC; Liu Z Crystal structures of agonist-bound human cannabinoid receptor CB1. Nature 2017, 547, 468–471. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 44.Hua T; Vemuri K; Pu M; Qu L; Han GW; Wu Y; Zhao S; Shui W; Li S; Korde A; Laprairie RB; Stahl EL; Ho J; Zvonok N; Zhou H; Kufareva I; Wu B; Zhao Q; Hanson MA; Bohn LM; Makriyannis A; Stevens RC; Liu Z Crystal Structure of the Human Cannabinoid Receptor CB1. Cell 2016, 167, 762.e14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Zhang K; Zhang J; Gao Z; Zhang D; Zhu L; Han GW; Moss SM; Paoletta S; Kiselev E; Lu W; Fenalti G; Zhang W; Müller CE; Yang H; Jiang H; Cherezov V; Katritch V; Jacobson KA; Stevens RC; Wu B; Zhao Q Structure of the human P2Y12 receptor in complex with an antithrombotic drug. Nature 2014, 509, 115–118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Byrne EFX; Sircar R; Miller PS; Hedger G; Luchetti G; Nachtergaele S; Tully MD; Mydock-McGrane L; Covey DF; Rambo RP; Sansom MSP; Newstead S; Rohatgi R; Siebold C Structural basis of Smoothened regulation by its extracellular domains. Nature 2016, 535, 517–522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Weierstall U; James D; Wang C; White TA; Wang D; Liu W; Spence JCH; Bruce Doak R; Nelson G; Fromme P; Fromme R; Grotjohann I; Kupitz C; Zatsepin NA; Liu H; Basu S; Wacker D; Han GW; Katritch V; Boutet S; Messerschmidt M; Williams GJ; Koglin JE; Marvin Seibert M; Klinker M; Gati C; Shoeman RL; Barty A; Chapman HN; Kirian RA; Beyerlein KR; Stevens RC; Li D; Shah STA; Howe N; Caffrey M; Cherezov V Lipidic cubic phase injector facilitates membrane protein serial femtosecond crystallography. Nat Commun 2014, 5, 3309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Wang C; Wu H; Evron T; Vardy E; Han GW; Huang X; Hufeisen SJ; Mangano TJ; Urban DJ; Katritch V; Cherezov V; Caron MG; Roth BL; Stevens RC Structural basis for Smoothened receptor modulation and chemoresistance to anticancer drugs. Nat Commun 2014, 5, 4355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Fenalti G; Giguere PM; Katritch V; Huang X; Thompson AA; Cherezov V; Roth BL; Stevens RC Molecular control of δ-opioid receptor signalling. Nature 2014, 506, 191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Fenalti G; Zatsepin NA; Betti C; Giguere P; Han GW; Ishchenko A; Liu W; Guillemyn K; Zhang H; James D; Wang D; Weierstall U; Spence JCH; Boutet S; Messerschmidt M; Williams GJ; Gati C; Yefanov OM; White TA; Oberthuer D; Metz M; Yoon CH; Barty A; Chapman HN; Basu S; Coe J; Conrad CE; Fromme R; Fromme P; Tourwe D; Schiller PW; Roth BL; Ballet S; Katritch V; Stevens RC; Cherezov V Structural basis for bifunctional peptide recognition at human 5-opioid receptor. Nat. Struct. Mol. Biol 2015, 22, 265–268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Thompson AA; Liu W; Chun E; Katritch V; Wu H; Vardy E; Huang X; Trapella C; Guerrini R; Calo G; Roth BL; Cherezov V; Stevens RC Structure of the nociceptin/orphanin FQ receptor in complex with a peptide mimetic. Nature 2012, 485, 395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Miller RL; Thompson AA; Trapella C; Guerrini R; Malfacini D; Patel N; Han GW; Cherezov V; Caló G; Katritch V; Stevens RC The Importance of Ligand- Receptor Conformational Pairs in Stabilization: Spotlight on the N/OFQ G Protein-Coupled Receptor. Structure 2015, 23, 2291–2299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Zhang H; Han GW; Batyuk A; Ishchenko A; White KL; Patel N; Sadybekov A; Zamlynny B; Rudd MT; Hollenstein K; Tolstikova A; White TA; Hunter MS; Weierstall U; Liu W; Babaoglu K; Moore EL; Katz RD; Shipman JM; Garcia Calvo M; Sharma S; Sheth P; Soisson SM; Stevens RC; Katritch V; Cherezov V Structural basis for selectivity and diversity in angiotensin II receptors. Nature 2017, 544, 327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Egloff P; Hillenbrand M; Klenk C; Batyuk A; Heine P; Balada S; Schlinkmann KM; Scott DJ; Schütz M; Pluckthun A Structure of signaling-competent neurotensin receptor 1 obtained by directed evolution in Escherichia coli. Proc. Natl. Acad. Sci. U. S. A 2014, 111, 655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.White JF; Noinaj N; Shibata Y; Love J; Kloss B; Xu F; Gvozdenovic-Jeremic J; Shah P; Shiloach J; Tate CG; Grisshammer R Structure of the agonist-bound neurotensin receptor. Nature 2012, 490, 508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Krumm BE; White JF; Shah P; Grisshammer R Structural prerequisites for G-protein activation by the neurotensin receptor. Nature Communications 2015, 6, 7895. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Krumm BE; Lee S; Bhattacharya S; Botos I; White CF; Du H; Vaidehi N; Grisshammer R Structure and dynamics of a constitutively active neurotensin receptor. Sci Rep 2016, 6, 38564. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Warne T; Moukhametzianov R; Baker JG; Nehme R; Edwards PC; Leslie AGW; Schertler GFX; Tate CG The structural basis for agonist and partial agonist action on a β(1)-adrenergic receptor. Nature 2011, 469, 241–244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Moukhametzianov R; Warne T; Edwards PC; Serrano-Vega MJ; Leslie AGW; Tate CG; Schertler GFX Two distinct conformations of helix 6 observed in antagonist-bound structures of a beta1-adrenergic receptor. Proc. Natl. Acad. Sci. U. S. A 2011, 108, 8228–8232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Warne T; Serrano-Vega MJ; Baker JG; Moukhametzianov R; Edwards PC; Henderson R; Leslie AGW; Tate CG; Schertler GFX Structure of a β1-adrenergic G-protein-coupled receptor. Nature 2008, 454, 486. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Christopher JA; Brown J; Doré AS; Errey JC; Koglin M; Marshall FH; Myszka DG; Rich RL; Tate CG; Tehan B; Warne T; Congreve M Biophysical fragment screening of the β1-adrenergic receptor: identification of high affinity arylpiperazine leads using structure-based drug design. J. Med. Chem 2013, 56, 3446–3455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Warne T; Edwards PC; Leslie AGW; Tate CG Crystal structures of a stabilized β1 adrenoceptor bound to the biased agonists bucindolol and carvedilol. Structure 2012, 20, 841–849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Miller-Gallacher JL; Nehmé R; Warne T; Edwards PC; Schertler GFX; Leslie AGW; Tate CG The 2.1 Å resolution structure of cyanopindolol-bound β1-adrenoceptor identifies an intramembrane Na+ ion that stabilises the ligand-free receptor. PLoS ONE 2014, 9, e92727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Huang J; Chen S; Zhang JJ; Huang X Crystal structure of oligomeric β1-adrenergic G protein-coupled receptors in ligand-free basal state. Nat. Struct. Mol. Biol 2013, 20, 419–425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Sato T; Baker J; Warne T; Brown GA; Leslie AGW; Congreve M; Tate CG Pharmacological Analysis and Structure Determination of 7-Methylcyanopindolol-Bound β1-Adrenergic Receptor. Mol. Pharmacol 2015, 88, 1024–1034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Rasmussen SGF; Choi H; Rosenbaum DM; Kobilka TS; Thian FS; Edwards PC; Burghammer M; Ratnala VRP; Sanishvili R; Fischetti RF; Schertler GFX; Weis WI; Kobilka BK Crystal structure of the human beta2 adrenergic G-protein-coupled receptor. Nature 2007, 450, 383–387. [DOI] [PubMed] [Google Scholar]
  • 67.Hanson MA; Cherezov V; Griffith MT; Roth CB; Jaakola V; Chien EYT; Velasquez J; Kuhn P; Stevens RC A specific cholesterol binding site is established by the 2.8 A structure of the human beta2-adrenergic receptor. Structure 2008, 16, 897–905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Wacker D; Fenalti G; Brown MA; Katritch V; Abagyan R; Cherezov V; Stevens RC Conserved binding mode of human beta2 adrenergic receptor inverse agonists and antagonist revealed by X-ray crystallography. J. Am. Chem. Soc 2010, 132, 11443–11445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Bokoch MP; Zou Y; Rasmussen SGF; Liu CW; Nygaard R; Rosenbaum DM; Fung JJ; Choi H; Thian FS; Kobilka TS; Puglisi JD; Weis WI; Pardo L; Prosser RS; Mueller L; Kobilka BK Ligand-specific regulation of the extracellular surface of a G-protein-coupled receptor. Nature 2010, 463, 108–112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Rasmussen SGF; Choi H; Fung JJ; Pardon E; Casarosa P; Chae PS; Devree BT; Rosenbaum DM; Thian FS; Kobilka TS; Schnapp A; Konetzki I; Sunahara RK; Gellman SH; Pautsch A; Steyaert J; Weis WI; Kobilka BK Structure of a nanobody-stabilized active state of the β1(2) adrenoceptor. Nature 2011, 469, 175–180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Rosenbaum DM; Zhang C; Lyons JA; Holl R; Aragao D; Arlow DH; Rasmussen SGF; Choi H; Devree BT; Sunahara RK; Chae PS; Gellman SH; Dror RO; Shaw DE; Weis WI; Caffrey M; Gmeiner P; Kobilka BK Structure and function of an irreversible agonist-β(2) adrenoceptor complex. Nature 2011, 469, 236–240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Rasmussen SGF; DeVree BT; Zou Y; Kruse AC; Chung KY; Kobilka TS; Thian FS; Chae PS; Pardon E; Calinski D; Mathiesen JM; Shah STA; Lyons JA; Caffrey M; Gellman SH; Steyaert J; Skiniotis G; Weis WI; Sunahara RK; Kobilka BK Crystal structure of the β2 adrenergic receptor-Gs protein complex. Nature 2011, 477, 549–555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Ring AM; Manglik A; Kruse AC; Enos MD; Weis WI; Garcia KC; Kobilka BK Adrenaline-activated structure of β2-adrenoceptor stabilized by an engineered nanobody. Nature 2013, 502, 575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Zou Y; Weis WI; Kobilka BK N-terminal T4 lysozyme fusion facilitates crystallization of a G protein coupled receptor. PLoS ONE 2012, 7, e46039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Weichert D; Kruse AC; Manglik A; Hiller C; Zhang C; Hübner H; Kobilka BK; Gmeiner P Covalent agonists for studying G protein-coupled receptor activation. Proc. Natl. Acad. Sci. U. S. A 2014, 111, 10744–10748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Staus DP; Strachan RT; Manglik A; Pani B; Kahsai AW; Kim TH; Wingler LM; Ahn S; Chatterjee A; Masoudi A; Kruse AC; Pardon E; Steyaert J; Weis WI; Prosser RS; Kobilka BK; Costa T; Lefkowitz RJ Allosteric nanobodies reveal the dynamic range and diverse mechanisms of G-protein-coupled receptor activation. Nature 2016, 535, 448–452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Huang CY; Olieric V; Ma P; Howe N; Vogeley L; Liu X; Warshamanage R; Weinert T; Panepucci E; Kobilka B; Diederichs K; Wang M; Caffrey M In meso in situ serial X-ray crystallography of soluble and membrane proteins at cryogenic temperatures. Acta Crystallogr D Struct Biol 2016, 72, 93–112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Ma P; Weichert D; Aleksandrov LA; Jensen TJ; Riordan JR; Liu X; Kobilka BK; Caffrey M The cubicon method for concentrating membrane proteins in the cubic mesophase. Nat Protoc 2017, 12, 1745–1762. [DOI] [PubMed] [Google Scholar]
  • 79.Shihoya W; Nishizawa T; Okuta A; Tani K; Dohmae N; Fujiyoshi Y; Nureki O; Doi T Activation mechanism of endothelin ETB receptor by endothelin-1. Nature 2016, 537, 363. [DOI] [PubMed] [Google Scholar]
  • 80.Shihoya W; Nishizawa T; Yamashita K; Inoue A; Hirata K; Kadji FMN; Okuta A; Tani K; Aoki J; Fujiyoshi Y; Doi T; Nureki O X-ray structures of endothelin ETB receptor bound to clinical antagonist bosentan and its analog. Nat. Struct. Mol. Biol 2017, 24, 758–764. [DOI] [PubMed] [Google Scholar]
  • 81.Okada T; Sugihara M; Bondar A; Elstner M; Entel P; Buss V The Retinal Conformation and its Environment in Rhodopsin in Light of a New 2.2Å Crystal Structure††This paper is dedicated to Dr Yoshimasa Kyogoku. Journal o f Molecular Biology 2004, 342, 571–583. [DOI] [PubMed] [Google Scholar]
  • 82.Li J; Edwards PC; Burghammer M; Villa C; Schertler GFX Structure of bovine rhodopsin in a trigonal crystal form. J. Mol. Biol 2004, 343, 1409–1438. [DOI] [PubMed] [Google Scholar]
  • 83.Okada T; Fujiyoshi Y; Silow M; Navarro J; Landau EM; Shichida Y Functional role of internal water molecules in rhodopsin revealed by X-ray crystallography. Proc. Natl. Acad. Sci. U. S. A 2002, 99, 5982–5987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Palczewski K; Kumasaka T; Hori T; Behnke CA; Motoshima H; Fox BA; Le Trong I; Teller DC; Okada T; Stenkamp RE; Yamamoto M; Miyano M Crystal structure of rhodopsin: A G protein-coupled receptor. Science 2000, 289, 739–745. [DOI] [PubMed] [Google Scholar]
  • 85.Teller DC; Okada T; Behnke CA; Palczewski K; Stenkamp RE Advances in determination of a high-resolution three-dimensional structure of rhodopsin, a model of G-protein-coupled receptors (GPCRs). Biochemistry 2001, 40, 7761–7772. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Nakamichi H; Okada T Crystallographic analysis of primary visual photochemistry. Angew. Chem. Int. Ed. Engl 2006, 45, 4270–4273. [DOI] [PubMed] [Google Scholar]
  • 87.Nakamichi H; Buss V; Okada T Photoisomerization mechanism of rhodopsin and 9-cis rhodopsin revealed by x-ray crystallography. Biophys. J 2007, 92, 106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Standfuss J; Xie G; Edwards PC; Burghammer M; Oprian DD; Schertler GFX Crystal structure of a thermally stable rhodopsin mutant. J. Mol. Biol 2007, 372, 1179–1188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Nakamichi H; Okada T Local peptide movement in the photoreaction intermediate of rhodopsin. Proc. Natl. Acad. Sci. U. S. A 2006, 103, 12729–12734. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Salom D; Lodowski DT; Stenkamp RE; Le Trong I; Golczak M; Jastrzebska B; Harris T; Ballesteros JA; Palczewski K Crystal structure of a photoactivated deprotonated intermediate of rhodopsin. Proc. Natl. Acad. Sci. U. S. A 2006, 103, 16123–16128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Standfuss J; Edwards PC; D’Antona A; Fransen M; Xie G; Oprian DD; Schertler GFX The structural basis of agonist-induced activation in constitutively active rhodopsin. Nature 2011, 471, 656–660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Makino CL; Riley CK; Looney J; Crouch RK; Okada T Binding of more than one retinoid to visual opsins. Biophys. J 2010, 99, 2366–2373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Park JH; Scheerer P; Hofmann KP; Choe H; Ernst OP Crystal structure of the ligand-free G-protein-coupled receptor opsin. Nature 2008, 454, 183–187. [DOI] [PubMed] [Google Scholar]
  • 94.Scheerer P; Park JH; Hildebrand PW; Kim YJ; Krauss N; Choe H; Hofmann KP; Ernst OP Crystal structure of opsin in its G-protein-interacting conformation. Nature 2008, 455, 497–502. [DOI] [PubMed] [Google Scholar]
  • 95.Choe H; Kim YJ; Park JH; Morizumi T; Pai EF; Krauss N; Hofmann KP; Scheerer P; Ernst OP Crystal structure of metarhodopsin II. Nature 2011, 471, 651–655. [DOI] [PubMed] [Google Scholar]
  • 96.Stenkamp RE Alternative models for two crystal structures of bovine rhodopsin. Acta Crystallogr. D Biol. Crystallogr 2008, D64, 902–904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Blankenship E; Vahedi-Faridi A; Lodowski DT The High-Resolution Structure of Activated Opsin Reveals a Conserved Solvent Network in the Transmembrane Region Essential for Activation. Structure 2015, 23, 2358–2364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Park JH; Morizumi T; Li Y; Hong JE; Pai EF; Hofmann KP; Choe H; Ernst OP Opsin, a structural model for olfactory receptors? Angew. Chem. Int. Ed. Engl 2013, 52, 11021–11024. [DOI] [PubMed] [Google Scholar]
  • 99.Singhal A; Ostermaier MK; Vishnivetskiy SA; Panneels V; Homan KT; Tesmer JJG; Veprintsev D; Deupi X; Gurevich VV; Schertler GFX; Standfuss J Insights into congenital stationary night blindness based on the structure of G90D rhodopsin. EMBO Rep. 2013, 14, 520–526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Szczepek M; Beyrière F; Hofmann KP; Elgeti M; Kazmin R; Rose A; Bartl FJ; von Stetten D; Heck M; Sommer ME; Hildebrand PW; Scheerer P Crystal structure of a common GPCR-binding interface for G protein and arrestin. Nat Commun 2014, 5, 4801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Deupi X; Edwards P; Singhal A; Nickle B; Oprian D; Schertler G; Standfuss J Stabilized G protein binding site in the structure of constitutively active metarhodopsin-II. Proc. Natl. Acad. Sci. U. S. A 2012, 109, 119–124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Gulati S; Jastrzebska B; Banerjee S; Placeres ÁL; Miszta P; Gao S; Gunderson K; Tochtrop GP; Filipek S; Katayama K; Kiser PD; Mogi M; Stewart PL; Palczewski K Photocyclic behavior of rhodopsin induced by an atypical isomerization mechanism. Proc. Natl. Acad. Sci. U. S. A 2017, 114, E2615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Singhal A; Guo Y; Matkovic M; Schertler G; Deupi X; Yan EC; Standfuss J Structural role of the T94I rhodopsin mutation in congenital stationary night blindness. EMBO Rep. 2016, 17, 1431–1440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Lebon G; Warne T; Edwards PC; Bennett K; Langmead CJ; Leslie AGW; Tate CG Agonist-bound adenosine A2A receptor structures reveal common features of GPCR activation. Nature 2011, 474, 521–525. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Doré AS; Robertson N; Errey JC; Ng I; Hollenstein K; Tehan B; Hurrell E; Bennett K; Congreve M; Magnani F; Tate CG; Weir M; Marshall FH Structure of the adenosine A(2A) receptor in complex with ZM241385 and the xanthines XAC and caffeine. Structure 2011, 19, 1283–1293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Jaakola V; Griffith MT; Hanson MA; Cherezov V; Chien EYT; Lane JR; IJzerman AP; Stevens RC The 2.6 Angstrom Crystal Structure of a Human A2A Adenosine Receptor Bound to an Antagonist. Science 2008, 322, 1211–1217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Xu F; Wu H; Katritch V; Han GW; Jacobson KA; Gao Z; Cherezov V; Stevens RC Structure of an agonist-bound human A2A adenosine receptor. Science 2011, 332, 322–327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Congreve M; Andrews SP; Doré AS; Hollenstein K; Hurrell E; Langmead CJ; Mason JS; Ng IW; Tehan B; Zhukov A; Weir M; Marshall FH Discovery of 1,2,4-triazine derivatives as adenosine A(2A) antagonists using structure based drug design. J. Med. Chem 2012, 55, 1898–1903. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Hino T; Arakawa T; Iwanari H; Yurugi-Kobayashi T; Ikeda-Suno C; Nakada-Nakura Y; Kusano-Arai O; Weyand S; Shimamura T; Nomura N; Cameron AD; Kobayashi T; Hamakubo T; Iwata S; Murata T G-protein-coupled receptor inactivation by an allosteric inverse-agonist antibody. Nature 2012, 482, 237–240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Lebon G; Edwards PC; Leslie AGW; Tate CG Molecular Determinants of CGS21680 Binding to the Human Adenosine A2A Receptor. Mol. Pharmacol 2015, 87, 907–915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Segala E; Guo D; Cheng RKY; Bortolato A; Deflorian F; Doré AS; Errey JC; Heitman LH; IJzerman AP; Marshall FH; Cooke RM Controlling the Dissociation of Ligands from the Adenosine A2A Receptor through Modulation of Salt Bridge Strength. J. Med. Chem 2016, 59, 6470–6479. [DOI] [PubMed] [Google Scholar]
  • 112.Carpenter B; Nehmé R; Warne T; Leslie AGW; Tate CG Structure of the adenosine A(2A) receptor bound to an engineered G protein. Nature 2016, 536, 104–107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Sun B; Bachhawat P; Chu ML; Wood M; Ceska T; Sands ZA; Mercier J; Lebon F; Kobilka TS; Kobilka BK Crystal structure of the adenosine A2A receptor bound to an antagonist reveals a potential allosteric pocket. Proc. Natl. Acad. Sci. U. S. A 2017, 114, 2066–2071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Batyuk A; Galli L; Ishchenko A; Han GW; Gati C; Popov PA; Lee M; Stauch B; White TA; Barty A; Aquila A; Hunter MS; Liang M; Boutet S; Pu M; Liu Z; Nelson G; James D; Li C; Zhao Y; Spence JCH; Liu W; Fromme P; Katritch V; Weierstall U; Stevens RC; Cherezov V Native phasing of x-ray free-electron laser data for a G protein-coupled receptor. Sci Adv 2016, 2, e1600292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Chen KM; Sun J; Salvo JS; Baker D; Barth P High-resolution modeling of transmembrane helical protein structures from distant homologues. PLoS Comput. Biol 2014, 10, e1003636. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Baker D; Sali A Protein structure prediction and structural genomics. Science 2001, 294, 93–96. [DOI] [PubMed] [Google Scholar]
  • 117.Dore AS; Bortolato A; Hollenstein K; Cheng RKY; Marshall*, Randy J Read and Fiona H Decoding Corticotropin-Releasing Factor Receptor Type 1 Crystal Structures. http://www.eurekaselect.com/149116/article (accessed May 1, 2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Thal DM; Sun B; Feng D; Nawaratne V; Leach K; Felder CC; Bures MG; Evans DA; Weis WI; Bachhawat P; Kobilka TS; Sexton PM; Kobilka BK; Christopoulos A Crystal structures of the M1 and M4 muscarinic acetylcholine receptors. Nature 2016, 531, 335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Burg JS; Ingram JR; Venkatakrishnan AJ; Jude KM; Dukkipati A; Feinberg EN; Angelini A; Waghray D; Dror RO; Ploegh HL; Garcia KC Structural basis for chemokine recognition and activation of a viral G protein-coupled receptor. Science 2015, 347, 1113–1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Zheng Y; Han GW; Abagyan R; Wu B; Stevens RC; Cherezov V; Kufareva I; Handel TM Structure of CC Chemokine Receptor 5 with a Potent Chemokine Antagonist Reveals Mechanisms of Chemokine Recognition and Molecular Mimicry by HIV. Immunity 2017, 46, 1017.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Zhang D; Gao Z; Zhang K; Kiselev E; Crane S; Wang J; Paoletta S; Yi C; Ma L; Zhang W; Han GW; Liu H; Cherezov V; Katritch V; Jiang H; Stevens RC; Jacobson KA; Zhao Q; Wu B Two disparate ligand-binding sites in the human P2Y1 receptor. Nature 2015, 520, 317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Wu H; Wang C; Gregory KJ; Han GW; Cho HP; Xia Y; Niswender CM; Katritch V; Meiler J; Cherezov V; Conn PJ; Stevens RC Structure of a Class C GPCR Metabotropic Glutamate Receptor 1 Bound to an Allosteric Modulator. Science 2014, 344, 58–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Yin J; Babaoglu K; Brautigam CA; Clark L; Shao Z; Scheuermann TH; Harrell CM; Gotter AL; Roecker AJ; Winrow CJ; Renger JJ; Coleman PJ; Rosenbaum DM Structure and ligand-binding mechanism of the human OX1 and OX2 orexin receptors. Nature Structural & Molecular Biology 2016, 23, 293. [DOI] [PubMed] [Google Scholar]
  • 124.Kim DE; Chivian D; Baker D Protein structure prediction and analysis using the Robetta server. Nucleic Acids Res. 2004, 32, 526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Berman HM; Westbrook J; Feng Z; Gilliland G; Bhat TN; Weissig H; Shindyalov IN; Bourne PE The Protein Data Bank. Nucleic Acids Research 2000, 28, 235–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Gerber PR; Muller K MAB, a generally applicable molecular force field for structure modelling in medicinal chemistry. J Computer-Aided M ol Des 1995, 9, 251–268. [DOI] [PubMed] [Google Scholar]
  • 127.Jakalian A; Jack DB; Bayly CI Fast, efficient generation of high-quality atomic charges. AM1-BCC model: II. Parameterization and validation. J Com put Chem 2002, 23, 1623–1641. [DOI] [PubMed] [Google Scholar]
  • 128.Case D; Darden T; Cheatham T; Simmerling C; Wang J; Duke RE; Luo R; Walker RC; Zhang W; Merz K; Roberts B; Hayik S; Roitberg A; Seabra G; Swails J; Gӧtz A; Kolossvary I; Wong KF; Paesani F; Kollman PA AMBER 12, University of California, San Francisco; 2012; Vol. 79, pp 926–935. [Google Scholar]
  • 129.Canutescu AA; Dunbrack RL Cyclic Coordinate Descent: A Robotics Algorithm for Protein Loop Closure. Protein Science 2003, 12, 963–972. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Wang C; Bradley P; Baker D Protein-protein docking with backbone flexibility. J. Mol. Biol 2007, 373, 503–519. [DOI] [PubMed] [Google Scholar]
  • 131.Mandell DJ; Coutsias EA; Kortemme T Sub-Angstrom Accuracy in Protein Loop Reconstruction by Robotics-Inspired Conformational Sampling. Nature Methods 2009, 6, 551–552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Stein A; Kortemme T Increased sampling of near-native protein conformations. PLoS ONE 2013, 8(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 133.Berkholz DS; Driggers CM; Shapovalov MV; Dunbrack RL; Karplus PA Nonplanar peptide bonds in proteins are common and conserved but not biased toward active sites. Proc Natl Acad Sci U S A 2012, 109, 449–453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Leaver-Fay A; O’Meara MJ; Tyka M; Jacak R; Song Y; Kellogg EH; Thompson J; Davis IW; Pache RA; Lyskov S; Gray JJ; Kortemme T; Richardson JS; Havranek JJ; Snoeyink J; Baker D; Kuhlman B Scientific Benchmarks for Guiding Macromolecular Energy Function Improvement. Methods Enzymol 2013, 523, 109–143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.O’Meara MJ; Leaver-Fay A; Tyka MD; Stein A; Houlihan K; DiMaio F; Bradley P; Kortemme T; Baker D; Snoeyink J; Kuhlman B Combined Covalent-Electrostatic Model of Hydrogen Bonding Improves Structure Prediction with Rosetta. J. Chem. Theory Comput 2015, 11, 609–622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Park H; Bradley P; Greisen P; Liu Y; Mulligan VK; Kim DE; Baker D; DiMaio F Simultaneous Optimization of Biomolecular Energy Functions on Features from Small Molecules and Macromolecules. J. Chem. Theory Comput 2016, 12, 6201–6212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.Alford RF; Leaver-Fay A; Jeliazkov JR; O’Meara MJ; DiMaio FP; Park H; Shapovalov MV; Renfrew PD; Mulligan VK; Kappel K; Labonte JW; Pacella MS; Bonneau R; Bradley P; Dunbrack RL; Das R; Baker D; Kuhlman B; Kortemme T; Gray JJ The Rosetta All-Atom Energy Function for Macromolecular Modeling and Design. J. Chem. Theory Comput 2017, 13, 3031–3048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 138.Boomsma W; Hamelryck T Full cyclic coordinate descent: solving the protein loop closure problem in Ca space. BMC Bioinformatics 2005, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Rotkiewicz P; Skolnick J Fast procedure for reconstruction of full-atom protein models from reduced representations. J Comput Chem 2008, 29, 1460–1465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.Jacobson MP; Pincus DL; Rapp CS; Day TJF; Honig B; Shaw DE; Friesner RA A hierarchical approach to all-atom protein loop prediction. Proteins 2004, 55, 351–367. [DOI] [PubMed] [Google Scholar]
  • 141.Kufareva I; Katritch V; Stevens RC; Abagyan R Advances in GPCR modeling evaluated by the GPCR Dock 2013 assessment: meeting new challenges. Structure 2014, 22, 1120–1139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 142.Arora B; Coudrat T; Wooten D; Christopoulos A; Noronha SB; Sexton PM Prediction of Loops in G Protein-Coupled Receptor Homology Models: Effect of Imprecise Surroundings and Constraints. J. Chem. Inf. Model 2016, 56, 671–686. [DOI] [PubMed] [Google Scholar]
  • 143.Kmiecik S; Jamroz M; Kolinski M Structure Prediction of the Second Extracellular Loop in G Protein-Coupled Receptors. Biophysical Journal 2014, 106, 2408–2416. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES