Abstract
Two-thirds of human hormones and one-third of clinical drugs activate ~350 G-protein-coupled receptors (GPCR) belonging to four classes: A, B1, C and F. Whereas a model of activation has been described for class A, very little is known about the activation of the other classes, which differ by being activated by endogenous ligands bound mainly or entirely extracellularly. Here we show that, although they use the same structural scaffold and share several ‘helix macroswitches’, the GPCR classes differ in their ‘residue microswitch’ positions and contacts. We present molecular mechanistic maps of activation for each GPCR class and methods for contact analysis applicable for any functional determinants. This provides a superfamily residue-level rationale for conformational selection and allosteric communication by ligands and G proteins, laying the foundation for receptor-function studies and drugs with the desired modality.
Subject terms: Structural biology, Computational biology and bioinformatics, Molecular biology, Drug discovery
Comparative analysis of inactive/active-state structures reveals molecular mechanistic maps of activation of the major GPCR classes. The findings and new approaches lay the foundation for targeted receptor-function studies and drugs with desired modalities.
Main
GPCRs are nature’s primary transmembrane transducers for carrying signals from extracellular ligands to intracellular effectors, to regulate numerous physiological processes. The most widely used nomenclature designates GPCR classes A–F and was introduced in the first version of the GPCR database, GPCRdb1, on the basis of conserved sequence fingerprints2,3. The human GPCRs have also been classified on the basis of phylogenetic analysis into families (classes): glutamate (C), rhodopsin (A), adhesion (B2), Frizzled (F), secretin (B1)4 and Taste 2 (T, reclassified as a separate family in ref. 5). The human GPCRs are activated from different endogenous ligand-binding sites in the transmembrane (class A) or extracellular domain (class C) or both (classes B and F), and have very low sequence similarity (cross-class pairs, mean 23%). This raises the question as to what extent the GPCR superfamily utilizes universal or unique activation mechanisms. Considering that GPCRs mediate the actions of two-thirds of endogenous hormones and neurotransmitters6, and over one-third of drugs7–9, mapping their activation mechanisms is important to understand human physiology, disease etiology and for rational drug design.
Comparisons of inactive and active structures of class A GPCRs have uncovered common activation mechanisms within the seven transmembrane helices (7TM), which have been shown to tilt, rotate, elongate or switch residue side chain rotamers to create contact networks that stabilize the receptors in a specific state10–13. These contacts converge near the G-protein site14 and are conserved regardless of the subtypes of intracellular effector11. Site-directed mutagenesis of contact residues has revealed differences in pharmacological responses, including constitutive activity11 and signaling bias15, and naturally occurring mutations have been associated with disease or change of response11,16,17. However, the classes B, C and F are largely unexplored with respect to common activation mechanisms, and until now such studies have not been possible due to a lack of structures across the activation states.
In this work, we conducted a comprehensive comparative structural analysis of inactive/active-state structures in each GPCR class by analyzing all 488 available structures from different GPCR classes (Fig. 1). We present a GPCR superfamily-wide molecular mechanistic map of activation, and link determinants to ligand-binding, G-protein coupling, transduction and allosteric sites. This can serve as a roadmap to assess the activation of any GPCR and to understand how common or distinct determinants can fine-tune physiological signaling responses and contribute to a desired drug efficacy.
Results
Dataset and methods for comparative structure analysis
To present the best maps possible to date, we made a comprehensive annotation of all available class A, B1, C and F GPCR structures (the 406 class A GPCR structures are twice as many as in any previous report11). Stringent quality filters were applied to these structures to select the representative inactive/active-state structures from each class (Methods and Supplementary Data 1). Out of 68 templates used, 45 have a resolution of 3.0 Å or better. It should be noted that in lower resolution structures (for example, above 3.0 Å) it is mainly backbone movements that can be discerned, while residue contacts can only be under-represented (that is, not lead to false-positive contacts). For classes A and B1, 22 out of 58 and 5 out of 5 receptor families (based on a shared endogenous ligand), respectively, have a structural template. This diversity ensures that any artifacts observed in a specific position or structure will only have a marginal effect on the overall frequency of movements and contacts.
We introduce a definition of state-specific contacts on the basis of their relative frequency (%) in a given inactive/active-state structure. Similarly to the scores calculated in ref. 11, this allows for the identification of many more determinants than requiring 100% presence and absence, respectively, in opposite states. An example is the four inactive- and two active-state stabilizing contacts identified in the landmark study comparing five class A GPCRs across the states14 (in which only two and three active-state templates were G protein- and agonist-bound, respectively). Importantly, the use of frequencies with (different) sets of inactive- and active-state structures opens up not only the same, but also different receptors across the states, for analysis, since the same helix backbone movements and contacts can be mediated by different amino acids. This is critical to the identification of a comparable number of residues with state-specific contacts (‘state-determinant residues’) in classes C and F, for which the structural coverage is limited so far: five and three receptors fulfill our cut-offs, representing 23% and 27% of all members of these relatively small classes, respectively. Furthermore, we also apply residue-pair conservation cut-offs (Methods). Together, the combined contact frequencies and conservation cut-offs address the class representativeness, allowing analyses across the GPCR superfamily.
GPCR activation engages all TMs with unique contact patterns
To investigate how the seven transmembrane helices TM1–7 rearrange upon activation, we compared the 13 GPCRs for which both an inactive- and active-state structure are available (all active templates are in the G-protein-bound state, Supplementary Table 1). We find that all transmembrane helices rearrange at least one end by >1.0 Å in most receptors, thus demonstrating unappreciated structural dynamics (Fig. 2, with individual receptor plots in Extended Data Figs. 1 and 2). Notably, in class B1, all seven transmembrane helices relocate their extracellular ends where the N-terminal domain restricts the conformation of the 7TM before activation18. Glucagon-like peptide 1 (GLP-1), which is the best template having a full-length inactive structure18, has a 2.5–10-Å relocation of TM1–7 in this region; all of which also move in corticotropin releasing factor type 1 (CRF1). Furthermore, we analyzed the conformational change of each class at the membrane mid, extracellular end and cytosolic end. Here the rearrangements total 6, 12 and 28 Å, respectively, and involve on average 1.7, 4.4 and 4.5 transmembrane helices per receptor. This presents a quantitated characterization of the magnitude and abundance of helix rearrangements at the mid, extracellular and cytosolic regions of the transmembrane domain. It supports opposite functional roles in: (1) maintaining a stable GPCR fold, (2) adapting to ligands of diverse size or (3) coupling to the much larger G proteins, respectively. This connects structure and function and reveals a wide engagement of the GPCR fold across its helices and domains.
We next investigated how the rearrangements of the transmembrane helix bundle change the residue contact networks between receptor segments by comparing all 42 inactive-state and 27 active-state representative structures (all active templates are G-protein bound, Supplementary Table 1). We find that, in addition to TM1–7, state-specific contacts are also formed to helix 8 (H8) in classes A and F, intracellular loop 1 (ICL1) in classes A and C and extracellular loop 2 (ECL2) in classes C and F (Figs. 3 and 4). Classes A, C and F have two-thirds to three-quarters of inactivating contacts (75%, 67% and 71%, respectively). Class B1 has fewer (39%) inactivating contacts, but this figure is close to half (namely, 48%) when considering only the contacts spanning different segments (excluding four intrahelical contacts). Sequence conservation analysis of each GPCR class shows that most residue-residue contacts can be formed in at least 30% of receptors. These findings provide a structural rationale of why most receptors have no or little activity without prior stimulation by an agonist. Furthermore, it demonstrates that most TM helices can switch from mainly inactive- to active-state contacts (Fig. 3b). Notably, this rewiring leads to markedly different patterns of segment contacts across GPCR classes, which combine several unique and some common contacts (below), as even high-homology receptors sharing endogenous ligands, such as the A1 and A2A adenosine or muscarinic acetylcholine M1 and M2 receptors, display unique helix movements (Fig. 2).
The common movements and contacts (see below) provide a structural rationale for shared overall functions throughout the GPCR superfamily: for example, ligand-dependent activation, signal transduction across the cell membrane and G-protein coupling. These are complemented by a larger number of unique structural features that allow the receptors to be diversified with respect to the specific ligand scaffold, G-protein profile and functional response kinetics and efficacy.
TM6 universal helix ‘macroswitch’ differs mechanistically
We next investigated single helix rearrangements across 13 receptor inactive/active-state structure pairs. We find that outward movement and rotation of TM6 on the cytosolic side—opening for G-protein coupling—is a universal feature of activation throughout the GPCR superfamily (Fig. 2a), as first suggested for class A19. Intracellular TM6 movement is observed in all 13 receptors investigated and is largest in classes A (7–13 Å), B1 (14–19 Å) and F (7 Å), where it is combined with a substantial average rotation: 38, 39 and 38°, respectively. The extracellular end of TM6 in these classes also moves by on average 2.0, 3.4 and 6.2 Å, respectively. The movement of TM6 gives it the largest (B1 and F) or second largest (A) number of contacts stabilizing an inactive or active state in these classes (Fig. 3); however, TM6 also displays large mechanistic differences across classes. Class B1 has a unique unwinding of the extracellular-facing half of TM6 in all three activated receptors20–22. This corroborates a recent study comparing the class B1 glucagon and class A β2 receptors, linking the weaker ability of the agonist to induce the outward movement of cytoplasmic TM6 to a slower G-protein activation21. Furthermore, in class C, TM6 uniquely has no extracellular movement and only a small cytosolic movement (2.8 Å) and rotation (13°). Consequently, TM6 has only two state-specific contacts and stabilizes only the inactive state, while TM1, TM3 and TM5 have 3–5 contacts across both states (Fig. 3b). The smaller role of TM6 in activation throughout the class is supported by markedly lower conservation of its proline, 55% compared to 93–100% in the other classes A–C, while the importance of TM7 is emphasized by a 95% conserved proline (Fig. 2b). The noncanonical small TM6 tilt, with a less conserved kink and atypical contacts to the adjacent TM5 and TM7, is associated with a different overall mechanism. This is because class C GPCRs work as an asymmetric homo- or heterodimer and the activated transmembrane domain is mainly characterized by dimer reorientation rather than rearrangements within the receptor monomers, of which only one couples to a G protein.
These findings show that the GPCR activation machinery utilizes TM6 as a universal switch in each class, although undergoing different rearrangements, such as helix toggling, rotation and/or unwinding. Such commonality across classes in the activation mechanism at the level of transmembrane helices, but diversity and nuances of the type of movements, provides an important structural rationale for drug discovery and in the future design of experiments to elucidate the effects of mutations, ligand efficacy and G-protein selectivity23 mechanisms.
TM5 is a common switch and TM3 is a hub for stabilization
TM5, like TM6, can be considered a universal switch for GPCR activation, as it moves on the intracellular side in all four classes (with, on average, A: 2.1, B1: 2.4, C: 2.0 and F: 1.8 Å) (Fig. 2b and Extended Data Fig. 2). TM7 also moves in all classes except class C, either on the cytosolic side (all class A receptors), the extracellular side (for example, 10-Å movement and 100° rotation in GLP-1, the class B1 receptor with full-length templates) or on both sides (class F). These movements are possible due to several conserved proline and glycine residues that induce helix plasticity (Fig. 2b). Class A GPCRs have triple proline kinks in TM5–7, which allow TM5 and TM7 to close in on and stabilize TM6. Class B1 TM6 unwinding is facilitated by both a proline kink (P6×47) and a glycine (G6×50). Class B1 GPCRs also feature a proline kink (P5×42) near the extracellular end of TM5, which moves 3.3 Å in GLP-1, and a glycine kink (G7×50) in TM7. Class F combines the TM6 switch with movements of cytosolic TM5 (with one Pro and two Gly residues) and TM7 on both sides. This demonstrates that TM6 does not act on its own but is supported by TM5 and TM7 in a concerted movement and that the determinants of this plasticity are conserved throughout the classes.
We find that TM3 has the largest number of state-specific contacts to other receptor segments in each GPCR class (3–5, Fig. 3). This reveals that TM3, previously shown to be a stabilization hub maintaining the common transmembrane fold24, also plays a central role in stabilization of distinct states across the GPCR superfamily. Furthermore, whereas an early report based on two class A GPCRs suggested an activation mechanism involving an upward movement of TM3 (ref. 25), our analysis of TM helix movements across classes instead points to rotation as the main mechanism (Extended Data Fig. 1). In 11 out of 13 of the investigated receptor pairs (all except A2A and CRF1), TM3 rotates at either the cytosolic (most frequent for class A, average 16°) or extracellular end (classes B1, C and F, average 19°, 12° and 17°, respectively), while no receptor rotates at both ends or at the membrane mid (Extended Data Figs. 1 and 2). Lateral movement of TM3 contributes to a different extent across classes, being mainly at: both ends (B1), the cytosolic end (C), either end (a minority of receptors in A) or no movement (F). In contrast, TM4, which is peripherally located in the transmembrane helix bundle, has few or no contacts (class A:1, B1:0, C:3 and F:0). This shows that the abundant helix packing has not immobilized TM3. Instead, it contributes to GPCR activation in several places through an array of mainly local rotations or movements.
Residue ‘microswitches’ expand the class A activation model
To investigate state determinants at the residue ‘microswitch’ level, we indexed topologically corresponding receptor positions with generic residue numbers26 and classified them into ‘inactivators’, ‘activators’ and ‘switches’ on the basis of frequent contacts in inactive, active and both states, respectively. We uncovered a comparable number of state determinants across classes (Fig. 4a). Across the classes, these span 94 distinct residue positions, 67 inactivators, 37 activators and only 11 switches (Fig. 4b and Supplementary Table 2). Only nine switches undergo side chain rotamer shifts, revealing that the rotamer microswitches—described as major state determinants for class A GPCRs12,13—play a small role in the GPCR superfamily. Importantly, many determinant positions contain the same highly conserved amino acid (Fig. 4a), and the amino acid pairs observed for each contact are conserved in at least 40% of all receptors. Notably, this includes the reference positions for generic residue numbers (index ×50) in five out of seven TMs, H8 and ICL1, all major previously known class A state determinants11,12 and a class F switch, 6×32 (refs. 27,28) (sequence motifs and microswitches with magenta border in Fig. 4a). This demonstrates that our approach (Discussion) identifies both known and new conserved determinants, even where the structural coverage is limited, including the active state of classes C and F.
In class A, W6×48, earlier suggested to be a rotamer toggle switch12, does not itself undergo a rotamer shift but a helix rotational shift (of 10°) and is approached by I3×40, which has both types of rotation. This provides an alternative to a reported ‘PIF’ motif29–31, which is here found to consist most frequently of ‘PIW’ (P5×50 being the third residue). The PIF motif’s last residue, F6×44, rather acts as a switch in a new triplet ‘LLF’ located in the same helices: TM3, TM5 and TM6. The two residues D2×50 and N7×49, which coordinate a sodium ion32, are here found to have a direct interaction that stabilizes the inactive state. Notably, N7×49 contacts P7×50, uncovering a concerted stabilization across the sodium ion site and TM7 helix kink around the ‘NPxxY’ motif. The final residue of the NPxxY motif, a Y7×53 switch11–13, has three inactivating and two activating state-specific contacts with over 40% frequency difference, including to another switch: I3×46. The TM3 ‘DRY’ motif includes an intrahelical ionic lock from D3×49 to R3×50, which, upon activation, swings to interact with the G protein as well as the switch Y5×58 on TM5 (ref. 13). Of note, the restraining of R3×50 is strengthened by contacts to TM2 and TM6 (A6×34)—however, typically not to E6×30, which was part of the first ionic lock reported in rhodopsin33 but less conserved (E: 25% or D/E: 31% compared to D: 65% or D/E: 86% for position 4×49). On the intracellular side, H8 and ICL1 contain three and two determinants, respectively, including the novel switch F8×50 contacting another new switch on TM7, A7×54. These findings corroborate the findings of previous studies on class A10–14. They also, together with the concerted movement of TM5–7 and the role of TM3 as a state-stabilization hub (see above), substantially expand the activation model of class A receptors.
To further substantiate the importance of inactivating and activating state determinants (collectively, predicted state-changing residue positions), we performed mutagenesis experiments and measured epinephrine-induced β2-adrenoceptor activation of Gs and G15 using bioluminescence energy transfer (BRET)-based biosensors. We mutated six predicted state-changing (contact frequency difference >80% across states) and six nonstate-changing residues to alanine. To isolate effects due to intrareceptor conformational stabilization, these excluded residues interacting with ligands or G proteins in structure complexes. We found that state-changing positions are more prone than nonstate-changing mutations to alanine mutation-induced potency reductions for Gs (mean log(EC50) from wild type (WT) 1.07 versus 0.22 (EC50, half-maximum effective concentration); Wilcoxon rank-sum test: P = 0.0193) and G15 (mean log(EC50) from WT 1.25 versus 0.25; Wilcoxon rank-sum test: P = 0.0049) (Fig. 5a and Supplementary Table 2). In contrast, the mutations did not have a statistically significant differential effect on efficacy (Wilcoxon rank-sum test; Gs: P = 0.6991; G15: P = 0.3095). Five out of six state-changing mutations cluster tightly in the transduction pathway between the ligand and G-protein pockets, and this group of mutations more frequently form intrahelical receptor contacts, whereas several nonstate-changing mutations instead face the membrane (Fig. 5b). This confirms the correlation between state-specific structural residue-residue contacts and ligand-induced pharmacological receptor activity, and points to reduction in potency (not efficacy) and differential intrahelical contacts as underlying determinants.
Class B1 state-determinant residues shared with class A
The comparison of unique and common state determinants across the GPCR superfamily shows that nearly half of the B1 determinants (16 out of 33) map to equivalent topological positions as in class A, compared to 10 for class F and 8 for class C (leftmost in Fig. 4b). Furthermore, the classes B1 and A have four common switches (second to rightmost in Fig. 4b). Notably, this includes the two known microswitches (A/B1 residue number): Y5×58/F5×54 and Y7×53/Y7×57 as well as the pair I3×46/E3×50 and F6×44/L6×49 (magenta in Extended Data Fig. 3). We also find that the class A ‘toggle switch’ activator W6×48 is substituted by a conserved smaller aromatic residue Y6×53 in class B1(Extended Data Fig. 3). Together, these commonalities between classes B1 and A are indicative of partially shared activation mechanisms.
However, there are also markedly unique features in class B1. Strikingly, 14 out of 33 of the state determinants in class B1 are located in TM6, which engages 10 additional determinants (mostly in TM5 and TM7) (Fig. 4) that move together with TM6 (Fig. 2). The high concentration of state determinants in TM6, and packing to adjacent helices, may provide a plausible structural rationale for a recent report demonstrating a higher energy barrier for the formation of the kinked and partially unwound TM6 in the class B1 glucagon receptor compared to the class A β2-adrenoceptor21. Another unique feature in class B1 is two additional switches in TM7 (Q7×49 and L7×56). Class B1, like class A, has a large structural coverage (10 out of 15 receptors) and contains several major drug targets7. The map of state determinants in class B1 gives a better understanding of the basic receptor-activation mechanisms and presents a foundation for targeting determinant networks in structure-based design of new drugs that stabilize receptors in the desired state.
No universal residue-level microswitch mechanism
Class C has the fewest residue state-determinant ‘microswitches’ and no activation switch with high-frequency contacts in both states. This is in concordance with its smallest (by far) conformational change (Fig. 2). Another characteristic of class C is that it has no determinants that form contacts within the same transmembrane helix, whereas each of classes A, B1 and F has four such microswitches. This reflects that class C GPCRs uniquely bind the endogenous ligand entirely in the N terminus and transduce signals across the membrane by dimers that reorient. In addition to the 7TM domain, class C GPCRs have one determinant in ECL2 and two microswitches in ICL1 (as does class A). Class F has 40 residue microswitches, of which one is in H8 (8×54) and two in ECL2 (Y45×51 and V45×52). The two ECL2 positions follow a conserved cysteine C45×50, which forms a covalent disulfide bridge, also to TM3, across the GPCR classes34. Class F has the highest conservation of determinant consensus amino acids (on average 82%) showing that most contacts identified in the three structural templates: FZD4, FZD7 and Smoothened, are probably shared by the remaining eight receptors. Class F has one switch, Y45×51.
By comparing all GPCR classes, we find that 60 out of 94 determinant positions are unique to one class, whereas 27 are found in two classes and 7 in three classes (Fig. 4b). The determinants common for at least two classes are even fewer when considering their type: ten inactivators (maximum five in A–F), five activators (maximum two in A–B1) and four switches (all between A and B1) (Fig. 4b and Extended Data Fig. 3). These findings demonstrate that although they belong to the same superfamily, use the same structural scaffold and share several macroswitches (helices), the GPCR classes differ in their microswitches (residues). This suggests that there is an ensemble of structural/mechanistic solutions that are available for the GPCR superfamily during evolution and that different receptor classes have explored. It also means that while thinking about developing drugs with different modalities—especially for classes C and F—one should aim to interact with or modulate class-specific state determinants and residue-level microswitches rather than use the same microswitches as in class A.
State determinants and functional interface sites
To obtain topological and functional mapping of the state determinants, we mapped their location in relation to ligand and Gα protein-interacting positions from structures (Fig. 6 and Supplementary Data 2). We find that 2%, 10%, 10% and 30% of determinants in classes class A, C, F and B1, respectively, map to ligand-interacting positions in the upper part of the transmembrane helix domain or ECL2 (the orthosteric binding pocket in classes A, B1 and F). The single such determinant in class A is the helix rotation switch W6×48. Furthermore, 0%, 10%, 29% and 36% of determinants in classes class C, F, A and B1, respectively, map to G-protein-interacting positions. Together, this shows that while most determinants are in the transduction pathway between ligand and G-protein sites, except for in class B1, ligands and G proteins can sense and stabilize receptor states by directly interacting with state determinants.
Given their important role in modulating GPCR activity, we specifically mapped switches, that is, determinant residues, that alternate contacts across the inactive/active states. In class A, two switches (R3×50 and 5×58) have direct G-protein interactions, whereas five switches are in transmembrane helices and between ligand and G-protein positions—thereby facilitating signal transduction across the membrane. Class B1 switches are distributed across orthosteric ligand (6×49), G protein (3×50, 7×56 and 7×57) and two other (5×54 and 7×49) positions. Class C uniquely has no switches and the single switch of class F is located in position 45×51, which is a ligand-interacting position in ECL2. These findings show that switches are spread throughout the 7TM bundle. Altogether, the functional mapping may help to explain observed effects on ligand or G-protein affinity from remote mutations, and presents a residue-level rationale for conformational selection35 and allosteric modulation36.
Discussion
We present molecular mechanistic maps for activation across the GPCR superfamily and helix macro/microscale residues while extending beyond the transmembrane region to H8 and structurally conserved loop segments. This study has shown that activation of other classes cannot be modeled on the basis of class A. Our findings demonstrate that, although they belong to the same superfamily, use the same structural scaffold and share several helix macroswitches, the GPCR classes differ in their microswitch residue positions, contacts and amino acids. This applies also to class B1, which shares about half of the determinant positions in class A, as the similarity is very small if considering the type of determinant—stabilizing an inactive, active or both states—and their specific consensus amino acids. This highlights the need to elucidate restraints and diverse activation mechanisms separately for each GPCR class and in more detail, to adequately capture structure-function relationships. Determinants of receptor activity are closely tied to the molecular mechanisms of conformational selection35 and allosteric modulation36, and influence an array of functions and responses, including ligand affinity, basal activity, efficacy and G-protein coupling. Therefore, the contact maps (Fig. 4) presented here provide an actionable foundation for the field, to design and interpret experiments across structural, biophysical (for example, fluorescence and double electron–electron resonance (DEER)), molecular dynamics and mutagenesis studies. The maps also inform drug discovery of which inactivating and activating state determinants are in the orthosteric and allosteric ligand sites and may therefore be exploited to design inverse agonists, neutral antagonists or agonists for different receptors.
The extensive engineering and limited resolution of some structures is an inherent limitation for all structure-based studies. Therefore, more native-like structures and advances in their determination are of utmost value and will serve to continuously refine structure-function relationships generally. For example, the universal TM5 switch reported herein was not discernible until cryo-EM allowed for more structures with a native TM5–ICL3–TM6 region (often subjected to deletion and protein fusion specifically in crystallography). It has furthermore been suggested that there are sequential conformational changes during GPCR activation and G-protein coupling, with transient intermediate states facilitating the transition of the extensive conformational rearrangement, which is not captured by currently available complex structures37,38. The proposed intermediate-state complexes may require additional state determinants beyond the ones identified here. Hence, going forward, it will be important to combine structural studies with biophysical investigations, such as fluorescence resonance energy transfer (FRET)-based systems39, DEER40, NMR41 or even mass spectroscopy42, for monitoring specific interactions in more infrequent conformations.
Our combined contact frequency and residue-pair conservation cut-offs uniquely address the class representativeness and allow the GPCR superfamily to be described. Classes A, B1, C and F have a 60, 42, 81 and 82% average conservation of determinant consensus amino acids, respectively. For these reasons, the common conserved determinants identified herein should still apply as our knowledge expands from new structures, which could also allow additional determinants to fulfill these cut-offs. Furthermore, although specific receptors and subsets thereof could have additional activation mechanisms not conserved in the whole class, such specific and general mechanisms, respectively, could act in concert.
Methods
Structure annotation and selection of representative template dataset
We annotated 510 GPCR structures from the Protein Data Bank49. These were all class A and B1 GPCR structures released before 1 November 2020, and all class C and F GPCR structures released by July 2021 (due to the relatively few templates, the two latter classes were updated during manuscript revision). For the two GABAB1–2 heterodimer structures (PDB 7C7Q and 7C7S), two additional artificial structures (PDB YZ01 and YZ02, respectively) were used herein to separate the GABAB1 monomer from the GABAB12 monomer. We selected a representative structure of each GPCR and inactive or active state by applying comprehensive filter criteria spanning completeness of receptor and G protein (≥83% and ≥43% of generic residue positions, respectively), sequence identity >90% to human, resolution ≤3.6 Å, degree active50 (≤20% and ≥90% for inactive and active structures, respectively) and consistent ligand modality and state50 (inverse agonist/antagonist inactive and agonist active). For representative templates of the active state, a G-protein complex was required. For GABAB2–Gi1 the PDB entry (7C7Q) does not include the G protein and the complex structure model was instead received from the authors (the EM data are deposited in the EMDB database under EMD-30300) (Supplementary Data 1)46.
Transmembrane helix rearrangement analysis
Two-dimensional (2D) plots for TM1–7 segment movement at the extracellular end, cytosolic end and membrane mid, respectively, were produced using our webserver for comparative structure analysis (http://review.gpcrdb.org/structure_comparison/comparative_analysis and ref. 50). Class counts of moving TM1–7 were calculated as the sum of consensus movements above 1.0 Å.
Segment (TM1–7, H8 and loops) contact analysis
Activation-dependent changes in segment networks were determined using 2D network plots of segment contacts50. Segment switches (or helix switches) were defined as the segments with distinct contacts (see ‘Generic residue numbering’) in both states.
Generic residue numbering
Corresponding residue positions in each class were indexed with the structure-based GPCRdb generic residue numbering system26. This builds on the sequence-based generic residue numbering systems for classes A (Ballesteros–Weinstein), B1 (Wootten), C (Pin) and F (Wang), but preserves gaps from a structural alignment of two receptors caused by a unique helix bulge or constriction in the sequence alignment, thereby avoiding offset of these and the following residues. All schemes assign residue numbers relative to the most conserved amino acid residue, which is given the number 50, and prefixed with the TM helix number (for example, 3×32 is on TM3 and 18 positions before the reference). This generic residue numbering scheme also uniquely indexes H8 and structurally conserved loop segments, which are numbered by the preceding and following TM helix (for example, 45 is ECL2 located between TM4 and TM5).
State-stabilizing contact identification
State-stabilizing contacts were identified using the Structure comparison tool50. The most distinct contact in class A (1×49–7×50 contact) has 80% higher frequency in the inactive than in the active state, and the specificity of state-specific contacts depends on the number of members and templates in each class. To obtain comparable numbers of networks and contacts, we therefore adjusted the class frequency difference thresholds accordingly (no. members and inactive/active-state templates): A: 40% (285, 33/14), B1: 67% (15, 3/10), C: 75% (22, 4/2) and F: 100% (11, 2/2). To ensure that the identified determinants have a wide role in each class, we also applied a sequence conservation cut-off. This cut-off requires the amino acids pairs that make up a state-specific contact to be conserved, and therefore able to be formed in at least 30% of all receptors in the given GPCR class. The remaining residues, referred to as ‘state determinants’ or ‘state stabilizers’, were further classified into ‘inactivators’, ‘activators’ and ‘switches’ on the basis of whether their most frequent contact occurs in inactive, active and both states, respectively.
State-determinant topological mapping in relation to ligand and G-protein sites
We mapped state stabilizers to functional sites to ligand and G-protein sites using the comparative structure analysis tool of GPCRdb50. Allosteric ligand-interacting positions outside the upper part of the transmembrane helix domain and ECL2 (the orthosteric binding pocket in classes A, B1 and F) were omitted.
Mutagenesis and BRET-based signaling assay
Codon-optimized human β2-adrenoceptor (β2) was cloned into pcDNA3.1 with an N-terminal signal sequence, Twin-Strep-tag and SNAP-tag. All biosensor constructs were in pcDNA3.1 (G15 biosensor51 and Gs sensor:52,53). Mutations were made as described in ref. 54. Biosensor and receptor DNA was transiently transfected into HEKSL cells (a gift from S. Laporte). Cell culture and transfection was performed as described in ref. 55. After incubation for two days at 37 °C with 5% CO2, DMEM was replaced with Tyrode’s buffer (137 mM NaCl, 0.9 mM KCl, 1 mM MgCl2, 11.9 mM NaHCO3, 3.6 mM NaH2PO4, 25 mM HEPES, 5.5 mM glucose,1 mM CaCl2, pH 7.4), followed by incubation for at least 30 min at 37 °C. Ligand was added 10 min before the measurement and the luciferase substrate coelenterazine 400a (Nanolight Technology) was added 5 min before the measurement. Coelenterazine 400a was added to a final concentration of 5 μM and ligand concentrations ranged from 31.6 nM to 3.16 mM in half-log steps. In addition, a buffer control was included. BRET was read in a Synergy Neo (Biotek) plate reader at 410 and 515 nm. All signaling experiments were done in biological triplicates.
Statistics
For the experimental alanine mutations of predicted state-changing and nonstate-changing residues (Fig. 5), statistical significance has been assessed by a two-sided Wilcoxon rank-sum test (n = 6 for each category, individual data points in Supplementary Table 2).
Reporting Summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Research reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41594-021-00674-7.
Supplementary information
Acknowledgements
This work was supported by the Lundbeck Foundation (grants R163-2013-16327 and R218-2016-1266), the Novo Nordisk Foundation (grant NNF18OC0031226) and Independent Research Fund Denmark, Natural Sciences (grant 8021-00173B) to D.E.G., the American Lebanese Syrian Associated Charities (ALSAC) to M.M.B., a Marie Skłodowska-Curie Individual Fellowship from the European Union’s Horizon 2020 research and innovation programme (grant 844622) and the American Heart Association (grant 19POST34380839) to F.M.H., the Canadian Institute for Health Research (grant FDN-148431) to M.B. and the Swiss National Science Foundation (grant 159748) to D.B.V.
Extended data
Author contributions
D.E.G. provided conceptualization of the project. A.J.K. and F.M.H. undertook data curation. Formal analysis was performed by A.S.H., D.E.G. and F.M.H. Funding acquisition was by D.B.V., D.E.G., M.B. and M.M.B. Investigations were carried out by D.E.G. and F.M.H. D.E.G., C.M. and A.J.K. designed the methodology. Project administration was by D.E.G. Resources were provided by M.B. Software was designed by A.J.K. and C.M. D.E.G., D.B.V. and M.B. supervised the project. Validation was performed by A.J.K., D.E.G. and F.M.H. Visualization was by D.E.G., A.S.H. and A.J.K. D.E.G. wrote the original draft and D.E.G., F.M.H. and M.M.B. reviewed and edited the manuscript. The manuscript was read and approved by all authors.
Data availability
All data are available in GPCRdb (https://review.gpcrdb.org), GitHub (https://github.com/protwis/gpcrdb_data) and Supplementary Data 1 and 2.
Code availability
No code was developed for this manuscript, which instead used the existing GPCRdb56 resources, including a new comparative structure analysis platform50. All open-source code can be obtained from GitHub (https://github.com/protwis/protwis) under the permissive Apache 2.0 License (https://www.apache.org/licenses/LICENSE-2.0). A complete list of the software used for data analysis is available from the Nature Research Reporting Summary.
Competing interests
M.B. is the president of Domain Therapeutics scientific advisory board. D.B.V. is a founder and a director of Z7 Biotech Ltd. After the completion of this study, C.M. moved to become an employee of Novozymes A/S. The other authors declare no competing interests.
Footnotes
Peer review information Nature Structural & Molecular Biology thanks Ka Young Chung and Brian Shoichet for their contribution to the peer review of this work. Peer reviewer reports are available. Florian Ullrich was the primary editor on this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Alexander S. Hauser, Albert J. Kooistra.
Extended data
is available for this paper at 10.1038/s41594-021-00674-7.
Supplementary information
The online version contains supplementary material available at 10.1038/s41594-021-00674-7.
References
- 1.Kolakowski LF., Jr. GCRDb: a G-protein-coupled receptor database. Recept. Channels. 1994;2:1–7. [PubMed] [Google Scholar]
- 2.Attwood TK, Findlay JB. Design of a discriminating fingerprint for G-protein-coupled receptors. Protein Eng. 1993;6:167–176. doi: 10.1093/protein/6.2.167. [DOI] [PubMed] [Google Scholar]
- 3.Attwood TK, Findlay JB. Fingerprinting G-protein-coupled receptors. Protein Eng. 1994;7:195–203. doi: 10.1093/protein/7.2.195. [DOI] [PubMed] [Google Scholar]
- 4.Fredriksson R, Lagerstrom MC, Lundin LG, Schioth HB. The G-protein-coupled receptors in the human genome form five main families. Phylogenetic analysis, paralogon groups, and fingerprints. Mol. Pharmacol. 2003;63:1256–1272. doi: 10.1124/mol.63.6.1256. [DOI] [PubMed] [Google Scholar]
- 5.Nordstrom KJ, Sallman Almen M, Edstam MM, Fredriksson R, Schioth HB. Independent HHsearch, Needleman–Wunsch-based, and motif analyses reveal the overall hierarchy for most of the G protein-coupled receptor families. Mol. Biol. Evol. 2011;28:2471–2480. doi: 10.1093/molbev/msr061. [DOI] [PubMed] [Google Scholar]
- 6.Foster SR, et al. Discovery of human signaling systems: pairing peptides to G protein-coupled receptors. Cell. 2019;179:895–908.e21. doi: 10.1016/j.cell.2019.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hauser AS, Attwood MM, Rask-Andersen M, Schioth HB, Gloriam DE. Trends in GPCR drug discovery: new agents, targets and indications. Nat. Rev. Drug Discov. 2017;16:829–842. doi: 10.1038/nrd.2017.178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Sriram K, Insel PA. G protein-coupled receptors as targets for approved drugs: how many targets and how many drugs? Mol. Pharmacol. 2018;93:251–258. doi: 10.1124/mol.117.111062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Santos R, et al. A comprehensive map of molecular drug targets. Nat. Rev. Drug Discov. 2017;16:19–34. doi: 10.1038/nrd.2016.230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Flock T, et al. Selectivity determinants of GPCR–G-protein binding. Nature. 2017;545:317–322. doi: 10.1038/nature22070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zhou Q, et al. Common activation mechanism of class A GPCRs. eLife. 2019;8:e50279. doi: 10.7554/eLife.50279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Nygaard R, Frimurer TM, Holst B, Rosenkilde MM, Schwartz TW. Ligand binding and micro-switches in 7TM receptor structures. Trends Pharmacol. Sci. 2009;30:249–259. doi: 10.1016/j.tips.2009.02.006. [DOI] [PubMed] [Google Scholar]
- 13.Carpenter B, Tate CG. Active state structures of G protein-coupled receptors highlight the similarities and differences in the G protein and arrestin coupling interfaces. Curr. Opin. Struct. Biol. 2017;45:124–132. doi: 10.1016/j.sbi.2017.04.010. [DOI] [PubMed] [Google Scholar]
- 14.Venkatakrishnan AJ, et al. Diverse activation pathways in class A GPCRs converge near the G-protein-coupling region. Nature. 2016;536:484–487. doi: 10.1038/nature19107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Schonegge AM, et al. Evolutionary action and structural basis of the allosteric switch controlling β2AR functional selectivity. Nat. Commun. 2017;8:2169. doi: 10.1038/s41467-017-02257-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hauser AS, et al. Pharmacogenomics of GPCR drug targets. Cell. 2018;172:41–54.e19. doi: 10.1016/j.cell.2017.11.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schoneberg T, Liebscher I. Mutations in G protein-coupled receptors: mechanisms, pathophysiology and potential therapeutic approaches. Pharmacol. Rev. 2021;73:89–119. doi: 10.1124/pharmrev.120.000011. [DOI] [PubMed] [Google Scholar]
- 18.Wu F, et al. Full-length human GLP-1 receptor structure without orthosteric ligands. Nat. Commun. 2020;11:1272. doi: 10.1038/s41467-020-14934-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Schoneberg T, Schultz G, Gudermann T. Structural basis of G protein-coupled receptor function. Mol. Cell. Endocrinol. 1999;151:181–193. doi: 10.1016/s0303-7207(99)00017-9. [DOI] [PubMed] [Google Scholar]
- 20.Liang YL, et al. Toward a structural understanding of class B GPCR peptide binding and activation. Mol. Cell. 2020;77:656–668.e5. doi: 10.1016/j.molcel.2020.01.012. [DOI] [PubMed] [Google Scholar]
- 21.Hilger D, et al. Structural insights into differences in G protein activation by family A and family B GPCRs. Science. 2020;369:eaba3373. doi: 10.1126/science.aba3373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhang X, et al. Differential GLP-1R binding and activation by peptide and non-peptide agonists. Mol. Cell. 2020;80:485–500.e7. doi: 10.1016/j.molcel.2020.09.020. [DOI] [PubMed] [Google Scholar]
- 23.Capper MJ, Wacker D. How the ubiquitous GPCR receptor family selectively activates signalling pathways. Nature. 2018;558:529–530. doi: 10.1038/d41586-018-05503-4. [DOI] [PubMed] [Google Scholar]
- 24.Venkatakrishnan AJ, et al. Molecular signatures of G-protein-coupled receptors. Nature. 2013;494:185–194. doi: 10.1038/nature11896. [DOI] [PubMed] [Google Scholar]
- 25.Tehan BG, Bortolato A, Blaney FE, Weir MP, Mason JS. Unifying family A GPCR theories of activation. Pharmacol. Ther. 2014;143:51–60. doi: 10.1016/j.pharmthera.2014.02.004. [DOI] [PubMed] [Google Scholar]
- 26.Isberg V, et al. Generic GPCR residue numbers—aligning topology maps while minding the gaps. Trends Pharmacol. Sci. 2015;36:22–31. doi: 10.1016/j.tips.2014.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wright SC, et al. A conserved molecular switch in class F receptors regulates receptor activation and pathway selection. Nat. Commun. 2019;10:667. doi: 10.1038/s41467-019-08630-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Turku A, Schihada H, Kozielewicz P, Bowin CF, Schulte G. Residue 6.43 defines receptor function in class F GPCRs. Nat. Commun. 2021;12:3919. doi: 10.1038/s41467-021-24004-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Rasmussen SG, et al. Structure of a nanobody-stabilized active state of the β2 adrenoceptor. Nature. 2011;469:175–180. doi: 10.1038/nature09648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Wacker D, et al. Structural features for functional selectivity at serotonin receptors. Science. 2013;340:615–619. doi: 10.1126/science.1232808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Huang W, et al. Structural insights into micro-opioid receptor activation. Nature. 2015;524:315–321. doi: 10.1038/nature14886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.White KL, et al. Structural connection between activation microswitch and allosteric sodium site in GPCR signaling. Structure. 2018;26:259–269 e255. doi: 10.1016/j.str.2017.12.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Palczewski K, et al. Crystal structure of rhodopsin: a G protein-coupled receptor. Science. 2000;289:739–745. doi: 10.1126/science.289.5480.739. [DOI] [PubMed] [Google Scholar]
- 34.de Graaf C, Foata N, Engkvist O, Rognan D. Molecular modeling of the second extracellular loop of G-protein coupled receptors and its implication on structure-based virtual screening. Proteins. 2008;71:599–620. doi: 10.1002/prot.21724. [DOI] [PubMed] [Google Scholar]
- 35.Kenakin T. Biased receptor signaling in drug discovery. Pharmacol. Rev. 2019;71:267–315. doi: 10.1124/pr.118.016790. [DOI] [PubMed] [Google Scholar]
- 36.DeVree BT, et al. Allosteric coupling from G protein to the agonist-binding pocket in GPCRs. Nature. 2016;535:182–186. doi: 10.1038/nature18324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Liu, X. et al. Structural insights into the process of GPCR-G protein complex formation. Cell10.1016/j.cell.2019.04.021 (2019). [DOI] [PMC free article] [PubMed]
- 38.Du Y, et al. Assembly of a GPCR-G protein complex. Cell. 2019;177:1232–1242.e11. doi: 10.1016/j.cell.2019.04.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Gregorio GG, et al. Single-molecule analysis of ligand efficacy in β2AR–G-protein activation. Nature. 2017;547:68–73. doi: 10.1038/nature22354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lerch MT, et al. Viewing rare conformations of the β2 adrenergic receptor with pressure-resolved DEER spectroscopy. Proc. Natl Acad. Sci. USA. 2020;117:31824–31831. doi: 10.1073/pnas.2013904117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ma X, et al. Analysis of β2AR-Gs and β2AR-Gi complex formation by NMR spectroscopy. Proc. Natl Acad. Sci. USA. 2020;117:23096–23105. doi: 10.1073/pnas.2009786117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Bolla JR, Fiorentino F, Robinson CV. Mass spectrometry informs the structure and dynamics of membrane proteins involved in lipid and drug transport. Curr. Opin. Struct. Biol. 2021;70:53–60. doi: 10.1016/j.sbi.2021.03.014. [DOI] [PubMed] [Google Scholar]
- 43.Lomize MA, Pogozheva ID, Joo H, Mosberg HI, Lomize AL. OPM database and PPM web server: resources for positioning of proteins in membranes. Nucleic Acids Res. 2012;40:D370–D376. doi: 10.1093/nar/gkr703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Cherezov V, et al. High-resolution crystal structure of an engineered human β2-adrenergic G protein-coupled receptor. Science. 2007;318:1258–1265. doi: 10.1126/science.1150577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Rasmussen SG, et al. Crystal structure of the β2 adrenergic receptor–Gs protein complex. Nature. 2011;477:549–555. doi: 10.1038/nature10361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Mao C, et al. Cryo-EM structures of inactive and active GABAB receptor. Cell Res. 2020;30:564–573. doi: 10.1038/s41422-020-0350-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wang C, et al. Structure of the human smoothened receptor bound to an antitumour agent. Nature. 2013;497:338–343. doi: 10.1038/nature12167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Qi X, Friedberg L, De Bose-Boyd R, Long T, Li X. Sterols in an intramolecular channel of Smoothened mediate Hedgehog signaling. Nat. Chem. Biol. 2020;16:1368–1375. doi: 10.1038/s41589-020-0646-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Burley SK, et al. RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy. Nucleic Acids Res. 2019;47:D464–D474. doi: 10.1093/nar/gky1004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Gloriam, D., Kooistra, A., Munk, C. & Hauser, A. An online GPCR structure analysis platform. Nat. Struct. Mol. Biol. 10.1038/s41594-021-00675-6 (2021). [DOI] [PubMed]
- 51.Avet, C. et al. Effector membrane translocation biosensors reveal G protein and β-arrestin profiles of 100 therapeutically relevant GPCRs. Preprint at bioRxiv10.1101/2020.04.20.052027 (2020). [DOI] [PMC free article] [PubMed]
- 52.Galés C, et al. Probing the activation-promoted structural rearrangements in preassembled receptor-G protein complexes. Nat. Struct. Mol. Biol. 2006;13:778–786. doi: 10.1038/nsmb1134. [DOI] [PubMed] [Google Scholar]
- 53.Galés C, et al. Real-time monitoring of receptor and G-protein interactions in living cells. Nat. Methods. 2005;2:177–184. doi: 10.1038/nmeth743. [DOI] [PubMed] [Google Scholar]
- 54.Heydenreich FM, et al. High-throughput mutagenesis using a two-fragment PCR approach. Sci. Rep. 2017;7:6787. doi: 10.1038/s41598-017-07010-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Heydenreich, F. M. et al. Vasopressin V2 is a promiscuous G protein-coupled receptor that is biased by its peptide ligands. Preprint at bioRxiv10.1101/2021.01.28.427950 (2021).
- 56.Kooistra AJ, et al. GPCRdb in 2021: integrating GPCR sequence, structure and function. Nucleic Acids Res. 2021;49:D335–D343. doi: 10.1093/nar/gkaa1080. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are available in GPCRdb (https://review.gpcrdb.org), GitHub (https://github.com/protwis/gpcrdb_data) and Supplementary Data 1 and 2.
No code was developed for this manuscript, which instead used the existing GPCRdb56 resources, including a new comparative structure analysis platform50. All open-source code can be obtained from GitHub (https://github.com/protwis/protwis) under the permissive Apache 2.0 License (https://www.apache.org/licenses/LICENSE-2.0). A complete list of the software used for data analysis is available from the Nature Research Reporting Summary.