Abstract
G protein-coupled receptors (GPCRs) form the largest family of membrane receptors in the human genome. Advances in membrane protein crystallization so far resulted in the determination of 24 receptors available as high-resolution atomic structures. We performed the first phylogenetic analysis of GPCRs based on the available set of GPCR structures. We present a new phylogenetic tree of known human rhodopsin-like GPCR sequences based on this structure set. We can distinguish the three separate classes of small-ligand binding GPCRs, peptide binding GPCRs, and olfactory receptors. Analyzing different structural subdomains, we found that small molecule binding receptors most likely have evolved from peptide receptor precursors, with a rhodopsin/S1PR1 ancestor, most likely an ancestral opsin, forming the link between both classes. A light-activated receptor therefore seems to be the origin of the small molecule hormone receptors of the central nervous system. We find hints for a common evolutionary path of both ligand binding site and central sodium/water binding site. Surprisingly, opioid receptors exhibit both a binding cavity and a central sodium/water binding site similar to the one of biogenic amine receptors instead of peptide receptors, making them seemingly prone to bind small molecule ligands, e.g. opiates. Our results give new insights into the relationship and the pharmacological properties of rhodopsin-like GPCRs.
Introduction
G-protein coupled receptors (GPCRs) form the largest group of membrane receptors [1]. Though they exhibit an extraordinary broad scope of recognized stimuli such as peptides, small organic molecules, calcium ions, and even light, all receptors share a heptahelical transmembrane (7TM) motif. Though GPCRs are the target of about 40% of all drugs commercially available [2], the first GPCR structure became available as late as 2000 with the successful crystallization of bovine rhodopsin [3]. Due to the development of new crystallization techniques, GPCR structure determination then had its major breakthrough in 2007 with the crystallization of the β2-adrenergic receptor (ADBR2) [4,5]. So far, 24 receptors have been structurally determined by x-ray protein crystallization [3–28].
In this work, we assess how far this subset of 7TM domain structures can give insight into the evolutionary relationship of the full set of known human GPCRs, with a special focus on rhodopsin-like GPCRs. The phylogeny of GPCRs was first studied based on a set of known receptor sequences [29]. The commonly used systematical classification system, the GRAFS-system by Fredriksson et al. [30], was developed from the data of the human genome project. In this system, GPCRs form the five distinct families of glutamate (G), rhodopsin-like (R), adhesion (A), frizzled/taste (F) and secretin (S) receptors. 90% of the receptors are found in the rhodopsin-like family, which is subdivided into the classes α–δ. However, GPCR phylogeny still is a highly debated field, as the receptors exhibit signs of phylogenetic mosaicism [31], and have developed under multiple overlapping evolutionary pathways [32]. 20 of the crystallized GPCRs belong to the rhodopsin-like class [3–22,26,27]. Furthermore, four receptor structures of non-rhodopsin GPCRs are available as structural models, of which two receptors are from the secretin family [23,25], one receptor from the frizzled/taste family [24], and one from the metabotropic glutamate receptor family [28]. Due to this availability of both rhodopsin-like and non-rhodopsin-like GPCR structures, we are now able to revise this classification based on structural information. While evolution of proteins so far is mostly discussed on a sequence comparison basis, paleontology deduces evolutionary sequences from comparison of structural features of species. With the now available set of GPCR structures, we here attempt to assess the evolution of GPCRs by combining both of these two possible approaches. We used the three non-rhodopsin GPCR structures to find the branching point between non-rhodopsin and rhodopsin-like GPCRs.
A better understanding of the underlying phylogeny can help in the deorphanization of GPCRs, whose ligands are still unknown, and therefore propose to be new druggable targets for pharmaceutical research [33]. Furthermore, we can detect new relations between GPCRs with this understanding, and thus find new insight into their pharmacological properties. The available set of GPCR structures now enables us to make an exact determination of sequence parts belonging to different structural subdomains, which is more precise than results from secondary structure prediction. By restricting our analysis to the 7TM domain, and ligand binding residues, we can gain precise information about the evolutionary development of these structural subfeatures. Especially for comparing ligand-binding residues, it is essential that we now can make a direct match and comparison of residues forming these sites in different structures, and thereby overcome biases introduced by sequence alignment algorithms. We first performed this analysis with the sequences of crystallized GPCRs transmembrane helices together with their loop domains. Here we found that the connection point between rhodopsin-like and non-rhodopsin-like GPCRs is found between peptide-binding receptors and small molecule receptors, close to the position of rhodopsin. We analyzed how this tree-like evolution corresponds to the changes of structural features found within the existing GPCR crystal structures. The development of small molecule receptors seems to have started with a hydrophobic ligand-recognizing receptor, most likely an opsin precursor, by an interplay of small molecule ligands and the opsin extracellular loop 2 (el2). Last, in an analysis of ligand-binding and sodium-binding [34,35] cavity forming residues, opioid and biogenic amine receptors exhibit signs of homoplasy, possibly explaining the affinity of opioid receptors for opiates despite being peptide receptors.
Materials and Methods
Phylogenetic analysis of GPCR structures
For the structural analysis, we considered the human sequences of a set of 24 crystallized GPCRs [3–28]. In case that several crystal structures were available for one receptor, we chose the one with highest available resolution for data analysis [respective PDB IDs: rhodopsin (OPSD): 1U19 [36]; β2 adrenergic receptor (ADBR2): 2RH1 [37]; β1 adrenergic receptor (ADBR1): 2YCW [6]; A2A adenosine receptor (AA2AR): 4EIY [20]; D3 dopamine receptor (D3DR): 3PBL [8]; H1 histamine receptor (H1HR): 3RZE [10]; (S1PR1): 3V2Y [11]; chemokine receptor CXCR4 (CXCR4): 3OE0 [9]; chemokine receptor CCR5 (CCR5): 4MBS [26]; δ-opioid receptor (OPRD): 4EJ4 [17]; μ-opioid (OPRM): 4DKL [14]; κ-opioid (OPRK): 4DJH [15]; N/OFQ opioid (OPRX): 4EA3 [16]; M2 muscarinic receptor (ACM2): 3UON [12]; M3 muscarinic receptor (ACM3): 4DAJ [13],; neurotensin receptor 1 (NTR1): 4GRV [18]; protease-activated receptor 1 (PAR1): 3VW7 [19]; serotonine receptor 1B (5HT1B): 4IAR [21]; serotonine receptor 2B (5HT2B): 4IB4 [21]; purine receptor P2Y12 (P2Y12): 2PXZ [27]; glucagon receptor (GLR): 4L6R [23]; smoothened receptor (SMO): 4JKV [24]; corticotropin-releasing factor receptor 1 (CRFR1): 4K5Y [25]; metabotropic glutamate receptor 1 (GRM1): 4OR2 [28]]. All structures were loaded into PyMol [38], a standard protein structure visualizer, for structural analysis and structure/sequence comparison. In case that a GPCR structure was obtained from a non-human or mutation-stabilized receptor, the respective native human sequence was used for the following sequence comparisons. Structure/sequence comparison was performed directly in PyMol. Rhodopsin-like GPCR helices were aligned according to the most conserved residues from Ballesteros-Weinstein numbering [39]. If this was not applicable (helix V in S1PR1 and P2Y12), we additionally focused on overlaying the sequences to keep Tyr5.58 conserved [40,41]. For the rhodopsin- and non-rhodopsin-like GPCR alignment, we focused on finding a best fit for the full heptahelical helix Cα atom arrangement with the “align” function of PyMol, followed by an individual fit of each transmembrane helix. To additionally take into account the loop domains in this evaluation, we extracted the respective sequences of extra- and intracellular loops 1–3, and aligned each loop of the 24 receptors separately. The resulting aligned loops were then introduced into the structural alignment between the respective helices. The resulting alignment is attached in FASTA format as S1 File. Phylogenetic trees were calculated with PhyML 3.0 [42], using JTT as substitution model. PhyML is a tool for the fast calculation of computationally expensive maximum likelihood [43] phylogenetic trees, and allows different choices of substitution models and tree rearrangement methods (Nearest Neighbor Interchange [NNI] and Subtree Pruning and Regrafting [SPR]). Dendroscope3 [44] was used for tree visualization. Dendroscope is a program for working with rooted phylogenetic trees and networks. It provides a number of methods for drawing and comparing rooted phylogenetic networks, and for computing them from rooted trees. The respective network analysis and visualization was performed using SplitsTree4 [45], using maximum likelihood distances and JTT [46] as substitution model to calculate NeighborNet networks. NeighborNet is a distance-based method to construct circular splits networks [45]. In short, a splits graph is an assembly of several possible evolutionary trees for a given sequence set. Thus it can display multiple possible evolutionary relationships at once, which is necessary to resolve signals of lateral gene transfer, recombination events, mosaicism, and homologous evolution [45]. In contrast to this, the calculation of a maximum likelihood tree only gives the tree-like relationship that is most probable under the assumed model of sequence evolution. If the input distances from the sequence set correspond to a perfect phylogeny, then the network will only contain the splits (“branches”) of the most probable tree, and network and tree analysis will give the same result. SplitsTree4 [45] is an interactive and comprehensive tool for calculating different types of phylogenetic networks from sequences, distances, and trees. We chose the JTT model, as it was the only method implemented in SplitsTree4 capable to calculate maximum likelihood NeighborNets across the full range of sequence lengths we employed. For ligand binding pocket sequences, we calculated a network with uncorrected P distances, as some of the sequences were too dissimilar to compute meaningful maximum likelihood distances. Evolutionary Trace (ET) calculations were performed with the Baylor code [47,48] using the BLOSUM62 matrix [49] and default parameters. ET assesses the evolutionary relationship of protein sequences by calculating a sequence similarity-based phylogenic tree. Secondary structure predictions were performed with Psipred [50,51]. Pispred takes the profile output of PSI-BLAST [52] and takes a consensus secondary structure prediction from four independently trained sets of neural networks. All molecular figures were prepared with PyMol [38].
Phylogenetic analysis of full human GPCR repertoire
For the full non-rhodopsin GPCR set and the rhodopsin-like GPCR set, GPCR amino acid sequences were obtained from the Uniprot database [53] and are named accordingly. From this set of 828 reviewed Uniprot GPCR sequence entries (status of March 21st 2012), we removed the sequences of GPR98, GP112, GP179, CELR1, CELR2, and CELR3, as their length (>1500 amino acids) prolonged the phylogenetic calculations to an unreasonable extend (no completion after 8 days). The sequence set used for alignment and phylogenetic analysis thus contains 822 entries. Sequence alignments were performed using MUSCLE 3.8 [54] with default parameters. MUSCLE is a fast and efficient algorithm for multiple sequence alignment procedures, which includes fast distance estimation using k-mer counting, progressive alignment using a profile function called the “log-expectation” score, and refinement using tree-dependent restricted partitioning. Phylogenetic analysis was performed with PhyML 3.0 [42]. A PhyML analysis of the full GPCR set failed, probably due to too much sequence variation. We therefore developed a hierarchical approach to determine the best fitting crystal structures for 7TM sequence determination. For this, we added the sequences of the 24 structurally known GPCRs [3–17,19,21–25,27,28,38,55] in their full length sequence to the set. Structural analysis for 7TM determination was performed with PyMol [38]. After an initial sequence alignment of the full set with MUSCLE, we constructed an initial Neighbor Joining tree using the BLOSUM62 matrix (calculations were done with Jalview [56,57], a graphical user interface for sequence comparison). We then extracted the leaf with the longest distance from the leaves containing the crystal structure sequences, and repeated sequence alignment and leaf extraction. We performed this approach iteratively, extracting seven leaves (groups 1–7), until the GPCR structural sequences were distributed over the tree of the remaining sequences. The remaining leaves were separated so that each of them contains a minimum of one GPCR structure sequence. Leftover sequences were collected in additional groups. A detailed list of the members of each group is given in S2 File. To all sequence set groups, the GPCR structure sequence set was added. All groups were then separately subjected to a MUSCLE sequence alignment and Neighbor Joining tree construction. For each group, the GPCR structure sequence, which was closest to the majority of sequences, was chosen for the determination of the 7TM helices. In case of ambiguity, the group was iteratively subdivided, realigned, and appearing leaves extracted along with their closest structurally available receptor sequence. The detailed group/structure combinations can be found in S2 File. The 7TM positions of the respective set were then determined from the respective template GPCR structure, and the sequences cut down to the presumed 7TM domain (i.e. transmembrane helices I-VII together with interconnecting loops). All 7TM sequence groups were then reunited into two large sequence sets: the resulting groups 1 to 6b, 8a(c), and vomeronasal receptors in 9a were found to be non-rhodopsin-like receptors, and thus merged into the non-rhodopsin group. The remaining groups (6c to 11c) were merged into the rhodopsin-like group. To find the connecting node between rhodopsin-like and non-rhodopsin-like receptors, the 7TM domain sequences of the four crystallized non-rhodopsin-like GPCRs were added to the rhodopsin-like GPCR set, and the 7TM domain sequences of the rhodopsin-like GPCRs to the non-rhodopsin-like GPCR set, vice versa. With these sets, one final MUSCLE alignment each was performed followed by a PhyML maximum likelihood analysis. The maximum likelihood analysis was performed with an initial Neighbor Joining tree, and JTT as amino acid substitution model. Tree topology, branch length, and rate parameters were optimized. Topology optimization was performed starting with a NNI, or an SPR tree topology search, respectively, followed by a second optimization with the respective other method. To assess the robustness of the resulting tree, we carried out a bootstrap analysis with 528 replicas. The trees can be found in Newick format in S3 and S4 Files, and the respective alignments in FASTA format in S5 and S6 Files. We here display the trees with the highest likelihood from both optimization runs. In the rhodopsin-like GPCR tree, we removed SMO, GP141, GP143, and GP148 from display, as they appeared unreasonably far from the remaining sequences, and thus seem not to be correctly placed within the tree. For finding the connection to the non-rhodopsin GPCRs, we therefore relied on the respective positions of GRM1, GLR, and CRFR1.
Results and Discussion
Phylogenetic analysis of crystal structure subset of human GPCRs
Fig 1 shows a maximum likelihood tree (A) and a maximum likelihood distance-based NeighborNet [58] (B) of the 7TM domain of the 24 crystallized receptors. The sequences of the 7TM helices were aligned according to their structural best fit, and the connecting loops were subjected to a sequence alignment with MUSCLE [54]. We added the helix-connecting loops to the 7TM helices, as a focus on solely the helices would introduce a gap into the protein sequence, and thus create artifacts in the calculated evolutionary trees. The overall bootstrap values of the tree are reasonably high (>400) and show a good overall robustness of the tree, with the exception of the 5HT1B (249) and AA2AR / S1PR1 branching nodes (114). The tree analysis shows a separation of rhodopsin-like GPCRs into small molecule receptors, i.e. amine receptors, AA2AR, and S1PR1, on one side, and peptide/purine receptors on the other side. Rhodopsin (OPSD) branches off between both subgroups. The connection with the non-rhodopsin-like GPCRs is formed in the middle between both branches, too, and is positioned close to rhodopsin and neurotensin receptor 1 (NTR1). With a bootstrap value of 446 out of 1000, we hold this connection point to be reasonably robust. The relative positions of the rhodopsin-like GPCRs are in qualitative agreement with the earlier analysis of Fredriksson et al. [30], while the connection node with non-rhodopsin-like GPCRs is at a previously not reported position. To counter-check the robustness of the tree topology, we performed a NeighborNet analysis. The network and the tree analysis both show a good topological agreement, indicating a general tree-like evolution signal. However, the network exhibits large meshes around the connection point of non-rhodopsin-like and rhodopsin-like GPCRs, indicating a certain ambiguity of their connection node. Furthermore, NTR1 is found at a position with an equal distance towards both the muscarinic acetylcholine receptor 2 (ACM2) and the chemokine receptor CXCR4, and further away from rhodopsin than seen in the tree analysis. S1PR1 and AA2AR, too, differ in their network positions from the ones in the tree analysis. We assume that these differences derive from the relatively small number of protein sequences used in the alignment.
To further evaluate the robustness of our maximum likelihood-based tree, we calculated a Evolutionary Trace (ET) sequence similarity tree [47,48] from our crystal structure-based alignment. The ET method was already successfully applied to find evolutionary important amino acids in biogenic amine receptors [59,60]. Fig 1C shows the resulting sequence similarity-based tree. The ET tree topology is in good agreement with our maximum likelihood tree in Fig 1A: non-rhodopsin receptors, biogenic amine receptors and peptide receptors are well separated, and rhodopsin / NTR1 are found between the three groups. A difference is found for the placement of S1PR1, which is now close to rhodopsin / NTR1. Due to this nice overall agreement of our maximum likelihood tree and the ET tree, we hold the observed subgroup connection and our tree topology to be robust.
Based on these results, it seems that in the subset of crystallized rhodopsin-like GPCRs, two main evolution branches exist: peptide-binding GPCRs and small molecule binding GPCRs. The development observed in the subset implies that small molecule binders seem to have evolved via hydrophobic ligand-binding GPCR precursors. The hydrophobic ligand binders themselves later developed into opsines and sphingosine receptors. An exception from this rule is the purine receptor P2Y12, which seems to have developed out of peptide receptors, too. This is in agreement with the observation that the P2Y12 crystal structure is closer to the structure of the protease-activated receptor PAR1 receptor than to ADRB2 [27]. However, as stated above, the small structure-based sequence set seems to contain artificial positions of NTR1, AA2AR, and S1PR1. To remove these artifacts, we extended our results from crystal structures to the full scope of known human GPCR protein sequences. For this, we used the available crystal structures as templates to determine the 7TM motifs of the GPCRs without available structural model.
We initially set out to assess the evolution of the full scope of the human GPCR repertoire with our structure-based approach. Our analysis reveals the same subgroups like observed by Fredriksson et al. [30], while it additionally includes vomeronasal receptors and several orphan receptor groups. However, as S1 Fig shows, while some subfamilies seem to be well resolved (glutamate receptors, frizzled/smoothened receptors, secretin receptors, taste receptors type 2, vomeronasal receptors, adhesion receptors), others lack a clear separation from the tree basis (orphan families) or are inexplicably separated over different nodes (taste receptors type I, EGF-like receptors). Our analysis thus cannot give a clear picture of the evolution of non-rhodopsin-like receptors yet. We assume that the currently available four non-rhodopsin GPCR structures do not sufficiently cover the full sequence range for a phylogenetic analysis. In the following, we therefore here focus on the analysis of the rhodopsin-like GPCR family of Fredriksson et al. [30]
Phylogenetic analysis of structure-based alignment reveals new ordering of human rhodopsin-like GPCR
To extend our crystal structure-based phylogenetic tree presented in Fig 1A, and remove artifacts from a too small number of sequences, we expand our investigation to the full scope of human rhodopsin-like GPCR sequences. As can be seen in Fig 2A, the newly calculated tree topology is in good agreement with the crystal structure-based tree in Fig 1A. It shows three major branches for the rhodopsin-like GPCR: olfactory receptors, small molecule binders, and peptide binders. The two different mammalian olfactory receptor classes (termed I and II) [61] are well resolved. Surprisingly, they are found apart from the other small molecule binding GPCRs, suggesting that they branched off at an early state of development of rhodopsin-like GPCRs [62,63]. Though no olfactory GPCR structure has been solved yet, we are convinced that our suggested position of this family is correct, as we used the rhodopsin structure to assign helix positions, which proved to be a good template for the homology modeling of olfactory receptors [64]. Furthermore, a new crystal structure of opsin in an active state bound to a lipid molecule suggest a close structural connection between opsin and olfactory receptors [65]. While the general topology of our tree is in overall agreement with the one from Fredriksson et al. [30], the position of the branching point between olfactory receptors, non-rhodopsin GPCRs, and the remaining rhodopsin-like GPCRs is a new finding and the most pronounced difference to this classification.
Fig 2B shows the new tree resolved by ligand properties: The small molecule binding receptors branching off closest to the common node with non-rhodopsin GPCRs are hydrophobic ligand binders, here sphingosine receptors and opsins, together with adenosine receptors. This suggest that their ancestors form the earliest small molecule binding GPCRs. Peptide receptors can be found before (prostaglandin & prostacyclin receptors) and after (e.g. opioid receptors, chemokine receptors) the branching nodes of the hydrophobic ligand receptors. Consequently, the ancestral rhodopsin-like GPCRs seem to have been peptide receptors, which is in agreement with the findings of Pele et al. [32]. This means that while opioid and chemokine receptors retained the property to bind and recognize peptide ligands, as most non-rhodopsin GPCRs do, some structural change must have led to the development of small molecule binding receptors, and this change must have happened with the development of an opsin / sphingosin receptor precursor. A similar structural change must have happened twice, as purine / leukotriene / free fatty acid receptors seem to have evolved out of a protease-activated receptor precursor. This suggests a possible easy interchangeability of peptide and purine binding, which is in agreement with the assumption that GPCRs have evolved from cAMP receptors [63], cAMP receptors essentially are purine receptors, too, but are absent in the human genome. In addition, the ligand property-focussed tree in Fig 2B allows for a tentative characterization of ligand properties of orphan receptors present in the analysis. For example, GPR161, which was recently found to be a new drug target for breast cancer treatment [66], is present in the orphan family II / opsin branch. Therefore, it most likely is recognizing hydrophobic small molecule ligands. However, we have to state clearly that the tree analysis of this large GPCR subset shows a high uncertainty, especially at the basis nodes (bootstrap values of 0 to 50 with 528 replicas). Therefore, we investigate in the following how far structural details of the available crystal structures support this tree.
An opsin precursor forms the loophole in development from peptide to small molecule GPCRs
We first evaluate if ligand binding in GPCRs exhibits conserved features, which we can relate to the protein structure. Fig 3 shows the volume occupied by peptide ligands in peptide-binding GPCRs (PDB IDs 3OE0 [9] and 4GRV [18]) and by organic ligands in small molecule-binding GPCRs (PDB IDs 2YCW [6], 3EML [7], 3PBL [8], 3RZE [10], 3V2Y [11], 3UON [12], 4DAJ [13], 4IAR [21], 4IB4 [21], 4NTY [27], 1U19 [36], and 2RH1 [37]) after superposition of their 7TM helices. Fig 3A displays the position of these volumes in respect to the 7TM bundle. Both peptide ligands and small molecule ligands bind to the targeting receptors between helices II-VII. While peptide ligands bind around the height of the extracellular end of the helical bundle, small molecule ligands bind in a volume within the extracellular half of the helical bundle. Both volumes share a common region, which interestingly is found in the volume occupied by the rhodopsin el2 (see Fig 3B). We cannot tell if this contact volume is important for receptor activation or protein stability, as active state or agonist-bound GPCR structures only exist for four receptors so far [67–72], which is not sufficient for such an analysis. In the following, we examine how the different receptor structures interact with these contact domains. For this, we set the respective structures into the evolutionary order proposed by our rhodopsin-like family tree in Fig 2.
Fig 4 shows the development of GPCR structures into peptide and small molecule receptors proposed by our tree analysis. As stated above, cAMP receptors are the common ancestors of GPCRs (with the exception of glutamate receptors) [63]. From this, peptide-binding GPCRs seem to have evolved first [32]. The peptide binding volume, small organic molecule ligands, and el2 exhibit an interesting relationship along the presumed evolutionary pathway: in all known non-rhodopsin GPCR structures [23–25,28], which so far all represent peptide binders, el2 forms a β–hairpin structure, which surrounds the peptide binding volume of rhodopsin-like GPCRs. We here classify the crystallized glutamate receptor GRM1 as a peptide receptor, too, as glutamate binds within an extracellular protein domain, which transfers its conformational changes into the transmembrane domain. Thus, a protein/protein contact activates the transmembrane domain. In rhodopsin-like peptide receptor structures with bound peptide ligand [9,18], el2 is found in form of a β-hairpin, too, and also flanks the peptide binding volume. This common arrangement suggests a retention of both peptide ligand binding features and el2 loop form and position during peptide binding GPCR evolution. Confirming our assumption from the previous section, both rhodopsin-like and non-rhodopsin peptide binders must have evolved from a common precursor. In rhodopsin, el2 goes right through the peptide-binding domain, and is held there by steric constraints from the N terminus. Retinal binds underneath the contact domain at a position close to the protein center. The length of the rhodopsin el2 is in good agreement with the one observed in peptide binding GPCRs, and its position equals the position of the common peptide / small molecule ligand volume (see Fig 3B). In the sphingosine receptor structure (see Fig 3C), el2 is held in this dual binding volume due to contacts with the N terminus as well. However, el2 here is disordered and has partially retracted from the binding volume in comparison to rhodopsin. Like a compensation for this, the ligand has advanced into the peptide-binding domain, performing contacts there. In polar ligand and biogenic amine GPCRs, el2 is disordered or helical and has retracted from the peptide-binding domain. Small molecule ligands bind in the peptide contact volume at the position of el2 in rhodopsin, and below it.
Combining these findings with our phylogenic tree in Fig 2, we judge that the ancestral rhodopsin-like GPCRs initially possessed an el2 in the form of a β-hairpin. Peptide binders and rhodopsin retained el2 in this form. The first small molecule-binding GPCR seems to have been an opsin precursor. As can be seen in Fig 3B, both N terminus and el2 seem to substitute a bound orthosteric peptide ligand. This finding is particularly interesting, as ligand-free opsin at low pH adopts a conformation, which closely resembles the active meta II state [73] and is capable to bind and activate the rhodopsin-specific G-protein transducin [74]. In agreement with this, the position and conformation of el2 in the meta II—state rhodopsin [72,75], in retinal free [76] and G-protein mimic-bound opsin [74] is very similar, while the arrangement in inactive dark-state rhodopsin [36] differs from them (see S2 Fig). Furthermore, el2 structural changes are associated with formation of the meta II state [77]. Retinal itself does not bind within this peptide binding volume, but below it. This suggests that retinal is not a conventional small molecule ligand. Instead, it seems that the orthosteric binding site of the chromophore has developed from an allosteric binding site in an initially peptide-binding receptor. This peptide binder must have developed the capability to bind retinal covalently via the appearance of Lys296. This model is in agreement with the findings of Feuda et al. [78], and negates a direct evolutionary connection between microbial and animal rhodopsins [79]. In the sphingosine receptor, el2 has partially retracted from the peptide contact domain. Instead, the sphingosine ligand extends from the new small molecule-binding site into the peptide-binding domain, forming polar contacts there, and partially substitutes el2 (compare Fig 3C). This trend continues in biogenic amine receptors and the A2A adenosine receptor, where el2 has completely retracted from the contact domain. As a compensation, small molecule ligands form contacts with the receptor in the new orthosteric small ligand binding site, and within the bottom of the peptide contact domain. We here need to point out that while we deduce this structural evolution from current receptors, the real development happened within ancestors of these receptors. Therefore, they all contain additional changes, e.g. the loss of the el1-el2 disulphide bond in S1PR1 [11], which we do not discuss here. Purine receptors seem to have undergone a similar, separate development from peptide receptors to small molecule receptors. However, as stated above, this might be a development back to cAMP receptors [63].
Concerning olfactory receptors, we currently cannot judge if their origin lies within an ancestral opsin as well, or if they followed a separate evolutionary pathway. On one hand, a development of olfactory receptors out of an opsin ancestor is a valid hypothesis: as stated before, rhodopsin is a suitable structural template for modeling olfactory receptors [64,65], which implies a high structural similarity. Both receptor subclasses bind hydrophobic ligands within their 7TM core. As we show in S3 Fig, el2 exhibits a quite conserved sequence and length (ca. 35 amino acids) in olfactory receptors, which is seven amino acids more than in the rhodopsin el2. There is no obvious sequence similarity between el2 in olfactory receptors and in rhodopsin. However, for the example of 20 randomly chosen olfactory receptors, a secondary structure prediction via Psipred [51] predicts predominantly sheet features for their el2 sequences. We therefore assume that like in rhodopsin, el2 in olfactory receptors forms a β-hairpin. On the other hand, several arguments exist for a separate evolution of small molecule binders and olfactory receptors: our phylogenetic analysis shows a maximum likelihood distance between opsins and olfactory receptors, which is considerably longer than the opsin / small molecule binder distance. Though they share the most conserved helical amino acid residues of rhodopsin-like GPCRs [39], olfactory receptors definitively form a GPCR subfamily of their own at least in humans. Furthermore, olfactory receptors exhibit peculiar ligand binding properties: one receptor can be activated by several ligands, which in turn induce different levels of signaling [80]. Last, it is possible that odorant recognition is not achieved via olfactory receptors directly, but mediated by odorant-binding proteins [81]. These proteins can absorb odorants, and then bind to the extracellular domain of olfactory receptors [82], turning them into peptide binders. Only a crystal structure of an olfactory receptor will give a definitive answer on the relationship between opsins and olfactory receptors.
Summarizing these results based on structural comparison of el2, the structural features within the available GPCR structures are in line with both our calculated maximum likelihood trees for the crystal structure comparison and for rhodopsin-like GPCRs. Small molecule binding ligands in GPCRs seem to have developed as allosteric binders via an opsin ancestor as key intermediate, with el2 substituting a bound peptide ligand and thus probably stabilizing it. We are aware that the current number of 24 GPCR structures is still too small to make a final statement on the evolution of el2 structure. However, we see the current set of structures to be sufficiently consistent with our theory on el2 evolution to support our model. Along the proposed pathway, small molecule ligands then seem to have substituted el2, which retracted from the binding site and lost the β-hairpin conformation. Rhodopsin can therefore be seen as a good model system to study the structures and activation mechanisms of both peptide and small molecule binding GPCRs [83]. This is in so far interesting, as it would render the sense of light detection to be the origin of development of the small molecule-based hormonal response system in our central nervous system (CNS). This coincides with our visual system being an actual part of our CNS instead of being connected to it via a peripheral nerve [84]. It is possible, that during evolution, light-sensing opsin ancestors developed in nervous tissues, which then led to the development of small molecule receptors in the CNS. A good candidate for such an intermediate is a melanopsin precursor, as melanopsin mediates a large variety of physiological responses to light, and from its sequence is closer related to invertebrate than to vertebrate rhodopsins [85].
We here have to state that all so far listed GPCR crystal structures come from mammalian or avian organisms. They therefore represent GPCR structures at a contemporary state, and thus only allow a limited assessment of GPCR evolution due to missing “fossil” GPCRs. However, as the crystal structure of squid (Todarodes pacificus) rhodopsin is available (PDB ID 2Z73) [86], we can verify how well preserved the position of el2 is by a comparison with the structure of bovine rhodopsin [3,36]. Our comparison in S4 Fig shows that the position in both structures is identical, despite the obvious difference in el2 sequences. Consequently, we can make the assumption that the position of el2 is highly conserved in rhodopsins, and needs to have been constant since the last common ancestor of mammals and squids. It thus has to be an evolutionary old feature across the animal kingdom. This supports our hypothesis that the el2 position in rhodopsin represents an evolutionary old transition between peptide and small molecule receptors, which still can be seen in present day structures.
Opioid receptors exhibit small ligand receptor-like binding pockets and sodium-binding sites
In the following, we perform a further analysis on the development of the amino acids flanking the ligand binding pockets in our investigated set of crystallized GPCRs. We found that a NeighborNet analysis with maximum likelihood distances resulted in a network with conflicting groupings, observable by widening of the internal faces and missing singular edges for the proteins (see S5 Fig). To get a better picture of the absolute similarity of the binding sites, we performed an uncorrected P distance analysis. This compares the binding sites by counting matching amino acids having the same sequence (and herein therefore spatial) position. A similar analysis was performed on the residues of the retinal binding pocket [87], which we extend now to the set of contact residues for all ligands in the available structures. Fig 5 shows the resulting NeighborNet. Four dominant categories of ligand binding cavities appear: peptide ligand receptors, containing the chemokine receptors and non-rhodopsin GPCRs; hydrophobic ligand receptors, containing the opsins and sphingosine receptors, hydrophilic ligand receptors, containing adenosine receptors; and ammonium ligand receptors, containing muscarinic acetylcholine receptors, biogenic amine receptors, and opioid receptors. Please note that in this framework, we categorize metabotropic glutamate receptors as peptide receptors (see above). Surprisingly, opioid receptors are found in a group together with the biogenic amine receptors, but not with peptide ligand receptors, sharing the major contact residue Asp3.32 with acetylcholine and biogenic amine receptors. However, this aspartate does not seem to be the major source of this similarity, as its removal from the uncorrected P distance analysis does not change the overall topology of the network (see S6 Fig, left). The same holds true to the glutamate positions at positions 3.28 or 3.29, respectively, in hydrophobic ligand receptors (see S6 Fig, right), which form a salt bridge with the respective bound ligand. Possibly, the placing of glutamates at position 3.28 or 3.29, respectively, was an important step in their evolutionary development to become small molecule binders.
We furthermore analyzed the evolutionary relationship of amino acids forming the central sodium binding site in GPCRs [88]. This site is known from AA2AR [20,34], the δ-opioid receptor (OPRD) [35], ADRB1 [89], and PAR1 [19] as an allosteric regulator for agonist binding. Residues forming contacts with the sodium ion were shown to be highly conserved [88]. However, the first coordination shell of the ion is partially formed by protein-bound water molecules. Such protein-internal water molecules are highly important for the structure and function of heptahelical transmembrane proteins [90–92], and show a conserved pattern of positions in GPCRs, as well [93]. We therefore extended the investigation to all residues being in contact with the sodium ion and water molecules in this central binding site. Fig 6 displays the resulting uncorrected P distance network: Surprisingly, the network shares the same general features like the one of the ligand binding site. As both analyzed networks share eight amino acid positions, we removed these residues from the ligand binding site comparison, and re-analyzed the resulting sequence set. S7 Fig shows the resulting NeighborNet: the overall network topology does not change, as well. The high similarity of the network topology for both ligand binding site and sodium/water binding site suggests a common evolutionary path, which is in good agreement with their functional coupling for small molecule binding [34]. The reason for the special placement of opioid receptors in both networks remains to be elucidated: from the results of our data, it seems that opioid receptors may not be exclusively primed to bind endorphins, i.e. peptide ligands, but to have the potential to interact with small molecule ligands, too.
Summing up, our network analysis proposes a coupled evolution of the GPCR ligand binding sites together with the central sodium/water binding sites. It seems that though opioid receptors developed as peptide receptors, both their ligand binding site and central sodium/water binding site are similar to the ones of biogenic amine receptors, suggesting a high degree of homoplasy between both subclasses. Though the primary ligands in opioid signaling in the brain are endorphins, i.e. peptides, the binding site might be primed for the interaction with small molecule ligands. In agreement with this, opioid and β-adrenergic receptor ligands exhibit common pharmacophores [94]. From our data alone, we cannot determine what is the evolutionary driving force for this development. A co-evolution of opiate receptors together with opiates in humans seems unlikely, as even evolutionarily distant organisms like zebrafish show responses to opiates [95]. Another possibility to explain this similarity would be the direct binding of biogenic amines and their derivatives to opioid receptors as a regulatory element in the central nervous system. Last, it might be a coincidental case of homoplasy due to a common usage of a tyrosine-based pharmacophore [94]. In any case, our own opioid receptors seem to be evolutionary primed to small ligand and thus opiate binding, which may pose a physical explanation to the strong action of such drugs on the central nervous system [14,96].
Conclusions
We here present the first evolutionary analysis of the human GPCR repertoire based on both sequences and structural features of the now available set of 24 GPCR structures, with focus on rhodopsin-like GPCRs. We discriminate them into the subclasses of peptide binding GPCRs, small molecule ligand binding GPCRs, and olfactory receptors. Concerning ligand binding, we propose that the critical step in the evolutionary development of small molecule ligand receptors lies in an opsin ancestor. Crucial therein is the interplay of small molecule ligands and el2 to substitute a bound peptide ligand. The presented evolutionary tree seems to be closely connected to the development of the human central nervous system. In an evolutionary analysis of ligand binding site and central sodium/water binding cavity forming residues, we show that the evolution of these sites is closely coupled. Last, we found that opioid receptors and muscarinic acetylcholine / biogenic amine receptors exhibit homoplasy, possibly explaining the strong affinity of opiates to opioid receptors, despite them being peptide receptors. We hope that these findings will contribute to the overall understanding of the GPCR family, and to neurophysiological, pharmaceutical & biophysical research as well.
Supporting Information
Acknowledgments
The authors would like to thank Raymond C. Stevens for the inspiration for this project and helpful discussion, and Vsevolod Katritch, Lian Gelis, and Carola Schmied for additional helpful discussion. Calculations were performed on the PICB high-performance computing cluster. The Evolutionary Trace Baylor code was kindly provided by Angela Wilkins, Oliver Lichtarge, and Baylor College of Medicine.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
SW is funded by a Chinese Academy of Sciences Fellowship for Young International Scientists. This work was supported by a grant from the National Natural Science Foundation of China (NSFC) Research Fund for International Young Scientists (grant No. 31250110070). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Pierce KL, Premont RT, Lefkowitz RJ (2002) Signalling: Seven-transmembrane receptors. Nat Rev Mol Cell Biol 3: 639–650. [DOI] [PubMed] [Google Scholar]
- 2. Lundstrom K (2006) Latest development in drug discovery on G protein-coupled receptors. Curr Protein Pept Sc 7: 465–470. [DOI] [PubMed] [Google Scholar]
- 3. Palczewski K, Kumasaka T, Hori T, Behnke CA, Motoshima H, et al. (2000) Crystal structure of rhodopsin: AG protein-coupled receptor. Science 289: 739–745. [DOI] [PubMed] [Google Scholar]
- 4. Cherezov V, Rosenbaum DM, Hanson MA, Rasmussen SGF, Thian FS, et al. (2007) High-Resolution Crystal Structure of an Engineered Human β2-Adrenergic G Protein Coupled Receptor. Science 318: 1258–1265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Rasmussen SGF, Choi H-J, Rosenbaum DM, Kobilka TS, Thian FS, et al. (2007) Crystal structure of the human β2 adrenergic G-protein-coupled receptor. Nature 450: 383–387. [DOI] [PubMed] [Google Scholar]
- 6. Warne T, Serrano-Vega MJ, Baker JG, Moukhametzianov R, Edwards PC, et al. (2008) Structure of a β1-adrenergic G-protein-coupled receptor. Nature 454: 486–491. 10.1038/nature07101 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Jaakola VP, Griffith MT, Hanson MA, Cherezov V, Chien EYT, et al. (2008) The 2.6 Angstrom Crystal Structure of a Human A2A Adenosine Receptor Bound to an Antagonist. Science 322: 1211–1217. 10.1126/science.1164772 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Chien EYT, Liu W, Zhao Q, Katritch V, Won Han G, et al. (2010) Structure of the Human Dopamine D3 Receptor in Complex with a D2/D3 Selective Antagonist. Science 330: 1091–1095. 10.1126/science.1197410 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Wu B, Chien EYT, Mol CD, Fenalti G, Liu W, et al. (2010) Structures of the CXCR4 Chemokine GPCR with Small-Molecule and Cyclic Peptide Antagonists. Science 330: 1066–1071. 10.1126/science.1194396 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Shimamura T, Shiroishi M, Weyand S, Tsujimoto H, Winter G, et al. (2011) Structure of the human histamine H1 receptor complex with doxepin. Nature 475: 65–70. 10.1038/nature10236 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Hanson MA, Roth CB, Jo E, Griffith MT, Scott FL, et al. (2012) Crystal Structure of a Lipid G Protein-Coupled Receptor. Science 335: 851–855. 10.1126/science.1215904 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Haga K, Kruse AC, Asada H, Yurugi-Kobayashi T, Shiroishi M, et al. (2012) Structure of the human M2 muscarinic acetylcholine receptor bound to an antagonist. Nature 482: 547–551. 10.1038/nature10753 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Kruse AC, Hu J, Pan AC, Arlow DH, Rosenbaum DM, et al. (2012) Structure and dynamics of the M3 muscarinic acetylcholine receptor. Nature 482: 552–556. 10.1038/nature10867 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Manglik A, Kruse AC, Kobilka TS, Thian FS, Mathiesen JM, et al. (2012) Crystal structure of the μ-opioid receptor bound to a morphinan antagonist. Nature 485: 321–326. 10.1038/nature10954 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Wu H, Wacker D, Mileni M, Katritch V, Han GW, et al. (2012) Structure of the human κ-opioid receptor in complex with JDTic. Nature 485: 327–332. 10.1038/nature10939 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Thompson AA, Liu W, Chun E, Katritch V, Wu H, et al. (2012) Structure of the nociceptin/orphanin FQ receptor in complex with a peptide mimetic. Nature 485: 395–399. 10.1038/nature11085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Granier S, Manglik A, Kruse AC, Kobilka TS, Thian FS, et al. (2012) Structure of the δ-opioid receptor bound to naltrindole. Nature 485: 400–404. 10.1038/nature11111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. White JF, Noinaj N, Shibata Y, Love J, Kloss B, et al. (2012) Structure of the agonist-bound neurotensin receptor. Nature 490: 508–513. 10.1038/nature11558 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Zhang C, Srinivasan Y, Arlow DH, Fung JJ, Palmer D, et al. (2012) High-resolution crystal structure of human protease-activated receptor 1. Science 492: 387–392. 10.1038/nature11701 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Liu W, Chun E, Thompson AA, Chubukov P, Xu F, et al. (2012) Structural Basis for Allosteric Regulation of GPCRs by Sodium Ions. Science 337: 232–236. 10.1126/science.1219218 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Wang C, Jiang Y, Ma J, Wu H, Wacker D, et al. (2013) Structural Basis for Molecular Recognition at Serotonin Receptors. Science 340: 610–614. 10.1126/science.1232807 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Wacker D, Wang C, Katritch V, Han GW, Huang XP, et al. (2013) Structural Features for Functional Selectivity at Serotonin Receptors. Science 340: 615–619. 10.1126/science.1232808 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Siu FY, He M, de Graaf C, Han GW, Yang D, et al. (2013) Structure of the human glucagon class B G-protein-coupled receptor. Nature 499: 444–449. 10.1038/nature12393 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Wang C, Wu H, Katritch V, Han GW, Huang X-P, et al. (2013) Structure of the human smoothened receptor bound to an antitumour agent. Nature 497: 338–343. 10.1038/nature12167 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Hollenstein K, Kean J, Bortolato A, Cheng RKY, Doré AS, et al. (2013) Structure of class B GPCR corticotropin-releasing factor receptor 1. Nature 499: 438–443. 10.1038/nature12357 [DOI] [PubMed] [Google Scholar]
- 26. Tan Q, Zhu Y, Li J, Chen Z, Han GW, et al. (2013) Structure of the CCR5 Chemokine Receptor-HIV Entry Inhibitor Maraviroc Complex. Science 341: 1387–1390. 10.1126/science.1241475 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Zhang K, Zhang J, Gao Z-G, Zhang D, Zhu L, et al. (2014) Structure of the human P2Y12 receptor in complex with an antithrombotic drug. Nature 509: 115–118. 10.1038/nature13083 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Wu H, Wang C, Gregory KJ, Han GW, Cho HP, et al. (2014) Structure of a Class C GPCR Metabotropic Glutamate Receptor 1 Bound to an Allosteric Modulator. Science 344: 58–64. 10.1126/science.1249489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Kolakowski LF Jr (1994) GCRDb: a G-protein-coupled receptor database. Receptor Channel 2: 1–7. [PubMed] [Google Scholar]
- 30. Fredriksson R (2003) The gpcr in human genome form five families. Mol Pharmacol 63: 1256–1272. [DOI] [PubMed] [Google Scholar]
- 31. Allaby RG, Woodwark M (2007) Phylogenomic Analysis Reveals Extensive Phylogenetic Mosaicism in the Human GPCR Superfamily. Evol Bioinform Online 3: 357–370. [PMC free article] [PubMed] [Google Scholar]
- 32. Pele J, Abdi H, Moreau M, Thybert D, Chabbert M (2011) Multidimensional scaling reveals the main evolutionary pathways of class A G-protein-coupled receptors. Plos One 6: e19094 10.1371/journal.pone.0019094 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Chung S, Funakoshi T, Civelli O (2009) Orphan GPCR research. Brit J Pharmacol 153: S339–S346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Gutiérrez-de-Terán H, Massink A, Rodríguez D, Liu W, Han GW, et al. (2013) The Role of a Sodium Ion Binding Site in the Allosteric Modulation of the A2A Adenosine G Protein-Coupled Receptor. Structure 21: 2175–2185. 10.1016/j.str.2013.09.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Fenalti G, Giguere PM, Katritch V, Huang X- P, Thompson AA, et al. (2014) Molecular control of δ-opioid receptor signalling. Nature 506: 191–196. 10.1038/nature12944 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Okada T, Sugihara M, Bondar A-N, Elstner M, Entel P, et al. (2004) The Retinal Conformation and its Environment in Rhodopsin in Light of a New 2.2 Å Crystal Structure. J Mol Biol 342: 571–583. [DOI] [PubMed] [Google Scholar]
- 37. Rosenbaum DM, Cherezov V, Hanson MA, Rasmussen SGF, Thian FS, et al. (2007) GPCR Engineering Yields High-Resolution Structural Insights into 2-Adrenergic Receptor Function. Science 318: 1266–1273. [DOI] [PubMed] [Google Scholar]
- 38.Schrödinger, LLC (2010) The PyMOL Molecular Graphics System, Version 1.3r1.
- 39. Ballesteros J, Weinstein H (1995) Integrated methods for the construction of three dimensional models and computational probing of structure function relations in G protein-coupled receptors In: Sealfon S, Conn P, editors. San Diego, CA: Methods Neurosci, Vol. 25 pp. 366–428. [Google Scholar]
- 40. Goncalves JA, South K, Ahuja S, Zaitseva E, Opefi CA, et al. (2010) Highly conserved tyrosine stabilizes the active state of rhodopsin. Proc Natl Acad Sci USA 107: 19861–19866. 10.1073/pnas.1009405107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Elgeti M, Kazmin R, Heck M, Morizumi T, Ritter E, et al. (2011) Conserved Tyr223 5.58Plays Different Roles in the Activation and G-Protein Interaction of Rhodopsin. J Am Chem Soc 133: 7159–7165. 10.1021/ja200545n [DOI] [PubMed] [Google Scholar]
- 42. Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, et al. (2010) New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol 59: 307–321. 10.1093/sysbio/syq010 [DOI] [PubMed] [Google Scholar]
- 43. Kuhner MK, Felsenstein J (1994) Simulation Comparison of Phylogeny Algorithms Under Equal and Unequal Evolutionary Rates. Mol Biol Evol 11: 459–468. [DOI] [PubMed] [Google Scholar]
- 44. Huson DH, Scornavacca C (2012) Dendroscope 3: An Interactive Tool for Rooted Phylogenetic Trees and Networks. Syst Biol 61: 1061–1067. d 10.1093/sysbio/sys062 [DOI] [PubMed] [Google Scholar]
- 45. Huson DH, Bryant D (2006) Application of phylogenetic networks in evolutionary studies. Mol Biol Evol 23: 254–267. [DOI] [PubMed] [Google Scholar]
- 46. Jones DT, Taylor WR, Thornton JM (1992) The Rapid Generation of Mutation Data Matrices From Protein Sequences. Comput Appl Biosci 8: 275–282. [DOI] [PubMed] [Google Scholar]
- 47. Lichtarge O, Bourne HR, Cohen FE (1996) An evolutionary trace method defines binding surfaces common to protein families. J Mol Biol 257: 342–358. [DOI] [PubMed] [Google Scholar]
- 48. Mihalek I, Reš I, Lichtarge O (2004) A Family of Evolution–Entropy Hybrid Methods for Ranking Protein Residues by Importance. J Mol Biol 336: 1265–1282. [DOI] [PubMed] [Google Scholar]
- 49. Henikoff S, Henikoff JG (1992) Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA 89: 10915–10919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Jones DT (1999) Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol 292: 195–202. [DOI] [PubMed] [Google Scholar]
- 51. Bryson K, McGuffin LJ, Marsden RL, Ward JJ, Sodhi JS, et al. (2005) Protein structure prediction servers at University College London. Nucleic Acids Res 33: W36–W38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. The UniProt Consortium (2013) Activities at the Universal Protein Resource (UniProt). Nucleic Acids Res 42: D191–D198. 10.1093/nar/gkt1140 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Edgar R (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: 1792–1797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Jafurulla M, Chattopadhyay A (2012) Membrane Lipids in the Function of Serotonin and Adrenergic Receptors. CMC 20: 47–55. [PubMed] [Google Scholar]
- 56. Clamp M, Cuff J, Searle SM, Barton GJ (2004) The Jalview Java alignment editor. Bioinformatics 20: 426–427. [DOI] [PubMed] [Google Scholar]
- 57. Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ (2009) Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 25: 1189–1191. 10.1093/bioinformatics/btp033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Bryant D (2003) Neighbor-Net: An Agglomerative Method for the Construction of Phylogenetic Networks. Mol Biol Evol 21: 255–265. [DOI] [PubMed] [Google Scholar]
- 59. Madabushi S, Gross AK, Philippi A, Meng EC, Wensel TG, et al. (2004) Evolutionary Trace of G Protein-coupled Receptors Reveals Clusters of Residues That Determine Global and Class-specific Functions. J Biol Chem 279: 8126–8132. [DOI] [PubMed] [Google Scholar]
- 60. Rodriguez GJ, Yao R, Lichtarge O, Wensel TG (2010) Evolution-guided discovery and recoding of allosteric pathway specificity determinants in psychoactive bioamine receptors. Proc Natl Acad Sci USA 107: 7787–7792. 10.1073/pnas.0914877107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Mombaerts P (2001) The human repertoire of odorant receptor genes and pseudogenes. Annu Rev Genomics Hum Genet 2: 493–510. [DOI] [PubMed] [Google Scholar]
- 62. Niimura Y (2009) On the origin and evolution of vertebrate olfactory receptor genes: comparative genome analysis among 23 chordate species. Genome Biol Evol 1: 34–44. 10.1093/gbe/evp003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Krishnan A, Almén MS, Fredriksson R, Schioth HB (2012) The Origin of GPCRs: Identification of Mammalian like Rhodopsin, Adhesion, Glutamate and Frizzled GPCRs in Fungi. Plos One 7: e29817 10.1371/journal.pone.0029817 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Gelis L, Wolf S, Hatt H, Neuhaus EM, Gerwert K (2012) Prediction of a ligand-binding niche within a human olfactory receptor by combining site-directed mutagenesis with dynamic homology modeling. Angew Chem Int Ed Engl 51: 1274–1278. 10.1002/anie.201103980 [DOI] [PubMed] [Google Scholar]
- 65. Park JH, Morizumi T, Li Y, Hong JE, Pai EF, et al. (2013) Opsin, a structural model for olfactory receptors? Angew Chem Int Ed Engl 52: 11021–11024. 10.1002/anie.201302374 [DOI] [PubMed] [Google Scholar]
- 66. Feigin ME, Xue B, Hammell MC, Muthuswamy SK (2014) G-protein-coupled receptor GPR161 is overexpressed in breast cancer and is a promoter of cell proliferation and invasion. Proc Natl Acad Sci USA 111: 4191–4196. 10.1073/pnas.1320239111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Rasmussen SGF, Choi H-J, Fung JJ, Pardon E, Casarosa P, et al. (2010) Structure of a nanobody-stabilized active state of the β2 adrenoceptor. Nature 469: 175–180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Rosenbaum DM, Zhang C, Lyons JA, Holl R, Aragao D, et al. (2010) Structure and function of an irreversible agonist-β2 adrenoceptor complex. Nature 469: 236–240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Rasmussen SGF, DeVree BT, Zou Y, Kruse AC, Chung KY, et al. (2011) Crystal structure of the β2 adrenergic receptor-Gs protein complex. Nature 477: 549–555. 10.1038/nature10361 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Lebon G, Warne T, Edwards PC, Bennett K, Langmead CJ, et al. (2011) Agonist-bound adenosine A2A receptor structures reveal common features of GPCR activation. Nature 474: 521–525. 10.1038/nature10136 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Warne T, Moukhametzianov R, Baker JG, Nehmé R, Edwards PC, et al. (2010) The structural basis for agonist and partial agonist action on a β1-adrenergic receptor. Nature 469: 241–244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Deupi X, Edwards P, Singhal A, Nickle B, Oprian D, et al. (2012) Stabilized G protein binding site in the structure of constitutively active metarhodopsin-II. Proc Natl Acad Sci USA 109: 119–124. 10.1073/pnas.1114089108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Vogel R (2001) Conformations of the Active and Inactive States of Opsin. J Biol Chem 276: 38487–38493. [DOI] [PubMed] [Google Scholar]
- 74. Scheerer P, Park JH, Hildebrand PW, Kim YJ, Krauß N, et al. (2008) Crystal structure of opsin in its G-protein-interacting conformation. Nature 455: 497–502. 10.1038/nature07330 [DOI] [PubMed] [Google Scholar]
- 75. Choe H-W, Kim YJ, Park JH, Morizumi T, Pai EF, et al. (2011) Crystal structure of metarhodopsin II. Nature 471: 651–655. 10.1038/nature09789 [DOI] [PubMed] [Google Scholar]
- 76. Park JH, Scheerer P, Hofmann KP, Choe HW, Ernst OP (2008) Crystal structure of the ligand-free G-protein-coupled receptor opsin. Nature 454: 183–187. 10.1038/nature07063 [DOI] [PubMed] [Google Scholar]
- 77. Ahuja S, Hornak V, Yan ECY, Syrett N, Goncalves JA, et al. (2009) Helix movement is coupled to displacement of the second extracellular loop in rhodopsin activation. Nat Struct Mol Biol 16: 168–175. 10.1038/nsmb.1549 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Feuda R, Hamilton SC, McInerney JO, Pisani D (2012) Metazoan opsin evolution reveals a simple route to animal vision. Proc Natl Acad Sci USA 109: 18868–18872. 10.1073/pnas.1204609109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Gehring WJ (2012) The evolution of vision. WIREs Dev Biol 3: 1–40. [DOI] [PubMed] [Google Scholar]
- 80. Firestein S (2001) How the olfactory system makes sense of scents. Nature 413: 211–218. [DOI] [PubMed] [Google Scholar]
- 81. Bianchet MA, Bains G, Pelosi P, Pevsner J, Snyder SH, et al. (1996) The three-dimensional structure of bovine odorant binding protein and its mechanism of odor recognition. Nat Struct Mol Biol 3: 934–939. [DOI] [PubMed] [Google Scholar]
- 82. Laughlin JD, Ha TS, Jones DNM, Smith DP (2008) Activation of pheromone-sensitive neurons is mediated by conformational activation of pheromone-binding protein. Cell 133: 1255–1265. 10.1016/j.cell.2008.04.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83. Ernst OP, Lodowski DT, Elstner M, Hegemann P, Brown LS, et al. (2014) Microbial and Animal Rhodopsins: Structures, Functions, and Molecular Mechanisms. Chem Rev 114: 126–163. 10.1021/cr4003769 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Schuenke M, Schulte E, Schumacher U (2010) THIEME Atlas of Anatomy: Head and Neuroanatomy. Thieme. 414 p.
- 85. Hankins M, Peirson S, Foster R (2008) Melanopsin: an exciting photopigment. Trends Neurosci 31: 27–36. [DOI] [PubMed] [Google Scholar]
- 86. Murakami M, Kouyama T (2008) Crystal structure of squid rhodopsin. Nature 453: 363–367. 10.1038/nature06925 [DOI] [PubMed] [Google Scholar]
- 87. Surgand J- S, Rodrigo J, Kellenberger E, Rognan D (2005) A chemogenomic analysis of the transmembrane binding cavity of human G-protein-coupled receptors. Proteins 62: 509–538. [DOI] [PubMed] [Google Scholar]
- 88. Katritch V, Fenalti G, Abola EE, Roth BL, Cherezov V, et al. (2014) Allosteric sodium in class A GPCR signaling. Trends Biochem Sci 39: 233–244. 10.1016/j.tibs.2014.03.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89. Miller-Gallacher JL, Nehmé R, Warne T, Edwards PC, Schertler GFX, et al. (2014) The 2.1 Å Resolution Structure of Cyanopindolol-Bound β1-Adrenoceptor Identifies an Intramembrane Na+ Ion that Stabilises the Ligand-Free Receptor. Plos One 9: e92727 10.1371/journal.pone.0092727 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Gerwert K, Freier E, Wolf S (2014) The role of protein-bound water molecules in microbial rhodopsins. BBA—Bioenergetics 1837: 606–613. [DOI] [PubMed] [Google Scholar]
- 91. Filipek S, Palczewski K, Yuan S, Vogel H (2014) Activation of G-protein-coupled receptors correlates with the formation of a continuous internal water pathway. Nat Commun 5: 1–10. [DOI] [PubMed] [Google Scholar]
- 92. Yuan S, Hu Z, Filipek S, Vogel H (2015) W2466. 48 Opens a Gate for a Continuous Intrinsic Water Pathway during Activation of the Adenosine A2A Receptor. Angew Chem Int Ed Engl. 127: 566–569. [DOI] [PubMed] [Google Scholar]
- 93. Angel TE, Chance MR, Palczewski K (2009) Conserved waters mediate structural and functional activation of family A (rhodopsin-like) G protein-coupled receptors. Proc Natl Acad Sci USA 106: 8555–8560. 10.1073/pnas.0903545106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94. Kane B, Svensson B, Ferguson D (2006) Molecular recognition of opioid receptor ligands. AAPS J 8: E126–E137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95. Stewart A, Wong K, Cachat J, Gaikwad S, Kyzar E, et al. (2011) Zebrafish models to study drug abuse-related phenotypes. Rev Neuroscience 22: 95 10.1515/RNS.2011.011 [DOI] [PubMed] [Google Scholar]
- 96. Molinari P, Vezzi V, Sbraccia M, Gro C, Riitano D, et al. (2010) Morphine-like Opiates Selectively Antagonize Receptor-Arrestin Interactions. J Biol Chem 285: 12522–12535. 10.1074/jbc.M109.059410 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.