Abstract
Also called the Golgi pH regulator (GPHR), GPR89 is an orphan membrane protein found in nearly all eukaryotic lineages. Despite its broad phylogenetic distribution, the evolutionary history, structural diversity, and function of GPR89 remain poorly understood. In this study, we present a comprehensive bioinformatic analysis of GPR89 in Eukarya by integrating phylogenetic reconstruction, genomic synteny, sequence conservation, and structural modeling. While GPR89 is typically encoded as a single-copy gene, we identified lineage-specific duplications in both vertebrates and vascular plants. In contrast to the large sequence conservation, differences can be observed in whether plants and animals preserve the gene structure flanking GPR89. Structural clustering places GPR89 within the solute carrier (SLC) group, together with LIMR protein family members. Predicted structures reveal a unique intracellular helix hairpin and a conserved transmembrane core compatible with putative transport activity. This work provides a unifying framework for interpreting the existing evidence on the GPR89 function and proposes that GPR89 should be classified within the LIMR superfamily.
Keywords: GPHR, COLD1, GTG, Organelle, Orphan Receptor, Solute Carrier, Transporter
Graphical Abstract
Highlights
-
•
GPR89 is conserved in nearly all major eukaryotic lineages.
-
•
Structural analysis places GPR89 within the LIMR superfamily.
-
•
The predicted structure features a conserved core and a unique intracellular helical hairpin.
1. Introduction
GPR89 is a highly conserved orphan membrane protein found across unicellular eukaryotes, land plants, and animals [13], [16], [43], [53], [9]. Deficient GPR89 function in mammals has been associated with neurodegeneration [56], disrupted brain cholesterol synthesis [57], and compromised epidermal barrier function [60]. Originally annotated as a G protein-coupled receptor (GPCR), GPR89 displays nine predicted transmembrane domains, differing from the canonical seven-transmembrane GPCR structure. Although interactions with the G protein alpha subunit have been reported in plants such as rice, maize, and Arabidopsis thaliana [16], [42], [49], [70], GPR89’s classification as a functional GPCR remains unconfirmed due to the absence of a defined activation mechanism or a preferred signaling pathway [30].
Interestingly, GPR89 has been linked to a variety of molecular functions that might seem incompatible at first. In vascular plants, its localization and proposed roles vary widely; it has been described as a cold-activated calcium-permeable channel [42], an abscisic acid receptor [49], [70], and an ion channel present in the ER, Golgi, and the plasma membrane [30].
In animals, GPR89 has been referred to as the Golgi pH regulator (GPHR) and is predominantly located in the Golgi apparatus and endoplasmic reticulum [43], [56], [9]. A particularly intriguing hypothesis suggests that mammalian GPR89 forms a large-conductance, non-rectifying chloride channel (∼300 pS; PCl-/PK+ ≈ 2.5) in the Golgi membrane, potentially corresponding to the GOLAC-2 current [43], [61]. This chloride conductance may provide the counterion flux necessary to support proton accumulation and maintain luminal pH gradient in the Golgi. Importantly, Golgi pH regulation is associated with several pathologies, including congenital glycosylation disorders, cancer, increased viral susceptibility, and intellectual disabilities [36], [52].
Lastly, in the unicellular eukaryote Trypanosoma brucei, GPR89 is found at the plasma membrane, where it likely functions as an oligopeptide transporter [53]. These divergent functions and localizations suggest that the different orthologs of GPR89 may have evolved distinct physiological roles across taxa, while maintaining a conserved core structure.
In this study, we investigated the evolutionary and structural landscape of GPR89 to better understand its roles across eukaryotic lineages. We show that it is a well conserved protein, retained as a single-copy gene in most taxa, with independent duplication events observed in both metazoa and vascular plants. Our clustering analysis strategy revealed that GPR89 displays strong structural similarity with members of the LIMR protein family, forming a group that is nested between solute carrier transporter (SLC) superfamilies. Moreover, sequence conservation and evolutionary covariance in AlphaFold2-predicted structures highlights a conserved hydrophobic core centered on transmembrane segment 5 (TM5). The latter forms part of a lateral groove that could be potentially involved in substrate binding and transport. Surrounding this core are polymorphic intracellular and extracellular loops, including a prominent cytosolic alpha-alpha hairpin (HP) super-secondary structure. The loop at the tip of the hairpin (HPloop) shows marked sequence variability, especially among unicellular taxa, and may represent a modular domain contributing to the observed functional divergence of GPR89 across eukaryotic lineages. Overall, GPR89 represents a structurally conserved membrane protein that might have acquired lineage-specific subtle functional specializations, while retaining a core architecture associated with membrane permeability and organellar homeostasis.
2. Results
2.1. Repertoire of GPR89 across species
To understand the process of diversification of GPR89, we first performed a Maximum likelihood (ML) phylogenetic analysis using CDS from representative members of diverse taxonomic groups. Following calculation of hydrophobicity profiles, we recovered a total of 130 sequences, annotated as bona fide GPR89 orthologs in model species (Supplementary Table 1). The collected orthologs had an average length of 1368 nucleotides and the final alignment spanned 3913 nucleotides, displaying 248 conserved sites.
The Maximum Likelihood tree obtained shows the presence of the GPR89 gene among Metazoa, Archaeplastida, and Heterotrophic protists corresponding to Amorphea (Holozoa, Ascomycota, and Amoebozoa), Excavata, and SAR groups (Stramenopiles, Alveolata, and Rhizaria lineages) (Fig. 1). The gene tree is in agreement with previous phylogenetic hypotheses obtained by unrooted Neighbor-Joining trees [13], [30], [45]. Our phylogenetic reconstruction, recovers the monophyly among Metazoa, within Vertebrata, and its natural relations as monophyletic clades (Mammalia, Sauropsida, Anura, and Actinopterygii). Furthermore, the tree recovers Chlorophyta species as a sister group of Embryophyta (Fig. 1). Other relationships—such as those established among heterotrophic protists—appear as paraphyletic clades, including organisms from SAR and Opisthokonta (Fig. 1). Therefore, our analysis is consistent with both previous observations and the updated hypotheses for the speciation of Eukaryotic organisms [7].
Fig. 1.
Phylogenetic tree of GPR89 CDS across Eukarya. Best Maximum Likelihood phylogenetic tree displaying large groups in color code, Archaeplastida in green, heterotrophic protist in blue and Metazoa in orange. Numbers under the nodes represent bootstrap support values and red circles denote GPR89 intraspecific functional gene duplication. In parenthesis it indicates the number of sequences for each collapsed clade. Bar indicates nucleotide substitutions per site. The tree was rooted at the midpoint.
The branch lengths within major clades are relatively uniform, which supports a high degree of sequence conservation among closely related taxa (Fig. 1). An exception to this pattern is observed in heterotrophic protists, where significantly longer branch lengths indicate a diversification process, contrasting to the higher conservation observed within Metazoa and Archaeplastida. Among protists, Trypanosoma brucei emerges as the most divergent species.
A paralog of GPR89A, named GPR89B, was identified in both Metazoans and Archaeplastida, but is absent in heterotrophic protists (Fig. 1). Among Metazoa, the presence of more than one copy is relatively rare and, when observed—as in some representatives of Mammalia, Actinopterygii, Nematoda, Cnidaria, and Brachiopoda—it occurs in distant lineages, suggesting it results from lineage-specific, independent gene duplication events rather than a shared ancestral duplication (Fig. 1).
In contrast, multiple copies of the GPR89 gene are more frequently found in plants, particularly in green algae (e.g., Chlorella variabilis) and in a wide range of vascular plants, including Lycophytes, Gymnosperms, and Angiosperms (both Eudicots and Monocots) (Fig. 1). This broader retention of duplicated gene copies in plants aligns with the well-documented prevalence of whole-genome duplication (polyploidy) events in plant evolution and the generally higher tolerance for genetic redundancy and plasticity in plant genomes [39], [41], [48], [68].
To better understand the evolutionary history of GPR89, especially in groups with known duplication events and similar evolutionary time frames, we inspected the genomic organization around GPR89 orthologs. For this purpose, we compared the synteny found from from H. sapiens to Danio rerio (divergence 429 MYA; [37]) (Supplementary Figure 1) and that of A. thaliana to Marchantia polymorpha (divergence 480 MYA; [37]) (Supplementary Figure 1 A & B).
For the case of vertebrates, a comparison of the genomic regions displayed by the mammalian, sauropsid, and amphibian orthologs indicates that the single copy gene is flanked by five conserved genes: GJA8, GJA5, APC6, and BLC9 upstream, and by PDZK1 downstream. In fish, only PDZK1 remains in the syntenic group (Supplementary Figure 1 A). For the case of humans, the two copies of GPR89 arose from a unique duplication event that happened approximately 4.7 MYA [15]. These gene copies are located in close proximity to each other on chromosome 1, suggesting a duplication event that originates from a single locus (Supplementary Figure 1 A). In contrast, Embryophyta displays a similar synteny only at the family taxonomic level (Supplementary Figure 1B). For instance, in Brassicales, A. thaliana GPR89A (known as AtGTG1) on chromosome 1 has similar gene composition than his ortholog in B. oleracea (that is located on chromosome C2). The consistency of the syntenic region is lost over larger evolutionary distances, which can be explained by the more extensive genomic rearrangement reported in plants [55]. Thus, the remarkably conserved synteny in chordates contrasts with the absence of conservation displayed by plants.
Taken together, our findings suggest that the occurrence of multiple functional GPR89 copies across taxa did not stem from a single ancestral duplication, but rather from multiple, independent duplication events—such as segmental duplications and polyploidizations—reflecting distinct evolutionary pressures in Metazoan and Archaeplastid lineages (Supplementary Figure 1).
2.2. Detailed evolutionary trajectory of GPR89 among mammals
In addition to the canonical duplication, we identified a reported human pseudogene annotated as GPR89P. This prompted a broader search for GPR89A-related pseudogenes. Among eukaryotes, we found GPR89P restricted to mammals, particularly in most Anthropoid primates, and sporadically in Lagomorpha (rabbit), Scandentia (treeshrews), Rodentia (naked mole rat, degu, jerboa), Cetartiodactyla (blue whale, dolphin), and Marsupialia (wombat, koala). To further explore the evolutionary trajectory of GPR89 in mammals, we conducted a phylogenetic analysis including sequences for GPR89A, GPR89B, and GPR89P (Supplementary Table 2). The resulting gene tree confirmed one-to-one orthology of GPR89A across mammals (Supplementary Figure 1 C), showing the expected relationships among major lineages. The only exception was the paraphyly of Rodentia, likely due to long-branch attraction involving Jaculus jaculus and Dipodomys ordii [18], [5]. Among mammals, two coding copies of the gene were found exclusively in Homo sapiens and Vombatus ursinus, with each pair exhibiting near perfect CDS similarity (1 bp synonymous substitution in humans, completely identical in wombat). Moreover, our topology recovered monophyletic clades containing both the coding gene and pseudogene for Primate, Scandentia, Lagomorpha, Cetartiodactyla, and Marsupialia. These observations suggest species-specific duplications rather than a shared ancestral event for both coding genes and for the retrospseudogene copies as well (Supplementary Figure 1 C).
2.3. Amino acidic conservation patterns highlight critical regions
In order to identify possible conserved motifs, and regions of high variability in GPR89 orthologs, we translated our set of sequences to their respective protein primary structure (Supplementary table 1). The final set included representatives of Opisthokonta (Metazoa, Fungi, and Choanozoa), Chlorophyta (land plants and single-celled algae), as well as heterotrophic protists, such as members of the groups SAR, Rhodophyta, and Amoebozoa. We aligned the set of translated sequences and constructed an amino acid sequence identity histogram to detect conserved and variable regions within the protein sequence of human GPR89 (Fig. 2). We observed that the regions encoding for TM segments previously suggested in the literature [16], [43], [49], [70], [9] are not only well defined but largely conserved among all taxa (Fig. 2A). A thorough analysis of the amino acid distribution revealed 4 residues with conservation over 95 %, which mapped to their positions in human GPR89 correspond to P91 at TM3 and G154, L161 & G163 at TM5 (Fig. 2A & B).
Fig. 2.
Multiple Sequence Alignment of the translated sequences of curated GPR89 orthologs found across Eukarya. (A) Per-residue frequency histogram for the different amino acids. Structural annotations from the human GPR89A protein, obtained from the AlphaFold DB and DeepTMHMM, are mapped to their corresponding positions within the histogram: transmembrane helices (TM1–9, gold/hatched), hairpin helices (HP1&2, blue/hatched), and their connecting hairpin loop (HPloop, magenta/hatched). Black triangles above the bars mark the positions of highly conserved residues (>95 %). A red rectangle delimits a cluster of highly conserved residues at TM5. (B) Amino acid frequency logo of the TM5 conserved cluster. Black arrows indicate 95 % conserved residues P91, G154, L161 & G163 mapped to the human GPHRA sequence. (C) MSA of the hairpin region of selected GPR89 orthologs from animals, plants, and one choanoflagellate. Residue similarity is indicated at 100 % (black), > 80 % (dark gray) & > 60 % (light gray). A consensus sequence of residues over 90 % conserved is shown on top. A highly variable region is observed surrounding and encompassing the HPloop.
An important degree of polymorphism, including a variety of insertions/deletions and natural mutations, was found in the portion of the intracellular linker located between TM5 and TM6 (Fig. 2A & B). This region is predicted to fold in an alpha-alpha helix hairpin (HP) with a short loop connecting the helices in most collected sequences (HPloop) (Fig. 2A & C). The most dramatic changes in the polymorphic region corresponding to large insertions in the HPloop (Fig. 2A). These are present in orthologs from unicellular eukaryotes such as Amoeba, green algae, Ascomycota, and Excavata. Interestingly, in flowering plants, such as Oryza sativa and A. thaliana, this linker has been implicated in binding the alpha subunit of the G protein heterotrimer [16], [42], [49], [70].
To inspect whether certain polymorphisms are associated with major taxonomic groups, representative sequences of chordates and vascular plants were inspected without the large gaps introduced by most unicellular organisms (Fig. 2C). Notably, both mammals and flowering plants have clear consensus sequences respective to their own groups. For example, a member of an ancient Tracheophyte lineage (Selaginella moellendorffii) shows a variable region closely resembling that of flowering plants, while an Actinopterygii (Danio rerio) displays a sequence that is closer to the one in mammals. On the other hand, the choanoflagellate Monosiga brevicollis, which lacks some of the distinct insertions of unicellular organisms found in this region, shows substitutions that do not clearly align with the distinguishing characteristics of plants or animals (Fig. 2C). Overall, these analyses suggest that putative functional properties arising from this sequence segment, such as Galpha binding reported in plants, might have evolved within their respective taxonomic group.
2.4. Structural alignments identify GPR89 as a member of the LIMR superfamily
Protein structure is closely related to function, and is more sensitive to distant homologies than the primary sequence alone [10], [29]. For this reason, we decided to search for structural similarities instead of primary sequence homology. A preliminary search was performed using the predicted structure of human GPR89 (Uniprot ID: B7ZAQ6) against the full human AlphaFold Protein Structure Database via the Dali server [28], [64]. Surprisingly, across the entire database, only the four members of the poorly studied LIMR family (Pfam PF04791) emerged as the closest structural matches, despite the fact that their primary sequences share less than 15 % identity (Table 1). Further structural examination shows that they all share 9 transmembrane helices, with a characteristic linker between TM5 & 6 (Supplementary Figure 2).
Table 1.
Structural comparison of human GPR89A against the AlphaFold Database using the Dali Server. Top hits (Z-scores > 10) are shown.
The namesake member of this family, LIMR (i.e., LMBRL), has been associated with the endocytic transport of hydrophobic molecule-binding proteins, such as lipocalin-1 and β-lactoglobulin ([66]; Fluckinger et al., 2007). Another member, LMBRD1, participates in the lysosomal cobalamin transport, acting as an escort protein of the ABCD4 transporter [14], [35], [54].
It has been suggested that GPR89 orthologs act either as a transporter or an ion channel (Maeda et al., 2009; [42], [53]). One of the most thorough functional studies was done in Trypanosoma brucei (Rojas et al., 2015), where authors suggest a role as an oligopeptide transporter, similar to SLC15, a member of the Major Facilitator Superfamily (MFS, CATH 1.20.1250.20) of solute carrier transporters (SLC). Members of the SLC superfamily are found throughout the eukaryotic cell in both the plasma membrane and organellar membranes. The SLC group comprises over 400 different transporters. According to their sequence homology and characteristic function, SLC are organized in multiple distinct families [19], [26], [62]. Thus, we focus our efforts on whether the human GPR89 ortholog is part of the large group of transporter proteins including SLCs and ABCs. The multiple structure alignment prediction was performed over 480 human proteins by using TM-Vec, a deep learning approach that efficiently approximates TM-scores from primary sequences without explicit structural superposition [25]. We included most human SLC and ABC transporters, plus representative members of other families, such as chloride and calcium channels, and GPCRs. Explicit structural outgroups were used to root the tree, i.e. beta-barrels (VDAC1 & TOM40) and globins (myo & hemoglobin). From the predicted TM-scores rendered by TM-Vec, we constructed a distance tree (distance = 1 - TM-score) using unweighted pair group method with arithmetic mean (UPGMA) analysis (Fig. 3). The resulting tree can be roughly categorized into two main branches: the MFS/DMT superfamily-containing branch (CATH: 1.20.1250.20 & 1.10.3730.20 respectively) and the APC superfamily-containing branch (CATH: 1.20.1740.10/1.20.1730.10). Consistent with the Dali results, GPR89 groups well with all LIMR family members, with the closest alignment being to LMBD2 (Fig. 3A; tree file in Supplementary Materials). This group lies in the APC-containing branch, between the SLC39 (Zinc-Iron permeases) and SLC42 (Ammonium transporters) families (Fig. 3A).
Fig. 3.
Pairwise structural alignment prediction of GPR89 with SLC transporters and other transmembrane proteins. (A) Expanded unrooted tree depicting the UPGMA groups from the TM-Vec prediction. Branch colors and numbers at the tips indicate the CATH fold classification of the transmembrane domains for each protein; black signifies no annotated CATH classification. Non-transporters are indicated: CAC (*), CLCN (**), CFTR(***) and GPCR (*****). The sky blue triangle indicates GPHRA, and the gray shaded sections correspond to the main structural groups. The black dotted section was selected for the reanalysis shown in panel B. (B) Pairwise structural alignment of the GPR89 neighborhood. The numbers at each node show their UPGMA distance (average 1 - TM-Score). A distance cutoff of 0.4 (dotted line) reflects an 80 % probability of shared CATH classification between random alignments [67]. (C) Structural Alignments showing the citosolic view between GPR89 and representative members of each family are presented at the bottom. Blue: GPR89, Yellow: SLC42 (PDB ID: 8CSX), Orange: SLC4 (PDB ID: 6CAA), Magenta: SLC39 (PDB ID: 7Z6N), Red: LMBRL.
Since TM-Vec analysis considered the entire protein sequence, unwanted noise in the grouping procedure might have been introduced by soluble or intrinsically disordered domains. To address this possibility, we performed a pairwise structural alignment considering the transmembrane domains exclusively, using TM-align [69]. The advantages of this approach include the utilization of explicit three-dimensional coordinates and the computation of inter-query Euclidean distance. Despite some grouping rearrangements, GPR89 still clusters with the LIMR family, being the earliest diverging member of the group, with an average pairwise distance to other members of 0.351 (Fig. 3B). This value falls below the established TM-score threshold of 0.4 for pairwise analysis, which indicates a > 80 % probability of sharing a CATH structural classification [67]. Above this threshold, we may only infer a shared topology at best.
In short, our evidence suggests that GPR89 is a novel member of the LIMR superfamily, sharing their characteristic 9TM and HP domain. Structurally, the transmembrane region of the group resembles SLC transporters more than ABC transporters, Calcium and Chloride Channels, or GPCRs. Nevertheless, taking into account the limitations of performing in silico structural analyses over predicted targets, further experimental data would be necessary to confirm our predictions.
2.5. GPR89 shows a conserved transmembrane groove
To better understand the implications of similarities and differences displayed by GPR89 orthologs, we inspected the predicted structures available in the AlphaFold database [64], accounting for representative organisms of the various species and groups included in the present work. The predicted structure for human GPR89A (Uniprot ID B7ZAQ6), calculated by AlphaFold 2 (AF2) [31], [64], shows the expected 9 transmembrane helices and a long helix hairpin (HP) that shapes the TM5-TM6 cytosolic linker (Fig. 4). These structural features are consistent with other predicted models in the database, such as those from A. thaliana (UniProt: Q9XIP7), D. discoideum (Q54QM5), and T. brucei (Q580H0). In these models, the prediction of a 9TM architecture with a long helical hairpin constituting the majority of the TM5-TM6 linker is a systematic feature, likely constituting a structural signature for GPR89. In general, the large insertions displayed in various organisms occur predominantly in and around the cytosolic HPloop.
Fig. 4.
Predicted structure and conservation of human GPR89A. (A) AlphaFold2-predicted structure of GPR89A, showing 9 transmembrane helices and two cytosolic helices forming a hairpin. Colored TM residues correspond to those predicted by the DeepTMHMM server. (B) Conservation scores from ConSurf, using the MSA from Fig. 3, were mapped onto the structure and colored as a gradient from magenta (most) to cyan (least) conserved.(C) A closer look at the highly conserved TM5 and its vicinity. (D-F) Lateral hydrophilic cavity observed from the cytosolic (D), membrane (E) and extracellular/luminal perspectives (F). The surface is colored by calculated electrostatic potential; red: negative charge, blue: positive charge, white: neutral. TM5 is depicted in yellow.
To better visualize the relative position of the conserved residues, we displayed the conservation score at each position of the MSA previously obtained (Fig. 2), in the context of the predicted structure for GPR89A (Fig. 4 B & C). By doing this structural mapping, we noted that conserved residues of different helices appear in close proximity (<4 Å atom to atom) to the well conserved TM5 helix, which appears tilted relative to the other transmembrane helices (Fig. 4C). A large, partially hydrophilic vestibule, formed by TM5, TM6 & TM7, is accessible from the cytosolic face of the predicted structure (Fig. 4 D & E). This opening displays polar residues (R300, R357, Y390, H423, D427). The cavity appears inaccessible from the extracellular side of the predicted structure (Fig. 4F).
3. Discussion
The present work confirms the broad conservation of GPR89 across eukaryotic lineages and reveals its predominant retention as a single-copy gene, which suggests its expression is under tight regulatory control. This would be consistent with a strong purifying selection and a function of general importance, underscored in recent literature [23], [56]. The strong congruence between the phylogeny of GPR89 and the species tree [7] further indicates evolutionary robustness preserved along speciation events. An exception is found in protists, which exhibit exceptional divergence, suggesting a distinct evolutionary path potentially involving lineage-specific innovations, restricted to specific protein domains.
Despite this conserved evolutionary signature, GPR89 exhibits diverse molecular roles and subcellular localizations across taxa. In vertebrates, it is found at the Golgi and endoplasmic reticulum membranes, where it likely supports the anion flux that is essential for luminal acidification [43]. It has been suggested that GPR89 orthologs in plants mediate calcium permeability and the detection of hormonal and environmental cues [16], [42], [49], [70]. In Trypanosoma brucei, it localizes primarily to the plasma membrane where it supports the transport of small peptides [53]. Collectively, this evidence suggests a general role in membrane transport.
Our data strongly suggest that GPR89 shares a structural topology with the LIMR superfamily. Members of the LIMR protein family have been described as endocytic receptors [20], [66], transporter escort proteins [14], [35], or mediators of GPCR signalling cascades [11], [40], [47], indicating functional plasticity.
A shared feature is the HPloop, a distinct intracellular helix containing a Gα binding site in plant GPR89 [16], [42], [49], [70]. This Gα-binding capacity aligns well with a role in modulating membrane permeability and may represent a common feature within the LIMR family.
The apparent functional diversity of GPR89 across eukaryotes could be explained by a hypothetical role as an escort protein, which interacts with clade-specific transport proteins or membrane receptors. However, a significant challenge to this model arises from the functional recovery demonstrated by Rojas et al., where the Trypanosoma brucei gene complemented a transporter-deficient strain of E. coli. Bacteria lack the complex eukaryotic machinery (e.g., G proteins, clathrin) required for an escort function, therefore, this finding suggests that at least TbGPR89 may operate as a transporter by itself.
Our findings thus support the transporter hypothesis originally proposed by Rojas et al. [53]. The structural classification places GPR89 and the related LIMR group within the SLC transporter superfamily, with closer structural similarity to the APC family. Moreover, AlphaFold2 structural models revealed a conserved transmembrane scaffold that would be compatible with solute translocation. Consistent with known SLC architecture [4], [51], the predicted structure is organized in two sections, displaying a central hydrophobic core surrounding a specific transmembrane segment (TM5) [12], [17]. TM5 is flanked by conserved transmembrane helices, and polymorphic cytoplasmic and extracellular loops. The cytoplasmic loops display kingdom-specific variations that are particularly evident in the intracellular helix hairpin (HP) and the associated HPloop, which may govern regulatory interactions or subcellular targeting. It is important to note that no structure of the putative LIMR superfamily or GPR89 ortholog has been resolved, and that AlphaFold predictions are less reliable for orphan proteins with no close experimental evidence [65].
The inherent modularity of SLCs allows subtle molecular adaptations to have major functional consequences. For instance, prestin (SLC26A5) has lost the capacity to transport and evolved into a molecular actuator for electromotility in mammalian cochlear hair cells, sustaining the hearing process in mammals [38]. Therefore, it would be reasonable to suggest that GRP89 might be an ancient protein that retained transport capacity while other members of the LIMR protein family lost their transport capacity and turned into modulatory proteins.
In summary, phylogenetic stability, conserved transmembrane architecture, and structural classification places GPR89 within a well-supported structural and evolutionary context as part of the LIMR superfamily.
4. Materials and methods
4.1. Phylogenetic analyses
We used the ortholog prediction function of Ensembl database [27] and BLAST on the NCBI platform [2] to search for ortholog and paralog genes of GPR89 on major groups of vertebrates, metazoan model species, major groups of Viridiplantae (including green algae), and unicellular eukaryotic organisms such as heterotrophic protists. After corroborating the number of transmembrane segments with DeepTMHMM and AlphaFold 2 Protein Database [24], [64], the accuracy of the CDS translation was confirmed rendering an initial data set of 130 CDS including GPR89A and GPR89B copies for Archaeplastida, Heterotrophic protist, and Metazoa (See details of the taxonomy sampling and the accession numbers on Supplementary Table 1). In instances where the nucleotide sequence was incomplete, we manually annotated the CDS by retrieving the corresponding genomic segments from Ensembl databases. Exon sequences were subsequently recovered using the exons of a canonical sequence from a closely related species for each case, in Geneious 2023.2.1 (https://www.geneious.com) (referred as “own annotation” in Supplementary Table 1).
For the phylogenetic analyses on Supplementary Figure 1, we looked for orthologs and paralogs of GPR89A and GPR89P among major groups of Angiospermae and Mammals (Supplementary Table 2). In this case, we retrieved GPR89P performing a BLASTN search on the Ensembl database with default settings against the whole genome of model mammals organisms, using the functional CDS of each species as a query sequence. When GRP89P was present, we manually annotated the CDS using the coding exons of the functional gene of each species in Geneious 2023.2.1 (https://www.geneious.com). Therefore, we obtained a new data set, containing 99 Mammalian sequences (including GPR89A, B and P). Sauropsid sequences were used as outgroups for mammals.
For both data sets, the Multiple Sequence Alignment (MSA) was performed using MAFFT [33], [34] on Geneious Prime 2023.2.1, allowing the software to choose the strategy to determine best fit. Then, we performed a visual inspection of the alignments. The conservative sites were obtained on MEGA11 [58], and Model Finder option was used by default on IQ-TREE 2 [32] to select the best fitting model for nucleotide substitutions (Fig. 1: SYM+I+G4, Supplementary Figure 1 C: GTR+F+I+G4). We further used IQ-TREE 2 [44] to conduct a maximum likelihood analysis in order to obtain the best phylogenetic tree. This analysis was run four times to corroborate the topology of this tree. Bootstrapping was performed to obtain the consensus from 1000 replicates. We report the best Maximum Likelihood phylogenetic tree, obtained with the following minimum Log-likelihood values: (Fig. 1: −105917.056, Supplementary Figure 1 C: −21678.235). The corresponding values for bootstrap support were obtained from the consensus tree.
4.2. Synteny assessment
From the Ensembl Compara database [27] we obtained the loci of the genes. From Genomicus 109.1 vertebrate database [46], and PLAZA5.0 Angiosperm database [63] we obtained the genomic context for predicted functional copies of GPR89. We examined upstream and downstream coding genes that are common to model organisms within these major groups. Specifically, for the case of mammals, we also look forward to retro-pseudogenes synteny but due to the fact that we were not able to find an evident pattern, synteny for pseudogenes is not reported.
4.3. Amino acidic frequency analysis
The 128 non-redundant sequences from the CDS were translated in silico, and then the amino acid sequences were aligned using MAFFT (L-INS-i strategy). In order to facilitate the visualization in Fig. 3C, we minimized the number and length of the gaps by removing protists and other sequences.
4.4. Multiple Structural Alignment
We selected a collection of human transporters, defined as such in the IUPHAR/BPS Guide to Pharmacology [1]. Sequences included members of the Solute Carriers (SLC), ABC transporters, human GPR89A, LIMR protein homologs and representatives of other transmembrane protein families, such as GPCRs, Calcium and Chloride Channels. Beta-barrel proteins (VDAC1 and TOM40) & globulins (myoglobin and alpha-hemoglobin) were chosen as structural outgroups. The amino acid sequence of these proteins was obtained directly from Uniprot, via an automatic Python script and manual accession. Transporter subunit OSTB was excluded from the analysis for its short length (128 a.a.). The final list comprises 482 sequences in total.
The structural alignment was carried out using TM-Vec [25]. This Deep-Learning tool predicts the TM-score (structural similarity) of proteins via their primary structure. We used the large CATH dataset-trained model [25] and the full amino acid sequences previously described. The TM-score prediction was turned to a symmetric distance matrix, where structural distance was defined as 1 - TM-score (0 = identical structures, 1 = complete dissimilar structures). Then we ran the UPGMA analysis on MAFFT online (https://mafft.cbrc.jp/alignment/server/phylogeny.html). We used the unrooted ultrametric tree. This tree file can be found in Newick format on our Supplementary Material.
For the pairwise structural alignment of transmembrane domains, experimental structures from representative neighbors of the previous analysis were recovered from the Protein Data Bank [6]. For close neighbors with no structure available, the AlphaFold Protein Structure Database structure was used. The soluble domains were removed from the pdb files, and the structures were aligned using TM-align [69]. Beta-barrel proteins (VDAC1 and TOM40) were selected as structural outgroups.
4.5. Structural predictions
All 3D models for the different GPR89 orthologs and paralogs we found present in Eukarya were obtained from the freely available AlphaFold Protein Structure Database [64]. We present human GPR89A as a representative structural model. All inspected models shared the general 9TM topology and the structural signatures discussed in this paper.
4.6. Figure rendering and accessibility
The trees were visualized using Figtree 1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/) and vectorial silhouettes for phylogenetic figures were taken from https:/phylopic.org. These figures were colored and prepared on Inkscape 1.1.2.
MSA data was plotted with a custom Python script [8] and the sequence logo was made using LogoMaker [59]. For accessibility, Google’s Turbo color palette was chosen for labeling (http://research.google/blog/turbo-an-improved-rainbow-colormap-for-visualization/), and all figures were tested using a Color Blindness Simulator in Python (https://medium.com/@er_95882/colour-blindness-simulator-in-python-detecting-visualisation-problems-in-presentations-and-reports-71595526dc0e).
The degree of conservation of the different residues obtained from the MSA was mapped onto the structure of GPR89 using the web server ConSurf [21], [3]. The mapped structure was then exported to the molecular visualization software UCSF ChimeraX [22], [50], and the conservation score was manually annotated into the PDB file. The transmembrane helices were annotated according to the prediction of the DeepTMHMM server [24]. Coulombic surface potentials were calculated by ChimeraX.
CRediT authorship contribution statement
Luka Robeson: Writing – review & editing, Visualization, Methodology, Formal analysis. Brauchi Sebastian: Writing – review & editing, Writing – original draft, Project administration, Methodology, Formal analysis, Data curation, Conceptualization. Camila A. Quercia-Raty: Writing – original draft, Visualization, Investigation, Formal analysis, Data curation.
Declaration of Generative AI and AI-assisted technologies in the writing process
During the preparation of this work, the authors used ChatGPT by OpenAI to correct the language and improve the readability of the text. After using this tool, the authors reviewed and edited the content as needed and took full responsibility for the content of the published article.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgements
The authors acknowledge Dr. Juan C. Opazo for consultation on annotating pseudogene sequences. This work was supported by Beca ANID #21220073 to LR, Fondo Nacional de Desarrollo Científico y Tecnológico from the Chilean government (FONDECYT 1191868 and 1241753) to SB, and ANID-Millennium Science Initiative Program #NCN19_168 (SB and CQ). The Millennium Nucleus of Ion Channel-Associated Diseases (MiNICAD) is a Millennium Nucleus of the Iniciativa Milenio, National Agency of Research and Development (ANID, Chile). This research was supported by the Patagon supercomputer of Universidad Austral de Chile (FONDEQUIP EQM180042).
Glossary
- APC
Amino acid-polyamine-organocation superfamily
- CDS
Coding DNA Sequence, i.e., the coding region of a gene
- COLD1
Protein chilling tolerance divergence 1, aka GPR89
- DMT
Drug/Metabolite Transporter superfamily
- GPCR
G protein-coupled receptor
- GPHR
Golgi pH regulator, aka GPR89
- GTG
GPCR type G protein, aka GPR89
- HP
alpha helix hairpin structure
- HPloop
Short turn connecting the HP helices.
- LIMR
Lipocalin-1-interacting membrane receptor, aka LMBR1
- LMBR
Limb region 1 protein homolog
- MFS
Major Facilitator Superfamily
- ML
Maximum Likelihood
- MSA
Multiple sequences alignment
- MYA
Million years ago
- SLC
Solute carrier transporter
- SAR
Stramenopiles, Alveolata and Rhizaria lineages. Major clade of eukaryotic organisms
- UPGMA
Unweighted pair group method with arithmetic mean
Footnotes
Supplementary data associated with this article can be found in the online version at doi:10.1016/j.csbj.2025.11.003.
Appendix A. Supplementary material
Supplementary material
Supplementary material
Supplementary material
Supplementary material
Supplementary material
Supplementary material
References
- 1.Alexander S.P., Fabbro D., Kelly E., Mathie A.A., Peters J.A., Veale E.L., et al. The concise guide to PHARMACOLOGY 2023/24: transporters. Br J Phar. 2023;180:S374–S469. doi: 10.1111/bph.16182. [DOI] [PubMed] [Google Scholar]
- 2.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 3.Ashkenazy H., Abadi S., Martz E., Chay O., Mayrose I., Pupko T., et al. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 2016;44:W344–W350. doi: 10.1093/nar/gkw408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bai X., Moraes T.F., Reithmeier R.A.F. Structural biology of solute carrier (SLC) membrane transport proteins. Mol Membr Biol. 2017;34:1–32. doi: 10.1080/09687688.2018.1448123. [DOI] [PubMed] [Google Scholar]
- 5.Bergsten J. A review of long-branch attraction. Cladistics. 2005;21:163–193. doi: 10.1111/j.1096-0031.2005.00059.x. [DOI] [PubMed] [Google Scholar]
- 6.Berman H.M., Westbrook J., Feng Z., Gilliland G., Bhat T.N., Weissig H., et al. The protein data bank. Nucleic Acids Res. 2000;28(1):235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Burki F., Roger A.J., Brown M.W., Simpson A.G.B. The New Tree of Eukaryotes. Trends Ecol Evol. 2020;35:43–55. doi: 10.1016/j.tree.2019.08.008. [DOI] [PubMed] [Google Scholar]
- 8.Cabezas-Bratesco D., Mcgee F.A., Colenso C.K., Zavala K., Granata D., Carnevale V., et al. Sequence and structural conservation reveal fingerprint residues in TRP channels. eLife. 2022;11 doi: 10.7554/eLife.73645. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Charroux B., Royet J. Mutations in the Drosophila ortholog of the vertebrate Golgi pH regulator (GPHR) protein disturb endoplasmic reticulum and Golgi organization and affect systemic growth. Biol Open. 2013;3:72–80. doi: 10.1242/bio.20137187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Chen Z., Zhang X., Yu W., Luo Q. A comprehensive benchmarking study of protein structure alignment tools based on downstream task performance. bioRxiv. 2025;03:2025. doi: 10.1101/2025.03.11.642719. [DOI] [Google Scholar]
- 11.Choi J.H., Zhong X., McAlpine W., Liao T.C., Zhang D., Fang B., et al. LMBR1L regulates lymphopoiesis through Wnt/β-catenin signaling. Science. 2019;364(6440) doi: 10.1126/science.aau0812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Colas C., Ung P.M.U., Schlessinger A. SLC transporters: structure, function, and drug discovery. Med Chem Comm. 2016;7(6):1069–1081. doi: 10.1039/C6MD00005C. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Deckstein J., van Appeldorn J., Tsangarides M., Yiannakou K., Müller R., Stumpf M., et al. The Dictyostelium discoideum GPHR Ortholog Is an Endoplasmic Reticulum and Golgi Protein with Roles during Development. Eukaryot Cell. 2015;14:41–54. doi: 10.1128/EC.00208-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Deme J.C., Hancock M.A., Xia X., Shintre C.A., Plesa M., Kim J.C., et al. Purification and interaction analyses of two human lysosomal vitamin B12 transporters: LMBD1 and ABCD4. Mol Membr Biol. 2014;31(7–8):250–261. doi: 10.3109/09687688.2014.990998. [DOI] [PubMed] [Google Scholar]
- 15.Dennis M.Y., Harshman L., Nelson B.J., Penn O., Cantsilieris S., Huddleston J., et al. The evolution and population diversity of human-specific segmental duplications. Nat Ecol Evol. 2017;1 doi: 10.1038/s41559-016-0069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Dong H., Yan S., Liu J., Liu P., Sun J. TaCOLD1 defines a new regulator of plant height in bread wheat. Plant Biotechnol J. 2019;17:687–699. doi: 10.1111/pbi.13008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Drew D., Boudker O. Shared Molecular Mechanisms of Membrane Transporters. Annu Rev Biochem. 2016;85:543–572. doi: 10.1146/annurev-biochem-060815-014520. [DOI] [PubMed] [Google Scholar]
- 18.Felsenstein J. Cases in which Parsimony or Compatibility Methods Will be Positively Misleading. Syst Zool. 1978;27:401–410. doi: 10.2307/2412923. [DOI] [Google Scholar]
- 19.Ferrada E., Superti-Furga G. A structure and evolutionary-based classification of solute carriers. iScience. 2022;25 doi: 10.1016/j.isci.2022.105096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Fluckinger M., Merschak P., Hermann M., Haertlé T., Redl B. Lipocalin-interacting-membrane-receptor (LIMR) mediates cellular internalization of β-lactoglobulin. Biochim Biophys Acta (BBA)Biomembr. 2008;1778(1):342–347. doi: 10.1016/j.bbamem.2007.10.010. [DOI] [PubMed] [Google Scholar]
- 21.Glaser F., Pupko T., Paz I., Bell R.E., Bechor-Shental D., Martz E., et al. ConSurf: Identification of Functional Regions in Proteins by Surface-Mapping of Phylogenetic Information. Bioinformatics. 2003;19:163–164. doi: 10.1093/bioinformatics/19.1.163. [DOI] [PubMed] [Google Scholar]
- 22.Goddard T.D., Huang C.C., Meng E.C., Pettersen E.F., Couch G.S., Morris J.H., et al. UCSF ChimeraX: Meeting modern challenges in visualization and analysis. Prot Sci Publ Prot Soc. 2018;27:14–25. doi: 10.1002/pro.3235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Guler Kara H., Dogan E., Bozok V., Aktan C., Cakiroglu E., Eroglu Z., et al. The G protein-coupled receptor GPR89A is a novel potential therapeutic target to overcome cisplatin resistance in NSCLC Calu1 cells. FEBS J. 2025;292(14):3755. doi: 10.1111/febs.70099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hallgren J., Tsirigos K.D., Pedersen M.D., Armenteros J.J.A., Marcatili P., Nielsen H., et al. DeepTMHMM predicts alpha and beta transmembrane proteins using deep neural networks. bioRxiv 2022. 2022;04(08) doi: 10.1101/2022.04.08.487609. [DOI] [Google Scholar]
- 25.Hamamsy T., Morton J.T., Blackwell R., et al. Protein remote homology detection and structural alignment using deep learning. Nat Biotechnol. 2024;42:975–985. doi: 10.1038/s41587-023-01917-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hediger M.A., Romero M.F., Peng J.B., et al. The ABCs of solute carriers: physiological, pathological and therapeutic implications of human membrane transport proteins. Pflug Arch Eur J Physiol. 2004;447:465–468. doi: 10.1007/s00424-003-1192-y. [DOI] [PubMed] [Google Scholar]
- 27.Herrero J., Muffato M., Beal K., Fitzgerald S., Gordon L., Pignatelli M., et al. Ensembl comparative genomics resources. Database. 2016;2016 doi: 10.1093/database/baw053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Holm L., Laiho A., Törönen P., Salgado M. DALI shines a light on remote homologs: One hundred discoveries. Prot Sci. 2023;32(1) doi: 10.1002/pro.4519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Illergård K., Ardell D.H., Elofsson A. Structure is three to ten times more conserved than sequence—a study of structural response in protein cores. Prot Struct Funct Bioinform. 2009;77(3):499–508. doi: 10.1002/prot.22458. [DOI] [PubMed] [Google Scholar]
- 30.Jaffé F.W., Freschet G.-E.C., Valdes B.M., Runions J., Terry M.J., Williams L.E. G Protein–Coupled Receptor-Type G Proteins Are Required for Light-Dependent Seedling Growth and Fertility in Arabidopsis. Plant Cell. 2012;24:3649–3668. doi: 10.1105/tpc.112.098681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kalyaanamoorthy S., Minh B.Q., Wong T.K.F., von Haeseler A., Jermiin L.S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Katoh K., Rozewicki J., Yamada K.D. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20:1160–1166. doi: 10.1093/bib/bbx108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Katoh K., Standley D.M. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol Biol Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kawaguchi K., Okamoto T., Morita M., et al. Translocation of the ABC transporter ABCD4 from the endoplasmic reticulum to lysosomes requires the escort protein LMBD1. Sci Rep. 2016;6 doi: 10.1038/srep30183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Khosrowabadi E., Kellokumpu S. Golgi pH and ion homeostasis in health and disease. Organe Dis. 2020:1–23. doi: 10.1007/112_2020_49. [DOI] [PubMed] [Google Scholar]
- 37.Kumar S., Suleski M., Craig J.M., Kasprowicz A.E., Sanderford M., Li M., et al. TimeTree 5: An Expanded Resource for Species Divergence Times. Mol Biol Evol. 2022;39 doi: 10.1093/molbev/msac174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kuwabara M.F., Haddad B.G., Lenz-Schwab D., Hartmann J., Longo P., Huckschlag B.M., et al. Elevator-like movements of prestin mediate outer hair cell electromotility. Nat Commun. 2023;14(1):7145. doi: 10.1038/s41467-023-42489-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Levy A.A., Feldman M. Evolution and origin of bread wheat. Plant Cell. 2022;34:2549–2567. doi: 10.1093/plcell/koac130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Liu W., Jiang X., Li X., Sun K., Yang Y., Yang M., et al. LMBR1L regulates the proliferation and migration of endothelial cells through Norrin/β-catenin signaling. J Cell Sci. 2022;135(6) doi: 10.1242/jcs.259468. [DOI] [PubMed] [Google Scholar]
- 41.Liu C., Wu Y., Liu Y., Yang L., Dong R., Jiang L., et al. Genome-wide analysis of tandem duplicated genes and their contribution to stress resistance in pigeonpea (Cajanus cajan) Genomics. 2021;113:728–735. doi: 10.1016/j.ygeno.2020.10.003. [DOI] [PubMed] [Google Scholar]
- 42.Ma Y., Dai X., Xu Y., Luo W., Zheng X., Zeng D., et al. COLD1 Confers Chilling Tolerance in Rice. Cell. 2015;160:1209–1221. doi: 10.1016/j.cell.2015.01.046. [DOI] [PubMed] [Google Scholar]
- 43.Maeda Y., Ide T., Koike M., Uchiyama Y., Kinoshita T. GPHR is a novel anion channel critical for acidification and functions of the Golgi apparatus. Nat Cell Biol. 2008;10:1135–1145. doi: 10.1038/ncb1773. [DOI] [PubMed] [Google Scholar]
- 44.Minh B.Q., Schmidt H.A., Chernomor O., Schrempf D., Woodhams M.D., von Haeseler A., et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020;37:1530–1534. doi: 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Mojib N., Kubanek J. Comparative transcriptomics supports the presence of G protein-coupled receptor-based signaling in unicellular marine eukaryotes. Limnol Oceano. 2020;65:762–774. doi: 10.1002/lno.11345. [DOI] [Google Scholar]
- 46.Nguyen N.T.T., Vincens P., Dufayard J.F., Roest Crollius H., Louis A. Genomicus in 2022: comparative tools for thousands of genomes and reconstructed ancestors. Nucleic Acids Res. 2022;50:D1025–D1031. doi: 10.1093/nar/gkab1091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Paek J., Kalocsay M., Staus D.P., Wingler L., Pascolutti R., Paulo J.A., et al. Multidimensional tracking of GPCR signaling via peroxidase-catalyzed proximity labeling. Cell. 2017;169(2):338–349. doi: 10.1016/j.cell.2017.03.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Panchy N., Lehti-Shiu M., Shiu S.-H. Evolution of Gene Duplication in Plants. Plant Physiol. 2016;171:2294–2316. doi: 10.1104/pp.16.00523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Pandey S., Nelson D.C., Assmann S.M. Two Novel GPCR-Type G Proteins Are Abscisic Acid Receptors in Arabidopsis. Cell. 2009;136:136–148. doi: 10.1016/j.cell.2008.12.026. [DOI] [PubMed] [Google Scholar]
- 50.Pettersen E.F., Goddard T.D., Huang C.C., Meng E.C., Couch G.S., Croll T.I., et al. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Prot Sci Publ Prot Soc. 2021;30:70–82. doi: 10.1002/pro.3943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Pizzagalli M.D., Bensimon A., Superti-Furga G. A guide to plasma membrane solute carrier proteins. FEBS J. 2021;288:2784–2835. doi: 10.1111/febs.15531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Rivinoja A., Pujol F.M., Hassinen A., Kellokumpu S. Golgi pH, its regulation and roles in human disease. Ann Med. 2012;44:542–554. doi: 10.3109/07853890.2011.579150. [DOI] [PubMed] [Google Scholar]
- 53.Rojas F., Silvester E., Young J., Milne R., Tettey M., Houston D.R., et al. Oligopeptide Signaling through TbGPR89 Drives Trypanosome Quorum Sensing. Cell. 2019;176:306–317.e16. doi: 10.1016/j.cell.2018.10.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Rutsch F., Gailus S., Miousse I., et al. Identification of a putative lysosomal cobalamin exporter altered in the cblF defect of vitamin B12 metabolism. Nat Genet. 2009;41:234–239. doi: 10.1038/ng.294. [DOI] [PubMed] [Google Scholar]
- 55.Soltis P.S., Soltis D.E. Ancient WGD events as drivers of key innovations in angiosperms. Curr Opin Plant Biol. 2016;30:159–165. doi: 10.1016/j.pbi.2016.03.015. [DOI] [PubMed] [Google Scholar]
- 56.Sou Y., Kakuta S., Kamikubo Y., Niisato K., Sakurai T., Parajuli L.K., et al. Cerebellar Neurodegeneration and Neuronal Circuit Remodeling in Golgi pH Regulator-Deficient Mice. eNeuro. 2019;6 doi: 10.1523/ENEURO.0427-18.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Sou Y., Yamaguchi J., Kameda H., Masuda K., Maeda Y., Uchiyama Y., et al. GPHR-mediated acidification of the Golgi lumen is essential for cholesterol biosynthesis in the brain. FEBS Lett. 2022;596:2873–2888. doi: 10.1002/1873-3468.14491. [DOI] [PubMed] [Google Scholar]
- 58.Stecher G., Tamura K., Kumar S. Molecular Evolutionary Genetics Analysis (MEGA) for macOS. Mol Biol Evol. 2020;37:1237–1239. doi: 10.1093/molbev/msz312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Tareen A., Kinney J.B. Logomaker: Beautiful sequence logos in Python. Bioinformatics. 2020;36:2272–2274. doi: 10.1093/bioinformatics/btz921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Tarutani M., Nakajima K., Uchida Y., Takaishi M., Goto-Inoue N., Ikawa M., et al. GPHR-Dependent Functions of the Golgi Apparatus Are Essential for the Formation of Lamellar Granules and the Skin Barrier. J Invest Dermatol. 2012;132:2019–2025. doi: 10.1038/jid.2012.100. [DOI] [PubMed] [Google Scholar]
- 61.Thompson R.J., Nordeen M.H., Howell K.E., Caldwell J.H. A Large-Conductance Anion Channel of the Golgi Complex. Biophys J. 2002;83:278–289. doi: 10.1016/S0006-3495(02)75168-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Tweedie S., Braschi B., Gray K., Jones T.E.M., Seal R.L., Yates B., et al. Genenames.org: the HGNC and VGNC resources in 2021. Nucleic Acids Res. 2020;49:D939–D946. doi: 10.1093/nar/gkaa980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Van Bel M., Silvestri F., Weitz E.M., Kreft L., Botzki A., Coppens F., et al. PLAZA 5.0: Extending the scope and power of comparative and functional genomics in plants. Nucleic Acids Res. 2022;50:D1468–D1474. doi: 10.1093/nar/gkab1024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Varadi M., Anyango S., Deshpande M., Nair S., Natassia C., Yordanova G., et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022;50:D439–D444. doi: 10.1093/nar/gkab1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Wang W., Peng Z., Yang J. Single-sequence protein structure prediction using supervised transformer protein language models. Nat Comput Sci. 2022;2(12):804–814. doi: 10.1038/s43588-022-00373-3. [DOI] [PubMed] [Google Scholar]
- 66.Wojnar P., Lechner M., Redl B. Antisense down-regulation of lipocalin-interacting membrane receptor expression inhibits cellular internalization of lipocalin-1 in human NT2 cells. J Biol Chem. 2003;278(18):16209–16215. doi: 10.1074/jbc.M210922200. [DOI] [PubMed] [Google Scholar]
- 67.Xu J., Zhang Y. How significant is a protein structure similarity with TM-score= 0.5? Bioinformatics. 2010;26(7):889–895. doi: 10.1093/bioinformatics/btq066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Zhang Z., Ma X., Liu Y., Yang L., Shi X., Wang H., et al. Origin and evolution of green plants in the light of key evolutionary events. J Integr Plant Biol. 2022;64:516–535. doi: 10.1111/jipb.13224. [DOI] [PubMed] [Google Scholar]
- 69.Zhang Y., Skolnick J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 2005;33(7):2302–2309. doi: 10.1093/nar/gki524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Zhang X., Zhang Z., Peng H., Wang Z., Li H., Duan Y., et al. GPCR-like Protein ZmCOLD1 Regulate Plant Height in an ABA Manner. Int J Mol Sci. 2024;25:11755. doi: 10.3390/ijms252111755. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary material
Supplementary material
Supplementary material
Supplementary material
Supplementary material
Supplementary material





