Abstract
Uroplakins are a widespread group of vertebrate integral membrane proteins that belong to two different families: UPK1a and UPK1b belong to the large tetraspanin (TSPAN) gene family, and UPK3a, UPK3b, UPK3c, UPK3d, UPK2a and UPK2b form a family of their own, the UPK2/3 tetraspanin-associated family. In a previous study, we reported that uroplakins first appeared in vertebrates, and that uroplakin tetraspanins (UPK1a and UPK1b) should have originated by duplication of an ancestor tetraspanin gene. However, the evolutionary origin of the UPK2/3 family remains unclear. In this study, we provide evidence that the UPK2/3 family originated by gene duplication and domain loss from a protoPTPRQ-like basal deuterostome gene. PTPRQs are members of the subtype R3 tyrosine phosphatase receptor (R3 PTPR) family, which are characterized by having a unique modular composition of extracellular fibronectin (FN3) repeats, a transmembrane helix, and a single intra-cytoplasmic phosphotyrosine phophatase (PTP) domain. Our assumption of a deuterostome protoPTPRQ-like gene as an ancestor of the UPK2/3 family by gene duplication and loss of its PTP and fibronectin (FN3) domains, excluding the one closest to the transmembrane helix, is based on the following: (i) phylogenetic analyses, (ii) the existence of an identical intron/exon gene pattern between UPK2/3 and the corresponding genetic region in R3 PTPRs, (iii) the conservation of cysteine patterns and protein motifs between UPK2/3 and PTPRQ proteins and, (iv) the existence in tunicates, the closest organisms to vertebrates, of two sequences related to PTPRQ; one with the full subtype R3 modular characteristic and another without the PTP domain but with a short cytoplasmic tail with some sequence similarity to that of UPK3a. This finding will facilitate further studies on the structure and function of these important proteins with implications in human diseases.
Introduction
Uroplakins (UPKs) are integral membrane proteins belonging to two families: UPK1a (UPIa) and UPK1b (UPIb) are members of the four-transmembrane domain tetraspanin (TSPAN or TM4SF) family [1,2]; and the single-spanning transmembrane uroplakins UPK2a (also known as UPII, encoded by the UPK2 gene), UPK2b (non-mammalian vertebrates), UPK3a (UPIIIa, UPK3A), UPK3b (UPIIIb, UPK3B), UPK3c (UPK3BL; present in mammals, reptiles and birds) and UPK3d (bony fish) form the tetraspanin-associated uroplakin family, UPK2/3 [3–7].
UPKs play important roles in the expansion and stabilization of the urothelial apical surface and contribute to the urothelium permeability barrier function in mammals [8]. Their protein structure, function and involvement in urinary tract development and malformations are the subject of several reports and reviews [5,7,9–16].
In a previous report, we showed that both the tetraspanin uroplakins and the UPK2/3 families first appeared in the common ancestor of vertebrates [7]. While the origin of UPK1a and UPK1b can be explained by gene duplication from an ancestor invertebrate tetraspanin gene [2,5,17], the origin of the UPK2/3 family is not yet established as it does not appear to be related to any known protein family [7].
PTPRQ is a member of the large family of phospho-tyrosine phosphatases that, along with the protein tyrosine kinases, regulates the level of phospho-tyrosine modifications in cells [18,19]. Specifically, PTPRQ belongs to subtype R3 phospho-tyrosine phosphatase receptors (R3 PTPRs), which like UPK2/3 uroplakins are single-spanning membrane proteins. However they are much larger than UPK2/3 proteins, which consist of an extracellular moiety of less than 200 amino acids and a short cytoplasmic tail, absent in UPK2a and UPK2b [3,7,20]. R3 PTPRs have a single cytoplasmic catalytic phophotyrosine phosphatase (PTP) domain and an extracellular region composed exclusively by fibronectin type III (FN3) domains (9 to 19 in humans). In vertebrates, R3 PTPRs comprise five members: PTPRB (VE-PTP), PTPRH (SAP-1), PTPRJ (DEP-1), PTPRO (GLEPP1) and PTPRQ (PTPS31) [21–23].
In this study, we show that UPK2/3 proteins are related to and are probably derived from an ancestor R3 PTPRQ-like gene in invertebrates. This finding is based on the results of phylogenetic analyses with UPK2/3 and R3 PTPR DNA and protein sequences of vertebrates and invertebrates, and the identity of the intron/exon gene structures and the conservation of several cysteines pairs and protein motifs.
Materials and Methods
Data mining and sequence analysis
Data mining was performed as described [2,21,24]. Sequences with the R3 PTPR characteristics of only FN3 domains and a single PTP catalytic domain were retrieved from major taxonomic groups of organisms by searching in online genomic databases and scientific reports [23–25]. The SMART server [26] and the NCBI CDD database [27] were used to determine the domain composition of the proteins. Since evolutionarily related genes have more conserved intron positions than merely similar sequences [28,29], intron-exon borders were determined as detailed in Garcia-España et al. [30] using the “align two sequences” option of the NCBI BLAST program [31]. Splice consensus signals were then manually annotated. All R3 PTPRs sequences used in this study plus organisms’ scientific names, accession numbers and structural characteristics are shown in S1 File. The UPK2/3 sequences used in this study have been reported elsewhere [5,7].
Protein sequence alignments
All protein sequence alignments were performed using the MAFFT server [32] or the ClustalW and the Multalin programs at the NPS@: Network Protein Sequence Analysis [33]. The NCBI TBLASTN program was used in protein profiling [31].
Phylogenetic analyses
We compared the sequences from representative vertebrates (Homo sapiens, Mus musculus, Gallus gallus, Xenopus tropicalis, and Danio rerio), invertebrate deuterostomes (Strongylocentrotus purpuratus, Saccoglossus kowalevskii, Ciona intestinallis and Ciona savignyi), protostomes (Capitella teleta, Drosophila melanogaster and Caenorhabditis elegans), and a porifera (Amphimedon queenslandica). In addition, we compared the corresponding regions of representative R2A subtype leukocyte common antigen-related (PTPRF) phosphatases and PTPRF from Monosiga brevicollis as outgroup. Alignments were generated using TranslatorX (with the MAFFT option and default settings) [34]. In this way, both DNA and protein sequence matrices were produced.
Examination of the DNA sequence matrices suggested extreme saturation of the data. Hence, we focused on using the amino acid sequences to infer the phylogenetic relationships of the various gene family members. We also coded the molecular motifs and intron positions for these genes as characters. This resulted in six binary characters (all scored as present = G, absent = T) C1–C2 (cysteine 1, 2), C3–C4 (cysteine 3, 4), intron 1, intron 2, intron 3 and intron 4. The present/absent characters were coded as G and T respectively to allow Bayesian and maximum parsimony analysis of combined amino acid sequence and molecular motif data. We generated Bayesian trees using MrBayes [35] for the amino acid sequences alone, the molecular motif (MM) data alone and for the two partitions combined (using a partitioned analysis with parameters for the amino acid sequences as below and the parsimony setting in MrBayes for the MM data). For Bayesian analysis of amino acid sequences, the mixed model using the “prset aamodelpr = mixed” command with 500000 generations (at which point the 2 analyses of MrBayes converged; split standard deviation = 0.0105) was used. Parsimony bootstraps were conducted using 1000 replicates of random taxon addition and TBR branch swapping in PAUP* [36]. The tree we present is from a Bayesian analysis of the combined matrix. Other trees generated using protein sequences alone, MM characters alone and bootstrap maximum parsimony analyses for combined and MM alone and protein alone are shown in S1 Fig. All matrices used in the analyses are given in S2 File.
Tyrosine phophorylation and furin cleavage sites prediction
Putative phosporylation residues and furin cleavage sites were predicted with NetPhos 2.0 and ProP 1.0 Servers [37,38].
Results and Discussion
UPK2/3 uroplakins conserve intron/exon and cysteine patterns
Our earlier phylogenetic analyses showed that UPK2/3 uroplakins probably originated from an unknown ancestor gene by a major duplication event in the first stages of vertebrate evolution [5,7]. UPK2/3 overall amino acid sequence identity is low. Their orthology is also shown by the following: (i) the location of their intron/exon junctions and the phases of the introns are conserved (phases 1, 1, 2, 1, 2 in the marked introns 1 to 5 in Fig 1 and S2 Fig; phase 1 and 2 denotes that the intron is located between the 1st and 2nd or 2nd and 3rd nucleotides of the codon, respectively), with the exception of intron 5, which is lost in UPK2 genes since they lack the exon coding for the cytoplasmic tail (Fig 1 and S2 Fig); and (ii) they share a set of highly conserved extracellular cysteine residues (C1–C4 in Fig 1 and S2 Fig). Thus UPK3a, UPK3b, UPK3c, UPK3d and UPK2b of reptiles and birds contain the four cysteines, but only UPK3c of birds and reptiles contains C1 and C2, UPK2a contains only C1 and C2 and UPK3d contains an additional pair (Fig 1 and S2 Fig). The above observations indicate that UPK2/3 uroplakins are variations on a common structural theme with UPK2a and UPK2b genes losing the cytoplasmic tail, UPK3d gaining a pair of cyteines, and UPK2a and UPK3c losing cysteine pairs C3 and C4 and C1 and C2, respectively.
UPK2/3s and R3 PTPRs share similar extracellular juxtamembrane and transmembrane regions
To study the evolutionary origin of the UPK2/3 genes, we performed searches in protein and genomic databases with several UPK2/3 sequences as bait. We first found that the UPK3a human sequence (except for the N-terminal signal peptide and the C-terminal cytoplasmic tail) was similar to the juxtamembrane long FN3-like and transmembrane domains of the human R3 phosphatase receptor PTPRQ between amino acids 1744 and 1928 (S3 Fig) [39]. This similarity was also evident in PTPRB and to a lesser extent in PTPRJ, but not in the other human R3 PTPR members PTPRO and PTPRH, which lacked the longer FN3-like domain located between introns 1 and 4 (Fig 2).
Moreover, these regions are encoded by 4 exons of similar lengths flanked by introns in phases 1, 1, 2, 1, 2 in PTPRQ, PTPRB and PTPRJ as in UPK2/3 genes (Fig 2) In addition, PTPRQ and PTPRB have also the four characteristic UPK2/3 C1–C4 cysteine residues (Fig 2).
UPK2/3 and R3 PTPR long juxtamembrane FN3 domain is conserved in vertebrate and invertebrate organisms
By searching genomic databases and scientific reports (see Materials and Methods section), we retrieved full sequences of R3 PTPR genes, which can be differentiated from other PTPR receptor subtypes (R1, R2, R4–R8) by their unique domain composition consisting of only FN3 domains in the extracellular moiety and a single catalytic (PTP) domain in the cytoplasmic region [21]. We obtained R3 PTPR sequences from representative organisms covering a broad evolutionary range, from sponges to mammals (S1 File).
Interestingly, in tunicates, the closest living organisms to vertebrates [40], we retrieved one R3-related sequence without a PTP domain, (CionaNocat.1 and CionaNocat.2 from Ciona intestinalis and Ciona savignyi, respectively) in addition to another sequence with the full R3 specific modular characteristics (Ciona.2 and Ciona from Ciona intestinalis and Ciona savignyi, respectively).
All these invertebrate R3 PTPR-related sequences contained similar regions to the UPK2/3 genes with the exception of only Ciona1 and Oikopleura1 sequences (S1 File).
The alignment of these long FN3-like regions of vertebrate and invertebrate R3 sequences revealed several highly conserved features between R3 PTPRs and UPK2/3 uroplakins: (i) the flanking introns 1 and 4 were already present in sponges (porifera), while introns 2 and 3 are a deuterostome acquisition (Fig 3 and S4 Fig); (ii) cysteines C3 and C4 are present in all organisms from sponges to mammals and cysteines C1 and C2 only from tunicates (Fig 3); (iii) seven short fragments (marked 1 to 7 in Fig 3) are conserved in all sequences; of these, motif 5 (C3X(n)C4NGPL signature where X represents any amino acid) is particularly highly conserved and was found only in UPK2/3 and R3 PTPR proteins. These results indicate that the long FN3-like domain close to the TMD, as defined by several highly conserved sequence and genomic features, is present in the R3 PTPR sequence of sponges and in almost all invertebrates including the sequences without PTP domain in tunicates.
Moreover, the cytoplasmic tails of these sequences without PTP domains (CionaNocat.1 and CionaNocat.2) have retained some sequence similarity with those of UPK3a genes (Fig 4), although vertebrates and tunicates diverged over 700 MYA [41]. This similarity includes two putatively phosphorylatable residues highly conserved in UPK3 genes (Fig 4): the Tyr249 in Xenopus that is highly conserved in vertebrate UPK3a genes and is transiently phosphorylated during Xenopus oocyte fertilization [16], and the human Thr244 that is highly conserved as a Ser/Thr in vertebrate UPK3a and 3b genes and is phosphorylated by a serine/threonine kinase following uropathogenic Escherichia coli adhesion to the urothelium [42] (Fig 4).
Phylogenetic analyses indicate that UPK3 genes are evolutionarily related to PTPRQ
In view of the above observations, we examined the evolutionary relationship between UPK2/3 and R3 PTPR genes by phylogenetic analyses using the long FN3-like and TMD domain, shared regions between these proteins. Using the protein matrix, the molecular morphology matrix and a matrix that combines the two and doing bootstrap analysis on all parsimony based analyses resulted in eight separate trees that we present in S4 Fig. Of these eight trees, six support the inclusion of UPKs as embedded in the PTPR family and one is unresolved (protein only with MP and bootstrap). One analysis (protein only with MP) pushed the UPKs outside of the PTPRs. Of all the analyses in the table this one would be most prone to the simplification of the parsimony model. Since the Bayesian analysis of the combined matrix (Fig 5) incorporated a model of sequence change we suggest that for the most part it is the most appropriate type of analysis for the data set we used. Fig 5 and S4 Fig summarize the phylogenetic analyses of these regions of vertebrate PTPRB, PTPRQ, UPK3a, and UPK3b and invertebrate R3 PTPR genes from representative groups of organisms. We present only the analysis performed with protein sequences combined with the molecular motifs (MM) characters since DNA sequences were saturated and could not be resolved in the analysis (see Materials and Methods section). The amino acid sequences combined with MM characters (presence/absence of introns and cysteines) placed the UPK3 and PTPRQ genes as sister groups in Bayesian analysis (Fig 5 and S1 Fig). Other analyses were ambiguous as to which PTPR was sister to the UPK3 genes except for the protein plus MM maximum parsimony analysis. The important result with respect to UPKs is that all trees ruled out PTPRF (from subtype R2A PTPR) as a potential sister group (Fig 5 and S1 Fig) except for the analysis of protein with MP bootsrap.
In addition to showing that UPK3a and UPK3b are most likely sister groups of the PTPRQ genes, the phylogenetic analysis demonstrated that UPK3 genes are also closely related to the Ciona sequence with a PTP catalytic domain (ciona and ciona2) and the Ciona sequence without a PTP domain (CionaNocat.1 and 2). These sequences without the catalytic domain and with only five FN3 domains closely resemble the intermediate stage in the transition of a PTPRQ-like gene to a UPK2/3 gene in the common ancestor of tunicates and vertebrates.
Taken together, the data suggest that UPK2/3 uroplakins originated by gene duplication, a common mechanism for the generation of new genes [43,44], coupled with domain loss from a PTPRQ precursor gene in the common ancestor of vertebrates over 700 MYA when vertebrates radiated from tunicates [41].
Structure and function of the juxtamembarene FN3-like domain
Phosphotyrosine phosphatase receptors (PTPR) play important roles in cell-to-cell interaction and signaling [18,19]. The diversity of these proteins is achieved principally through changes in the extracellular, non-catalytic region, of the receptors [19]. Although the number of the FN3 domains in R3 PTPR genes is variable (8 to 18) [42], the long FN3-like region close to the transmembrane helix in R3 PTPRs is highly conserved from sponges to mammals, suggesting that this domain performs essential functions. Indeed, the biological significance of this domain is demonstrated by the observation that in angiosarcomas, 36% of the inactivating mutations of the recessive cancer gene PTPRB are over-represented in the long FN3-like region, which represents only 7.5% of the protein sequence [25].
An interesting result from these analyses is that since cysteines that form disulfide bridges are almost always gained or lost as in pairs during protein evolution [45], the emergence of C1 and C2 in the R3 PTPR sequences of Ciona (urochordates) coupled to the specific loss of C3 and C4 in UPK2a and of C1 and C2 in UPK3c (Figs 1 and 3 and S2 Fig), suggest that C1 forms a disulfide bridge with C2 and C3 forms a disulfide bridge with C4, in situ. Additionally, an important feature of UPK2a is its prosequence that is cleaved after cysteine C2, at an Rx(R/K)R furin motif. It has been suggested that this prosequence blocks the oligomerization of the six UPK1a/2a-UPK1b/3a heterotetramers, thus preventing them from forming a 16-nm particle [3,11,46]. This UPK2a furin site is also conserved in some UPK2b proteins and in many invertebrate R3 PTPR proteins (ciona, ciona 2, cionanocat, acornworm, acornworm1, sea urchin1, annelid, PTP52 F from Drosophila melanogaster and the sponge protein from Amphimedon queenslandica (S5 Fig). These UPK2a cleaved fragments probably remain bound to the tetraspanin uroplakin UPK1a partner [12]. Proteolytic cleavage of proteins close to the transmembrane domain without the release of the cleaved fragment could introduce flexibility in the protein proximal region and has been observed in FN3 domains of other receptors PTPs including PTPRμ and PTPRκ that are processed in the trans-Golgi network by subtilin-like convertases [47].
Tetraspanins and many kinases involved in cell signaling are enriched in lipid rafts [48]. It is possible that the common ancestral protein of PTPR and UPK2/3 genes were clustered in lipid raft domains where they interacted with a UPK1a/1b tetraspanin precursor, followed by co-evolution between these two uroplakin families. This idea is supported by the reported association of an unknown tyrosine phosphatase with CD53 and CD63 in lymph node cells [49].
Conclusions
The discovery that UPK2/3 proteins are most likely derived from an ancestral PTPRQ gene by gene duplication and domain loss and that UPK2/3 and R3 PTPR genes share a juxtamembrane long FN3 domain with conserved structural characteristics will facilitate further studies on the structure and function of these important proteins with implications in human disease.
Supporting Information
Acknowledgments
This work was supported by MICINN grant CONSOLIDER INGENIO CSD00065 (A.G.E), co-financed by the European Regional Development Fund (ERDF), and by NIH grants DK52206 and DK39753 (T.T.S). A.G.E. is supported by the Research Stabilization Program of the ICSIII-ICS. We thank Sara García-España for assistance in generating some of the figures.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
Support was provided by Ministerio de Economía y Competitividad (MICINN) - CONSOLIDER 2010 CSD00065 (AGE) and the National Institutes of Health DK52206 and DK39753 (TTS). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Yu J, Lin JH, Wu XR, Sun TT. Uroplakins Ia and Ib, two major differentiation products of bladder epithelium, belong to a family of four transmembrane domain (4TM) proteins. J Cell Biol. 1994. April;125(1):171–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Garcia-Espana A, Chung PJ, Sarkar IN, Stiner E, Sun TT, Desalle R. Appearance of new tetraspanin genes during vertebrate evolution. Genomics. 2008. April;91(4):326–34. 10.1016/j.ygeno.2007.12.005 [DOI] [PubMed] [Google Scholar]
- 3.Lin JH, Wu XR, Kreibich G, Sun TT. Precursor sequence, processing, and urothelium-specific expression of a major 15-kDa protein subunit of asymmetric unit membrane. J Biol Chem. 1994. January 21;269(3):1775–84. [PubMed] [Google Scholar]
- 4.Wu XR, Lin JH, Walz T, Haner M, Yu J, Aebi U, et al. Mammalian uroplakins. A group of highly conserved urothelial differentiation-related membrane proteins. J Biol Chem. 1994. May 6;269(18):13716–24. [PubMed] [Google Scholar]
- 5.Garcia-Espana A, Chung PJ, Zhao X, Lee A, Pellicer A, Yu J, et al. Origin of the tetraspanin uroplakins and their co-evolution with associated proteins: implications for uroplakin structure and function. Mol Phylogenet Evol. 2006. November;41(2):355–67. 10.1016/j.ympev.2006.04.023 [DOI] [PubMed] [Google Scholar]
- 6.Sun T-T, Kreibich G, Pellicer A, Konh XP, Wu XR. Uroplakins as a unique tetraspanin network In: Berditchevski F and Rubinstein E editors. Tetraspanins. Springer; 2013; pp 299–320 [Google Scholar]
- 7.Desalle R, Chicote JU, Sun TT, Garcia-Espana A. Generation of divergent uroplakin tetraspanins and their partners during vertebrate evolution: identification of novel uroplakins. BMC Evol Biol. 2014;14:13 10.1186/1471-2148-14-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wu XR, Kong XP, Pellicer A, Kreibich G, Sun TT. Uroplakins in urothelial biology, function, and disease. Kidney Int. 2009. June;75(11):1153–65. 10.1038/ki.2009.73 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Deng FM, Liang FX, Tu L, Resing KA, Hu P, Supino M, et al. Uroplakin IIIb, a urothelial differentiation marker, dimerizes with uroplakin Ib as an early step of urothelial plaque assembly. J Cell Biol. 2002. November 25;159(4):685–94. 10.1083/jcb.200204102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tu L, Sun TT, Kreibich G. Specific heterodimer formation is a prerequisite for uroplakins to exit from the endoplasmic reticulum. Mol Biol Cell. 2002. December;13(12):4221–30. 10.1091/mbc.E02-04-0211 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hu CC, Liang FX, Zhou G, Tu L, Tang CH, Zhou J, et al. Assembly of urothelial plaques: tetraspanin function in membrane protein trafficking. Mol Biol Cell. 2005. September;16(9):3937–50. 10.1091/mbc.E05-02-0136 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Min G, Wang H, Sun TT, Kong XP. Structural basis for tetraspanin functions as revealed by the cryo-EM structure of uroplakin complexes at 6-A resolution. J Cell Biol. 2006. June 19;173(6):975–83. 10.1083/jcb.200602086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mitra S, Lukianov S, Ruiz WG, Cianciolo Cosentino C, Sanker S, Traub LM, et al. Requirement for a uroplakin 3a-like protein in the development of zebrafish pronephric tubule epithelial cell function, morphogenesis, and polarity. PLoS One. 2012;7(7):e41816 10.1371/journal.pone.0041816 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Jenkins D, Woolf AS. Uroplakins: new molecular players in the biology of urinary tract malformations. Kidney Int. 2007. February;71(3):195–200. 10.1038/sj.ki.5002053 [DOI] [PubMed] [Google Scholar]
- 15.Khandelwal P, Abraham SN, Apodaca G. Cell biology and physiology of the uroepithelium. Am J Physiol Renal Physiol. 2009. December;297(6):F1477–501. 10.1152/ajprenal.00327.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mahbub Hasan AK, Hashimoto A, Maekawa Y, Matsumoto T, Kushima S, Ijiri TW, et al. The egg membrane microdomain-associated uroplakin III-Src system becomes functional during oocyte maturation and is required for bidirectional gamete signaling at fertilization in Xenopus laevis. Development. 2014. April;141(8):1705–14. 10.1242/dev.105510 [DOI] [PubMed] [Google Scholar]
- 17.DeSalle R, Mares R, Garcia-Espana A. Evolution of cysteine patterns in the large extracellular loop of tetraspanins from animals, fungi, plants and single-celled eukaryotes. Mol Phylogenet Evol. 2010. July;56(1):486–91. 10.1016/j.ympev.2010.02.015 [DOI] [PubMed] [Google Scholar]
- 18.Alonso A, Sasin J, Bottini N, Friedberg I, Friedberg I, Osterman A, et al. Protein tyrosine phosphatases in the human genome. Cell. 2004. June 11;117(6):699–711. 10.1016/j.cell.2004.05.018 [DOI] [PubMed] [Google Scholar]
- 19.Tonks NK. Protein tyrosine phosphatases—from housekeeping enzymes to master regulators of signal transduction. Febs J. 2013. January;280(2):346–78. 10.1111/febs.12077 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wu XR, Sun TT. Molecular cloning of a 47 kDa tissue-specific and differentiation-dependent urothelial cell surface glycoprotein. J Cell Sci. 1993. September;106 (Pt 1):31–43. [DOI] [PubMed] [Google Scholar]
- 21.Andersen JN, Mortensen OH, Peters GH, Drake PG, Iversen LF, Olsen OH, et al. Structural and evolutionary relationships among protein tyrosine phosphatase domains. Mol Cell Biol. 2001. November;21(21):7117–36. 10.1128/MCB.21.21.7117-7136.2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hendriks WJ, Elson A, Harroch S, Pulido R, Stoker A, den Hertog J. Protein tyrosine phosphatases in health and disease. Febs J. 2013. January;280(2):708–30. 10.1111/febs.12000 [DOI] [PubMed] [Google Scholar]
- 23.van Eekelen M, Overvoorde J, van Rooijen C, den Hertog J. Identification and expression of the family of classical protein-tyrosine phosphatases in zebrafish. PLoS One. 2010;5(9):e12573 10.1371/journal.pone.0012573 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Camp E, Sanchez-Sanchez AV, Garcia-Espana A, Desalle R, Odqvist L, Enrique O'Connor J, et al. Nanog regulates proliferation during early fish development. Stem Cells. 2009. September;27(9):2081–91. 10.1002/stem.133 [DOI] [PubMed] [Google Scholar]
- 25.Byrum CA, Walton KD, Robertson AJ, Carbonneau S, Thomason RT, Coffman JA, et al. Protein tyrosine and serine-threonine phosphatases in the sea urchin, Strongylocentrotus purpuratus: identification and potential functions. Dev Biol. 2006. December 1;300(1):194–218. 10.1016/j.ydbio.2006.08.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Letunic I, Doerks T, Bork P. SMART: recent updates, new developments and status in 2015. Nucleic Acids Res. 2015. January;43(Database issue):D257–60. 10.1093/nar/gku949 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Marchler-Bauer A, Derbyshire MK, Gonzales NR, Lu S, Chitsaz F, Geer LY, et al. CDD: NCBI's conserved domain database. Nucleic Acids Res. 2015. January;43(Database issue):D222–6. 10.1093/nar/gku1221 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Irimia M, Roy SW. Spliceosomal introns as tools for genomic and evolutionary analysis. Nucleic Acids Res. 2008. March;36(5):1703–12. 10.1093/nar/gkn012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Henricson A, Forslund K, Sonnhammer EL. Orthology confers intron position conservation. BMC Genomics. July 02;11:412 10.1186/1471-2164-11-412 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Garcia-Espana A, Mares R, Sun TT, Desalle R. Intron evolution: testing hypotheses of intron evolution using the phylogenomics of tetraspanins. PLoS One. 2009;4(3):e4680 10.1371/journal.pone.0004680 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990. October 5;215(3):403–10. 10.1016/S0022-2836(05)80360-2 [DOI] [PubMed] [Google Scholar]
- 32.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013. April;30(4):772–80. 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Combet C, Blanchet C, Geourjon C, Deleage G. NPS@: network protein sequence analysis. Trends Biochem Sci. 2000. March;25(3):147–50. [DOI] [PubMed] [Google Scholar]
- 34.Abascal F, Zardoya R, Telford MJ. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Res. 2010. July;38(Web Server issue):W7–13. 10.1093/nar/gkq291 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ronquist F, Huelsenbeck JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003. August 12;19(12):1572–4. [DOI] [PubMed] [Google Scholar]
- 36.Swofford DL. PAUP*. Phylogenetic analysis using parsimony (*and other methods). Version 4 2003. http://paup.csit.fsu.edu/ [Google Scholar]
- 37.Blom N, Gammeltoft S, Brunak S. Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. J Mol Biol. 1999. December 17;294(5):1351–62. 10.1006/jmbi.1999.3310 [DOI] [PubMed] [Google Scholar]
- 38.Duckert P, Brunak S, Blom N. Prediction of proprotein convertase cleavage sites. Protein Eng Des Sel. 2004. January;17(1):107–12. 10.1093/protein/gzh013 [DOI] [PubMed] [Google Scholar]
- 39.Wright MB, Hugo C, Seifert R, Disteche CM, Bowen-Pope DF. Proliferating and migrating mesangial cells responding to injury express a novel receptor protein-tyrosine phosphatase in experimental mesangial proliferative glomerulonephritis. J Biol Chem. 1998. September 11;273(37):23929–37. [DOI] [PubMed] [Google Scholar]
- 40.Delsuc F, Brinkmann H, Chourrout D, Philippe H. Tunicates and not cephalochordates are the closest living relatives of vertebrates. Nature. 2006. February 23;439(7079):965–8. 10.1038/nature04336 [DOI] [PubMed] [Google Scholar]
- 41.Blair JE, Hedges SB. Molecular phylogeny and divergence times of deuterostome animals. Mol Biol Evol. 2005. November;22(11):2275–84. 10.1093/molbev/msi225 [DOI] [PubMed] [Google Scholar]
- 42.Ohno S. Gene duplication and the uniqueness of vertebrate genomes circa 1970–1999. Semin Cell Dev Biol. 1999. October;10(5):517–22. 10.1006/scdb.1999.0332 [DOI] [PubMed] [Google Scholar]
- 43.Thumbikat P, Berry RE, Zhou G, Billips BK, Yaggie RE, Zaichuk T, et al. Bacteria-induced uroplakin signaling mediates bladder response to infection. PLoS Pathog. 2009. May;5(5):e1000415 10.1371/journal.ppat.1000415 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Apic G, Gough J, Teichmann SA. Domain combinations in archaeal, eubacterial and eukaryotic proteomes. J Mol Biol. 2001. July 6;310(2):311–25. 10.1006/jmbi.2001.4776 [DOI] [PubMed] [Google Scholar]
- 45.Kreisberg R, Buchner V, Arad D. Paired natural cysteine mutation mapping: aid to constraining models of protein tertiary structure. Protein Sci. 1995. November;4(11):2405–10. 10.1002/pro.5560041119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hu CC, Bachmann T, Zhou G, Liang FX, Ghiso J, Kreibich G, et al. Assembly of a membrane receptor complex: roles of the uroplakin II prosequence in regulating uroplakin bacterial receptor oligomerization. Biochem J. 2008. September 1;414(2):195–203. 10.1042/BJ20080550 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Aricescu AR, Siebold C, Choudhuri K, Chang VT, Lu W, Davis SJ, et al. Structure of a tyrosine phosphatase adhesive interaction reveals a spacer-clamp mechanism. Science. 2007. August 31;317(5842):1217–20. 10.1126/science.1144646 [DOI] [PubMed] [Google Scholar]
- 48.Simons K, Gerl MJ. Revitalizing membrane rafts: new tools and insights. Nat Rev Mol Cell Biol. 2010. October;11(10):688–99. 10.1038/nrm2977 [DOI] [PubMed] [Google Scholar]
- 49.Carmo AM, Wright MD. Association of the transmembrane 4 superfamily molecule CD53 with a tyrosine phosphatase activity. Eur J Immunol. 1995. July;25(7):2090–5. 10.1002/eji.1830250743 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.