Abstract
Armadillo (ARM) repeat proteins function in various cellular processes including vesicular transport and membrane tethering. They contain an imperfect repeating sequence motif that forms a conserved three-dimensional structure. Recently, structural and functional insight into tethering mediated by the ARM-repeat protein p115 has been provided. Here we describe the p115 ARM-motifs for reasons of clarity and nomenclature and show that both sequence and structure are highly conserved among ARM-repeat proteins. We argue that there is no need to invoke repeat types other than ARM repeats for a proper description of the structure of the p115 globular head region. Additionally, we propose to define a new subfamily of ARM-like proteins and show lack of evidence that the ARM motifs found in p115 are present in other long coiled-coil tethering factors of the golgin family.
Introduction
The armadillo (ARM) repeat motif is present in a variety of proteins. It was first described in the Drosophila segment-polarity gene product armadillo [1], the mammalian homolog of β-catenin that is essential for cadherin-based cell adhesion and Wnt/Wingless growth factor signaling. Furthermore, it functions to bridge the cytoplasmic domain of cadherins to α-catenin and the actin cytoskeleton [2], [3] and is associated to multiple diseases including cancer [4]–[7].
The presence and arrangement of ARM motifs differ in various proteins, and it was suggested that these linked units comprise a structural domain described by a universal consensus sequence (Figure 1) [8]. The number of tandem ARM repeats in an ARM fold ranges from 6 to 12. Based on the organization of their ARM motifs, three major subfamilies of ARM-like proteins are distinguished, namely the classical catenins, the p120ctn related catenins and the proteins involved in nuclear import [9], [10].
Murine β-catenin (138–664) was the first structure of an ARM-repeat protein to have its structure solved [11], revealing that each ARM motif folds into a conserved three-dimensional structure consisting of three helices (H1, H2 and H3) that form a compact helical bundle with distinct features (Figures 1, 2). While H1 is the shortest helix containing approximately two turns, helices H2 and H3 comprise about three and four turns, respectively. Helices H2 and H3 share extensive hydrophobic interactions and are oriented in an antiparallel fashion, whereas H1 lies almost perpendicular to the remaining helices. Importantly, all H3 helices within the ARM fold decorate the superhelical groove of the solenoid structure, whereas helices H1 and H2 are located at the cylindrical outer surface [11].
Canonical ARM repeats possess a sequence of about 42 amino acids. Generally, the sequence similarity between the sequences of repeating ARM motifs within a single protein may be very low, but their similarity at the three-dimensional structure level tends to be high.
The ARM-repeat helix H1 contains five highly conserved residues within the universal consensus sequence [8]. Additionally, the Gly residue C-terminal of the ARM-repeat helix H1 is strongly conserved and mediates a distinct kink between H1 and H2 [11], [12]. ARM-repeat helix H2 possesses three highly conserved hydrophobic residues (usually Leu), one at the N-terminus of H2 and two consecutive hydrophobic residues in a block of eight conserved residues. ARM-repeat H3 contains ten conserved residues including a strongly conserved solvent exposed polar residue, most frequently an Asn at the C-terminus of the helix.
Recently, structural insight into vesicle tethering mediated by the ARM-repeat protein p115 has been provided [13], [14]. Although the two independently determined crystal structures are virtually identical, the two publications came to different conclusions regarding the classification of structural repeats present in p115. Whereas Striegl et al. [13] characterized p115 as an ARM-repeat protein, An et al. [14] suggested the presence of novel “tether repeats” (TR) in p115 and proposed that these tether repeats would also occur in a broad spectrum of other tether proteins.
In order to clarify this discrepancy, we here present a proper classification of the p115 ARM-motifs by combining both structural and sequence information. Additionally, in our analysis we observe no significant evidence that the p115 ARM-motif pattern is present in other tethering factors such as golgins GM130 and giantin.
Analysis
The Globular Head Region of p115: An ARM-Like Helical Conserved Structure
The human general vesicular transport factor p115 is a protein of the golgin family that gives identity and structure to the Golgi apparatus and is part of a complex protein network at the Golgi membrane [15]–[18]. p115 facilitates the tethering of transport vesicles inbound from the endoplasmic reticulum to the cis-Golgi membrane. The myosin-shaped protein forms stable homodimers and comprises a long central coiled-coil region (p115CC), a large N-terminal globular head domain (p115GHR) and a C-terminal acidic region [19], [20]. p115 is recruited to membranes by the guanosine triphosphatase (GTPase) Rab1a in a nucleotide-dependent manner and is among the best characterized representatives of long coiled-coil tethering factors [21]–[27].
Recently, the crystal structures of the human (Figure 2) and bovine p115GHR were determined [13], [14]. Since human and bovine p115GHR are more than 99% identical in their amino acid sequence, it comes as no surprise that the structure of human p115 (Protein Data Bank accession code 2W3C) is very similar to that of the bovine p115 (Protein Data Bank accession code 3GRL), yielding a Z-score of 47.6 for an alignment of 549 residues with a root-mean-square deviation of 1.1 Å by the DaliLite program [28]. The high structural similarity, confirmed by superposition of the α-carbon traces of the human and bovine p115, suggests that both proteins should share an identical structural classification.
However, there are significant differences concerning the p115GHR ARM-fold nomenclature and classification adopted in these publications. An et al. [14] claim that the p115GHR solenoid is made up by a functionally specific TR motif. Striegl et al. [13], however, advance the view that this TR motif is actually a frame-shifted classical ARM repeat in which helix H1 of TR corresponds to H2 of ARM, H2 (TR) to H3 (ARM) and H3 (TR) to H1 (ARM). Accordingly, we argue that, on a sequence and structural level, p115GHR, indeed, belongs to the ARM-protein superfamily [13] (Figures 1, 2, 3).
In fact, the crystal structures of both the human and the bovine p115GHR show that the protein consists of a multi-helical β-catenin-like ARM fold arranged in a regular right-handed superhelix. The published human p115GHR structure included residues Asp54 to Tyr629 of p115 resulting in the assignment of the N-terminal armadillo repeat observed in the structure as ARM1 [13]. The globular head domain of bovine p115 [14] completes the full-length ARM fold of p115GHR by an additional but incomplete (due to a disordered helix H1) ARM repeat at the N-terminus of the molecule. To facilitate a structural comparison, the ARM repeats in human p115GHR have been renumbered such that ARM1 of Striegl et al. [13] is now labeled ARM2, and the last repeat preceding the ARM-like USO element is ARM11 (Figure 2a).
The N-terminal armadillo helical domain comprising ARM1 to ARM7 of p115GHR (residues 1–342) is remarkably similar to members of different ARM-protein subfamilies. For example, an iterative sequence search of the database with this fragment using PSIBLAST [32] retrieves proteins containing ARM repeats at significant E-values (<0.005) already in the 2nd iteration. On the contrary, the five C-terminal repeats of p115GHR (residues 343–629, starting from ARM8) are not easily discernable as ARM-repeats at the sequence level. In fact, sequence analyses classify this region as a USO1 head domain (Figure 2), a domain that identifies a group of proteins described as general vesicular transport factors, transcytosis-associated proteins (TAP) or vesicle docking proteins [29]. A structure-based sequence alignment of p115GHR and β-catenin ARM repeats, however, clearly shows that the conserved hydrophobic residues located in this region align very well, with the exception of the C-terminal four helices (USO element; Figure 1, Tables S1, S2). Thus, the ARM8-ARM11 repeats within the USO1 head domain are indeed armadillo repeats.
The USO element folds back into the superhelical groove covering helices H3 of repeats ARM8-ARM11 [13] (Figures 2, 3b, 4). This possibly explains the described differences in sequence and structure between the N-terminal ARM domain and the C-terminal USO1 head domain of p115GHR. The interaction with the superhelical groove is mediated by hydrophobic interactions and a single salt bridge (Figure 3b, Table S3). In addition, the USO1 head domain displays large inter- and intra-repeat insertions (Figure 1, 2a). The ARM10 helix H1, for example, is connected to helix H2 by 15 residues, whereas the kink of these helices of ARM5 within β-catenin is mediated by a single glycine (Figure 1).
Despite these structural differences of the USO1 head domain, the superimposition of all p115GHR repeats on the one hand and the superimposition of repeats of p115GHR and β-catenin on the other hand reveals significant structural similarity and a common overall fold (Figure 2b, 3a). Thus, the repeats within the USO1 head domain are indeed ARM repeats with exception of the C-terminal USO element.
In summary, p115GHR contains 11 ARM repeats. The last four C-terminal ARM repeats of p115GHR and the USO element form the USO1 head domain that reveals some sequence and structural alterations compared to the N-terminal classical ARM domain. These differences go along with the function of p115 in vesicular transport and tethering.
The ARM Motifs of p115: Unique and Not Present in GM130 and Giantin
Analysis of the globular head domain of bovine p115 by An et al. [14] led to the assumption that the p115GHR repeats lack sequence conservation except for leucine-rich motifs, and, due to these characteristics, variable leucine-rich motifs for the helices H1, H2 and H3 were suggested [14]. Upon visual inspection, a pattern of leucine-rich residues separated by sequences of variable length, as found for p115GHR, was detected in other tether proteins that are involved in exocytic and endocytic trafficking [14], including the cis-Golgi golgins GM130 and giantin [reviewed in 16]. This sequence similarity was used for the characterization and classification of the TR motifs. However, iterative sequence searches with these proteins using PSIBLAST [28] did not support their similarity to p115 or to any protein with ARM-repeats. In order to make a more exhaustive analysis we collected orthologs of the GM130, giantin and p115 human proteins, and scanned them with ARD, which uses a neural network to detect ARM and other repeats forming alpha-rods [30]. Whereas four correct matches could be identified in the N-terminal part of most of the p115 homologs used, no such signal was obtained in human GM130, giantin (not shown), or their orthologs tested (Figure 5).
Additionally, we scanned ten golgin-related sequences (Golgin245, Golgin84, Gmap210, BicaudalD1, Iporin, Mical1, Rabenosyn5, Rabaptin5, EEA1, Rim3, Noc2) for alpha-rod repeats using the ARD server. None of the sequences was identified as containing such repeats: seven sequences received no single hit, and three (Rabaptin5, EEA1, Rim3) received one single hit above 0.8, whereas at least three such hits are taken as evidence for repeats.
Discussion
Proteins within the different ARM subfamilies display a conserved architecture and provide a scaffold for the assembly of protein complexes with various functions. Generally, the identification of ARM repeats by sequence comparisons is relatively simple, the C-terminal region of p115GHR, however, demonstrates the difficulty to classify the protein as an ARM-fold protein just by sequence comparisons. This may explain why a structural annotation of bovine p115GHR [14] invoked a new type of repeat (TR) which we find, however, neither required nor helpful in classifying this protein structure.
Crystal structure analysis revealed a special ARM-fold architecture of the p115GHR C-terminal domain identified as the USO1 head domain, bearing large insertions and a unique USO element. This domain is inimitable among ARM-repeat proteins and defines proteins as vesicular transport factors. The unexpected ARM fold of the USO1 head domain of p115GHR differs from the classical ARM fold, but structure-based sequence alignments advance a better understanding of how to unambiguously classify p115 as an ARM-protein superfamily member.
In conclusion, we propose to define a fourth subfamily of ARM-like proteins. Thus, besides the classical catenins, the p120ctn-related catenins and the proteins involved in nuclear import the new ARM subfamily is termed USO1 head domain-like and describes a group of proteins that are involved in vesicular transport and are conserved from yeast to human. Therefore, the globular head region of p115 is the first crystal structure of a member of the USO1 head domain-like ARM subfamily.
Supporting Information
Acknowledgments
We are grateful to Anja Schütz (Max-Delbrück-Centrum, Berlin) for critical reading of this manuscript.
Figure production: All pictures of protein structures were prepared using PyMOL [31]. The structure based sequence alignment was prepared using the DaliLite program [32].
Footnotes
Competing Interests: The authors have declared that no competing interests exist.
Funding: This work was supported by the Deutsche Forschungsgemeinschaft (www.dfg.de) through SFB 740. M.A.A-N. acknowledges support from the Helmholtz foundation (www.helmholtz.de). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Riggleman B, Wieschaus E, Schedl P. Molecular analysis of the armadillo locus: uniformly distributed transcripts and a protein with novel internal repeats are associated with a Drosophila segment polarity gene. Genes Dev. 1989;3:96–113. doi: 10.1101/gad.3.1.96. [DOI] [PubMed] [Google Scholar]
- 2.Hulsken J, Birchmeier W, Behrens J. E-cadherin and APC compete for the interaction with β-catenin and the cytoskeleton. J Cell Biol. 1994;127:2061–2069. doi: 10.1083/jcb.127.6.2061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.McCrea PD, Turck CW, Gumbiner B. A homolog of the armadillo protein in Drosophila (plakoglobin) associated with E-cadherin. Science. 1991;254:1359–1361. doi: 10.1126/science.1962194. [DOI] [PubMed] [Google Scholar]
- 4.Moon RT, Bowerman B, Boutros M, Perrimon N. The promise and perils of Wnt signaling through β-catenin. Science. 2002;296:1644–1646. doi: 10.1126/science.1071549. [DOI] [PubMed] [Google Scholar]
- 5.Peifer M, Polakis P. Wnt signaling in oncogenesis and embryogenesis - a look outside the nucleus. Science. 2000;287:1606–1609. doi: 10.1126/science.287.5458.1606. [DOI] [PubMed] [Google Scholar]
- 6.Bienz M, Clevers H. Linking colorectal cancer to Wnt signaling. Cell. 2000;103:311–320. doi: 10.1016/s0092-8674(00)00122-7. [DOI] [PubMed] [Google Scholar]
- 7.Kinzler KW, Vogelstein B. Lessons from hereditary colorectal cancer. Cell. 1996;87:159–170. doi: 10.1016/s0092-8674(00)81333-1. [DOI] [PubMed] [Google Scholar]
- 8.Peifer M, Berg S, Raynolds AB. A repeating amino acid motif shared by proteins with diverse cellular roles. Cell. 1994;76:769–791. doi: 10.1016/0092-8674(94)90353-0. [DOI] [PubMed] [Google Scholar]
- 9.Hartzfeld M, Nachtsheim C. Cloning and characterization of a new armadillo family member, p0071, associated with the junctional plaque: evidence for a subfamily of closely related proteins. J Cell Sci. 1996;109:2767–2778. doi: 10.1242/jcs.109.11.2767. [DOI] [PubMed] [Google Scholar]
- 10.Hatzfeld M. The armadillo family of structural proteins. Int Rev Cytol. 1999;186:179–224. doi: 10.1016/s0074-7696(08)61054-2. [DOI] [PubMed] [Google Scholar]
- 11.Huber AH, Nelson WJ, Weis WI. Three-dimensional structure of the armadillo repeat region of β-catenin. Cell. 1997;90:871–882. doi: 10.1016/s0092-8674(00)80352-9. [DOI] [PubMed] [Google Scholar]
- 12.Andrade MA, Petosa C, O'Donoghue SI, Müller CW, Bork P. Comparison of ARM and HEAT protein repeats. J Mol Biol. 2001;309:1–18. doi: 10.1006/jmbi.2001.4624. [DOI] [PubMed] [Google Scholar]
- 13.Striegl H, Roske Y, Kümmel D, Heinemann U. Unusual armadillo fold in the human general vesicular transport factor p115. PLoS ONE. 2009;4(2):e4656. doi: 10.1371/journal.pone.0004656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.An Y, Chen CY, Moyer B, Rotkiewicz P, Elsliger MA, et al. Structural and functional analysis of the globular head domain of p115 provides insight into membrane tethering. J Mol Biol. 2009;391:26–41. doi: 10.1016/j.jmb.2009.04.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ramirez IB, Lowe M. Golgins and GRASPs: holding the Golgi together. Semin Cell Dev Biol. 2009;7:770–779. doi: 10.1016/j.semcdb.2009.03.011. [DOI] [PubMed] [Google Scholar]
- 16.Short B, Haas A, Barr FA. Golgins and GTPases, giving identity and structure to the Golgi apparatus. Biochim Biophys Acta. 2005;1744:383–395. doi: 10.1016/j.bbamcr.2005.02.001. [DOI] [PubMed] [Google Scholar]
- 17.Chan EKL, Fritzler MJ. Golgins: coiled-coil proteins associated with the Golgi complex. Electron J Biotechnol. 1998;1:1–10. [Google Scholar]
- 18.Burkhard P, Stetefeld J, Strelkov SV. Coiled coils: a highly versatile protein folding motif. Trends Cell Biol. 2001;11:82–88. doi: 10.1016/s0962-8924(00)01898-5. [DOI] [PubMed] [Google Scholar]
- 19.Sapperstein SK, Walter DM, Grosvenor AR, Heuser JE, Waters MG. p115 is a general vesicular transport factor related to the yeast endoplasmic reticulum to Golgi transport factor Uso1p. Proc Natl Acad Sci USA. 1995;92:522–526. doi: 10.1073/pnas.92.2.522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Yamakawa H, Seog DH, Yoda K, Yamasaki M, Wakabayashi T. Uso1 protein is a dimer with two globular heads and a long coiled-coil tail. J Struct Biol. 1996;116:356–365. doi: 10.1006/jsbi.1996.0053. [DOI] [PubMed] [Google Scholar]
- 21.Allan BB, Moyer BD, Balch WE. Rab1 recruitment of p115 into a cis-SNARE complex: programming budding COPII vesicles for fusion. Science. 2000;289:444–448. doi: 10.1126/science.289.5478.444. [DOI] [PubMed] [Google Scholar]
- 22.Beard M, Satoh A, Shorter J, Warren G. A cryptic Rab1-binding site in the p115 tethering protein. J Biol Chem. 2005;280:25840–25848. doi: 10.1074/jbc.M503925200. [DOI] [PubMed] [Google Scholar]
- 23.Shorter J, Warren G. A role for the vesicle tethering protein, p115, in the post-mitotic stacking of reassembling Golgi cisternae in a cell-free system. J Cell Biol. 1999;146:57–70. doi: 10.1083/jcb.146.1.57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Satoh A, Warren G. In situ cleavage of the acidic domain from the p115 tether inhibits exocytic transport. Traffic. 2008;9:1522–1529. doi: 10.1111/j.1600-0854.2008.00783.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Puthenveedu MA, Linstedt AD. Gene replacement reveals that p115/SNARE interactions are essential for Golgi biogenesis. Proc Natl Acad Sci USA. 2004;101:1253–1256. doi: 10.1073/pnas.0306373101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Guo Y, Punj V, Sengupta D, Linstedt AD. Coat-tether interaction in Golgi organization. Mol Biol Cell. 2008;7:2830–2843. doi: 10.1091/mbc.E07-12-1236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Sohda M, Misumi Y, Yoshimura S, Nakamura N, Fusano T, et al. The interaction of two tethering factors, p115 and COG complex, is required for Golgi integrity. Traffic. 2007;8:270–284. doi: 10.1111/j.1600-0854.2006.00530.x. [DOI] [PubMed] [Google Scholar]
- 28.Holm L, Park J. DaliLite workbench for protein structure comparison. Bioinformatics. 2000;16(6):566–567. doi: 10.1093/bioinformatics/16.6.566. [DOI] [PubMed] [Google Scholar]
- 29.Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, et al. The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 2000;29:37–40. doi: 10.1093/nar/29.1.37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Palidwor GA, Shcherbinin S, Huska MR, Rasko T, Stelzl U, et al. Detection of alpha-rod protein repeats using a neural network and application to huntingtin. PLoS Comput Biol. 2009;5(3):e1000304. doi: 10.1371/journal.pcbi.1000304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.DeLano WL. San Carlos, CA, USA: DeLano Scientific LLC; 2003. The PyMOL Molecular Graphics System. [Google Scholar]
- 32.Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Huska MR, Buschmann H, Andrade-Navarro MA. BiasViz: visualization of amino acid biased regions in protein alignments. Bioinformatics. 2007;23(22):3093–3094. doi: 10.1093/bioinformatics/btm489. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.