Abstract
Collagen is modified by hydroxylation and glycosylation of hydroxylysine residues. This glycosylation is initiated by the β1,O galactosyltransferases GLT25D1 and GLT25D2. The structurally similar protein cerebral endothelial cell adhesion molecule CEECAM1 was previously reported to be inactive when assayed for collagen glycosyltransferase activity. To address the cause of the absent galactosyltransferase activity, we have generated several chimeric constructs between the active human GLT25D1 and inactive human CEECAM1 proteins. The assay of these chimeric constructs pointed to a short central region and a large C-terminal region of CEECAM1 leading to the loss of collagen galactosyltransferase activity. Examination of the three DXD motifs of the active GLT25D1 by site-directed mutagenesis confirmed the importance of the first (amino acids 166–168) and second motif (amino acids 461–463) for enzymatic activity, whereas the third one was dispensable. Since the second DXD motif is incomplete in CEECAM1, we have restored the motif by introducing the substitution S461D. This change did not restore the activity of the C-terminal region, thereby showing that additional amino acids were required in this C-terminal region to confer enzymatic activity. Finally, we have introduced the substitution Q471R-V472M-N473Q-P474V in the CEECAM1-C-terminal construct, which is found in most animal GLT25D1 and GLT25D2 isoforms but not in CEECAM1. This substitution was shown to partially restore collagen galactosyltransferase activity, underlining its importance for catalytic activity in the C-terminal domain. Because multiple mutations in different regions of CEECAM1 contribute to the lack of galactosyltransferase activity, we deduced that CEECAM1 is functionally different from the related GLT25D1 protein.
Introduction
Collagen undergoes several post-translational modifications before formation of a right-handed triple helix in the endoplasmic reticulum. These modifications include the hydroxylation of selected proline [1] and lysine [2] residues. Thereafter, a portion of hydroxylysine (Hyl) residues are further modified by the addition of carbohydrates, forming the disaccharide Glc(α1-2)Gal(β1-O)Hyl [3], [4]. The extent of hydroxylation and glycosylation depends on the type of collagen and their tissue distribution. The formation of hydroxyproline is essential for the thermal stability of the collagen triple helix [5]. Lysyl hydroxylation is important for the cross-linking of collagen fibrils and acts as substrate for glycosylation reactions. The importance of collagen post-translational modifications is reflected by the diseases associated to defective collagen modifications. Mutations in prolyl 3-hydroxylase and in the lysyl hydroxylase genes lead to various forms of skeletal dysplasia [6], [7],[8],[9].
Whereas the roles of proline and lysine hydroxyllation in collagen biology are well established, the functional relevance of collagen glycosylation is presently unclear. Recently, we have identified the two genes GLT25D1 and GLT25D2 as encoding collagen galactosyltransferase (ColGalT) enzymes [10]. RNA interference studies in Caenorhabditis elegans have shown that the inactivation of the lysyl hydroxylase gene let-268 yields a lethal phenotype related to abnormal collagen type-IV secretion [11], [12]. Similarly, the inactivation of the D2045.9 gene, the putative ortholog to human GLT25D1 and GLT25D2, is associated to growth defects and multiple morphologic abnormalities [13], [14].
The sequences of the GLT25D1 and GLT25D2 proteins were found to be strongly similar to the cerebral endothelial cell adhesion molecuzle (CEECAM1) protein [15]. However, in spite of the strong structural conservation, CEECAM1 was found to be inactive when assayed for collagen glycosyltransferase activity [10]. Although several types of collagen acceptors have been tested, it could not be excluded that CEECAM1 only recognizes specific substrates. Alternatively, the claimed localization of CEECAM1 at the surface of endothelial cells [15] suggests that the protein does not function as a glycosyltransferase like GLT25D1 and GLT25D2 proteins. The CEECAM1 protein could be enzymatically inactive as collagen glycosyltransferase because of point mutations introduced during evolution. The polymorphism of ABO blood group glycosyltransferases [16] or the inactive form of the α1,3 galactosyltransferase gene in primate genomes [17] are typical examples of such lost glycosyltransferase activities.
In the present work, we investigate the rationale for the lack of ColGalT activity in the CEECAM1 protein by domain swapping and single point mutation. By permutation with portions of the active GLT25D1 galactosyltransferase, the production of GLT25D1-CEECAM1 chimeric proteins enables the identification of the regions responsible for the loss of galactosyltransferase activity. In parallel, this approach allows narrowing down the regions of GLT25D1 that are essential for the catalytic activity.
Results
The CEECAM1 protein is structurally similar to the ColGalT enzymes GLT25D1 and GLT25D2. The comparison of the three polypeptides shows 55% sequence identity between human CEECAM1 and human GLT25D1 and 50% between human CEECAM1 and human GLT25D2 [10]. The three proteins also share the same putative conformation and domain organization as predicted by the PROMALS3D algorithm [18] (Fig. 1). To further compare the three proteins, we aligned GLT25D1, GLT25D2 and CEECAM1 with known protein structures using the pdb70 database and HHpred tool (http://toolkit.tuebingen.mpg.de/hhpred) [19]. This search pointed to a potential structural similarity of the three proteins with the chondroitin polymerase from Escherichia coli (PDB ID: 2z86). The region ranging from amino acids 24 to 466 of human GLT25D1 produced a hit with an E-value of 6.2E-09 and P-value of 2.7E-13. This bacterial chondroitin polymerase belongs to the CAZY GT2 family of glycosyltransferases and contains two functional domains adopting a glycosyltransferase GT-A fold [20]. Such a binary structure is supposedly adopted by the GLT25D1, GLT25D2 and CEECAM1 proteins as also noted by Liefhebber et al. [21]. Based on this prediction, the GLT25D1, GLT25D2 and CEECAM1 proteins can be parted into three structural domains: a N-terminal domain of about 230 amino acids, a central domain of about 110 amino acids and a C-terminal domain of about 280 amino acids (Fig. 1).
To identify the molecular basis for the lack of collagen glycosyltransferase activity in CEECAM1, we constructed chimeric proteins between GLT25D1 and CEECAM1 based on the three domains mentioned here above. The corresponding chimeras were named CEECAM1-Nter, CEECAM1-Mid and CEECAM1-Cter (Fig. 2). The chimeric proteins were Flag-tagged at their N-terminus and expressed in Sf9 cells as recombinant baculoviruses. The amount of recombinant proteins present in Sf9 cell lysates was monitored by Western blotting using an anti-Flag antibody and ColGalT activity was assayed towards denatured bovine collagen type I. The CEECAM1 and CEECAM1-Nter proteins were expressed as two glycoforms, whereas the other chimeric proteins expressed as a single band (Fig. 3). The CEECAM1-Nter construct yielded the same ColGalT activity as the GLT25D1 enzyme, showing that the N-terminal domain of CEECAM1 does not contribute to the absence of glycosyltransferase activity. By contrast, the constructs including either the central or the C-terminal domain of CEECAM1 did not show any ColGalT activity (Fig. 3).
To delineate more precisely the regions of CEECAM1 and GLT25D1 that are essential for enzymatic activity, we have produced four additional chimeric constructs for the central domain and five additional constructs for the C-terminal domain. From the four chimeric constructs containing portions of the CEECAM1 central domain, only the construct, in which the stretch between amino acids 232 and 282 of GLT25D1 was replaced by the corresponding stretch of CEECAM1 (named CEECAM1-MidC), was found to be enzymatically active, yet with decreased ColGalT activity to 32% of wildtype GLT25D1 levels (Fig. 3). The analysis of these four chimeric constructs delimitated a CEECAM1 stretch of 63 amino acids that caused the loss of the enzymatic activity when introduced into GLT25D1. The comparison of GLT25D1, GLT25D2 and CEECAM1 protein sequences from different animal genomes showed that this stretch corresponds to a strongly conserved sequence region (Fig. 4A). Within this stretch of 63 amino acids, only six conserved residues within GLT25D1 and GLT25D2 isoforms were found to be different in CEECAM1.
To study the impact of the C-terminal domain on ColGalT activity, five chimeric constructs encompassing parts of the CEECAM1 C-terminal domain in GLT25D1 (Fig. 2) were expressed in Sf9 cells and assayed for enzymatic activity. This analysis pointed to a large portion of the C-terminal domain as being involved in the catalysis of the ColGalT reaction. In fact, only the GLT25D1 stretch between amino acids 342 and 414 was replaceable for enzymatic activity, as shown by the partially active construct CEECAM1-CterC (Fig. 3). The examination of the C-terminal domain of GLT25D1, GLT25D2 and CEECAM1 from multiple animal species revealed 26 amino acid residues strongly conserved in GLT25D1 and GLT25D2 but not in CEECAM1. Of interest, two DXD motifs are included among these diverging amino acids (Fig. 4B).
The presence of DXD motifs is typical for glycosyltransferases utilizing nucleotide-activated sugars as donor substrates. Such DXD motifs are usually involved in the binding of the nucleotide diphosphate moiety through Mn2+ chelation [22]. The collagen galactosyltransferases GLT25D1 and GLT25D2 include three DXD motifs and one related EDD motif (Fig. 1). The EDD and one DXD motifs are also conserved in CEECAM1, whereas two C-terminally localized DXD motifs are missing in CEECAM1. This suggests that the C-terminal DXD motifs may be important for the activity of the GLT25D1, considering their absence in the enzymatically inactive CEECAM1 protein. To determine the importance of the DXD motifs on the activity of GLT25D1, we have produced mutant forms of GLT25D1 lacking each of these three motifs. The mutant GLT25D1 proteins mainly yielded two glycoforms (Fig. 5), although this pattern was not seen in all expressed samples. The substitution D166A-D168A led to an inactive protein, thus establishing the importance of the first DXD motif for the ColGalT activity. Note worthily, this DXD motif is also found in the N-terminal domain of CEECAM1. The second DXD motif of GLT25D1, found at D461-D463, was also essential to the ColGalT activity, since its mutation yielded an inactive GLT25D1 protein (Fig. 5). By contrast, the last DXD motif of GLT25D1 at D585-D587 was dispensable for the ColGalT activity (Fig. 5).
As noted in our domain swapping experiments, a portion of the CEECAM1 central domain abolished the enzymatic activity when replacing the GLT25D1 stretch encompassing amino acids 280 to 342 (Fig. 3). In an attempt to isolate amino acids in this central domain that are important for this enzyme activity, we replaced the residues P292 and D336 in GLT25D1 by the corresponding residues found in CEECAM1, thereby yielding the substitutions P292N and D336S (Fig. 4). These two residues were selected because they are strictly conserved in animal GLT25D1 and GL25D2 isoforms. In addition, P292 may be involved in making a critical twist in the polypeptide structure. The expression of the two P292N and D336S GLT25D1 mutants showed that these two substitutions only lightly decreased ColGalT activity, thereby speaking against a critical involvement of these amino acids in the inactivity of CEECAM1 (Fig. 5).
The comparison between the C-terminal domain of GLT25D1, GLT25D2 and CEECAM1 isoforms from various animal genomes pointed to 26 amino acids diverging between GLT25D1 and GLT25D2 on the one hand and CEECAM1 on the other hand (Fig. 4). As shown here above, the C-terminal domain includes an essential DXD motif that is not found in CEECAM1. We chose to introduce this DXD motif in CEECAM1 to determine if this change could restore the ColGalT activity in the inactive chimeric construct CEECAM1-CterB (Fig. 2). To this end, the S461D substitution was engineered and the resulting CEECAM1-CterB[S461D] construct assayed for enzymatic activity. This test showed that the S461D substitution was insufficient to restore ColGalT activity (Fig. 6). Another strictly conserved stretch found in GLT25D1 and GLT25D2 but absent in CEECAM1 is located just downstream of the D461–D463 DXD motif. This stretch begins at R471 and reads RMQV whereas it corresponds to a QVNP sequence in CEECAM1 (Fig. 4). We have added the substitution Q471R-V472M-N473Q-P474V in the CEECAM1-CterB[S461D] construct, yielding CEECAM1-CterB[S461D][RMQV]. This new mutant form showed a partially restored ColGalT activity (Fig. 6), thus demonstrating the importance of this stretch for the enzymatic activity.
Finally, we determined the apparent Km and Vmax parameters of the partially active chimeric constructs CEECAM1-MidC and CEECAM1-CterC to assess whether the decreased activity can be related to altered interaction with the substrates. This kinetic analysis showed that the low ColGalT activity of CEECAM1-MidC was due to decreased Vmax (Table 1), suggesting that the central domain is important to maintain a normal reaction rate. By contrast, the analysis of the CEECAM1-CterC showed normal Vmax values but increased Km values for both donor and acceptor substrates (Table 1). This finding supports the idea that the C-terminal domain is important for the binding to UDP-Gal and to collagen.
Table 1. Kinetic parameters for chimeric constructs CEECAM1-MidC and CEECAM1-CterC.
UDP-Gal | Collagen | |||
Enzyme | Km (µM) | Vmax (pmol min−1) | Km (µg µl−1) | Vmax (CPM min−1) |
GLT25D1 | 29.91±2.56 | 12.45±1.09 | 5.7±0.9 | 235.13±25.66 |
CEECAM1-MidC | 20.31±1.63 | 6.86±0.45 | 5.04±1.12 | 108.53±29.95 |
CEECAM1-CterC | 74.29±5.56 | 11.2±1.4 | 13.4±1.75 | 259.27±63.48 |
Values represent the average ± S.E.M. of three independent experiments.
Discussion
The present study demonstrates that the inactivity of CEECAM1 as a ColGalT is related to sequence variations in two distinct regions of the protein. Based on the analysis of chimeric constructs between the inactive CEECAM1 and the active GLT25D1 proteins, the study also enabled the identification of structural motifs essential to the ColGalT activity. The recombinant GLT25D1, CEECAM1 and chimeric proteins tested were expressed to similar levels in Sf9 insect cells and did not yield insoluble aggregates. The resulting uniform expression patterns enabled the functional comparison of all constructs described in the study.
The N-terminal domain of GLT25D1 and CEECAM1 were interchangeable without losing the galactosyltransferase activity, which did not mean that this domain was dispensable for the enzymatic activity. In fact, the DXD motif in this domain was shown to be essential to the activity of GLT25D1. Since the N-terminal domain of CEECAM1 could yield an active chimeric protein when combined with GLT25D1, the experiment showed that both proteins are structurally closely related and that such chimeric constructs are neither instable nor inactive per se. The N-terminal domain of GLT25D1, GLT25D2 and CEECAM1 also shared structural similarity with glycosyltransferases like the polypeptide N-acetylgalactosaminyltransferases-2, -10 (CAZY family GT27) and the α1,4 N-acetylhexosaminyltransferase EXTL2 (CAZY family GT64) [23], [24], [25], indicating that the sequence context around the first DXD motif is suitable for an interaction with Mn2+ and with the diphosphate moiety of the donor substrate.
Most of the central domain of CEECAM1 failed to yield any enzymatic activity when combined to GLT25D1. The kinetic analysis of the CEECAM1-MidC construct showed that this region was important for the rate of the galactosyltransferase reaction. Considering the predicted GT-A fold of the GLT25 family proteins, this finding suggests that the central domain is important to associate the two lobes of the active enzymes to encompass the donor and acceptor substrates.
The C-terminal domain of the active GLT25D1 enzyme includes two DXD motifs, which are missing in the corresponding domain of the inactive CEECAM1 protein. Our mutagenesis experiments showed that the first of these two DXD motifs is required for the ColGalT activity of GLT25D1, whereas the second one is dispensable. This fact suggests that the C-terminal domain is also involved in the recognition of acceptor substrates and that critical residues for this function are missing in the CEECAM1 sequence. The importance of the C-terminal domain in the recognition of the donor and acceptor substrates was confirmed by the kinetic analysis performed on the partially active chimeric construct CEECAM1-CterC.
Bacterial glycosyltransferases of the CAZY family GT25, such as Lex2B β1,4-glucosyltransferase and LpsB β1,2-galactosyltransferase [26], [27], share structural similarity with the C-terminal part of GLT25D1 between amino acids 344 and 622. These bacterial enzymes are involved in lipopolysaccharide biosynthesis and add hexoses in the oligosaccharide extension. In spite of this structural similarity as seen with the program PROMALS3D, the bacterial Lex2B and LpsB glycosyltransferases also present differences to GLT25D1 since they lack the first DXD motif and the RMQV sequence, which are essential for the ColGalT activity.
The CEECAM1 protein was first described as a cerebral adhesion protein [15], yet the CEECAM1 gene is widely transcribed in the nervous system and in several secretory tissues such as salivary glands, pancreas, liver and placenta [10]. The presence of a potential ER retention signal at the end of the CEECAM1 protein suggests that this protein primarily localizes to the ER, like the active collagen galactosyltransferases GLT25D1 and GLT25D2. Our present study showed that the lack of ColGalT activity of CEECAM1 is related to multiple changes in the protein sequence when compared to GLT25D1 and GLT25D2. However, we cannot exclude at this stage that CEECAM1 does function as a glycosyltransferase that recognizes substrates different from collagen. The presence of potential CEECAM1 orthologous genes in all mammalian genomes would argue against CEECAM1 as representing a pseudo-gene or being the consequence of a late gene duplication event specifically found in the human genome.
Taken together, our results provided insights on the functional organization of the GLT25D1 ColGalT by showing that two regions of the protein are essential for the enzymatic activity. Our work also showed that the inactivity of CEECAM1 as a ColGalT relies on multiple amino acid changes, thus suggesting different substrate specificity or different biological activity. The determination of the exact structure of mammalian GLT25 proteins or of homologous bacterial proteins by crystallization will clarify the structural basis of the CEECAM1 functions.
Materials and Methods
Materials
Collagen type I from bovine Achilles tendon, UDP-Gal and the anti-Flag M2 monoclonal antibody were from Sigma-Aldrich (Buchs, Switzerland). UDP-[14C]Gal was from GE Healthcare (Glattbrugg, Switzerland). Restriction enzymes and DNA modifying enzymes were from New England Biolabs (BioConcept, Allschwil, Switzerland).
Generation of chimeric proteins GLT25D1-CEECAM1
Full length cDNA fragments were subcloned into pFastBac1-Flag baculovirus transfer vector [28] to construct Flag-tagged GLT25D1 and Flag-tagged CEECAM1 expression plasmids. Chimeric GLT25D1-CEECAM1 constructs were created by replacing parts of the GLT25D1 cDNA sequence by the corresponding CEECAM1 sequence using restriction sites conserved in both genes (Fig. S1). For the CEECAM1-Nter chimeric protein, the restriction sites EcoRI at cDNA position 57 and NcoI (position 796) were used. To create the CEECAM1-Mid and CEECAM1-Cter chimeric proteins, GLT25D1 DNA was digested with PspXI and XbaI, thereby releasing a 1698 bp fragment that was subcloned into pBluescript-SKII (Agilent Technologies, Basel, Switzerland) to create pBluescript-GLT25D1-short. Then, the NcoI-Tth111I and Tth111I-XbaI fragments from GLT25D1 were replaced respectively by the corresponding CEECAM1 sequence using identical restriction sites, thus yielding the constructs pBluescript-CEECAM1-Mid-short and pBluescript-CEECAM-Cter-short. These two new constructs were digested by PspXI and XbaI to subclone the 1698 bp fragments into pFastBac-GLT25D1, yielding the constructs CEECAM1-Mid and CEECAM1-Cter.
For the chimeric proteins CEECAM1-MidA/B/C/D and CEECAM1-CterA/B/C/D/E, restriction sites were introduced by PCR to generate silent mutations without impact on the amino acid sequence. For chimeric proteins CEECAM1-MidA/B/C/D, PCR was performed using pBluescript-GLT25D1-short as template and forward and reverse primers listed in Table 2 and Figure S1. These PCR products were digested and cloned back into pBluescript-CEECAM1-Mid-short, previously linearized using the same enzymes (Table 2, Fig. S1). The pBluescript-based constructs were digested by PspXI and XbaI for cloning back into pFastBac-GLT25D1 as described above, yielding the final constructs CEECAM1-MidA/B/C/D.
Table 2. PCR primers used to generate chimeric GLT25D1-CEECAM1 proteins.
CEECAM1 chimeric construct | Plasmid template | Primer | Oligonucleotide sequence (5′-3′) | Restriction site |
MidA | pBluescript-GLT25D1-short | For | GCAGTTCCCATGGTGCACTC | NcoI |
Rev | GCAACACGTGCATCTGAACCTCTGCCTGCTTGC | PmlI | ||
MidB | pBluescript-GLT25D1-short | For | GCAGTTCCCATGGTGCACTC | NcoI |
Rev | GCAAACTAGTACCTCCAGCTGCACATGCATG | SpeI | ||
MidC | pBluescript-GLT25D1-short | For | GCAACACGTGTGCAACAAGGAGGAGTACG | PmlI |
Rev | AGCCAGGCAGCATCTGGATC | BamHI | ||
MidD | pBluescript-GLT25D1-short | For | GCAAACTAGTGAAGCACCCGCCCGCAGAGC | SpeI |
Rev | AGCCAGGCAGCATCTGGATC | BamHI | ||
CterA | pBluescript-GLT25D1-short | For | GCAATGGCCACGCAACCTGCATGCCTTC | MscI |
Rev | GCGGCCGCTCTAGAGGCCAC | XbaI | ||
CterB | pBluescript-CEECAM1-Cter-short | For | TGAGGCCGAGAGCTTCATGC | NsiI |
Rev | GCAAGGCGCCCGCCAGACGCAGGGCATAG | KasI | ||
CterC | pBluescript-CEECAM1-Cter-short | For | TGAGGCCGAGAGCTTCATGC | NsiI |
Rev | GCAAGCTCAGGAAGCAGCCCACCTCGCC | Bpu10I | ||
CterD | pBluescript-CEECAM1-Cter-short | For | GCAAGGCGCCCGCAAGCTGCTGGCCTCAC | KasI |
Rev | CGGCCGCTCTAGAGTAGTGGCCTG | XbaI | ||
CterE | pBluescript-CEECAM1-Cter-short | For | GCAACCTGAGCCATTACTCCATCTGGGAAGAGG | Bpu10I |
Rev | GCAAGCTCAGGAAGCAGCCCACCTCGCC | KasI |
Plasmids used for PCR amplification are indicated for each constructs with the restriction sites used for cloning. Sequences are shown for the forward and reverse PCR primers, respectively. Bases marked in bold represent positions changed to create new restriction sites. Underlined bases mark the restriction sites used for cloning and dotted underlined bases show the partial restriction sites used for cloning. The BamHI restriction site used in MidC and MidD constructs is present at the 3′-end of the PCR product and not in the PCR primer.
For the chimeric protein CEECAM1-CterA, PCR was performed on pBluescript-GLT25D1-short DNA using the primers given in Table 2. The PCR product was digested by MscI and XbaI (Fig. S2) and cloned back into pBluescript-CEECAM1-Cter-short opened at the same restriction sites. For chimeric proteins CEECAM-CterB/C/D/E, PCR was done on pBluescript-CEECAM1-Cter-short DNA using the corresponding forward and reverse primers listed in Table 2. After restriction digestion (Fig. S2), CEECAM1 fragments were cloned back into pBluescript-GLT25D1-short, previously digested by the same enzymes. Finally, all engineered pBluescript DNA were digested with PspXI and XbaI and introduced into pFastBac-GLT25D1 as described above, yielding the constructs CEECAM1-CterA/B/C/D/E. The integrity of the constructs generated by PCR was verified by DNA sequencing (Microsynth, Balgach, Switzerland).
Mutagenesis
Site directed mutagenesis of GLT25D1 and chimeric GLT25D1-CEECAM1 constructs was performed by QuickChange site-directed mutagenesis [29]. The corresponding primers and DNA templates are shown in Table 3. The introduction of the appropriate mutations was confirmed by DNA sequencing (Microsynth). The mutant constructs are named according to the amino acids substituted and their respective position on the protein sequence where the start methionine of GLT25D1 counts as position one.
Table 3. Sense strand oligonucleotides used for site-directed mutagenesis.
Plasmid template | Mutations | Oligonucleotide sequence (5′-3′) |
GLT25D1 | D166A-D168A | CATCCTGTTTGTAGCGGCCGCCAACCTGATCCTCAACCC |
GLT25D1 | P292N | GGAGTACGGATTCTTGAACGTTCCATTGCGCGCCCACAGC |
GLT25D1 | D336S | CCCACCAAGACACCGAGCAAGATGGGCTTCG |
GLT25D1 | D461A-D463A | GGAGGGCCTGGCCTGGGCCCTCATCTATGTGG |
GLT25D1 | D585A-D587A | CGTCAAGACCGCCTGGGCCCGCGCCAAGTCC |
CEECAM1-CterB | S461D | GGAGGCAGAGAAACTGGATTGGGACCTGATCTACC |
CEECAM1-CterB[S461D] | Q471R-V472M-N473Q-P474V | GATCTACCTCGGACGGAAGCGCATGCAGGTTGAGAAGGAGACGGCCGTGG |
Sequences show the sense strand oligonucleotides of each complementary pair of primers used for site-directed mutagenesis. Underlined bases mark the introduced mutations.
ColGalT assays
Recombinant baculoviruses were produced in Spodoptera frugiperda Sf9 cells as described previously [30]. Galactosyltransferase assays were performed as described by Schegg et al. [10]. Briefly, 2.106 infected Sf9 cells were lyzed in 50 mM TrisHCl pH 8, 150 mM NaCl, 1% Triton X100 and protease inhibitors (Complete, Roche, Rotkreuz, Switzerland) for 15 min on ice. Insoluble material was precipitated at 14,000×g for 20 min at 4°C. Assays were performed using 10 µl of postnuclear supernatant, 5 mg/ml of denatured collagen type I from bovine Achilles tendon, 60 µM UDP-Gal spiked with 50,000 cpm UDP-[14C]Gal, 10 mM MnCl2, 20 mM NaCl, 50 mM morpholinepropanesulfonic acid pH 7.4 and 1 mM DTT. Reactions were incubated for 3 h at 37°C then stopped by addition of 500 µl of 5%TCA/5% phosphotungstic acid. The precipitated assay products were applied on glass fiber filters (Whatman, Sigma-Aldrich), washed with 10 ml of 50% ethanol, dried for 30 min and counted in a scintillation β-counter (Packard).
Western Blotting
After determination of protein concentration, samples corresponding to 10 µg proteins were denatured at 95°C, loaded on 8% SDS-PAGE, and transferred to PVDF membranes (Millipore, Zug, Switzerland). Flag proteins were detected using an anti-Flag M2 monoclonal antibody (dilution 1∶2,000) (Sigma-Aldrich), then by a secondary antibody, anti-mouse peroxidase conjugate (dilution 1∶10,000) (Sigma-Aldrich), followed by detection using SuperSignal West Pico chemiluminescent substrate (Thermo Scientific, Lausanne, Switzerland).
Alignment and modeling of protein structure
Protein sequences corresponding to human GLT25D1 (Swiss-Prot:Q8NBJ5), human GLT25D2 (Swiss-Prot:Q81IYK4) and human CEECAM1 (Swiss-Prot:Q5T4B2) were used as input for the PROMALS3D multiple sequence and structure alignment tool (http://prodata.swmed.edu/ promals3d/promals3d.php) [18]. An identity threshold of 0.6 was chosen and other settings were set as default. Sequences were aligned according to predicted α-helices and β-strands. Protein homology detection of the three proteins was done using the HHpred program based on the pdb70 database (http://toolkit.tuebingen.mpg.de/hhpred) [19]. Multiple sequence alignment was done using ClustalW2 program (http://www.ebi.ac.uk/Tools /msa/clustalw2/) [31].
Supporting Information
Acknowledgments
We thank Nicolas Tricaud for reading critically the manuscript.
Footnotes
Competing Interests: The authors have declared that no competing interests exist.
Funding: This work was supported by the Swiss National Foundation (www.snf.ch) grant 310030-129633 to T. Hennet. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Mizuno K, Hayashi T, Bachinger HP. Hydroxylation-induced stabilization of the collagen triple helix. Further characterization of peptides with 4(R)-hydroxyproline in the Xaa position. J Biol Chem. 2003;278:32373–32379. doi: 10.1074/jbc.M304741200. [DOI] [PubMed] [Google Scholar]
- 2.Risteli M, Niemitalo O, Lankinen H, Juffer AH, Myllyla R. Characterization of collagenous peptides bound to lysyl hydroxylase isoforms. J Biol Chem. 2004;279:37535–37543. doi: 10.1074/jbc.M405638200. [DOI] [PubMed] [Google Scholar]
- 3.Spiro RG. Characterization and quantitative determination of the hydroxylysine-linked carbohydrate units of several collagens. J Biol Chem. 1969;244:602–612. [PubMed] [Google Scholar]
- 4.Spiro RG, Lucas F, Rudall KM. Glycosylation of hydroxylysine in collagens. Nat New Biol. 1971;231:54–55. doi: 10.1038/newbio231054a0. [DOI] [PubMed] [Google Scholar]
- 5.Berg RA, Prockop DJ. The thermal transition of a non-hydroxylated form of collagen. Evidence for a role for hydroxyproline in stabilizing the triple-helix of collagen. Biochem Biophys Res Commun. 1973;52:115–120. doi: 10.1016/0006-291x(73)90961-3. [DOI] [PubMed] [Google Scholar]
- 6.Hyland J, Ala-Kokko L, Royce P, Steinmann B, Kivirikko KI, et al. A homozygous stop codon in the lysyl hydroxylase gene in two siblings with Ehlers-Danlos syndrome type VI. Nat Genet. 1992;2:228–231. doi: 10.1038/ng1192-228. [DOI] [PubMed] [Google Scholar]
- 7.van der Slot AJ, Zuurmond AM, Bardoel AF, Wijmenga C, Pruijs HE, et al. Identification of PLOD2 as telopeptide lysyl hydroxylase, an important enzyme in fibrosis. J Biol Chem. 2003;278:40967–40972. doi: 10.1074/jbc.M307380200. [DOI] [PubMed] [Google Scholar]
- 8.Salo AM, Cox H, Farndon P, Moss C, Grindulis H, et al. A connective tissue disorder caused by mutations of the lysyl hydroxylase 3 gene. Am J Hum Genet. 2008;83:495–503. doi: 10.1016/j.ajhg.2008.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Cabral WA, Chang W, Barnes AM, Weis M, Scott MA, et al. Prolyl 3-hydroxylase 1 deficiency causes a recessive metabolic bone disorder resembling lethal/severe osteogenesis imperfecta. Nat Genet. 2007;39:359–365. doi: 10.1038/ng1968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Schegg B, Hülsmeier AJ, Rutschmann C, Maag C, Hennet T. Core glycosylation of collagen is initiated by two beta(1-O)galactosyltransferases. Mol Cell Biol. 2009;29:943–952. doi: 10.1128/MCB.02085-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Myllyharju J, Kivirikko KI. Collagens, modifying enzymes and their mutations in humans, flies and worms. Trends Genet. 2004;20:33–43. doi: 10.1016/j.tig.2003.11.004. [DOI] [PubMed] [Google Scholar]
- 12.Norman KR, Moerman DG. The let-268 locus of Caenorhabditis elegans encodes a procollagen lysyl hydroxylase that is essential for type IV collagen secretion. Dev Biol. 2000;227:690–705. doi: 10.1006/dbio.2000.9897. [DOI] [PubMed] [Google Scholar]
- 13.Simmer F, Moorman C, van der Linden AM, Kuijk E, van den Berghe PV, et al. Genome-wide RNAi of C. elegans using the hypersensitive rrf-3 strain reveals novel gene functions. PLoS Biol. 2003;1:77–84. doi: 10.1371/journal.pbio.0000012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Kamath RS, Fraser AG, Dong Y, Poulin G, Durbin R, et al. Systematic functional analysis of the Caenorhabditis elegans genome using RNAi. Nature. 2003;421:231–237. doi: 10.1038/nature01278. [DOI] [PubMed] [Google Scholar]
- 15.Starzyk RM, Rosenow C, Frye J, Leismann M, Rodzinski E, et al. Cerebral cell adhesion molecule: a novel leukocyte adhesion determinant on blood-brain barrier capillary endothelium. J Infect Dis. 2000;181:181–187. doi: 10.1086/315163. [DOI] [PubMed] [Google Scholar]
- 16.Yamamoto F, Clausen H, White T, Marken J, Hakomori S. Molecular genetic basis of the histo-blood group ABO system. Nature. 1990;345:229–233. doi: 10.1038/345229a0. [DOI] [PubMed] [Google Scholar]
- 17.Galili U, Shohet SB, Kobrin E, Stults CL, Macher BA. Man, apes, and old world monkeys differ from other mammals in the expression of α-galactosyl epitopes on nucleated cells. JBiolChem. 1988;263:17755–17762. [PubMed] [Google Scholar]
- 18.Pei J, Kim BH, Grishin NV. PROMALS3D: a tool for multiple protein sequence and structure alignments. Nucleic Acids Res. 2008;36:2295–2300. doi: 10.1093/nar/gkn072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Soding J. Protein homology detection by HMM-HMM comparison. Bioinformatics. 2005;21:951–960. doi: 10.1093/bioinformatics/bti125. [DOI] [PubMed] [Google Scholar]
- 20.Osawa T, Sugiura N, Shimada H, Hirooka R, Tsuji A, et al. Crystal structure of chondroitin polymerase from Escherichia coli K4. Biochem Biophys Res Commun. 2009;378:10–14. doi: 10.1016/j.bbrc.2008.08.121. [DOI] [PubMed] [Google Scholar]
- 21.Liefhebber JM, Punt S, Spaan WJ, van Leeuwen HC. The human collagen beta(1-O)galactosyltransferase, GLT25D1, is a soluble endoplasmic reticulum localized protein. BMC Cell Biol. 2010;11:33. doi: 10.1186/1471-2121-11-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ramakrishnan B, Boeggeman E, Ramasamy V, Qasba PK. Structure and catalytic cycle of β-1,4-galactosyltransferase. Curr Opin Struct Biol. 2004;14:593–600. doi: 10.1016/j.sbi.2004.09.006. [DOI] [PubMed] [Google Scholar]
- 23.Pedersen LC, Dong J, Taniguchi F, Kitagawa H, Krahn JM, et al. Crystal structure of an alpha 1,4-N-acetylhexosaminyltransferase (EXTL2), a member of the exostosin gene family involved in heparan sulfate biosynthesis. J Biol Chem. 2003;278:14420–14428. doi: 10.1074/jbc.M210532200. [DOI] [PubMed] [Google Scholar]
- 24.Fritz TA, Raman J, Tabak LA. Dynamic association between the catalytic and lectin domains of human UDP-GalNAc:polypeptide alpha-N-acetylgalactosaminyltransferase-2. J Biol Chem. 2006;281:8613–8619. doi: 10.1074/jbc.M513590200. [DOI] [PubMed] [Google Scholar]
- 25.Kubota T, Shiba T, Sugioka S, Furukawa S, Sawaki H, et al. Structural basis of carbohydrate transfer activity by human UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferase (pp-GalNAc-T10). J Mol Biol. 2006;359:708–727. doi: 10.1016/j.jmb.2006.03.061. [DOI] [PubMed] [Google Scholar]
- 26.Deadman ME, Lundstrom SL, Schweda EK, Moxon ER, Hood DW. Specific amino acids of the glycosyltransferase LpsA direct the addition of glucose or galactose to the terminal inner core heptose of Haemophilus influenzae lipopolysaccharide via alternative linkages. J Biol Chem. 2006;281:29455–29467. doi: 10.1074/jbc.M604908200. [DOI] [PubMed] [Google Scholar]
- 27.Griffin R, Cox AD, Makepeace K, Richards JC, Moxon ER, et al. The role of lex2 in lipopolysaccharide biosynthesis in Haemophilus influenzae strains RM7004 and RM153. Microbiology. 2003;149:3165–3175. doi: 10.1099/mic.0.26387-0. [DOI] [PubMed] [Google Scholar]
- 28.Müller R, Altmann F, Zhou D, Hennet T. The Drosophila melanogaster brainiac protein is a glycolipid-specific β1,3 N-acetylglucosaminyltransferase. JBiolChem. 2002;277:32417–32420. doi: 10.1074/jbc.C200381200. [DOI] [PubMed] [Google Scholar]
- 29.Braman J, Papworth C, Greener A. Site-directed mutagenesis using double-stranded plasmid DNA templates. Methods Mol Biol. 1996;57:31–44. doi: 10.1385/0-89603-332-5:31. [DOI] [PubMed] [Google Scholar]
- 30.Hennet T, Dinter A, Kuhnert P, Mattu TS, Rudd PM, et al. Genomic cloning and expression of three murine UDP-galactose: beta-N-acetylglucosamine beta1,3-galactosyltransferase genes. J Biol Chem. 1998;273:58–65. doi: 10.1074/jbc.273.1.58. [DOI] [PubMed] [Google Scholar]
- 31.Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, et al. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res. 2003;31:3497–3500. doi: 10.1093/nar/gkg500. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.