Skip to main content
Viruses logoLink to Viruses
. 2021 Jan 10;13(1):87. doi: 10.3390/v13010087

Chlorovirus PBCV-1 Multidomain Protein A111/114R Has Three Glycosyltransferase Functions Involved in the Synthesis of Atypical N-Glycans

Eric Noel 1,2,, Anna Notaro 3,4,, Immacolata Speciale 3,5,, Garry A Duncan 1, Cristina De Castro 5,*, James L Van Etten 1,6,*
Editors: Jacques Le Pendu, Göran Larson
PMCID: PMC7826918  PMID: 33435207

Abstract

The structures of the four N-linked glycans from the prototype chlorovirus PBCV-1 major capsid protein do not resemble any other glycans in the three domains of life. All known chloroviruses and antigenic variants (or mutants) share a unique conserved central glycan core consisting of five sugars, except for antigenic mutant virus P1L6, which has four of the five sugars. A combination of genetic and structural analyses indicates that the protein coded by PBCV-1 gene a111/114r, conserved in all chloroviruses, is a glycosyltransferase with three putative domains of approximately 300 amino acids each. Here, in addition to in silico sequence analysis and protein modeling, we measured the hydrolytic activity of protein A111/114R. The results suggest that domain 1 is a galactosyltransferase, domain 2 is a xylosyltransferase and domain 3 is a fucosyltransferase. Thus, A111/114R is the protein likely responsible for the attachment of three of the five conserved residues of the core region of this complex glycan, and, if biochemically corroborated, it would be the second three-domain protein coded by PBCV-1 that is involved in glycan synthesis. Importantly, these findings provide additional support that the chloroviruses do not use the canonical host endoplasmic reticulum–Golgi glycosylation pathway to glycosylate their glycoproteins; instead, they perform glycosylation independent of cellular organelles using virus-encoded enzymes.

Keywords: glycosyltransferases, multidomain protein, N-glycan, chloroviruses, PBCV-1

1. Introduction

Glycosylation is an important post-translational modification that confers a diversity of structures and functions in both cell and virus biology. One of the most common forms of protein modification is N-linked glycosylation, in which a high mannose core is linked to the amide nitrogen of asparagine in the context of the conserved motif Asn–X–Ser/Thr. This attachment occurs early in protein synthesis, followed by a complex process of trimming and remodeling of the oligosaccharide during transit through the endoplasmic reticulum (ER) and Golgi [1] producing glycoproteins with varying oligosaccharide structures. Typically, viruses use host-encoded glycosyltransferases (GT) and glycosidases located in the ER and Golgi to add and remove N-linked sugar residues to/from virus glycoproteins either co-translationally or shortly after translation of the protein. The viral glycoproteins are then often transported to the host plasma-membrane where progeny viruses bud though the membrane, and so the viruses only become infectious as they leave the cell [2].

However, the large dsDNA chloroviruses in the family Phycodnaviridae are interesting because, unlike other viruses, they encode most, if not all, of the machinery to glycosylate their major capsid proteins (MCP) independent of the cellular organelles [3,4]. Furthermore, the process occurs in the cytoplasm, and infectious viruses are formed inside the cell prior to cell lysis. The prototype chlorovirus, Paramecium bursaria chlorella virus (PBCV-1), has a 330-kb genome that is predicted to encode as many as 400 proteins, many of which are unusual for a virus [5], including at least 17 related to various aspects of carbohydrate metabolism [6].

The PBCV-1 MCP (also referred to as Vp54) is coded by gene a430l, and the protein has four glycosylation sites [7]. The predominant oligosaccharide is a nonasaccharide (Figure 1) with several unique features, including the following: (i) its structure does not resemble any structure previously reported in the three domains of life [8]; (ii) the four glycoforms are generated by the non-stochiometric presence of two monosaccharides, L-arabinose (Araf) and D-mannose (Man); (iii) the most abundant glycoform consists of nine neutral monosaccharide residues organized in a highly-branched fashion; (iv) none of the N-linked glycans is attached to a typical Asn–X–(Thr/Ser) consensus site in the protein [9]; (v) the glycans are attached to the protein by a β-glucose (Glc) linkage, which is rare in nature and has only been reported in glycoproteins from a few organisms [10,11,12,13]; and (vi) the glycoform contains a dimethylated rhamnose (Rha) as the capping residue of the main chain, a hyper-branched fucose (Fuc) residue and two Rha residues with opposite configurations.

Figure 1.

Figure 1

Structure of the N-glycan attached to the chlorovirus PBCV-1 major capsid protein (Vp54). Monosaccharides (Man and Araf) connected by dashed lines are non-stoichiometric substituents. The larger black box encloses the conserved pentasaccharide core structure common to all chloroviruses analyzed to date. The inner red box indicates the tetrasaccharide structure of the N-glycan of the antigenic mutant chlorovirus PIL6 [6].

Interestingly, all the chloroviruses studied to date, including those with different host specificities, have the capsid protein N-glycosylated with other types of oligosaccharides; however, all chloroviruses share the same pentasaccharide core oligosaccharide [14,15] composed of an N-linked Glc, a hyperbranched Fuc, a distal and a proximal xylose (Xyl), and a galactose (Gal) (Figure 1). Additional monosaccharides decorate this core N-glycan, producing a molecular signature for each chlorovirus [16]. This oligosaccharide core was also found in all mutants (or antigenic variants) of PBCV-1 analyzed to date, except for PIL6 (Figure 1 [6]). Mutant PIL6 is a representative of the antigenic class D, characterized by a large genomic deletion spanning genes a014r through a078r [6,17]. Its N-glycan is a tetrasaccharide and is significantly truncated compared to that of wild-type PBCV-1. It contains all the units of the chloroviruses N-glycans present in the conserved core region except the distal Xyl attached to Fuc [16]. By analyzing the genomes of all chloroviruses and of the mutants sequenced to date, we noted that the a111/114r gene is the only annotated orthologous GT gene found outside of the region of the large deletion mutants that is present in all other chloroviruses. This finding suggests its likely involvement in the synthesis of the initial part of the unusual N-glycan shared by all of these viruses and prompted us to investigate its role in the assemblage of the conserved core oligosaccharide.

To predict the A111/114R protein structure and functions, we used a combination of genetic and structural analyses, together with hydrolytic activity assays and in silico evidence by sequence analysis and protein modeling. The combined results suggest that the A111/114R protein is a multi-domain/multi-functional GT likely involved in the attachment of three of the five monosaccharides in the conserved core region of the N-glycan (Figure 1). Specifically, this large protein of 860 amino acids is comprised of three putative domains (Figure S1; Figure 2), each with a specific role; the N-terminal domain (1–260 aa) is a galactosyltransferase (GalT), the central domain (261–559 aa) is a xylosyltransferase (XylT), and the C-terminal domain (560–860 aa) is a fucosyltransferase (FucT).

Figure 2.

Figure 2

Predicted GT domains of PBCV-1 encoded protein A111/114R. A111/114R domain analysis based on remote homology identified three putative GT domains labeled as D1, D2, and D3, located at the N-terminal (1 to 260 aa), central (261 to 559 aa), and C-terminal (560 to 860 aa) regions, respectively. Below individual domains are the corresponding three-dimensional protein models assigned by Phyre2 [18] based on alignments to known protein structures identified by their PDB entry: 1GA8 Chain A (D1), 2Z86 Chain D (D2), 2NZY Chain A (D3). Protein ribbon models are rendered using rainbow colors from N-terminus (blue) to C-terminus (red). The putative domain, predicted protein model, and sugar substrate are connected by the black dashes.

2. Materials and Methods

2.1. Protein Modeling

The prediction of the different domains of A111/114R was performed by HHpred [19] based on remote homology detection. Then, the 3D model of each domain was built by Phyre2 (Protein Homology/analogY Recognition Engine V 2.0) in a normal mode [18]. Each 3D model was based on an alignment generated by HMM–HMM matching. Final drawings and residue analysis were prepared with the molecular graphics system PyMol Version 1.2r3pre, Schrödinger, LLC.

2.2. Cloning and Expression

PBCV-1 a111/114r and domain variants were cloned from PCR-amplified viral DNA using oligonucleotide primers with restriction sites NotI–BamHI. PCR fragments of the expected size were digested and inserted into the restriction sites of the pMAL-c6T expression vector (New England Biolabs, Ipswich, MA, USA). This process produced a maltose-binding protein (MBP) tag at the N-terminus of the target protein. The resulting plasmid was transformed into E. coli strain One Shot TOP10 competent cells (Invitrogen) for maintenance. The E. coli cells containing positive cloned plasmids were selected with 100 μg/mL carbenicillin. The cloned structure of each vector was sequence verified. Plasmids were isolated with a QIAprep Spin Miniprep kit (Qiagen, Valencia, CA, USA) according to the manufacturer’s instructions and transformed into NEBExpress competent cells for expression. Viral genes were expressed by growing cells overnight at 37 °C in 10 mL of LB medium (10 g/L tryptone, 5 g/L yeast extract, 5 g/L sodium chloride) containing 100 μg/mL carbenicillin. Then, 5 mL of the over-night culture was sub-cultured into 200 mL LB medium containing 100 μg/mL carbenicillin. The batch culture was grown to an OD600 of 0.6 at 37 °C and then induced with 0.1 mM IPTG and incubated at 16 °C overnight. The cells were harvested by centrifugation at 3500× g, for 5 min at 4 °C, and resuspended in 35 mL of PBS with 2 mM phenylmethylsulfonyl fluoride (PMSF). After incubation on ice for 30 min, cells were disrupted by sonication for 3 min using a Tekmar sonic disruptor at 30% amplitude, in 5 s pulses. Samples were centrifuged at 10,000 rpm for 15 min to separate soluble and insoluble fractions.

2.3. Purification of Recombinant Enzymes

Amylose resin (New England Biolabs) was loaded onto a 5-mL self-packing column with a 45- to 90-μm-pore-size polyethylene filter (frit) (Life Science Products, Chestertown, MD, USA), and the resin was allowed to settle. The column was equilibrated with 5 column volumes of cold wash buffer (50 mM NaH2PO4, 150 mM NaCl, 1 mM DTT, 1 mM EDTA, pH 7.2). The soluble bacterial fraction was applied to the column and allowed to drain. The column was washed again with 5 column volumes of cold wash buffer. The recombinant proteins were eluted with the MBP moiety using elution buffer (wash buffer plus 10 mM maltose). The recombinant protein concentrations were determined by a NanoDrop spectrophotometer (NanoDrop Technologies, Wilmington, DE, USA). Eluted proteins were resolved by SDS-PAGE (7.5% acrylamide) with Coomassie brilliant blue staining.

2.4. UDP-GloTM and GDP-GloTM GT Assays

Detection of free uridine diphosphate (UDP) after hydrolysis of the sugar nucleotide was performed using the UDP-GloTM GT assay kit (Promega Corporation, Madison, WI, USA), which detects UDP after UDP-sugar hydrolysis or transfer by converting UDP to light (measured in Relative Luminescence Units) in a luciferase-type reaction. Detection of free guanosine diphosphate (GDP) from GDP-sugar hydrolysis was evaluated using the GDP-GloTM GT assay kit (Promega Corporation), which operates by the same principles as above. A standard curve using 0–25 μM of the respective nucleoside diphosphate (NDP) was performed, and the range of measurements was determined to be in the linear range of detection, where the luminescence detected is directly proportional to NDP concentration.

Enzymes were diluted with an optimized GT solution (0.1 M MOPS-NaOH pH 7, 10 mM Mn2+) and supplemented with 100 μM of the targeted nucleotide sugar(s). Each sugar–nucleotide hydrolysis reaction was incubated at 16 °C for 16 h. Following the manufacturer’s protocol, each reaction was combined in a ratio of 1:1 (25 μL: 25 μL) with the UDP-GloTM detection reagent in independent wells of a white, flat bottom 96-well assay plate and allowed to incubate at ambient temperature. After 1 h of incubation, luminescence was measured in triplicate, which is directly proportional to UDP or GDP concentration based on the standard curves. Luminescence measurements were performed with a VeritasTM microplate luminometer (Turner Biosystems, Sunnyvale, CA, USA) using a 96-well microplate with standard 128 × 86-mm geometry with an integration time of 1.0 s. Luminescence was measured by a low-noise photomultiplier detector through an empty filter position in the emission filter wheel.

3. Results and Discussion

3.1. In Silico Analysis of A111/114R Domain 1

PBCV-1 encoded protein A111/114R was annotated in NCBI (https://blast.ncbi.nlm.nih.gov/) as a hypothetical protein (NP_048459) of which only one region was predicted to be a GT (262 to 382 aa). PSI-BLAST analysis [20] revealed that A111\114R was conserved among all chloroviruses with 62–94% sequence identity, reinforcing the notion that it could be involved in the assembly of the conserved oligosaccharide core. Therefore, to elucidate the function of A111\114R, we analyzed the full-protein sequence using HHpred tool [19], which predicted three putative GT domains. N-terminal homology with several known GTs (>98% probability) was identified for residues 1 to 256; hence, we assigned residues 1 to 260 aa as domain 1 (A111/114R-D1). The other two domains are discussed in the following sections.

Protein structure prediction by Phyre2 analysis [18] identified six GT crystal structures (Table S1) to model three-dimensional structures of A111/114R-D1 based on 50–82% protein coverage (>90% confidence and 12–23% sequence identity). Models were ranked according to raw alignment score using the sequence and the secondary structure similarity, inserts, and deletions. Interestingly, the top ranked models were based on two XylTs (PDB:6BSV, 4WMA), one glucosyltransferase (GlcT) (PDB:1LL2), and three GalTs (PDB:1GA8, 5GVV, 6U4B), in agreement with our initial hypothesis, as both Xyl and Gal are part of the conserved region of the core glycan (Figure 1). The highest ranked model was the XylT XXT1 from Arabidopsis (PDB: 6BSV) based on 120 residues (19% identity with 96% confidence). However, the homologous region of A111/114R-D1 (9 to 168 aa) that aligned with XXT1 residues 167 to 287 had no homology with the residues in the active site of the XXT1 model, which utilized Lys-382, Asp-317, Asp-318, and Gln-319 to bind the nucleotide sugar for catalytic activity [21]. Similarly, important residues for enzyme activity in the XylT XXYLT1 (PDB: 4WMA) from Mus musculus [22] (His-262, Trp-265, and Gly-325) did not share homologous positions with A111/114R-D1. Based on this evidence, we deduced that A111/114R-D1 has no XylT activity, and for this reason we examined the other high top ranked models, all reporting well characterized GalTs (PDB: 1GA8, 5GVV, 6U4B), except for 1LL2 which is a GlcT involved in glycogen synthesis [23]. Although GlcT activity cannot be definitely ruled out, A111/114R-D1 lacks the equivalent 1LL2 residue, Tyr-194, involved in Glc addition.

In order to investigate possible GalT activity for A111/114R-D1, we chose, as reference protein, LgtC (1GA8), a GalT of Neisseria meningitidis, for which the residues involved in the sugar–nucleotide binding and in the catalysis were well characterized. LgtC is a retaining GT that transfers α-D-Gal from UDP-Gal to a terminal lactose. The structure of LgtC was solved in complex with Mn2+ and a non-cleavable analog of the donor sugar (UDP-Gal) in which the hydroxyl at the 2′ position of the Gal was substituted by a fluorine for stability purposes [23]. The alignment of the protein sequences of LgtC and A111/114R-D1 (Figure 3), along with the structural superimposition of the 3D model of A111/114R-D1 based on LgtC (Table S1) with 1GA8 (Chain A) in complex with Mn2+ and an analog of the donor sugar (Figure 4), clearly revealed that all the residues responsible for binding and catalysis were preserved. In detail, A111/114R-D1 shares four sugar binding residues with LgtC, namely Asp-85, Asn-149, Asp-193, and Gln-194, which correspond to Asp-103, Asn-153, Asp-188, and Gln-189 in LgtC (1GA8). Mutagenesis experiments have established that LgtC Gln-189, contained within the invariant D/EQD motif found in all members of GTs in family 8 (GT8), plays a crucial role in binding and probably in catalysis as well [24]. A catalytic mechanism leading to the retention of the anomeric carbon configuration for A111/114R-D1 is consistent with the presence in the oligosaccharide core of an α-D-Gal (Figure 1). It has also been proven that LgtC is a cation-dependent GT [23]. Indeed, LgtC exhibits the typical DXD motif (103DXD105), common in a wide range of GTs, both in prokaryotes and eukaryotes [25]. In addition, a crucial role has been attributed to Asp-103, as LgtC D103E and D103N mutants present a dramatic reduction in their activity compared to the wild-type [23].

Figure 3.

Figure 3

Amino acid sequence alignment of A111/114R domains and known GTs. Invariant and similar residues are highlighted in black and gray, respectively. Sequences from the following organisms were used (PDB code) for three individual domains (D1, D2, and D3), respectively: (a) N. meningitidis LgtC GalT (1GA8), (b) E. coli K4CP dual GalNAcT and GlcAT (2Z86), and (c) H. pylori FucT (2NZW). Homologous residues positioned less than 4 Å from the nucleotide sugar in the three-dimensional model analysis are marked with an asterisk (*). Known residues from annotated enzymes that interact with the nucleotide sugar donor and ion are marked with a dot () and underline (), respectively. Multiple alignment was performed by Phyre2 using structural information and homology extension. File output was compiled by BOXSHADE.

Figure 4.

Figure 4

Superpositions of A111/114R-D1, -D2, and -D3 with structural homologs. Individual domains of A111/114R (cyan) are shown independently as ribbon diagrams superimposed with known GTs (gray) bound to respective nucleotide sugars drawn as stick models (orange) and Mn2+ ion (magenta): N. meningitidis LgtC with UDP-fluorogalactose (PDB: 1GA8), E. coli K4CP with UDP-glucuronic acid (PDB: 2Z86), and H. pylori FucT with GDP-Fuc (PDB: 2NZY) are superposed with D1, D2, and D3, respectively (left to right). The corresponding active sites of D1, D2, and D3 are shown magnified below the complexed stereoviews with labeled residues proposed to be involved in sugar and ion coordination. Hydrogen bonds are represented as yellow dotted lines.

However, the 103DXD105 motif of LgtC is not preserved in position with A111/114R-D1, except for LgtC Asp-103, which corresponds to A111/114R-D1 Asp-85. Homologous to residue LgtC Asp-103, Asp-85 of A111/114-D1 is positioned in close proximity to a divalent cation and to the nucleotide sugar, as denoted with the characteristic ligand distance < 4 A° (Table 1). It is likely that Asp-85 is sufficient in providing one side-chain oxygen ligand in coordination with Mn2+, as evident from the structural superimposition (Figure 4). The evidence that A111/114R-D1 possesses all residues involved in binding and catalysis is further supported by the fact that other well-noted GalTs exhibit similar homologies. The GT GlyE from Streptococcus pneumoniae TIGR4 (PDB: 5GVV) binds UDP-Gal [26], and it shares the same conserved sugar binding residues as LgtC and A111/114R-D1 (Asp-103, Asn-142, Asp-177, and Gln-178) (Figure S2). Mutation of these key residues in GlyE completely abolished the hydrolytic activity [26]. Additionally, the bifunctional domain polymerase WbbM from Klebsiella pneumoniae (PDB: 6U4B) possesses a C-terminal galactopyranosyltransferase [27] with similar homologous residues, Asp-486 and Gln-487, resembling the known GT8 family enzyme signature likely to bind UDP-Gal.

Table 1.

Predicted H-bond distances between homologous A111/114R residues and nucleotide–sugar ligand.

A111/114R Residues H-Bond Distance of Nucleotide-Sugar (Å)
Domain 1 Tyr-83 <4
Asp-85 2.6, 3.0, 3.6
Asn-149 <4
Ala-150 <4
Asp-193 2.4, 2.5
Gln-194 <4
Domain 2 Pro-265 2.8, 3.3
Asp-294 <4
Asn-323 <4
Gly-328 <4
Asp-351 3.2, 3.3
Asp-353 <4
Asp-447 <4
Domain 3 Leu-617 <4
Val-684 <4
Arg-693 <4
Gly-711 <4
Asn-749 1.6, 3.1, 3.3
Tyr-755 1.9, 3.4
Ser-757 <4
Glu-758 2.2
Lys-759 2.5
Asp-762 <4

3.2. In Silico Analysis of A111/114R Domain 2

Three-dimensional renderings of the central domain of A111/114R (261 to 559 aa), referred to as A111/114R-D2, were assembled based on protein alignment and secondary structure similarities. Phyre2 analysis assigned protein model predictions based on as high as 97% coverage with 100% confidence (11–18% sequence identity) from multiple N-acetylgalactosaminyltransferases (GalNAcT) (Table S1). The top ranked model was based on the second domain of the chondroitin polymerase K4CP from E. coli (PDB: 2Z86), namely 197 residues (66% of the protein sequence) of A111/114R-D2 were modelled with 100% confidence. K4CP is a bi-functional enzyme organized into two GT-A domains (A1 and A2) that catalyzes elongation of the bacterial chondroitin chain [28]. K4CP A1 (1–417) and A2 (418–682), located respectively to the N- and C-terminal, are engaged in the transfer of N-acetylgalactosamine (GalNAc) and glucuronic acid (GlcA) residues alternatively from UDP-GalNAc and UDP-GlcA. The 3D model of A111/114R-D2 (261–474) is based on the second domain (A2) of K4CP (435 to 631 aa), which binds UDP-GlcA, thus excluding GalNAcT activity. The GlcA, absent in the oligosaccharidic core, has the same stereochemistry of Xyl and differs from this monosaccharide by a carboxyl function attached to carbon 5. This finding suggests that the A111/114R-D2 could be a XylT, and for that reason we used K4CP-A2 as a reference to assess the conservation of the residues implicated in binding and in catalysis. Sequence alignment of A111/114R-D2 with K4CP-A2 validated the conserved residues involved with GlcA binding and divalent cation coordination (Figure 3). In detail, A111/114R-D2 residues involved in the sugar-nucleotide binding are Pro-265, Asp-294, and Asn-323, which correspond to Pro-439, Asp-469, and Asn-496 in K4CP-A2 [28]. Superposition of K4CP Chain D with the predicted A111/114R-D2 structure (Figure 4) showcase these residues aligning in close proximity (<4 Å) to the nucleotide sugar (Table 1), strengthening their participation in sugar-binding and catalysis. K4CP, in analogy with other GT-A fold GTs, has a 519DSD521 motif coordinating the Mn2+. This DXD motif is preserved in A111/114R-D2 and corresponds to 351DDD353.

Together, the corresponding catalytic sites and DXD motif support orthology in enzyme activity. Of further note, two additional DXD motifs (Figure S1) are present downstream of A111/114R-D2 (422DRD424 and 439DPD441) that potentially could play an active role in substrate recognition or catalysis, but they do not exhibit homology with K4CP (Figure 3). Should A111/114R-D2 have functional XylT activity, it would be the first XylT sandwiched between two domains specific for different nucleotide sugars. The placement of the proximal Xyl in the core glycan structure, positioned closely to Gal and Fuc, is agreeable with the proposed domain organization of A111/114R.

3.3. In Silico Analysis of A111/114R Domain 3

The C-terminal domain (560 to 860 aa), referred to as A111/114R-D3, was clearly predicted as a putative α-1,3-FucT (Table S1) modelled from Helicobacter pylori (PDB: 2NZW) with 100% confidence and 15% sequence identity (86% coverage). The prediction is that Fuc is linked to the O-3 of a Glc, which, like the monosaccharide contributions of domains D1 and D2, is a component of the overall virus core glycan structure. FucT belongs to the GT B family, in which the protein contains N- and C-terminal domains binding to the acceptor and donor substrates, respectively. Normally, these GTs do not have a recognizable DXD motif responsible for the Mn2+/Mg2+ binding; however, A111/114R-D3 has a 642DLD644 signature (Figure S1) that must be evaluated for activity.

A sequence alignment of A111/114R-D3 and H. pylori FucT (hpFucT) revealed conserved residues involved in binding of the donor substrate, GDP-Fuc (Figure 3); Asn-240, Tyr-246, Glu-249, and Lys-250 correspond to Asn-749, Tyr-755, Glu-758, and Lys-759 in A111/114R-D3, respectively (Table 1). Classified as an inverting GT, the proposed catalytic mechanism of hpFucT incorporates Glu-95 as a general base in catalysis, while Lys-250 and Arg-195 in hpFucT share a key role in the neutralization of the negative charged phosphate groups from GDP-Fuc to facilitate the glycosidic bond cleavage [29]. Glu-249 acts to stabilize in part the positive charge developed in the transition state as well as to form two hydrogen bonds with both the ribose and the Fuc residues of GDP-Fuc. The Glu-95 residue of hpFucT is located in the N-terminal domain, presumably associated with the acceptor substrate, and it has no equivalent in A111/114R-D3. This finding could be related to alternative acceptors in the two FucT enzymes compared here, for example, hpFucT binding GlcNAc or A111/114R-D3 binding Glc.

Comparison of superimposed proteins disclosed that the second half of A111/114R-D3 traces the C-terminal domain of hpFucT Chain A (160 to 320 aa), illustrating similarities in secondary structures. In contrast, structure resemblance decreases towards the N-terminal regions, which could be a result of different acceptor substrates. The open structure of the N-terminal region of the A111/114R-D3 model could be a consequence of a larger acceptor.

To evaluate the sugar-binding and catalytic sites of A111/114R-D3, we superimposed the domain with hpFucT (Chain A) in complex with GDP-Fuc (PDB: 2NZY). This overlap confirmed that all the expected residues involved in the GDP-Fuc binding were preserved and in good orientation (Figure 4), in agreement with the sequence alignment data (Figure 3). A111/114R-D3 residues Asn-749, Tyr-755, Glu-758, Lys-759 appeared <4 Å from the nucleotide sugar in agreement with their equivalent residues in hpFucT (Figure 4). This observation suggests these A111/114R-D3 residues likely participate in GDP-Fuc binding and are critical for A111/114R-D3 activity. Specifically, Asn-749, Tyr-755, and Glu-758 likely stabilize the positive charge on the Fuc moiety like their equivalent residue in hpFucT. Positioned close in proximity to GDP-Fuc is A111/114R-D3 residue Arg-693 that corresponds to Arg-195 in hpFucT, an essential residue for enzyme activity. In fact, Ala mutants of Arg-195 or Lys-250 resulted in no detectable activity, supporting the idea that the two residues provide positive charges to interact with negatively charged GDP-Fuc [29]. The important role of Arg-693 is supported by its conservation amongst different chloroviruses.

3.4. In Vitro Evidence of A111/114R Hydrolytic Activity

In order to test A111/114R for GT activity, we used the bioluminescent UDP/GDP-Glo assays to detect free UDP/GDP released by GT-mediated hydrolysis of the nucleotide sugars. This allowed us to screen for nucleotide donor specificities without their target substrates (acceptor). A time course experiment with the full-length recombinant A111/114R protein in the presence of the core monosaccharides UDP-Gal, UDP-Xyl, GDP-Fuc, and Glc displayed evidence of UDP-hydrolysis (Figure 5). Glc was supplemented to simulate the Glc-Asn acceptor located on the nascent glycoprotein. The release of the UDP increased steadily over time, suggestive of GT activity by A111/114R-mediated hydrolysis.

Figure 5.

Figure 5

A111/114R-catalyzed hydrolysis of UDP-sugars. Time-course experiment with the full-length recombinant A111/114R protein (6 µg) in the optimized buffer consisting of 0.1 M MOPS-NaOH (pH 7.0), 10 mM MgCl2, and 100 µM each of UDP-Gal, UDP-Xyl, GDP-Fuc, and Glc for 16 h at 16 °C. The release of UDP was detected by the UDP-GloTM assay. Data are representative from three independent replicates, and error bars represent standard deviation.

We recombinantly expressed the full-length A111/114R protein (1 to 860 aa) and three variants with omitted regions (Figure 6a,b). A111/114R (1–397 aa) contains the complete domain 1 and approximately the first half of domain 2, A111/114R (266–860 aa) contains the complete domains 2 and 3, and A111/114R (391–860 aa) contains the second half of domain 2 and the complete domain 3. Constructs containing only individual domains were originally designed; however, their proteins did not exhibit activity. This may reflect inaccurate residue boundaries or an incorrect folding of the single domains in the absence of adjacent ones. Representative data from each assay are shown in Figure 6c,d and are represented as a ratio of the UDP or GDP measured from reactions containing the indicated GTs relative to the full-length protein-catalyzed hydrolysis, and negative controls in which no enzyme was added.

Figure 6.

Figure 6

Hydrolysis of UDP- and GDP-sugars by recombinant A111/114R and truncated constructs. (a) SDS-PAGE analysis of the expressed proteins: full length recombinant MBP-A111/114R (144 kDa), and truncated variants MBP-A111/114R (1–397) (87 kDa), MBP-A111/114R (266–860) (110 kDa), and MBP-A111/114R (391–860) (96 kDa) were eluted from an amylose column and resolved by SDS/PAGE with Coomassie blue staining. BenchMarkTM pre-stained protein ladder and recombinant proteins were separated on a 4–20% tris-glycine gel. (b) Cartoon renderings of the full-length A111/114R protein and truncated versions are color coordinated by three putative domains (D1, D2, D3), each corresponding to a different nucleotide sugar donor. White regions denote omitted sections of A111/114R. Shortened constructs are defined by their residues in the left column. (c and d) Representative data from hydrolysis assays shown as a ratio of the UDP (c) or GDP (d) measured from reactions containing the indicated GTs relative to the negative controls where no enzyme was added. The release of UDP and GDP was detected by the UDP-GloTM assay and GDP-GloTM assay, respectively. GDP-hydrolysis from GDP-Fuc was significantly elevated in the presence of UDP-Gal and UDP-Xyl with A111/114R (266–860) and A111/114R (391–860). Data are representative from three replicates, and error bars represent standard deviation.

Analysis of UDP/GDP-hydrolysis by the A111/114R constructs reported in Figure 6b were especially revealing in regard to A111/114R domain assignments. Indeed, starting with the A111/114R (1–397) construct, the UDP molecules were detected only when UDP-Gal was used (Figure 6c). Given A111/114R-D2 is half omitted in A111/114R (1–397), hydrolysis of UDP-Gal implies A111/114R-D1 is a GalT. In agreement, no UDP was produced from UDP-Gal containing reactions involving A111/114R constructs devoid of the first domain, A111/114R (266–860) or A111/114R (391–860). This finding is in agreement with the bioinformatic data.

Hydrolysis of UDP-Xyl by A111/114R (266–860) suggests either A111/114R-D2 or A111/114R-D3 has XylT activity. However, UDP-Xyl was not detected in reactions involving A111/114R (1–397) or A111/114R (391–860), both of which are devoid of a complete D2 (Figure 6b). This suggests that A111/114R-D2 (260–560) harbors the XylT activity. GDP-Fuc hydrolysis was detected in reactions involving A111/114R (266–860) and A111/114R (391–860) exclusively (Figure 6d). These results suggest A111/114R-D3 is a FucT. Notably, GDP-Fuc reactions supplemented with UDP-Gal and UDP-Xyl showed elevated levels of liberated GDP, especially in the presence of A111/114R (266–860). This could be a result of improved protein folding allowed by the extension of residues and co-presence of the nucleotide sugars.

Finally, to evaluate the residues of A111/114R involved in hydrolytic activity, we constructed Ala mutants by site-directed mutagenesis (SDM) (GenScript) to target amino acids from each domain predicted to be involved in nucleotide–sugar or metal–ion binding. Three mutants were expressed, each containing two Ala substitutions inside separate domains. A111/114R-D1, -D2, and -D3 SDM constructs contained Ala mutants of Asp-85 and Gln-194, Asp-351 and Asp-353, and Arg-693 and Lys-759, respectively. In the presence of UDP-Xyl, UDP-Gal, GDP-Fuc, and Glc, the SDM of D1, D2, and D3 resulted in a significant reduction in UDP-sugar hydrolysis, lowering the activity by 95%, 90%, and 80%, respectively (Figure S3). Likewise, in the presence of the same nucleotide–sugars, GDP-Fuc hydrolysis was reduced by 90%, 70%, and 96%, respectively. This dramatic reduction in detectable nucleotide–sugar hydrolysis supports the idea that these residues are critical for enzyme activity, and that A111/114R functions best when all three domains are active.

4. Conclusions

The giant chloroviruses continue to challenge our understanding of canonical metabolic pathways in host–virus interplay. The identification of the atypical N-glycan structure attached to the PBCV-1 Vp54 has led to the characterization of virus-encoded enzymes involved in glycosylation independent of host-derived ER and Golgi GTs. The N-glycan’s core pentasaccharide conserved among the chloroviruses is especially noteworthy and prompted us to investigate candidate virus-encoded GTs involved in the assemblage of part or all the conserved core oligosaccharide. Results in this study establish that PBCV-1 encoded protein A111/114R has three GT domains of approximately 300 aa each. Evidence from a combination of amino acid alignments, three dimensional renderings, and GT assays indicate that the N-terminal (1 to 260 aa), central (261 to 559 aa), and the C-terminal (560 to 860 aa) regions resemble a GalT, a XylT, and a FucT, respectively. The three-dimensional protein models built by Phyre2 are predictions and were used only for the purpose of identifying potential individual domains. As with all methods for protein modeling prediction, caution should be exercised when evaluating structural elements of new enzymes that lack homology to currently deposited structures in the Protein Data Bank archives. In fact, since the percent identity between the various domains and the structures on which they were modeled was low, it was not possible to model the various domains accurately. Eventually individual protein domains must be solved by biochemical and crystallographic methods to fully reveal their catalytic mechanism.

Preliminary evidence suggests that A111/114R-D2 has XylT activity and, if biochemically confirmed, presents a new structural class of XylTs based on its limited resemblance to known XylTs. Moreover, A111/114R would be the second three-domain protein encoded by PBCV-1 that is involved in glycan synthesis. Indeed, recent studies showed that PBCV-1 protein A064R (638 aa) has three functional domains; the first two are GTs (β-L-rhamnosyltransferase and α-L-rhamnosyltransferase, respectively) and the third is a methyltransferase that methylates O-2 of the terminal α-L-Rha residue [30].

It is known that many of the chlorovirus genes encode enzymes involved in various aspects of carbohydrate metabolism. However, it remains unclear how the virus-encoded proteins are involved in the synthesis and/or assembly of the Vp54 glycan. For example, are the sugars added to Vp54 sequentially or are they synthesized independently of Vp54, possibly on a lipid carrier, and then attached to the protein en bloc? A slight variation of these two possibilities is to synthesize a core glycan(s) independently of the protein and attach it to Vp54. Additional experiments will be required to address this issue.

Importantly, the results described herein provide support that the synthesis of the PBCV-1 core glycan structure, or at least part of it, is accomplished with a multidomain enzyme encoded by the virus itself. This finding is in line with the finding that another GT of PBCV-1, the protein A064R, is able to elongate the viral glycan with two units of Rha and to methylate the ultimate unit at O-2. Taking these finding together, the dogma that all viruses use host enzymes to glycosylate their proteins is further subverted.

Acknowledgments

We thank Fatima Maitham Al-Sammak for her technical support with experiments.

Supplementary Materials

The following are available online at https://www.mdpi.com/1999-4915/13/1/87/s1, Figure S1: Amino acid sequence of chlorovirus PBCV-1 encoded protein A111/114R, Figure S2: Amino acid sequence alignment of A111/114R-D1 and annotated GalTs, Figure S3: Hydrolysis of UDP- and GDP-sugars by recombinant A111/114R and SDM constructs, Table S1: Glycosyltransferase homologs of A111/114R domains obtained using Phyre2 analysis.

Author Contributions

Conceptualization, E.N., G.A.D., C.D.C., and J.L.V.E.; methodology, E.N., A.N., and I.S.; software, E.N., A.N., and I.S.; validation, E.N.; formal analysis, E.N.; investigation, E.N., A.N., and I.S.; data curation, E.N., A.N., and I.S.; writing—original draft preparation, E.N.; writing—review and editing, E.N., A.N., I.S., G.A.D., C.D.C., and J.L.V.E.; supervision, project administration, resources, funding acquisition, C.D.C. and J.L.V.E. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded in part by the National Science Foundation under grant no. 1736030 (JVE), the National Science Foundation Graduate Research Fellowship Program under grant no. 2505060195001 (EN), and the Mizutani Foundation under grant no. 180047.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article or supplementary material.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Bieberich E. Glycobiology of the Nervous System. Springer; Berlin/Heidelberg, Germany: 2014. Synthesis, Processing, and Function of N-Glycans in N-Glycoproteins; pp. 47–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Vigerust D.J., Shepherd V.L. Virus Glycosylation: Role in Virulence and Immune Interactions. Trends Microbiol. 2007;15:211–218. doi: 10.1016/j.tim.2007.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Van Etten J.L., Gurnon J.R., Yanai-Balser G.M., Dunigan D.D., Graves M.V. Chlorella Viruses Encode Most, If Not All, of the Machinery to Glycosylate Their Glycoproteins Independent of the Endoplasmic Reticulum and Golgi. Biochim. Biophys. Acta (BBA) Gen. Subj. 2010;1800:152–159. doi: 10.1016/j.bbagen.2009.07.024. [DOI] [PubMed] [Google Scholar]
  • 4.Van Etten J.L., Agarkova I., Dunigan D.D., Tonetti M., De Castro C., Duncan G.A. Chloroviruses Have a Sweet Tooth. Viruses. 2017;9:88. doi: 10.3390/v9040088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Van Etten J.L., Agarkova I.V., Dunigan D.D. Chloroviruses. Viruses. 2019;12:20. doi: 10.3390/v12010020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Speciale I., Duncan G.A., Unione L., Agarkova I.V., Garozzo D., Jimenez-Barbero J., Lin S., Lowary T.L., Molinaro A., Noel E., et al. The N-Glycan Structures of the Antigenic Variants of Chlorovirus PBCV-1 Major Capsid Protein Help to Identify the Virus-Encoded Glycosyltransferases. J. Biol. Chem. 2019;294:5688–5699. doi: 10.1074/jbc.RA118.007182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Nandhagopal N., Simpson A.A., Gurnon J.R., Yan X., Baker T.S., Graves M.V., Van Etten J.L., Rossmann M.G. The Structure and Evolution of the Major Capsid Protein of a Large, Lipid-Containing DNA Virus. Proc. Natl. Acad. Sci. USA. 2002;99:14758–14763. doi: 10.1073/pnas.232580699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.De Castro C., Klose T., Speciale I., Lanzetta R., Molinaro A., Van Etten J.L., Rossmann M.G. Structure of the Chlorovirus PBCV-1 Major Capsid Glycoprotein Determined by Combining Crystallographic and Carbohydrate Molecular Modeling Approaches. Proc. Natl. Acad. Sci. USA. 2018;115:E44–E52. doi: 10.1073/pnas.1613432115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.De Castro C., Molinaro A., Piacente F., Gurnon J.R., Sturiale L., Palmigiano A., Lanzetta R., Parrilli M., Garozzo D., Tonetti M.G., et al. Structure of N-Linked Oligosaccharides Attached to Chlorovirus PBCV-1 Major Capsid Protein Reveals Unusual Class of Complex N-Glycans. Proc. Natl. Acad. Sci. USA. 2013;110:13956–13960. doi: 10.1073/pnas.1313005110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wieland F., Heitzer R., Schaefer W. Asparaginylglucose: Novel Type of Carbohydrate Linkage. Proc. Natl. Acad. Sci. USA. 1983;80:5470–5474. doi: 10.1073/pnas.80.18.5470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Mengele R., Sumper M. Drastic Differences in Glycosylation of Related S-Layer Glycoproteins from Moderate and Extreme Halophiles. J. Biol. Chem. 1992;267:8182–8185. doi: 10.1016/S0021-9258(18)42424-6. [DOI] [PubMed] [Google Scholar]
  • 12.Schreiner R., Schnabel E., Wieland F. Novel N-Glycosylation in Eukaryotes: Laminin Contains the Linkage Unit Beta-Glucosylasparagine. J. Cell Biol. 1994;124:1071–1081. doi: 10.1083/jcb.124.6.1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Gross J., Grass S., Davis A.E., Gilmore-Erdmann P., Townsend R.R., Geme J.W.S. The Haemophilus Influenzae HMW1 Adhesin Is a Glycoprotein with an Unusual N-Linked Carbohydrate Modification. J. Biol. Chem. 2008;283:26010–26015. doi: 10.1074/jbc.M801819200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Quispe C.F., Esmael A., Sonderman O., McQuinn M., Agarkova I., Battah M., Duncan G.A., Dunigan D.D., Smith T.P., De Castro C. Characterization of a New Chlorovirus Type with Permissive and Non-Permissive Features on Phylogenetically Related Algal Strains. Virology. 2017;500:103–113. doi: 10.1016/j.virol.2016.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Speciale I., Agarkova I., Duncan G.A., Van Etten J.L., De Castro C. Structure of the N-Glycans from the Chlorovirus Ne-Jv-1. Antonie Van Leeuwenhoek. 2017;110:1391–1399. doi: 10.1007/s10482-017-0861-3. [DOI] [PubMed] [Google Scholar]
  • 16.De Castro C., Speciale I., Duncan G., Dunigan D.D., Agarkova I., Lanzetta R., Sturiale L., Palmigiano A., Garozzo D., Molinaro A., et al. N-Linked Glycans of Chloroviruses Sharing a Core Architecture without Precedent. Angew. Chem. 2016;55:654–658. doi: 10.1002/anie.201509150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Landstein D., Burbank D.E., Nietfeldt J.W., Van Etten J.L. Large Deletions in Antigenic Variants of the Chlorella Virus PBCV-1. Virology. 1995;214:413–420. doi: 10.1006/viro.1995.0051. [DOI] [PubMed] [Google Scholar]
  • 18.Kelley L.A., Mezulis S., Yates C.M., Wass M.N., Sternberg M.J.E. The Phyre2 Web Portal for Protein Modeling, Prediction and Analysis. Nat. Protoc. 2015;10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Söding J., Biegert A., Lupas A.N. The Hhpred Interactive Server for Protein Homology Detection and Structure Prediction. Nucl. Acids Res. 2005;33(Suppl. 2):W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Altschul S.F., Madden T.L., Schäffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. Gapped Blast and Psi-Blast: A New Generation of Protein Database Search Programs. Nucl. Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Culbertson A.T., Ehrlich J.J., Choe J.-Y., Honzatko R.B., Zabotina O.A. Structure of Xyloglucan Xylosyltransferase 1 Reveals Simple Steric Rules That Define Biological Patterns of Xyloglucan Polymers. Proc. Natl. Acad. Sci. USA. 2018;115:6064–6069. doi: 10.1073/pnas.1801105115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Haltiwanger R.S., Yu H., Takeuchi M., LeBarron J., Kantharia J., London E., Bakker H., Li H., Takeuchi H. Regulation of Notch Signaling by O-Glucosylation: Notch-Modifying Xylosyltransferase-Substrate Complexes Support an Sni-Like Retaining Mechanism. FASEB. 2016;30(Suppl. 1):624.3. [Google Scholar]
  • 23.Gibbons B.J., Roach P.J., Hurley T.D. Crystal Structure of the Autocatalytic Initiator of Glycogen Biosynthesis, Glycogenin. J. Mol. Biol. 2002;319:463–477. doi: 10.1016/S0022-2836(02)00305-4. [DOI] [PubMed] [Google Scholar]
  • 24.Persson K., Ly H.D., Dieckelmann M., Wakarchuk W.W., Withers S.G., Strynadka N.C. Crystal Structure of the Retaining Galactosyltransferase Lgtc from Neisseria Meningitidis in Complex with Donor and Acceptor Sugar Analogs. Nat. Struct. Biol. 2001;8:166–175. doi: 10.1038/84168. [DOI] [PubMed] [Google Scholar]
  • 25.Campbell J.A., Davies G.J., Bulone V., Henrissat B. A Classification of Nucleotide-Diphospho-Sugar Glycosyltransferases Based on Amino Acid Sequence Similarities. Biochem. J. 1997;326:929–939. doi: 10.1042/bj3260929u. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Jiang Y.-L., Jin H., Yang H.-B., Zhao R.-L., Wang S., Chen Y., Zhou C.-Z. Defining the Enzymatic Pathway for Polymorphic O-Glycosylation of the Pneumococcal Serine-Rich Repeat Protein Psrp. J. Biol. Chem. 2017;292:6213–6224. doi: 10.1074/jbc.M116.770446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Clarke B.R., Ovchinnikova O.G., Sweeney R.P., Kamski-Hennekam E.R., Gitalis R., Mallette E., Kelly S.D., Lowary T.L., Kimber M.S., Whitfield C. A Bifunctional O-Antigen Polymerase Structure Reveals a New Glycosyltransferase Family. Nat. Chem. Biol. 2020;16:450–457. doi: 10.1038/s41589-020-0494-0. [DOI] [PubMed] [Google Scholar]
  • 28.Ninomiya T., Sugiura N., Tawada A., Sugimoto K., Watanabe H., Kimata K. Molecular Cloning and Characterization of Chondroitin Polymerase from Escherichia Coli Strain K4. J. Biol. Chem. 2002;277:21567–21575. doi: 10.1074/jbc.M201719200. [DOI] [PubMed] [Google Scholar]
  • 29.Sun H.-Y., Lin S.-W., Ko T.-P., Pan J.-F., Liu C.-L., Lin C.-N., Wang A.H.-J., Lin C.-H. Structure and Mechanism of Helicobacter Pylori Fucosyltransferase a Basis for Lipopolysaccharide Variation and Inhibitor Design. J. Biol. Chem. 2007;282:9973–9982. doi: 10.1074/jbc.M610285200. [DOI] [PubMed] [Google Scholar]
  • 30.Speciale I., Laugieri M.E., Noel E., Lin S., Lowary T.L., Molinaro A., Duncan G.A., Agarkova I.V., Garozzo D., Tonetti M.G. Chlorovirus PBCV-1 Protein A064R Has Three of the Transferase Activities Necessary to Synthesize Its Capsid Protein N-Linked Glycans. Proc. Natl. Acad. Sci. USA. 2020;117:28735–28742. doi: 10.1073/pnas.2016626117. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

Data is contained within the article or supplementary material.


Articles from Viruses are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES