Abstract
Tc toxin is an exotoxin composed of three subunits named TcA, TcB and TcC. Structural analysis revealed that TcA can form homopentamer that mediates the cellular recognition and delivery processes, thus contributing to the host tropism of Tc toxin. N-glycans and heparan sulfates have been shown to act as receptors for several Tc toxins. Here, we performed two independent genome-wide CRISPR-Cas9 screens, and have validated glycans and sulfated glycosaminoglycans (sGAGs) as Tc toxin receptors also for previously uncharacterized Tc toxins. We found that TcdA1 form Photorhabdus luminescens W14 (TcdA1W14) can recognize N-glycans via the RBD-D domain, corroborating previous findings. Knockout of N-glycan processing enzymes specifically blocks the intoxication of TcdA1W14-assembled Tc toxin. On the other hand, our results showed that sGAG biosynthesis pathway is involved in the cell surface binding of TcdA2TT01 (TcdA2 from P. luminescens TT01). Competition assays and biolayer interferometry demonstrated that the sulfation group in sGAGs is required for the binding of TcdA2TT01. Finally, based on the conserved domains of representative TcA proteins, we have identified 1,189 putative TcAs from 1,039 bacterial genomes. These TcAs are categorized into five subfamilies. Each subfamily shows a good correlation with both genetic organization of the TcA protein(s) and taxonomic origin of the genomes, suggesting these subfamilies may utilize different mechanisms for cellular recognition. Taken together, our results support the previously described two different binding modalities of Tc toxins, leading to unique host targeting properties. We also present the bioinformatics data and receptor screening strategies for TcA proteins, provide new insights into understanding host specificity and biomedical applications of Tc toxins.
Author summary
The Toxin complexes, also referred to as Tc toxins, are a family of A5BC exotoxins widely distributed among Gram-negative and positive bacteria. First identified in Entomopathogenic bacteria as key virulence factors to combat insect hosts, putative Tc toxin loci are also encoded by a range of human pathogens such as Salmonella and Yersinia. Previous studies indicated that several Tc toxins can target invertebrate and vertebrate cells via binding with N-glycans and heparan sulfates. Here our genome-wide CRISPR-Cas9 screens validated that different Tc toxins utilized distinct receptors for the adhesion to their targets, which is determined by TcA homopentamer. For example, TcdA1 from Photorhabdus luminescens W14 (TcdA1W14) relies on N-glycan binding to exert its toxic effects, while sulfate groups of sulfated glycosaminoglycans are critical for the cell targeting of other TcAs such as TcdA2TT01 (TcdA2 from P. luminescens TT01). Consistent with the previously described different binding modalities of Tc toxins, our results confirm that the receptor selectivity of TcAs contribute to the cellular tropism of Tc toxins. Furthermore we has also identified 1,189 TcA homologues and categorized them into five subfamilies. Each TcA subfamily shows a good correlation with the taxonomic origin of the genomes, suggesting these subfamilies are linked to diverse host tropisms via different binding modalities. Together, our findings provide mechanistic insights into understanding host specificity of distinct Tc toxins and the development of therapeutics for Tc toxin-related infections, as well as the adaptation of Tc-injectisomes as potential biotechnology tools and pest-control weapons.
Introduction
Bacterial pathogens deploy a range of toxins to combat the host immune system, and favor the microbial infection [1]. These toxins can manipulate host cell signaling pathways, induce cell death by damaging the cytoplasmic membrane or cytoskeleton, or modify host proteins such as Rho GTPase [2–4]. Well-characterized toxins include the anthrax toxin from Bacillus anthracis, Exotoxin A and Exotoxin B produced by Clostridium difficile, Staphylococcus aureus-derived staphylococcal enterotoxin, etc. [5–7].
The insecticidal Tc toxins, first identified in entomopathogenic bacteria, are hetero-multimeric exotoxins composed of TcA, TcB and TcC subunits [8]. High-resolution structures of example Tc toxins reveal that the TcA assembles as a bell-shaped pentamer which mediates target cell recognition and effector domain delivery, while TcB and TcC form a cocoon-like structure that catalyzes the autocleavage of the toxic TcC C-terminal domain [9–14]. The released TcC C-terminal domain, is translocated into the target cell cytoplasm by an injectosome-like mechanism, leading to the toxic effect [9,15–17]. For example, PTC3 from Photorhabdus luminescens W14, composed of TcdA1, TcdB2 and TccC3, induces actin polymerization in G. mellonella hemocytes, due to the adenosine diphosphate (ADP)-ribosyltransferase activity of the TccC3 C-terminal fraction [15]. Moreover, only 0.5 nM of PTC3 is sufficient to cause the intoxication of target cells, indicating the Tc toxin functions as a potent virulence factor to facilitate bacterial infections [9].
Tc toxin homologues can encode a variety of diverse putatively toxic domains, thus can be classified as a polymorphic toxin system (PTS). From publicly available bacterial genomes in GenBank, our previous study identified 1,608 Tc loci encoded in 1,421 bacterial genomes, which are widely distributed among Gram-negative and positive bacteria (http://www.mgc.ac.cn/dbTC/). Further analysis indicated that the Tc toxin TcC subunit harbors a hypervariable region (HVR) at its C-terminus, representing the toxic “bullet” of the system. Nearly 200 distinct HVR clusters have been characterized among the 2,528 TcC protein homologues. Moreover, in addition to the previously mentioned entomopathogenic bacteria, Tc toxins are widely encoded in many diverse pathogenic bacteria, such as Salmonella enterica and Yersinia pestis [11,18,19]. This indicates that Tc toxins represent a polymorphic toxin system that can target various eukaryotic hosts and facilitate diverse pathogenic mechanisms.
Though the high-resolution cryo-EM structure and phylogenetic distribution of Tc toxins have been studies, little is known about the cellular recognition specificities of Tc toxins. As host cell-surface glycans have been previously reported to serve as ligands for various toxins, two independent studies utilized comprehensive glycan microarray screens in attempts to identify potential receptor ligands that may contribute to the cell surface binding Tc toxins [20,21]. Piper et al. showed that YenTc, as well as the Chi1 and Chi2 subunits, can bind with a range of glycan structures that incorporate galactose, glucose, N-acetylgalactosamine and N-aceytylglucosamine motifs, suggesting that glycans play an important role in cellular recognition by YenTc [20]. Meanwhile, Roderer et al. focused on the screening of receptors of different TcAs from insect and human pathogens, demonstrating that Photorhabdus luminescens W14-derived TcdA1 can bind with glycan epitopes such as trisaccharide Lewis X, the tetrasaccharides Lewis Y and sialyl-Lewis X. They also reported that both Morganella morganii TcdA4 and Xenorhabdus nematophila XptA1 interact with negatively charged heparins [21]. Although both these studies characterized the glycans that interact with Tc toxins, it is still unclear which host factors/pathways are critical to allow intoxication by Tc toxins. To elucidate this issue, we performed the genome-wide CRISPR-Cas9 screens using Tc toxin PTC3-induced cytotoxicity, and determined two sets of host genes that are involved in the action of different Tc toxins. In addition, based on the Tc database we previously established, we have analyzed all available TcA homologues from both entomopathogenic and human-pathogenic bacteria, and classified them into 5 subfamilies, which show a good correlation with the taxonomic origin of the genomes. Taken together, our study provides an effective strategy for future screening of Tc toxin receptors, and that analysis of the TcA database offers the potential to decipher the host specificity and possible future biomedical applications of Tc toxins.
Results
N-linked Glycans are required for the cellular recognition by the TcdA1W14 pentamer
Though first identified as an insecticidal toxin, the activities of Tc toxins have been studied in a variety of mammalian cells [9,15,18]. For example, PTC3W14, a Tc toxin derived from entomopathogenic bacteria P. luminescens W14, can target both HeLa and HEK293T cells, and induce aberrant actin polymerization, suggesting that the receptor(s) of this Tc toxin are ubiquitously expressed in both invertebrate and vertebrate cells [9,15]. PTC3W14 is composed of TcdA1 (hereinafter referred to as TcdA1W14), TcdB2 and TccC3, corresponding to the TcA, TcB and TcC subunits respectively [15]. Considering the TcdA1W14 pentamer has been proposed as mediating toxin binding to cell surfaces, we carried out a genome-wide CRISPR–Cas9 mediated knock-down screening with the aim of identifying the host factor(s) involved in TcdA1W14 interaction.
PTC3W14, assembled from TcdA1W14 and P. luminescens W14-derived functionally active TcdB2-TccC3 fusion proteins, was recombinantly expressed in E. coli as described previously (S1A Fig) [15]. Both gel filtration chromatography and negative-stain electron microscopy showed that PTC3W14 was successfully prepared (S1A–S1C Fig). We first assessed the sensitivity of a series of mammalian cell lines to PTC3W14 treatment. Five mammalian cells including HEK293T, HeLa, Vero, L929 and MEF, were exposed to a titration of PTC3W14 for 24 h, and the percentage of surviving cells was assayed by CCK-8. As shown in Fig 1A, all the tested cell lines are sensitive to PTC3W14. HeLa cells were one of the most sensitive lines, with an IC50 of approximately 1 nM. Therefore, we elected to use HeLa cells to perform the CRISPR/Cas9 knock-out screens. HeLa cells stably expressing Cas9 (HeLa-Cas9) were generated and transduced with a lentiviral single guide RNA (sgRNA) library (GeCKO v.2), which targets 19,050 protein-coding genes and 1,864 microRNAs in the human genome [22]. Both HeLa-Cas9 cells and HeLa-Cas9 cells transduced with a non-targeting sgRNA (HeLa-Cas9-sgCon) showed similar sensitive to PTC3W14 when compared with wild-type HeLa cells (S1D Fig). We also noted that treatment with 5 nM PTC3W14 for 8 h is sufficient to induce robust actin polymerization and membrane blebbing in HeLa-Cas9-sgCon cells (Fig 1B). Following three rounds of selection with PTC3W14 (at 5, 10 and 20 nM, Fig 1C), the surviving transduced HeLa-Cas9 cells exhibited robust resistance to PTC3W14 intoxication, suggesting that key genes involved in the PTC3W14 mode of action had been successfully ablated.
The relevant sgRNA targeted genes were identified using next-generation sequencing (NGS) of the population of PTC3W14-resistant cells, and ranked according to the p value of MAGeCK analysis (y axis) (Fig 1D and S1 and S2 Tables). The majority of top-ranked genes fall into the N-linked Glycan (N-glycan) synthesis pathway, and include enzymes that convert oligo-mannose to galactose-containing complex N-linked glycans, such as Mannosyl-oligosaccharide 1,2-α-mannosidase 1B (MAN1A2), α-1,3-mannosyl-glycoprotein 2-β-N-acetylglucosaminyltransferase (MGAT1) and α-1,6-mannosyl-glycoprotein 2-β-N-acetylglucosaminyltransferase (MGAT2) [23]. To validate the results of the PTC3 screening, we next generated MGAT1-, MGAT2- and MAN1A2-KO HeLa cells, respectively (Fig 1E). All these KO cell lines showed increased resistance to PTC3W14 treatment compared with control cells (Figs 1F–1H and S1D). Furthermore, we also utilized kifunensine, as a potent and selective inhibitor of class I α-mannosidases, to pharmacologically block the synthesis of N-glycans without interfering with cell viability [24,25]. Consistently, pretreatment with kifunensine was also shown to significantly inhibit intoxication by PTC3W14 (Fig 1I and 1J). In addition, knockout of O-linked glycan (O-Glycan) genes such as core 1 glycoprotein-N-acetylgalactosamine 3-β-galactosyltransferase 1 (C1GalT1) did not show any effect on PTC3W14-induced membrane blebbing and cell death (S1E and S1F Fig). Taken together with the structural data showing that glycan interacts with TcdA1W14 [21], these data demonstrated that N-glycans are indeed required for the cell targeting of TcdA1W14 pentamer, mediating the cellular recognition process.
The receptor binding domain-D (RBD-D) of TcdA1W14 mediates the interaction with N-glycans
We next investigated whether N-glycans mediate the cell binding of other TcA homologues. Based on the sequence alignment, we found that TcdA1 from P. luminescens TT01 (hereinafter referred to as TcdA1TT01), shares significant similarity with TcdA1W14, with the exception of the receptor binding domain-D (RBD-D), a putative TcA receptor binding domain (also referred to as TcA_RBD, Pfam:PF18518, 1637–1772 aa of TcdA1TT01) (S2A Fig). We therefore purified the Tc toxin, PTC3TT01, which is assembled using the TcdA1TT01 and P. luminescens W14-derived TcdB2-TccC3 fusion proteins (S2B Fig). Both actin polymerization and membrane blebbing was again observed in PTC3TT01-treated HeLa-Cas9 cells, although it was less pronounced when compared with TcdA1W14-assembled PTC3W14 (Figs 2A, 2B and S2C). Intriguingly, neither knockout of glycan-related genes nor kifunensine treatment was able to block intoxication by PTC3TT01 (Figs 2C and 2D and S2D and S2E), suggesting that the cellular recognition of PTC3TT01 is independent of N-glycans. Considering that RBD-D is the major structural difference between TcdA1W14 and TcdA1TT01, these data indicated that the RBD-D of TcdA1W14 is directly responsible for N-glycans-dependent intoxication, consistent with the previously report showing Lewis antigens can bind to RBD-D [21].
To examine whether RBD-D is sufficient for N-glycans-dependent intoxication, we generated a mutated TcdA1TT01 by replacing the RBD-D with that of TcdA1W14, and vice versa (Figs 2E and S3A–S3D). This mutated TcdA1TT01 (TcdA1TT01-RBDW14) was able to trigger robust membrane blebbing in control HeLa cells comparable to that seen for the wild-type TcdA1W14 complex (Figs 2F and S3E). Importantly, it was also not able to intoxicate N-glycan-deficient cells (Figs 2G and S3F). In contrast, the mutated TcdA1W14 (TcdA1W14-RBDTT01), which encodes the RBD-D of TcdA1TT01, showed equal toxigenic effects in both control HeLa cells (sgCon) and N-glycan-deficient cells (Figs 2G and S3F). Taken together, these results confirm that the RBD-D of TcdA1W14 can specifically mediate the N-glycans-dependent intoxication.
The Trp1721 and Lys1738 of TcdA1W14 RBD-D are critical for N-glycan-dependent intoxication
Since the RBD-D of TcdA1W14 directly mediates binding to N-glycans [21], we screened other Tc toxin homologue sequences for those with the potential to bind N-glycans using TcA sequence alignment. We found that the RBD-D of TcdA2 from P. luminescens TT01 (TcdA2TT01) shares 88% identity with that of TcdA1W14 (S4A Fig). We therefore examined whether the RBD-D of TcdA2TT01 can also mediate N-glycan-dependent intoxication. To compare with the RBD-D of TcdA1W14, we generated a mutated TcdA1W14 by replacing the wild-type RBD-D with that of TcdA2TT01, referred to as TcdA1W14-RBDA2 (Fig 3A). As shown in Figs 3B and 3C and S4B and S4C, TcdA1W14-RBDA2 could also trigger membrane blebbing in control HeLa cells, but again not in N-glycan-deficient cells. However, the intoxication of TcdA1W14-RBDA2 was much weaker than that of TcdA1W14. These data suggest that the RBD-D of TcdA2TT01 does indeed possess the capacity for N-glycan binding, but that the toxic effect was not as strong as that of TcdA1W14, when in the sequence context of the rest of the TcdA1W14 protein. Alternatively, it is possible that the RBD-D of TcdA2TT01 sequence does not possess certain key amino acid residue(s) for mediating a strong interaction with N-glycans.
We next analyzed the RBD-D sequences of TcdA1W14 and TcdA2TT01, and found that most of the substitution differences did not lead to a change in the conserved amino acid properties, with the exception of two, Trp1721 and Lys1738 (Figs 3D and S4A). Both Trp1721 and Lys1738 of TcdA1W14 were substituted by Serine in TcdA2TT01. According to previous reports investigating the interaction between N-glycans and C-type lectins, the galactose residues of N-glycans can pack alongside the side chain of aromatic amino acids such as Tryptophan, while fucose residues tend to make contact with positive-charged amino acids including Arginine and Lysine [26]. Therefore, we hypothesized that Trp1721 and Lys1738 may be important for N-glycan-dependent intoxication. We therefore performed a mutagenesis study to evaluate the effect on intoxication of TcdA1W14 with W1721S and K1738S substitutions (S4D and S4E Fig). As shown in Fig 3E and 3F, both TcdA1W14-W1721S and TcdA1W14-K1738S substitutions reduce PTC3 intoxication effects. Moreover, the TcdA1W14-W1721S/K1738S double mutant showed much weaker effect on the membrane blebbing phenotype than that induced by the wild-type TcdA1W14, suggesting that both amino acid residues contribute to the binding of N-glycans. Taken together, these results confirm that the RBD-D of TcdA1W14 mediates the N-glycan-dependent intoxication, and identified two key amino acid residues that contribute to the interactions. This also suggests that the N-glycan binding mode of TcdA1W14 resembles that observed for C-type lectins.
Sulfated glycosaminoglycan (sGAG) mediates intoxication by a TcdA2TT01-based toxin
Intriguingly, although the RBD-D of TcdA2TT01 can only weakly bind to N-glycans, a TcdA2TT01 based complex (PTC3A2-TT01) showed a robust effect on cell death, which was only partially blocked in N-glycan-deficient cells (Fig 4A and 4B). Similarly, kifunensine did not significantly protect cells against PTC3A2-TT01 (Fig 4C and 4D). Thus, we suspected that other mechanisms may be involved in the cellular recognition of TcdA2TT01. To explore the alternative pathway(s) involved in TcdA2TT01 recognition, we performed further genome-wide CRISPR-Cas9 screens. A lentiviral sgRNA library was transduced into HeLa-Cas9 cells as described in Fig 1C. To exclude any interference from potential N-glycan-mediated intoxication, HeLa cells were pre-treated with kifunensine before exposure to PTC3A2-TT01. After three rounds of selection with PTC3A2-TT01 (5, 10 and 20 nM, S5A Fig), the resistant population of HeLa-Cas9 cells was again analyzed using next-generation sequencing (NGS).
Very few genes involved in N-glycan synthesis were identified, with the top-ranked genes seen to encode key regulators involved in sulfated glycosaminoglycan (sGAG) biosynthesis pathways (Fig 4E and S1 and S3 Tables). These genes include enzymes involved in sulfated glycosaminoglycan (sGAG) synthesis, such as β-1,3-galactosyltransferase 6 (B3GALT6) and β-1,3-glucuronyltransferase 3 (B3GAT3), proteins that facilitate the sulfation of GAGs, such as N-deacetylase and N-sulfotransferase 1 (NDST1), and solute carrier family 35 member B2 (SLC35B2), as well as glycosyltransferases that are specifically required for the elongation of the heparan sulfate chain, such as exostosin like glycosyltransferase 3 (EXTL3). To validate this result, three representative sGAG-related genes, B3GAT3, NDST1, and EXTL3, were knocked-down in order to block the sGAG synthesis (Fig 4F). After preincubating with kifunensine, these sGAG-deficient cell lines showed increased resistance to the intoxication by PTC3A2-TT01, compared to the control cells (Fig 4G–4I). Taken together, these data demonstrated that in addition to N-glycans, sulfated glycosaminoglycans also mediate intoxication by TcdA2TT01-related Tc toxins, and corroborate the previous finding by Roderer et al. showing that diverse Tc toxins interact with heparin/heparan sulfates [21].
The sulfate group of sulfated glycosaminoglycans is critical for mediating intoxication by TcdA2TT01-based toxin
Glycosaminoglycans such as heparan sulfate have previously been implicated in microbial invasion of host cells [27]. For example, heparan sulfate not only functions a receptor/co-receptor for H. pylori VacA cytotoxin, but can also be recognized by Clostridium difficile toxin A, via the interaction with the charged sulfation group [28,29]. Roderer et al. also reported that both Morganella morganii TcdA4 and Xenorhabdus nematophila XptA1 interact with heparin/heparan sulfates [21]. To determine the mechanism underlying the sGAGs-mediated PTC3A2-TT01 intoxication, we investigated whether it relies on the electrostatic interactions between the sulfate group of sGAGs and TcdA2TT01. Surfen (bis-2-methyl-4-amino-quinolyl-6-carbamide) is a small molecule that can neutralize the negative charge of sGAGs. Pretreatment of HeLa cells with surfen significantly reduced the toxicity of PTC3A2-TT01 (Fig 5A and 5B), indicating that negative charges on sGAGs may be critical for the cellular recognition by TcdA2TT01.
Since a variety of sGAGs have been reported to mediate the interaction with toxins, we utilized a series of GAGs to investigate the role of sulfate groups in PTC3A2-TT01 intoxication. These competition assays showed that GAGs which possess the sulfate group, such as heparan sulfate, heparin, chondroitin sulfate and dextran sulfate, were all able to competitively inhibit the toxigenic effect of PTC3A2-TT01 in HeLa cells pretreated with kifunensine, while non-sulfated GAGs such as hyaluronic acid did not block PTC3A2-TT01 and so induced membrane blebbing (Fig 5C and 5D). Furthermore, we examined the effect of two different linkage types of sulfate groups. As shown in Fig 5C and 5D, both N-acetyl-de-O-sulfated heparin and de-N-sulfated heparin showed a similar protection against PTC3A2-TT01 intoxication. Taken together, these data demonstrate that sGAG can mediate PTC3A2-TT01 intoxication, which is dependent on the charged sulfate group.
TcAs of different subfamilies may exhibit different target specificities
Recently, Roderer et al. also reported that TcAs of Morganella morganii and Xenorhabdus nematophila can strongly bind with heparins/heparan sulfates, compared with TcdA1W14 and TcaA-TcaB of Yersinia pseudotuberculosis [21]. Since TcdA2TT01 also exhibits sGAG-dependent cell binding, we next examined whether other P. luminescens-derived TcA proteins could also utilize sGAG to bind mammalian cells. To do so, we quantified the interaction between different TcAs and biotin-conjugated heparin by using biolayer interferometry (BLI). As shown in Fig 5E, the dissociation constants (KD) for TcdA2TT01-heparin is 2.77 nM (kon of 8.44 ± 0.1 × 104 M−1 s−1, and koff of 2.34 ± 0.03 × 10−4 s−1). Other than TcdA2TT01, P. luminescens TcA homologues such as TcdA2W14, TcdA4W14, and TcdA4TT01, also showed strong binding with heparin (Figs 5F and S5B and S5C). In contrast, the binding signals for TcdA1W14 and TcdA1TT01 were much weaker, indicating that the cellular recognition of these two TcAs may not rely on an interaction with heparin. Consistently, knockout of EXTL3 significantly interfered with the action of Tc toxins assembled using TcdA2W14, TcdA4W14, or TcdA4TT01, while only marginally effecting intoxication by those using TcdA1W14- and TcdA1TT01 subunits (Fig 5G and 5H and S2B and S5D). Taken together, these data suggest that the binding of TcAs with sGAG is a more representative cellular recognition mode for P. luminescens Tc toxins.
To further explore the potential binding modes of TcAs from other species, we performed a genome-wide analysis of TcA homologues across multiple bacterial species. Typically, TcAs from P. luminescens represent a single protein that harbors at least three conserved domains, named the N-terminal VRP1 domain (PF03538), the middle Neuraminidase domain (PF18413) and the C-terminal TcA_TcB_BD domain (PF18276) (S6 Fig). Nevertheless, there are functional examples of “split” TcA homologues, such as TccA1 and TccB1 of P. laumondii, and TcaA and TcaB in Y. pseudotuberculosis (S6 Fig). Based on this information, we have curated a collection of 1,189 putative TcA homologues from 1,039 bacterial genomes derived from our open access dbTC database (http://www.mgc.ac.cn/dbTC/). For this set, a valid “TcA” refers to a either a single protein or 2–3 consecutive proteins that together encode(s) all of the three aforementioned conserved domains in the appropriate order. Then, based on a concatenation of the three conserved domains, we constructed a maximum-likelihood phylogenetic tree of the 1,189 predicted TcA protein set (Fig 6).
The majority (98.8%) of identified TcAs fall into five distinct phylogenetic clades, that we designate types I to V, with only 14 outliners which do not fit this classification. Perhaps unsurprisingly, these TcA subfamilies show a good correlation with both genetic organization of the TcA protein(s) and taxonomic origin of the genomes. Type I TcA is the largest and the most variable group, containing genomes from several bacterial species, including Burkholderia, Photorhabdus, and Xenorhabdus. Most members of the type I subfamily are single large proteins that most closely resemble those reported in P. luminescens. Of note, though RBD-D, which mediates binding with N-glycans is specifically required for TcdA1W14, 83.4% (331/397) of the TcA proteins from type I encode the RBD-D (PF18518) (S7 Fig), suggesting that this domain may exhibit more functions other than N-glycans binding.
TcA homologues from all of the other subfamilies generally consist of two different open reading frames encoded in immediate proximity to one-another. Nevertheless, the genetic organization and the total molecular weight of TcAs are clearly different between the types II to V subfamilies (Fig 6). The type II TcAs are highly represented in Yersinia genomes, comprising separate ~90 kDa A-subunits and ~130 kDa B-subunits. Type III TcAs were found to be predominately distributed in genomes from the genus Pseudomonas, with subunit molecular weights of ~100 kDa and ~150 kDa, respectively. The type IV TcAs, which are prevalent in Salmonella, comprise of the largest A and B subunits, which are ~180 kDa and ~160 kDa, respectively. Though very little is known about the cellular recognition mechanisms of these TcAs, the differences in both A and B subunits suggests that the TcAs of these three subfamilies might exhibit distinct receptor specificities.
The Y. entomophaga MH96 TcA subunit proteins YenA and YenB, were categorized into the type V TcA subfamily. Previous reports demonstrated that these TcAs were decorated by two endochitinases (Chi1 and Chi2), which are critical in allowing the toxin complex to overcome the chitin-based barrier immunity seen in the guts of insects [20]. Intriguingly, although this type V subfamily consists of only 16 members (S7 Fig), most of them also harbor chitinase gene(s) immediate downstream of TcA genes, suggesting Tc toxins derived from this type V subfamily may represent potent insecticidal candidates for biological control of plant diseases. Taken together, these results supported the reliability of our classification of the TcA subfamilies, and further implied that each type of TcA may represent specific receptor-binding features given the distinct genetic organizations.
Discussion
Cellular recognition by TcA subunits is a prerequisite for the activity of Tc toxins, and appears to determine host cell specificity. To take an unbiased approach towards identifying mammalian cell targeting mechanisms of different TcAs, we utilized a genome-wide CRISPR library screening strategy. This approached relied on the ability of the toxin complex to kill target cells that continued to express the critical factors that mediate the toxin action, but not to intoxicate those in which these factors were ablated. The intoxication phenotype in HeLa cells is a seen as Tc toxin-induced membrane blebbing and cell death. In this study we have demonstrated two distinct pathways that are required for the action of certain Tc toxins, involving the synthesis of either N-glycans or sGAGs.
In addition, using a bioinformatic analysis, we have identified 1,189 TcA protein homologues from 1,039 bacterial genomes. According to similarities in gene structure and sequence characteristics, we have classified these TcAs into five distinct subfamilies, which we propose likely employ class and species-specific target cell recognition mechanisms. There are several well described examples of bacterial exotoxins that use cell surface glycans as receptors [30]. One of the best-characterized example is the AB5 cholera toxin, produced by Vibrio cholera, which consists of one A subunit and five B subunits [31]. The homopentamer formed by the B subunits binds to the Galβ1-3GalNAc moiety of the GM1 ganglioside, a cell surface glycan receptor, via the C-terminal repetitive domain (CRD) located on the base of the cholera toxin [32]. Moreover, Yamaji et al. previously performed a genome-wide screening for the receptor of the subtilase cytotoxin (SubAB), another AB5 toxin [33]. They reported that knockout of MGAT1 can block the toxic effect of SubAB, suggesting that N-glycans also serve as SubAB receptors. Similar to the AB5 toxins, the TcA pentamer that forms the receptor binding component for the Tc toxins, can also bind N-glycans and facilitate delivery of the TcC toxic domain into the target cell [20,21]. Of note, the molecular species of N-glycan could be vary between different hosts. For example, HepG2 has been reported to express more sialylated glycans than HeLa cells, while MCF-7 cells express less glycans [34]. Our data also showed that the difference between the tested mammalian cell lines in Tc toxin sensitivity, suggests that a further characterization of N-glycans species in toxin-related studies may provide more information about the host tropism of various pathogens.
TcdA1 can bind to BSA-Lewis X glycoconjugate via the RBD-D domain, with a dissociation constant (KD) of 5.8±0.7μM [21], suggesting that the RBD-D domain may contribute to the diverse pattern of host tropism. We therefore performed a sequence alignment of the RBD-D from all 332 RBD-D-containing TcAs (331 in type I superfamily and 1 in Type IV superfamily). Unfortunately, few similarities could be found within this domain (S8 Fig). This is in line with our results that showing that TcdA1TT01, a RBD-D-containing TcA, can function normally in both wild-type and N-glycan-deficient cells. In addition, Piper et al. demonstrated that YenTc, which does not contain a RBD-D domain, can also bind to some classes of glycans [20]. These data suggest that the N-glycan-binding role of RBD-D cannot be applied to all RBD-D-containing TcAs. We further generated an evolution-similarity matrix based on the sequences of 332 RBD-D-containing TcAs. As shown in S9 Fig, the RBD-D similarity showed a good correlation with the taxonomic origin of the genomes, which differed from species to species, suggesting that the RBD-D domains of different TcAs may interact with different host factors. Of note, a putative TcA from the human pathogenic strain Photorhabdus asymbiotica (TcdAPA, WP_012777086) harbors a RBD-D that shares 83% and 81% amino acid identities with those of TcdA1W14 and TcdA2TT01, respectively. Considering that the Tc toxin may contribute to the pathogenesis of P. asymbiotica, it is worth considering if blockade of the N-glycan-binding of TcdAPA could control P. asymbiotica infection. Taken together, these observations indicate that N-glycans can act as receptors for various toxins employing a pentameric recognition and delivery complex. It also suggests that multi-valent synthetic N-glycan ligand/analogues could be used as protective agents against Tc toxin mediated infections. Of note, other than RBD-D, several TcAs such as TcdA1W14, contain three other potential receptor-binding domains (RBD-A, RBD-B, and RBD-C) as well as a neuraminidase-like domain, which may also contribute to the cellular recognition of Tc toxins [11]. For example, the heparin binding site of Xn-XptA1 is located in a cleft formed by RBD-B, RBD-D and the neuraminidase-like domain [21]. Piper et al. also indicated that the neuraminidase-like domain of YenTcA harbors lectin activity, and may bind with cell surface glycans [20]. Thus, these domains may provide alternative mechanisms for the binding of diverse Tc toxins with host cells, although further analysis will be needed to screen the putative ligands that mediate the interactions.
Glycosaminoglycans (GAGs) are complex linear polysaccharides that are ubiquitously expressed at the cell surface, and exploited by a variety of bacterial toxins as essential host cell receptors required for intoxication [35,36]. For example, Heparan sulfate, one type of GAGs, has been reported as a critical host factor that mediates the delivery and translocation of Yersinia outer proteins (Yops) [37]. In addition, Chondroitin sulfate is suggested to play an important role in the toxicity of anthrax toxin, produced by Bacillus anthracis [38]. Both our findings (Fig 5F) and those of Roderer et al. demonstrate that a series of TcAs can bind with sulfated glycosaminoglycans (sGAG), such as heparin [21]. However, the sGAG-binding pocket is not conserved in the Morganella morganii TcdA4 (located in the α-helical domain of the shell) or in the Xenorhabdus nematophila XptA1 (which is located in a gap between the neuraminidase-like domain, RBD B and RBD D). Moreover, although Clostridium difficile toxin A also utilizes sGAGs as host cell receptors, the exact binding sites for sGAGs remains to be determined. Therefore, at this point we cannot reliably determine the potential sGAG-binding region to facilitate the further analysis. In addition, Roderer et al. reported that the binding signals of TcdA1W14 was weaker than the values of Morganella morganii TcdA4 for porcine intestinal mucosa-derived heparin [21], which is in line with our BLI results (Fig 5F). Together with the observation that EXTL3 knockout in HeLa cells showed marginal effects on the action of PTC3W14, it suggested that sGAGs may not be required for the mammalian cell targeting of TcdA1W14. Although the interaction between TcdA1W14 and natural heparin mixtures is weak, TcdA1W14 can indeed bind to several specific types of heparin oligosaccharides with relatively high affinities [21]. Whether these interactions is required for the insect cell recognition of TcdA1W14 needs to be investigated.
Intriguingly, a study by Tao et al., that also used a CRISPR known-down screening strategy, recently reported that sulfated glycosaminoglycans act as host cell receptors of the Clostridium difficile toxin A [28]. The top-ranked genes shown to contribute to the action of C. difficile toxin A are almost identical to those we enriched for in our TcdA2TT01 CRISPR screen. This indicates that the C. difficile toxin A and TcdA2TT01-related Tc homologues share a similar cellular recognition mechanism. Moreover, this study showed that blocking sGAG binding by GM-1111, a sulfated heparan sulfate analogue, can also reduce the toxicity of C. difficile toxin A in the colon [28]. Thus, blockade of GAG-binding by engineered GAG mimetics, sulfated compounds, as well as GAG-digesting enzymes, could represent promising antimicrobial strategies for the development of therapeutics for infectious diseases facilitated by toxins such as Tc toxin.
In addition, two golgi-linked genes TM9SF2 and TMEM165 were identified in both C. difficile toxin A and the TcdA2TT01 Tc toxin screening (Fig 4E), suggesting these two genes may be involved in the synthesis of sulfated glycosaminoglycans. TM9SF2 was previously found to regulate the localization of NDST1, a Golgi complex-localized enzyme that catalyzes N-sulfation of heparan sulfate, and is therefore critical for chikungunya virus infection [39]. Besides these screening experiments, three independent groups have also suggested that TM9SF2 plays a conserved role in Shiga toxin-induced cell death [40–42]. Tian et al. demonstrated that TM9SF2 not only regulates glycosylation in Golgi, but also modulates endosomal trafficking in general [42]. These observations suggest that TM9SF2 might be a potential target for treating various infections. Besides TM9SF2, TMEM165, a Mn2+ transporter, has also been identified as one critical host factor that mediates the intoxication of Shiga toxin and ricin [41,42]. Intriguingly, knockdown of TMEM165 in zebrafish showed a defect in chondroitin sulfate proteoglycan expression by alterations in either core protein expression or glycosaminoglycan chain composition [43]. Thus, TMEM165 inhibition may effect discrete aspects of GAG synthesis, although further analysis will be required to demonstrate the precise mechanism.
Although the Tc toxins were first identified almost two decades ago, current knowledge regarding Tc toxin biology has mainly come from structural investigations that focus on the complex assembly and mechanisms of delivery such as the TcC autocleavage and injection [9,10,13,14,16], while less attention has been paid to the host cell recognition by Tc toxins. Recently, two independent studies utilized comprehensive glycan microarray screens to identify potential receptor ligands for Tc toxins, and found that N-glycans and heparan sulfates can bind with several TcAs [20,21]. Genome-wide CRISPR/Cas9 screening has emerged as a powerful genome-editing technology for studying the intoxication of mammalian cells by toxins. Therefore, we decided to utilize this strategy to explore the host factors that are required for the actions of Tc toxin homologues. Of note, both our results and those of Roderer et al. demonstrated the interaction of TcdA1W14 with N-glycans. Since we employed a different, and unbiased, strategy to that study (CRISPR knock-down as opposed to glycan array screens), it is reassuring to see the two studies giving consistent findings, indicating that the genome-wide CRISPR/Cas9 screening would be a promising strategy to identify host factors for other Tc toxins derived from distinct subfamilies indicated in Figs 6 and S7.
It should be noted that certain other bacterial toxin host recognition systems have been shown to use binding to multiple receptors to facilitate efficient the cell entry and internalization. Other than glycans and glycosaminoglycans, several cell surface proteins such as CD31 and integrins have also been shown to contribute to the target specificity of distinct toxins [44,45]. Thus, it remains an open question as to whether or not Tc toxin binding also involves receptors other than glycans and glycosaminoglycans. Interestingly, our current study did not identify enrichments for any specific cell surface proteins. Nevertheless, while it is beyond the scope of the current study, we cannot rule out that further “deeper” screens might yet provide indications of additional receptors for Tc toxins.
Since the intoxication has been observed for several mammalian cell types, the Tc toxins have been proposed as a potential virulence factors that may contribute to diverse human infectious diseases. However, the exact roles of Tc toxin homologues in the majority of bacterial infections remains unclear. This is in part a consequence of a lack of systematic study of the distribution of Tc toxins in the different pathogenic bacteria. Here we have addressed this issue by performing a comprehensive analysis of TcA homologues in bacterial genomes available in public databases, and the construction and provision of an open access database. In total, a set of 1,189 TcA protein homologues were identified from 1,039 bacterial genomes which we classified into 5 distinct subfamilies. Notably, several TcAs derived from the human pathogenic bacteria Yersinia and Salmonella fell into two different subfamilies, suggesting these bacteria may utilize Tc toxins to facilitate their invasion and survival in host cells. Further detailed studies regarding these human pathogenic bacteria-derived Tc toxins will no doubt contribute to our understanding about this polymorphic toxin system.
Appropriate host cell recognition by toxins is often critical for successful infection strategies used by pathogenic bacteria. Using a CRISPR-Cas9-based sgRNA knock-down library, we have performed a genome-wide loss-of-function screening to identify the critical host cell enzymic processes required for the recognition and action of several Tc toxins. We have demonstrated that not only N-glycans but also glycosaminoglycans can serve as receptors for different TcA pentamers. Taken together with our bioinformatic study characterizing the diversity of Tc homologues among different bacterial species, we provide valuable information for elucidation of the pathogenic roles of Tc toxins. Future research focusing on the structural basis of the toxin-receptor interaction and the design of protective compounds could not only create effective therapies for patients with Tc toxin-facilitated diseases, but also expand our understanding of this versatile bacterial weapon. Finally, these findings will inform on selection strategies for potential biotechnology tools for new crop-protection agents and biomedical applications.
Materials and methods
Cell culture
HEK293T cell (CRL-11268, RRID:CVCL_1926), MEF cell (CRL-2991, RRID:CVCL_L690), Vero Cell (CCL-81, RRID:CVCL_0059), L929 cell (CRL-6364, RRID:CVCL_0462) were obtained from ATCC and cultured according to the manufacturer’s protocol. HeLa cell (CCL-2, RRID:CVCL_0030) was obtained from ATCC and cultured in Dulbecco’s modified Eagle’s medium (DMEM, Cat# 11965092, Thermo Fisher Scientific) supplemented with 10% FBS (Cat# FS101-02, TransGen Biotech), 100 U/mL Penicillin and 100 μg/ml Streptomycin (Cat# 15140148, Thermo Fisher Scientific) at 37°C under 5% CO2. HeLa cells stably expressing Cas9 (HeLa-Cas9) were generated by transducting Lenti-Cas9-Blast virus (#52962, Addgene, RRID:Addgene_52962), and selecting using 10 μg/mL Blasticidin S (Cat# A1113903, Thermo Fisher Scientific).
Plasmids
The coding sequences of TcdA1W14, TcdA2W14, TcdA4W14, TcdB2, TccC3 were cloned from the genome of Photorhabdus luminescens W14 (lab stock). TcdA1TT01, TcdA2TT01, and TcdA4TT01 were cloned from the genome of P. luminescens TT01 (lab stock). All TcAs was cloned into RSFDuet-1 (Novagen) with N-terminal 6×His tag. TcdB2 and TccC3 fusion protein was constructed incorporating a 5-aa linker region encoding Arg-Gly-Ser-Arg-Pro, and cloned into pETDeut-1 (Novagen) with N-terminal 6×His tag and C-terminal Flag tag. To engineer the mutants of indicated TcAs, site-directed mutagenesis or seamless cloning were performed according to the manufacturer’s protocol of pEASY-uni seamless cloning and assembly Kit (Cat#CU101-01, TransGen Biotech). All constructs were confirmed by DNA sequencing (Sangon Biotech, Shanghai).
Tc toxin purification
All Tc toxin components were prepared according to the previous protocol with some modifications [15]. His-tagged TcA proteins were expressed in E. coli BL21 (DE3) cells (Cat# CD601-02, TransGen Biotech). Expression was performed in 1L LB medium with 50 mM IPTG induction for 24 h at 20°C, with aeration by 250 rpm shaking. The cells were then collected and sonicated. Cell lysates were centrifugated and then applied to the Ni-NTA affinity column (Cat# 30250, Qiagen). The elution fraction was further purified using gel filtration (Superose 6 Increase 10/300 GL, GE Healthcare) using an AKTA avant25. The TcB-TcC expressing plasmids were transduced into BL21-CodonPlus (DE3)-RIPL cells (Cat# 230250, Agilent Technologies). A single clone was inoculated into 1L LB medium containing 25 μM IPTG, with aeration by 250 rpm shaking for 4 h at 28°C, followed by 48 h shanking at 20°C. The cells were harvested by centrifugation and lysed by ultrasonication. His-tagged TcB-TcC fusion protein was purified by anion exchange chromatography, followed by a Ni-NTA affinity column and final gel filtration as described above. All TcAs and TcB-TcC fusion proteins were concentrated to about 1 mg/ml, using Amicon filter devices (Millipore).
For the purification of the holotoxin complex formed by indicated TcA and TcdB3-TccC3 fusion proteins, TcA and TcdB2-TccC3 were mixed in a 2:1 mole ratio with a buffer containing 50 mM Tris pH 8.0, 100 mM NaCl, and 5% glycerol. The mixture was then applied to gel filtration using a Superose-6 10/300 GL column (Cat# 17-5172-01, GE Healthcare). The elution fractions between 13–14 ml were concentrated and finally used for cell intoxication assay.
Western blot
In brief, samples were subjected to SDS-PAGE and then transferred onto PVDF membranes. Membranes was incubated with indicated primary antibodies and corresponding secondary HRP antibodies, and examined by using enhanced chemoluminescent detection reagent (Thermo Fisher Scientific). Mouse molyclonal anti-FLAG M2 antibody (Cat# F3165, RRID: AB_259529) was purchased from Sigma. Rabbit polyclonal anti-MAN1A2 antibody (PA5-56974, RRID: AB_2643718) was purchased from Thermo Fisher Scientific. Rabbit polyclonal anti-NDST1 antibody (26203-1-AP) was purchased from Proteintech. Mouse molyclonal anti-EXTL3 antibody (sc-271986, RRID:AB_10709180) and anti-B3GAT3 antibody (sc-390475) were purchased from Santa cruz. Rabbit monoclonal anti-MGAT1 antibody (ab180578, RRID:AB_2800510) and anti-MGAT2 antibody (ab184965) were purchased from Abcam.
Negative stain electron microscopy
After gel filtration, 4 μl sample droplets were applied on freshly glow-discharged copper grids (G2400C, Agar scientific) covered by a thin, continuous carbon film. The samples were left for 20 s on the grid before blotting with filter paper, and stained with 0.04% uranyl acetate twice, air-dried for 20 s. All images were taken with a JEOL JEM-1400 electron microscope equipped with a LaB6 cathode operated at 120 kV. Digital electron micrographs were recorded with a 4k × 4k CMOS camera F416 (TVIPS) using minimal dose conditions.
Cell intoxication
HeLa cells were seeded into 24-well plate (1 × 105 per well), and grown adherently on sterile coverslips overnight in 500 μL culture medium. The indicated holotoxin was subsequently added. For blockade of the biosynthesis of N-linked glycans, HeLa cells were pretreated with 20 μM kifunensine for 12 h before Tc toxin stimulation. For F-actin staining, the cells were fixed with 4% paraformaldehyde and then permeabilized with PBS containing 0.1% Triton X-100 for 15 min. Coverslips were then incubated with Alexa Fluor 568 Phalloidin (Cat# A12380, Thermo Fisher Scientific) for 30 min and washed for 3 times with PBS. Coverslips were mounted with ProLong Gold Antifade Mountant with DAPI (Cat# P36941, Thermo Fisher Scientific). All the images were captured by EVOS FL Auto 2 Imaging System (Thermo Fisher Scientific).
Cell viability assay
HeLa cells were seeded in 96-well microplates and incubated until 60% confluence, then exposed to the indicated doses of Tc toxins. The cells were further incubated for 24 h in toxin-containing medium and then 10 μL CCK-8 solution and incubated for 2 h. The cell viability was measured by Varioskan flash spectral scanning multimode reader (Thermo Scientific) at wavelength of 450 nm. Eight hour post stimulation, bright field microscopy was performed with Leica DMi1 Microsystems equipped with an ICC50 W/E digital camera, and captured by using the Leica LAS EZ software (Leica).
CRISPR-Cas9 screen
Lentiviral sgRNA plasmid libraries were generated using the human GeCKO-V2 sgRNA library (Addgene, #1000000049) consists of 123,411 sgRNAs targeting 19,050 protein-coding genes (six sgRNAs per gene) and 1,864 microRNAs (four sgRNAs per microRNA). The titer of lentiviral sgRNA libraries was measured as colony-forming units (CFU) per mL. HeLa-Cas9 cells (2 × 107) were transduced with a multiplicity of infection of 0.2 to provide 100 × coverage of each gudie RNA. Two days post infection, cells infected by lentivirus were further selected by puromycin. For the first round of screening, the cell library was exposed to 5 nM PTC3 for 24 h. The survival cells were seeded and cultured without PTC3 until about 80% confluence. Cells were then subjected to the next two round of selection (10 and 20 nM PTC3). Genomic DNA from 2 × 107 of the HeLa cells (unselected control and selected with PTC3) was isolated (Blood & Cell Culture DNA Mini Kit, Cat#13323, Qiagen). Three biological replicates were performed for each selection. The genomic DNAs of three replicates were pooled together, and then amplified by PCR with primers lentiGP-1_F (AATGGAC TATCATATGCTTACCGTAACTTGAAAGTATTTCG) and lentiGP-3_R (ATGAATACTG CCATTTGTCTCAAGATCTAGTTACGC). Next-generation sequencing was performed by a commercial vendor (OBiO technology). Analysis of enriched guides was performed using MAGeCK analysis. The sgRNAs are ranked based on P-values calculated from the NB model. A modified robust ranking aggregation (RRA) algorithm named α-RRA is utilized to identify positively or negatively selected genes. If one gene has no effect on selection, the sgRNAs targeting this gene should be uniformly distributed across the ranked list of all the sgRNAs [46].
Model building
Structural modeling was performed using SwissModel. For both TcdA1 and TcdA2 of P. luminescens TT01, the structure of TcdA1 of P. luminescens W14 (PDB: 4O9Y) was used as a template. Figures were prepared using PyMOL.
Biolayer interferometry
Biolayer interferometry (BLI) assays were performed on the Octet Red Instrument (fortéBIO). Biotin conjugated heparin from porcine intestinal mucosa (5 μg/ml, B9806, Sigma) was immobilized onto streptavidin biosensors (fortéBIO) and balanced with HBS-EP buffer (10 mM HEPES, pH 7.4, 150 mM NaCl, 3.0 mM EDTA, and 0.005% Tween-20). The biosensors were then exposed to indicated doses of TcdA2TT01, or 50 nM of the indicated TcAs, followed by washing in HBS-EP buffer. BLI sensorgrams were measured with 300 s association and 600 s dissociation time, and corrected for background association on mock biosensors. The binding affinities between TcAs and heparin were monitored using the Blitz system (ForteBio). For calculating the dissociation constant (KD) of the TcdA2TT01-heparin binding, a global fit according to a 1:1 binding model was utilized.
Phylogenetic analysis of TcA proteins
A total of 2,315 TcA related genes available from the dbTC database (http://www.mgc.ac.cn/dbTC/) were used to identify known protein domains by batch CD-Search of the conserved domain database with default parameters [47]. Though TcA in P. luminescens W14 is a single protein that encodes at least three conserved domains, including the N-terminal VRP1 (PF03538), the middle neuraminidase (PF18413) and the C-terminal TcA_TcB_BD (PF18276), the split homologues in Yersinia were confirmed to be functional previously [17,20]. Therefore, extensive manual curation was conducted based on the criterion that a valid set of tcA gene(s) may encode a single protein or 2–3 consecutive proteins, but all of the three aforementioned domains should present in a logical order. The protein sequences of VRP1, Neuraminidase and TcA_TcB_BD domains of each valid TcA set were extracted by custom Perl script based on the CD-Search results, and then fed to the Clustal-Omega program to generate multiple sequence alignment (MSA) of each domain individually. The three MSA files were concatenated accordingly to form a single MSA before the FastTree program was employed to construct the maximum-likelihood phylogenetic tree under Whelan Goldman (WAG) models with gamma optimization [48]. The iTOL online server was used to annotate and visualize the phylogeny [49].
Statistical analysis
Unpaired Student’s t tests were performed using Excel software (Microsoft) or GraphPad Prism 6 (http://www.graphpad.com).
Supporting information
Acknowledgments
We thank Tsinghua University Branch of the China National Center for Protein Sciences Beijing for providing facility support in using negative stain electron microscopy.
Data Availability
All relevant data are within the manuscript and its Supporting Information files.
Funding Statement
This work was supported by the National Natural Science Foundation of China (81801980 to N.S., 31970635 to J.Y., 82072234 to G.Y.); Beijing Hospitals Authority Youth Programme (QML20190107 to N.S.); the CAMS Innovation Fund for Medical Sciences (2017-I2M-3-017 to J.Y.). N.R.W. was supported by BBSRC grant BB/T004150/1 and Warwick University Medical School. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Galan JE, Waksman G. Protein-Injection Machines in Bacteria. Cell. 2018;172(6):1306–18. Epub 2018/03/10. 10.1016/j.cell.2018.01.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Aktories K. Bacterial protein toxins that modify host regulatory GTPases. Nat Rev Microbiol. 2011;9(7):487–98. Epub 2011/06/17. 10.1038/nrmicro2592 . [DOI] [PubMed] [Google Scholar]
- 3.Zhang D, de Souza RF, Anantharaman V, Iyer LM, Aravind L. Polymorphic toxin systems: Comprehensive characterization of trafficking modes, processing, mechanisms of action, immunity and ecology using comparative genomics. Biol Direct. 2012;7:18 Epub 2012/06/27. 10.1186/1745-6150-7-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Jamet A, Nassif X. New Players in the Toxin Field: Polymorphic Toxin Systems in Bacteria. Mbio. 2015;6(3). WOS:000357867400015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Young JAT, Collier RJ. Anthrax toxin: Receptor binding, internalization, pore formation, and translocation. Annu Rev Biochem. 2007;76:243–65. 10.1146/annurev.biochem.75.103004.142728 WOS:000249336800012. [DOI] [PubMed] [Google Scholar]
- 6.Jank T, Giesemann T, Aktories K. Rho-glucosylating Clostridium difficile toxins A and B: new insights into structure and function. Glycobiology. 2007;17(4):15r–22r. 10.1093/glycob/cwm004 WOS:000244961800001. [DOI] [PubMed] [Google Scholar]
- 7.Pinchuk IV, Beswick EJ, Reyes VE. Staphylococcal Enterotoxins. Toxins. 2010;2(8):2177–97. 10.3390/toxins2082177 WOS:000208435400015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bowen D, Rocheleau TA, Blackburn M, Andreev O, Golubeva E, Bhartia R, et al. Insecticidal toxins from the bacterium Photorhabdus luminescens. Science. 1998;280(5372):2129–32. Epub 1998/06/26. 10.1126/science.280.5372.2129 . [DOI] [PubMed] [Google Scholar]
- 9.Gatsogiannis C, Merino F, Roderer D, Balchin D, Schubert E, Kuhlee A, et al. Tc toxin activation requires unfolding and refolding of a β-propeller. Nature. 2018;563(7730):209–13. 10.1038/s41586-018-0556-6 [DOI] [PubMed] [Google Scholar]
- 10.Roderer D, Hofnagel O, Benz R, Raunser S. Structure of a Tc holotoxin pore provides insights into the translocation mechanism. Proc Natl Acad Sci U S A. 2019;116(46):23083–90. Epub 2019/11/02. 10.1073/pnas.1909821116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Roderer D, Raunser S. Tc Toxin Complexes: Assembly, Membrane Permeation, and Protein Translocation. Annu Rev Microbiol. 2019;73:247–65. Epub 2019/05/30. 10.1146/annurev-micro-102215-095531 . [DOI] [PubMed] [Google Scholar]
- 12.Busby JN, Panjikar S, Landsberg MJ, Hurst MRH, Lott JS. The BC component of ABC toxins is an RHS-repeat-containing protein encapsulation device. Nature. 2013;501(7468):547–50. 10.1038/nature12465 [DOI] [PubMed] [Google Scholar]
- 13.Gatsogiannis C, Lang AE, Meusch D, Pfaumann V, Hofnagel O, Benz R, et al. A syringe-like injection mechanism in Photorhabdus luminescens toxins. Nature. 2013;495(7442):520–3. Epub 2013/03/22. 10.1038/nature11987 . [DOI] [PubMed] [Google Scholar]
- 14.Meusch D, Gatsogiannis C, Efremov RG, Lang AE, Hofnagel O, Vetter IR, et al. Mechanism of Tc toxin action revealed in molecular detail. Nature. 2014;508(7494):61–5. Epub 2014/02/28. 10.1038/nature13015 . [DOI] [PubMed] [Google Scholar]
- 15.Lang AE, Schmidt G, Schlosser A, Hey TD, Larrinua IM, Sheets JJ, et al. Photorhabdus luminescens toxins ADP-ribosylate actin and RhoA to force actin clustering. Science. 2010;327(5969):1139–42. Epub 2010/02/27. 10.1126/science.1184557 . [DOI] [PubMed] [Google Scholar]
- 16.Leidreiter F, Roderer D, Meusch D, Gatsogiannis C, Benz R, Raunser S. Common architecture of Tc toxins from human and insect pathogenic bacteria. Sci Adv. 2019;5(10):eaax6497 Epub 2019/10/31. 10.1126/sciadv.aax6497 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Landsberg MJ, Jones SA, Rothnagel R, Busby JN, Marshall SDG, Simpson RM, et al. 3D structure of the Yersinia entomophaga toxin complex and implications for insecticidal activity. Proceedings of the National Academy of Sciences. 2011;108(51):20544–9. 10.1073/pnas.1111155108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hares MC, Hinchliffe SJ, Strong PCR, Eleftherianos I, Dowling AJ, Ffrench-Constant RH, et al. The Yersinia pseudotuberculosis and Yersinia pestis toxin complex is active against cultured mammalian cells. Microbiology. 2008;154(Pt 11):3503–17. Epub 2008/10/30. 10.1099/mic.0.2008/018440-0 . [DOI] [PubMed] [Google Scholar]
- 19.Waterfield NR, Bowen DJ, Fetherston JD, Perry RD, ffrench-Constant RH. The tc genes of Photorhabdus: a growing family. Trends Microbiol. 2001;9(4):185–91. Epub 2001/04/05. 10.1016/s0966-842x(01)01978-3 . [DOI] [PubMed] [Google Scholar]
- 20.Piper SJ, Brillault L, Rothnagel R, Croll TI, Box JK, Chassagnon I, et al. Cryo-EM structures of the pore-forming A subunit from the Yersinia entomophaga ABC toxin. Nature Communications. 2019;10 ARTN 1952 10.1038/s41467-019-09890-8 WOS:000465838600004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Roderer D, Brocker F, Sitsel O, Kaplonek P, Leidreiter F, Seeberger PH, et al. Glycan-dependent cell adhesion mechanism of Tc toxins. Nature Communications. 2020;11(1). ARTN 2694 10.1038/s41467-020-16536-7 WOS:000542982400006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Sanjana NE, Shalem O, Zhang F. Improved vectors and genome-wide libraries for CRISPR screening. Nat Methods. 2014;11(8):783–4. Epub 2014/07/31. 10.1038/nmeth.3047 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Aebi M, Bernasconi R, Clerc S, Molinari M. N-glycan structures: recognition and processing in the ER. Trends Biochem Sci. 2010;35(2):74–82. Epub 2009/10/27. 10.1016/j.tibs.2009.10.001 . [DOI] [PubMed] [Google Scholar]
- 24.Vallee F, Karaveg K, Herscovics A, Moremen KW, Howell PL. Structural basis for catalysis and inhibition of N-glycan processing class I alpha 1,2-mannosidases. J Biol Chem. 2000;275(52):41287–98. Epub 2000/09/21. 10.1074/jbc.M006927200 . [DOI] [PubMed] [Google Scholar]
- 25.Waheed AA, Gitzen A, Swiderski M, Freed EO. High-Mannose But Not Complex-Type Glycosylation of Tetherin Is Required for Restriction of HIV-1 Release. Viruses. 2018;10(1). Epub 2018/01/06. 10.3390/v10010026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Taylor ME, Drickamer K. Structural insights into what glycan arrays tell us about how glycan-binding proteins interact with their ligands. Glycobiology. 2009;19(11):1155–62. 10.1093/glycob/cwp076 WOS:000270707700001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Jinno A, Park PW. Role of glycosaminoglycans in infectious disease. Methods Mol Biol. 2015;1229:567–85. Epub 2014/10/19. 10.1007/978-1-4939-1714-3_45 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Tao L, Tian SH, Zhang J, Liu ZM, Robinson-McCarthy L, Miyashita SI, et al. Sulfated glycosaminoglycans and low-density lipoprotein receptor contribute to Clostridium difficile toxin A entry into cells. Nat Microbiol. 2019;4(10):1760–9. 10.1038/s41564-019-0464-z WOS:000487286800021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Utt M, Danielsson B, Wadstrom T. Helicobacter pylori vacuolating cytotoxin binding to a putative cell surface receptor, heparan sulfate, studied by surface plasmon resonance. Fems Immunol Med Mic. 2001;30(2):109–13. 10.1111/j.1574-695X.2001.tb01557.x WOS:000167748900005. [DOI] [PubMed] [Google Scholar]
- 30.Bourdoulous S, Lemichez E. BACTERIAL PATHOGENESIS Decoding glycan recognition by bacterial toxins. Nat Microbiol. 2018;3(2):124–6. 10.1038/s41564-018-0107-9 WOS:000422987500003. [DOI] [PubMed] [Google Scholar]
- 31.Beddoe T, Paton AW, Le Nours J, Rossjohn J, Paton JC. Structure, biological functions and applications of the AB(5) toxins. Trends Biochem Sci. 2010;35(7):411–8. 10.1016/j.tibs.2010.02.003 WOS:000280051600007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Patry RT, Stahl M, Perez-Munoz ME, Nothaft H, Wenzel CQ, Sacher JC, et al. Bacterial AB(5) toxins inhibit the growth of gut bacteria by targeting ganglioside-like glycoconjugates. Nature Communications. 2019;10 ARTN 1390 10.1038/s41467-019-09362-z WOS:000462458900005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Yamaji T, Hanamatsu H, Sekizuka T, Kuroda M, Iwasaki N, Ohnishi M, et al. A CRISPR Screen Using Subtilase Cytotoxin Identifies SLC39A9 as a Glycan-Regulating Factor. Iscience. 2019;15:407–+. 10.1016/j.isci.2019.05.005 WOS:000470104600033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kawai T, Ota N, Imasato A, Shirasaki Y, Otsuka K, Tanaka Y. Profiling of N-linked glycans from 100 cells by capillary electrophoresis with large-volume dual preconcentration by isotachophoresis and stacking. J Chromatogr A. 2018;1565:138–44. Epub 2018/06/28. 10.1016/j.chroma.2018.06.034 . [DOI] [PubMed] [Google Scholar]
- 35.Bartlett AH, Park PW. Proteoglycans in host-pathogen interactions: molecular mechanisms and therapeutic implications. Expert Rev Mol Med. 2010;12 ARTN e5 10.1017/S1462399409001367 WOS:000278221600001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Aquino RS, Park PW. Glycosaminoglycans and infection. Front Biosci-Landmrk. 2016;21:1260–77. 10.2741/4455 WOS:000374585000012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Boyd AP, Sory MP, Iriarte M, Cornelis GR. Heparin interferes with translocation of Yop proteins into HeLa cells and binds to LcrG, a regulatory component of the Yersinia Yop apparatus. Mol Microbiol. 1998;27(2):425–36. 10.1046/j.1365-2958.1998.00691.x WOS:000071623800018. [DOI] [PubMed] [Google Scholar]
- 38.Ahn HC, Kim NY, Hur GH, Yang JM, Shin S. Role of chondroitin sulfate C in the action of anthrax toxin. Toxicology. 2012;297(1–3):10–6. 10.1016/j.tox.2012.03.008 WOS:000304744300002. [DOI] [PubMed] [Google Scholar]
- 39.Tanaka A, Tumkosit U, Nakamura S, Motooka D, Kishishita N, Priengprom T, et al. Genome-Wide Screening Uncovers the Significance of N-Sulfation of Heparan Sulfate as a Host Cell Factor for Chikungunya Virus Infection. J Virol. 2017;91(13). Epub 2017/04/14. 10.1128/JVI.00432-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Pacheco AR, Lazarus JE, Sit B, Schmieder S, Lencer WI, Blondel CJ, et al. CRISPR Screen Reveals that EHEC's T3SS and Shiga Toxin Rely on Shared Host Factors for Infection. Mbio. 2018;9(3). Epub 2018/06/21. 10.1128/mBio.01003-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Yamaji T, Sekizuka T, Tachida Y, Sakuma C, Morimoto K, Kuroda M, et al. A CRISPR Screen Identifies LAPTM4A and TM9SF Proteins as Glycolipid-Regulating Factors. Iscience. 2019;11:409–24. Epub 2019/01/21. 10.1016/j.isci.2018.12.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Tian S, Muneeruddin K, Choi MY, Tao L, Bhuiyan RH, Ohmi Y, et al. Genome-wide CRISPR screens for Shiga toxins and ricin reveal Golgi proteins critical for glycosylation. PLoS Biol. 2018;16(11):e2006951 Epub 2018/11/28. 10.1371/journal.pbio.2006951 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Bammens R, Mehta N, Race V, Foulquier F, Jaeken J, Tiemeyer M, et al. Abnormal cartilage development and altered N-glycosylation in Tmem165-deficient zebrafish mirrors the phenotypes associated with TMEM165-CDG. Glycobiology. 2015;25(6):669–82. Epub 2015/01/23. 10.1093/glycob/cwv009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Bruggisser J, Tarek B, Wyder M, Muller P, von Ballmoos C, Witz G, et al. CD31 (PECAM-1) Serves as the Endothelial Cell-Specific Receptor of Clostridium perfringens beta-Toxin. Cell Host & Microbe. 2020;28(1):69–+. 10.1016/j.chom.2020.05.003 WOS:000548935100011. [DOI] [PubMed] [Google Scholar]
- 45.Yahiro K, Morinaga N, Satoh M, Matsuura G, Tomonaga T, Nomura F, et al. Identification and characterization of receptors for vacuolating activity of subtilase cytotoxin. Mol Microbiol. 2006;62(2):480–90. 10.1111/j.1365-2958.2006.05379.x WOS:000240834000015. [DOI] [PubMed] [Google Scholar]
- 46.Li W, Xu H, Xiao TF, Cong L, Love MI, Zhang F, et al. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol. 2014;15(12). ARTN 554 10.1186/s13059-014-0554-4 WOS:000346609500026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Lu S, Wang J, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, et al. CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res. 2020;48(D1):D265–D8. Epub 2019/11/30. 10.1093/nar/gkz991 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Price MN, Dehal PS, Arkin AP. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5(3):e9490 Epub 2010/03/13. 10.1371/journal.pone.0009490 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47(W1):W256–W9. Epub 2019/04/02. 10.1093/nar/gkz239 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the manuscript and its Supporting Information files.