Abstract
Type IV pili (Tfp) are functionally versatile filaments, widespread in prokaryotes, that belong to a large class of filamentous nanomachines known as type IV filaments (Tff). Although Tfp have been extensively studied in several Gram-negative pathogens where they function as key virulence factors, many aspects of their biology remain poorly understood. Here, we performed a global biochemical and structural analysis of Tfp in a recently emerged Gram-positive model, Streptococcus sanguinis. In particular, we focused on the five pilins and pilin-like proteins involved in Tfp biology in S. sanguinis. We found that the two major pilins, PilE1 and PilE2, (i) follow widely conserved principles for processing by the prepilin peptidase PilD and for assembly into filaments; (ii) display only one of the post-translational modifications frequently found in pilins, i.e. a methylated N terminus; (iii) are found in the same heteropolymeric filaments; and (iv) are not functionally equivalent. The 3D structure of PilE1, solved by NMR, revealed a classical pilin-fold with a highly unusual flexible C terminus. Intriguingly, PilE1 more closely resembles pseudopilins forming shorter Tff than bona fide Tfp-forming major pilins, underlining the evolutionary relatedness among different Tff. Finally, we show that S. sanguinis Tfp contain a low abundance of three additional proteins processed by PilD, the minor pilins PilA, PilB, and PilC. These findings provide the first global biochemical and structural picture of a Gram-positive Tfp and have fundamental implications for our understanding of a widespread class of filamentous nanomachines.
Keywords: Gram-positive bacteria, protein structure, type IV pili, virulence factor, molecular motor, pilin, Streptococcus sanguinis, twitching motility, type IV filaments
Introduction
Type IV pili (Tfp)4 are thin, long and flexible surface-exposed filaments, widespread in Bacteria and Archaea, which mediate a wide array of functions (1, 2). Tfp are polymers of primarily one major protein subunit, a type IV pilin with distinctive N-terminal sequence motif (class III signal peptide) and 3D structure (3), assembled by a conserved set of dedicated proteins (4). These defining features are shared by a large class of filamentous nanomachines called type IV filaments (Tff) (1). Tff are ubiquitous in prokaryotes because genes encoding type IV pilins and filament assembly proteins are found in virtually every bacterial and archaeal genome (1).
Much of our current understanding of Tff biology comes from studies of bacterial Tfp in a few Gram-negative human pathogens in which they function as key virulence factors (4). The following general picture has emerged. Prepilins are translocated by the general secretory pathway across the cytoplasmic membrane (5, 6), where they remain embedded via a universally conserved structural feature, i.e. a protruding hydrophobic N-terminal α-helix (α1N) (3). This leaves the hydrophylic leader peptide of the class III signal peptide in the cytoplasm, which is then cleaved by a membrane-bound aspartyl protease (7), the prepilin peptidase, generating a pool of mature pilins in the membrane ready for polymerization into filaments. Efficient prepilin processing by the prepilin peptidase, which does not require any other protein (8), depends on the last residue of the prepilin leader peptide (a conserved Gly) (9) and two conserved catalytic Asp residues in the prepilin peptidase (7, 10). How filaments are polymerized remains poorly understood, but it is clear that this process is mediated by a multiprotein machinery in the cytoplasmic membrane (11, 12), which transmits energy generated by a cytoplasmic hexameric assembly ATPase (13) to membrane-localized pilins. As a result, pilins are extruded from the membrane and polymerized into helical filaments via hydrophobic packing of their α1N helix within the filament core (14, 15). Finally, once Tfp reach the outer membrane, they are extruded onto the surface through a multimeric pore, the secretin (12, 16, 17). The above picture, although complex, is oversimplified because there are additional proteins that play key roles in Tfp biology, including several proteins with class III signal peptides, named minor pilins or pilin-like proteins, whose localization and exact role are often unclear. Moreover, Tfp are highly dynamic filaments, constantly extending and retracting. Retraction has been best characterized in a subclass known as Tfpa, where it results from filament depolymerization powered by the cytoplasmic hexameric ATPase PilT (18), which generates massive tensile forces (19, 20).
Until recently, Tfp have not been extensively studied in Gram-positive species although it has been recognized that this represents a promising new research avenue as these bacteria possess a simpler surface architecture (21). Although Gram-positive Tfp were first described in Clostridiales (22, 23), Streptococcus sanguinis has emerged as a model because it is genetically tractable (24, 25). A comprehensive genetic analysis of S. sanguinis Tfpa (24) has revealed that they: (i) are assembled by a similar machinery as in Gram-negative species but with fewer components, (ii) are retracted by a PilT-dependent mechanism, generating tensile forces very similar to those measured in Gram-negative species, and (iii) power intense twitching motility. The main peculiarity of S. sanguinis filaments is that they contain two major pilins, rather than one as normally seen (24). In the present study, we have focused on the pilins and pilin-like proteins involved in Tfp biology in S. sanguinis and have performed a global biochemical and structural analysis of its filaments.
Results
The pil locus in S. sanguinis 2908 encodes five pilin/pilin-like proteins
All known integral components of Tfp and/or Tff share an N-terminal sequence motif named class III signal peptide (1, 3). It consists of a leader peptide, composed predominantly of hydrophilic amino acids (aa) ending with a conserved Gly−1, followed by a stretch of 21 predominantly hydrophobic aa (except for a negatively charged Glu5) forming the protruding α1N helix that is the main assembly interface for subunits within filaments (1, 3). Processing by the prepilin peptidase PilD occurs after Gly−1. In most Tfp and/or Tff, there are multiple pilin and/or pilin-like proteins (1, 3). Bioinformatic analysis of the proteins encoded by the pil locus in S. sanguinis, which contains all the genes involved in Tfp biology (24), predicts five pilins and/or pilin-like proteins (PilA, PilB, PilC, PilE1, and PilE2) (Fig. S1A). Four proteins (PilB, PilC, PilE1, and PilE2) display a canonical N-terminal IPR012902 motif (26), which is part of the class III signal peptide (Fig. 1A). Visual inspection of the sequences of the remaining proteins suggests that PilA has a degenerate class III signal peptide (Fig. 1B), which is not identified by the bioinformatic tools available, including PilFind, which is dedicated to the identification of type IV pilins (27).
PilA, PilB, and PilC, all of which are essential for piliation (24), exhibit sequence features distinct from PilE1 and PilE2, the major pilin subunits of S. sanguinis Tfp (24). All three proteins have much shorter leader peptides than PilE1/PilE2 (Fig. 1B). In addition, although mature PilA has a size (17 kDa) similar to previously studied pilins and pilin-like proteins (3), its putative class III signal peptide is unique for several reasons. The first residue of the predicted mature protein is Ser1, there is an unusual Pro4, and the highly conserved Glu5 is missing. On the other hand, whereas PilB and PilC have canonical class III signal peptides (Fig. 1B), both are much larger than classical pilin-like proteins, with mature sizes of 50 and 52.7 kDa, respectively (Fig. 1A). This is explained by the presence of bulky domains at the C terminus of PilB and PilC. PilB contains a von Willebrand factor type A motif (IPR002035) (26), whereas PilC contains a concanavalin A-like lectin/glucanase structural domain (SSF49899) (28) (Fig. 1A).
Taken together, these findings suggest that PilA, PilB, and PilC are pilin-like proteins, which might be cleaved by PilD and polymerized into S. sanguinis Tfp, alongside the major pilins PilE1 and PilE2.
S. sanguinis Tfp are heteropolymers of N-terminally methylated PilE1 and PilE2
As shown previously, purified S. sanguinis Tfp consist predominantly of comparable amounts of two proteins, the major pilins PilE1 and PilE2 (24), which share extensive sequence identity (Fig. S2). Tfp, sheared by vortexing, were purified by removing cells/cellular debris by centrifugation before pelleting filaments by ultracentrifugation (24). As assessed by Coomassie staining after SDS-PAGE (Fig. 2A), these pilus preparations contain contaminants that originate from cells/cellular debris. To determine the precise composition of S. sanguinis Tfp, we improved the purity of our pilus preparations. To remove more debris, we performed one additional centrifugation step and passed the sheared filaments through a 0.22-μm syringe filter before ultracentrifugation. As assessed by SDS-PAGE/Coomassie (Fig. 2A), filaments prepared using this enhanced purification procedure were significantly purer, with no visible contaminants. Intriguingly, the morphology of the purified filaments also changed. Although they were previously overwhelmingly thick (12 nm) and wavy (24), most filaments purified using this enhanced procedure display a classical Tfp morphology (1, 4) as assessed by transmission EM (TEM) (Fig. 2B). Indeed, they are thin (∼6 nm wide), long (several μm), and flexible, but they do not form large bundles.
In other piliated species, major pilins frequently undergo post-translational modifications (PTM) (3). Their N-terminal residue upon processing is usually methylated and other PTM sometimes include the addition of a variety of glycans and phosphoforms. Methylation is catalyzed by PilD, which is a bifunctional enzyme in many species (8, 9). A bioinformatic analysis shows that S. sanguinis PilD contains the IPR010627 motif catalyzing N-methylation (Fig. S1B), suggesting that the N-terminal Phe1 residue in PilE1 and PilE2 is likely to be methylated. Because other PTM cannot be inferred bioinformatically, we used top-down and bottom-up mass spectrometry (MS) to map the PTM of the two major pilins of S. sanguinis 2908. Top-down MS analysis of purified filaments showed the presence of two major proteoforms with a ratio of ∼4:3 (PilE1:PilE2) (Fig. 2C), with deconvoluted singly charged monoisotopic masses at m/z 14,745.63 and 14,087.46 Da. These masses are consistent with the predicted theoretical masses of mature PilE1 (14,731.60 Da [M + H]+) and PilE2 (14,073.40 Da [M + H]+) with the addition of a single N-terminal methyl group (14.01 Da). Bottom-up LC-MS/MS analysis of the two bands excised separately and digested in-gel identified PilE1 and PilE2 with nearly complete sequence coverage (85 and 86%, respectively). The only peptide that was not detected, despite employing several proteolytic enzymes, was the N terminus (1FTLVELIVVIIIIAIIAAVAI21) (Fig. S2). These MS results strongly suggest that S. sanguinis pili consist mainly of a 4:3 ratio of N terminally methylated PilE1 and PilE2 subunits.
Next, we sought to answer the question whether S. sanguinis Tfp are heteropolymers of PilE1 and PilE2 or whether two homopolymers co-exist. Recently, using a markerless gene editing strategy (25), we showed that pilE1 could be engineered in situ to encode a protein with a C-terminal His6 tag, without affecting piliation or Tfp functionality. We therefore used this property to design an affinity co-purification procedure to answer the above question (Fig. 3A). In brief, the intention was to (i) engineer pilE1 and pilE2 mutants encoding C terminally His6-tagged proteins, (ii) shear the filaments, (iii) affinity-purify (pulldown) sheared filaments containing His6-tagged subunits, and (iv) assess whether the untagged pilin co-purifies, suggesting that the filaments are heteropolymers, or does not co-purify, suggesting that two distinct homopolymers co-exist (Fig. 3A). We therefore engineered four different variants by either fusing a His6 tag to the C terminus of full-length PilE1 and PilE2 (PilE16His-long and PilE26His-long) or by replacing the last seven aa in these pilins by the tag (PilE16His-short and PilE26His-short). We first confirmed by SDS-PAGE/Coomassie analysis of purified pilus preparations that the four variants were piliated (Fig. 3B), although at various levels. Purified pili contained both tagged and untagged pilins as assessed by immunoblotting using antibodies specific for PilE1 and PilE2 (24), or an anti-His6 tag commercial antibody (Fig. 3B). When affinity-purified sheared filaments were analyzed by immunoblotting using anti-PilE1, anti-PilE2, or anti-His6 antibodies, we found that in each of the four mutants the untagged pilin co-purifies with the tagged pilin (Fig. 3C). Importantly, wildtype (WT) untagged filaments cannot be affinity-purified using this procedure, indicating that the pulldown is a His6 tag-specific process (Fig. 3C). Considering that S. sanguinis Tfp show little if any bundling, these findings strongly suggest that they are heteropolymers of PilE1 and PilE2. Taken together, these findings show that S. sanguinis Tfp are heteropolymers composed of a 4:3 ratio of PilE1 and PilE2, both harboring a single PTM, i.e. a methylated N terminus.
Major pilin processing and/or assembly into filaments in Gram-positive Tfp follow widely conserved principles
Mutagenesis studies of major pilins in several Tfp and/or Tff have identified residues in class III signal peptides key for processing by PilD and/or assembly into filaments (9, 29–31). A common rule has emerged concerning two highly conserved residues Gly−1 and Glu5 (1). Gly−1 is crucial for processing by PilD, whereas Glu5 is dispensable for processing but critical for filament assembly (14, 15). Because the importance of these residues has not been assessed for Gram-positive Tfp, we tested it in S. sanguinis. Because PilE1 and PilE2 have identical N termini (Fig. 1B and Fig. S2), we focused our efforts on PilE1 and used our gene editing strategy (25) to construct markerless mutant strains expressing PilE1 variants in which the Gly−1 and Glu5 residues would be mutated. Next, we tested whether these mutants proteins were processed by PilD and assembled into filaments. Pilin processing was assessed by immunoblotting in whole-cell protein extracts using the anti-PilE1 antibody (24), i.e. processed PilE1 is 14.7 kDa, whereas the unprocessed protein is 16.9 kDa. Assembly into Tfp was assessed by immunoblotting on purified filaments. Originally, we replaced Gly−1 by an Ala (PilE1G−1A), which had no effect on processing (Fig. 4A) or assembly into filaments (Fig. 4B). A similar finding was reported in P. aeruginosa, and was attributed to the small size of Ala because substitutions with bulkier residues abolished pilin processing (9). Accordingly, when Gly−1 was replaced by a Ser (PilE1G−1S), PilE1 could not be processed (Fig. 4A) or polymerized into filaments, which consisted only of PilE2 (Fig. 4B). The marked decrease in PilE2 in pilus preparation suggests a dominant-negative effect of unprocessed PilE1G−1S on PilE2 filament formation. As for the Glu5 residue, when it was replaced by an Ala (PilE1E5A), PilE1 processing was unaffected (Fig. 4A) but the protein could not be polymerized into filaments, which again consisted only of PilE2 (Fig. 4B). As above, there was a marked decrease in PilE2 in pilus preparation suggesting a dominant-negative effect of PilE1E5A on PilE2 filament formation. These findings confirm that the widely conserved principles defining how major pilins are processed and assembled into filaments (1) apply to Gram-positive Tfp as well. This further highlights the suitability of S. sanguinis as a Gram-positive model to study Tfp biology.
3D structure of PilE1 reveals a pilin-fold with an uncommon highly flexible C terminus
Next, to improve our structural understanding of S. sanguinis Tfp, we solved the 3D structure of its major pilins. To facilitate purification, we expressed in Escherichia coli the soluble portions of PilE1 (112 aa) and PilE2 (105 aa) fused to an N-terminal His6 tag (Fig. S2). This truncated the first 27 residues of the mature proteins, which form the protruding hydrophobic N-terminal α-helix in type IV pilins (1, 3). This procedure allowed us to purify well-folded and soluble proteins using a combination of affinity and gel-filtration chromatography. Because PilE2 shares 78% sequence identity with PilE1 (Fig. S2), we decided to focus on the longest PilE1 and determine its structure by NMR. We isotopically labeled His6-PilE1 with 13C and 15N for backbone and side chain NMR resonance assignment (Table S1) and obtained a high-resolution structure in solution. This structure (Fig. 5A) revealed that PilE1 adopts the classical type IV pilin-fold (1, 3). It exhibits a long N-terminal α-helix packed against a β-meander consisting of three anti-parallel β-strands, flanked by distinctive “edges” (Fig. 5A), which usually differ between pilins (1, 3). Although the loop connecting α1 and β1 is unexceptional except perhaps for its length (50 aa), the C terminus of the protein (after β3) is striking. Unlike in other pilins in which this region is stabilized by being “stapled” to the last β-strand either by a disulfide bond, a network of hydrogen bonds, or a calcium-binding site (1, 3), the 10-aa long C terminus in PilE1 is unstructured and highly flexible. Although the structures within the NMR ensemble superpose well up to the last β strand (β3), their C termini exhibit different conformations and orientations (Fig. 5B). Intriguingly, when compared with structures in the Protein Data Bank, PilE1 was found to be most similar to pseudopilins, which form short Tff, rather than to bona fide Tfp subunits. As can be seen in Fig. 5C, the structures of PilE1 and PulG, the major pseudopilin in Klebsiella oxytoca type II secretion system (T2SS) (32), show extended similarity.
Because of their high sequence identity (Fig. S2), we used our PilE1 structure as a template to produce a homology model of PilE2 (Fig. 6A). As expected, PilE2 was found to be virtually identical to PilE1 except for its shorter α1β1 loop (Fig. 6B), which is explained by the fact that eight residues at the C terminus of this loop in PilE1 are absent in PilE2 (Fig. S2). Finally, after producing full-length models of PilE1 and PilE2 using the full-length gonococcal pilin (33, 34) as a template, we were able to model packing of these pilins within recently determined structures of Tfpa (14, 15) (Fig. 6C). This revealed that PilE1 and PilE2 fit readily into these Tfp, which have a similar morphology to S. sanguinis filaments. This finding supports the notion that polymerization of pilins into filaments in Gram-positive species also occurs via hydrophobic packing of their α1N helix within the filament core.
Together, these findings show that S. sanguinis Tfp obey structural principles common to this class of filaments with some intriguing peculiarities. Although S. sanguinis major pilins adopt a canonical type IV pilin-fold, their C terminus appears to be highly flexible, which is uncommon. Moreover, whereas S. sanguinis major pilins fit readily within available Tfp structures, they are more similar structurally to pseudopilins than to bona fide Tfp-forming pilins.
PilA, PilB, and PilC are minor pilin components of S. sanguinis Tfp
In all Tff systems, there are in addition to the major pilins several proteins with class III signal peptides, whose role and localization are often unclear (3). To start with experimental characterization of PilA, PilB, and PilC in S. sanguinis, we purified each protein and generated antisera. Immunoblotting using whole-cell protein extracts confirmed that the three proteins are expressed because they could be detected in WT, but not in the corresponding deletion mutants (Fig. 7A). We then tested whether PilA, PilB, and PilC were processed by PilD, which for PilA was uncertain considering its degenerate class III signal peptide (Fig. 1B). Processing by PilD, which removes the leader peptide, is expected to generate mature proteins of 17, 50.5, and 52.8 kDa for PilA, PilB, and PilC, respectively, shorter than their 18-, 51.5-, and 53.9-kDa precursors. Immunoblots confirmed that all three proteins are cleaved by PilD as indicated by the detection of proteins with masses consistent with unprocessed precursors in a ΔpilD mutant (Fig. 7A). Next, we took advantage of our ability to prepare highly pure Tfp, to determine whether PilA, PilB, and PilC could be detected in pilus preparations by immunoblotting. As can be seen in Fig. 2A, although only PilE1 and PilE2 could be detected by SDS-PAGE/Coomassie analysis, PilA, PilB, and PilC are readily detected by immunoblotting in these preparations (Fig. 7B). Co-purification of PilA, PilB, and PilC with the filaments depended upon their processing by PilD because these proteins were not detected in sheared fractions prepared from a nonpiliated ΔpilD mutant (Fig. 7B). In conclusion, findings that PilA, PilB, and PilC are cleaved by PilD and co-purify with Tfp suggest that these three proteins are minor (low abundance) pilin components of S. sanguinis Tfp, likely assembled into filaments in a similar fashion to the major subunits PilE1 and PilE2.
Homopolymeric filaments composed only of PilE1 or PilE2 are able to promote motility, but at different speeds
As previously reported, whereas a double ΔpilE1ΔpilE2 mutant is nonpiliated, single ΔpilE1 and ΔpilE2 mutants produce WT-like filaments consisting of the remaining pilin (24). We therefore wondered whether these homopolymeric filaments are functional and, if so, whether the structural differences between the two pilins (see Fig. 6B) would have a functional impact. We compared the ability of the homopolymeric filaments produced by single ΔpilE1 and ΔpilE2 mutants to mediate twitching motility. We first found that these mutants still exhibited spreading zones around bacteria grown on agar plates (Fig. 8A). This confirms that Tfp consisting exclusively of PilE1 or PilE2 are functional. Motility was next assessed quantitatively at a cellular level by tracking under the microscope the movement of small chains of cells (Fig. 8B). As previously reported for the WT (24), both mutants showed “train-like” directional motion mainly parallel to the long axis of bacterial chains. Short duration movies illustrating the movement of ΔpilE1 and ΔpilE2 are included as Supplemental Information (Movies S1 and S2). Measurement of instantaneous velocities revealed that, whereas the WT moved at 694 ± 4 nm s−1 (mean ± S.E., n = 22,957) consistent with previous measurements (24), the mutants moved at 462 ± 2 nm s−1 (n = 37,001) for ΔpilE1, and 735 ± 3 nm s−1 (n = 22,231) for ΔpilE2 (Fig. 8B). These statistically significant differences suggest that the two pilins are not functionally equivalent with respect to twitching motility.
Discussion
Their ubiquity in prokaryotes (1) makes Tff an important research topic. A better understanding of the molecular mechanisms governing Tff biology might have practical implications for human health and nanotechnology. Perhaps one of the reasons for our limited understanding of Tff biology is that, historically, these filamentous nanomachines have been studied in just a few Gram-negative bacterial species, all belonging to the same phylum (Proteobacteria) (4). The study of Tff in phylogenetically distant species has the potential to move the field forward, which has recently sparked studies in Archaea (2) and in distant phyla of Bacteria (21). One of the most promising new Tfp models that has emerged is the Gram-positive opportunistic pathogen S. sanguinis (24, 25). A recent systematic genetic analysis of Tfp biology in this species, the first to be realized in a non-Proteobacterium, showed that S. sanguinis uses a simpler machinery (with fewer components) to assemble canonical Tfpa that generates high tensile forces and power twitching motility (24). In this report, we performed in depth biochemical and structural analysis of S. sanguinis filaments, which led to the notable findings discussed below.
The first important achievement in this study is the establishment of one of the most complete biochemical pictures of a Tfp. Critically, this confirms the observed trend (24) that S. sanguinis Tfpa are simpler filaments, with fewer components. Indeed, in Gram-negative Tfpa models, in addition to the major pilin there are 7–8 proteins possessing class III signal peptides (4). For example, in Neisseria meningitidis, there are four conserved pilin-like proteins required for piliation (PilH, PilI, PilJ, and PilK) whose role (priming filament assembly) (35) and localization (at the tip of the pili or distributed throughout the filaments) (36, 37) remain uncertain, and three species-specific minor pilins (ComP, PilV, and PilX) that are dispensable for piliation but modulate Tfp-associated functions (38–40). In contrast, in S. sanguinis, besides PilE1 and PilE2 there are only three Pil proteins possessing class III signal peptides (PilA, PilB, and PilC). As shown here, these proteins are efficiently recognized and post-translationally modified by PilD, which processes their N-terminal leader peptides and (most likely) methylates the first residue of the resulting mature proteins, as demonstrated for PilE1 and PilE2. Processing follows widely conserved principles (1). No other PTM, which frequently decorate major pilins in Gram-negative Tfp are found on S. sanguinis PilE1 and PilE2. Importantly, the use of highly pure pilus preparations clearly showed that these five proteins are Tfp subunits. PilE1 and PilE2 are the two major pilins, whereas PilA, PilB, and PilC are three minor pilins. Although the arrangement of the major pilins in the filaments (geometric or stochastic) remains to be determined, this study strongly suggests that S. sanguinis Tfp are heteropolymeric structures containing comparable amounts of PilE1 and PilE2, a property not previously reported for Tff. However, the reason for this peculiarity remains unclear because the homopolymeric Tfp assembled by ΔpilE1 and ΔpilE2 mutants power efficient twitching motility. A possible explanation might be that PilE1 and PilE2 are important for optimal stability of S. sanguinis Tfp as ΔpilE1 and ΔpilE2 mutants produce less filaments than WT. On the other hand, what could be the role of the minor subunits of S. sanguinis Tfp (PilA, PilB, and PilC)? Although they are required for piliation (24), they are unlikely to prime filament assembly because they are unrelated to the four conserved pilin-like proteins carrying out this process in Gram-negative Tfp (35). Rather, PilA, PilB, and PilC are likely to contribute to filament stability and to modulate Tfp-associated functions. This notion is supported by the unusual presence of large C-terminal domains in PilB and PilC. Interestingly, both the von Willebrand factor type A motif (found in PilB) and the concanavalin A-like lectin/glucanase structural domain (found in PilC) are often involved in binding protein or carbohydrate ligands, suggesting that PilB and PilC are involved in host adhesion in S. sanguinis, a property frequently associated with Tfp in other species (4).
The structural information generated on S. sanguinis Tfp is the second notable achievement in this study. Our new pilus purification strategy shows that the morphological features of S. sanguinis filaments are canonical of Tfpa (4). It is possible that the thick and irregular filaments purified previously (24) were damaged during ultracentrifugation by the presence of cells/cellular debris. Our high-resolution NMR structure of the globular domain of PilE1 confirms that Gram-positive Tfp major subunits adopt the classical type IV pilin-fold (1, 3). The 3D structure of PilE2 is likely to be virtually identical, because of its ∼80% sequence identity to PilE1. Full-length PilE1/PilE2 are therefore expected to adopt the canonical “lollipop” structure (33, 34), because the missing hydrophobic α1N portion can reliably be modeled as a protruding α-helix. Their α1-helix is likely to adopt a gentle S-shaped curve like in the gonococcal major pilin (33, 34), because the helix-breaking Pro22 is conserved. Importantly, PilE1/PilE2 fit well in recent cryo-electron microscopy (cryo-EM) reconstructions of several Gram-negative Tfpa (14, 15), which despite different helical parameters display similar packing of the α1N helix within the core of the filaments. Because of the conservation of the helix-breaking Pro22, it is likely that the α1N helix will be partially melted between Ala14 and Pro22 as the pilins in the above reconstructions (14, 15), which is thought to provide flexibility and elasticity to the filaments (41). In addition, this would allow the formation of a salt bridge between Glu5 and the methylated Phe1 (both conserved in PilE1 and PilE2) of the neighboring pilin. Minor subunits PilB and PilC, which have canonical class III signal peptides similar to PilE1/PilE2 are likely to assemble within filaments in a similar fashion. It is unclear, however, whether the α1N helix would be partially melted, because the helix-breaking Pro22 is absent in these proteins. Moreover, it remains to be seen how the bulky C-terminal domains in PilB and PilC, which appear to have been “grafted” by evolution onto a pilin moiety, would be exposed on the surface of the filaments. As for PilA, its unique class III signal peptide, makes it difficult to predict how its α1N helix will be packed in the filament core, especially because of the unusual Pro4. Critically, our high-resolution structure of PilE1 also challenges two common assumptions in the field (1, 3). First, in contrast to all previously available pilin structures, the C terminus in PilE1 is unstructured and highly flexible. This explains why it is a permissive insertion site and why it even can be deleted and replaced by a His6 tag (25), without interfering with the ability of PilE1/PilE2 to be polymerized into filaments. Therefore, the common rule that the C terminus of major pilins must be stabilized to preserve pilin integrity and its ability to be polymerized (3), is not always true. It cannot be excluded at this point that the major pilins of S. sanguinis are an exception as the major pilin of another Tfp-expressing Gram-positive, Clostridium difficile, apparently needs to stabilize its C terminus (42). Second, a Tfp-forming major pilin, PilE1, is most similar to pseudopilins that form short filaments in alternative Tff, such as PulG from K. oxytoca T2SS (32). This indicates that a major pilin 3D structure cannot be used to predict whether the subunit will form pseudopili or bona fide Tfp, which perhaps blurs the lines between different Tff and suggests an even closer evolutionary relationship between these filamentous nanomachines.
In conclusion, by providing an unprecedented global view of a Gram-positive Tfp, this study further cements S. sanguinis as a model species that is fast closing the gap with historic Gram-negative Tfp models. Together with our recent reports (24, 25) and S. sanguinis exquisite genetic tractability, these findings pave the way for future investigations, which will undoubtedly contribute to improve our understanding of a fascinating filamentous nanomachine almost universal in prokaryotes.
Experimental procedures
Strains and growth conditions
Strains and plasmids that were used in this study are listed in Table S2. For cloning, we used E. coli DH5α. E. coli BL21(DE3) was used for protein purification. E. coli strains were grown in liquid or solid lysogeny broth (LB) (Difco) containing, when required, 100 μg/ml of spectinomycin or 50 μg/ml of kanamycin (both from Sigma). The WT S. sanguinis 2908 strain and deletion mutants were described previously (24, 25). S. sanguinis strains were grown on plates containing Todd Hewitt (TH) broth (Difco) and 1% agar (Difco), incubated at 37 °C in anaerobic jars (Oxoid) under anaerobic conditions generated using Anaerogen sachets (Oxoid). Liquid cultures were grown statically under aerobic conditions in THT (i.e. TH broth containing 0.05% Tween 80 (Merck)) to limit bacterial clumping. When required, 500 μg/ml of kanamycin (Km) (Sigma) was used for selection. For counterselection, we used 15 mm p-Cl-Phe (Sigma) (25).
Chemically competent E. coli cells were prepared as described (43). DNA manipulations were done using standard molecular biology techniques (44). All PCR were done using high-fidelity DNA polymerases from Agilent (see Table S3 for a list of primers used in this study). S. sanguinis genomic DNA was prepared from overnight liquid cultures using the kit XIT Genomic DNA from Gram-positive bacteria (G-Biosciences). Strain 2908, which is naturally competent, was transformed as described elsewhere (24, 25).
Unmarked S. sanguinis mutants in pilE1 and pilE2 in situ used in this study were constructed using a recently described two-step, cloning-independent, gene editing strategy (25). In brief, in the first step, the target gene was cleanly replaced in the WT, by allelic exchange, with a promoterless pheS*aphA-3 double cassette, which confers sensitivity to p-Cl-Phe and resistance to kanamycin. To do this, a splicing PCR (sPCR) product fusing the upstream (amplified with primers F1 and R1) and downstream (amplified with F2 and R2) regions flanking the target gene to pheS*aphA-3 (amplified with pheS-F and aph-R) was directly transformed into the WT, and allelic exchange mutants were selected on Km-containing plates. Allelic exchange was confirmed for a couple of transformants by PCR. In the second step, the pheS*aphA-3 double cassette was cleanly replaced in this primary mutant, by allelic exchange, with an unmarked mutant allele of the target gene (see below). To do this, an sPCR product fusing the mutant allele to its upstream and downstream flanking regions was directly transformed into the primary mutant and allelic exchange mutants were selected on p-Cl-Phe-containing plates. Markerless allelic exchange mutants, which are KmS, were identified by re-streaking p-Cl-PheR colonies on TH plates with and without Km. The pilE1 and pilE2 mutant alleles encoding proteins with a C-terminal His6 tag were engineered by sPCR. We made two constructs for each gene: a long construct in which we fused the His6 tag to the C terminus of the full-length protein (sPCR with F1/R3 and F3/R2), and a short construct in which we replaced the last seven aa by the tag (sPCR with F1/R4 and F4/R2). To construct the missense mutants in pilE1, we used as a template a pCR8/GW/TOPO plasmid in which the WT gene (amplified with F and R) was cloned (Table S2) and the QuikChange site-directed mutagenesis kit (Agilent) (with complementary primers #1 and #2). Then, the sPCR product for transformation in the primary mutant was produced by fusing the mutant allele (amplified with F and R) to flanking regions upstream (amplified with F1 and R5) and downstream (amplified with F5 and R2).
SDS-PAGE, antisera, and immunoblotting
S. sanguinis whole-cell protein extracts were prepared using a FastPrep-24 homogenizer (MP Biomedicals) and quantified as described elsewhere (24). Separation of the proteins by SDS-PAGE, subsequent blotting to Amersham Biosciences Hybond ECL membrane (GE Healthcare), and blocking were carried out using standard molecular biology techniques (44). To detect PilE1 and PilE2, we used previously described primary rabbit antipeptide antibodies (24). Antisera against PilA, PilB, and PilC were produced for this study by Eurogentec, by immunizing rabbits with purified recombinant proteins (see below). These proteins were then used to affinity purify the antibodies. Primary antibodies were used at between 1/2,000 and 1/5,000 dilutions, whereas the secondary antibody, an ECL horseradish peroxidase-linked anti-rabbit antibody (GE Healthcare) was used at 1/10,000 dilution. Amersham Biosciences ECL Prime (GE Healthcare) was used to reveal the blots. To detect His-tagged proteins, we used a horseradish peroxidase-linked anti-His6 antibody (Sigma) at 1/10,000 dilution.
Tfp purification and visualization
S. sanguinis 2908 Tfp were purified as described elsewhere with minor modifications (24). Liquid cultures (10 ml), grown overnight in THT, were used the next day to re-inoculate 90 ml of THT and grown statically until the OD600 reached 1–1.5, at which point OD were normalized, if needed. Bacteria were pelleted at 4 °C by centrifugation for 10 min at 6,000 × g and pellets were re-suspended in 2 ml of pilus buffer (20 mm Tris, pH 7.5, 50 mm NaCl). This suspension was vortexed for 2 min at full speed to shear Tfp. Bacteria were then pelleted as above, and supernatant containing the pili was transferred to a new tube. This centrifugation step was repeated, before the supernatant was passed through a 0.22-μm pore size syringe filter (Millipore) to remove residual cells and cellular debris. Pili were then pelleted by ultracentrifugation as described (24), resuspended in pilus buffer, separated by SDS-PAGE, and gels were stained using Bio-Safe Coomassie (Bio-Rad). Purified filaments were visualized by TEM after negative staining as described elsewhere (24).
Pulldown purification of His6-tagged filaments was done as follows. Fifty μl of Dynabeads His tag isolation and pulldown beads (Life Technologies) were aliquoted in an Eppendorf tube, and pelleted by placing the tube on a magnet for 2 min. After decanting the buffer, beads were washed three times using 500 μl of pilus buffer, mixed with 900 μl of sheared filaments (prepared as for the purification above), and incubated for 10 min on a roller at room temperature. Then, the tube was placed on the magnetic holder for 2 min and the supernatant discarded. Beads were subsequently washed three times with 400 μl of pilus buffer, and finally incubated for 10 min with 100 μl of elution buffer (pilus buffer with 500 mm imidazole) on a roller at room temperature. Then, the tube was placed on the magnetic holder for 2 min, the supernatant was collected and analyzed by SDS-PAGE/Coomassie and/or immunoblotting.
Proteomics and mass spectrometric analysis of purified Tfp
For the bottom-up MS analysis of purified Tfp, we carefully excised PilE1 and PilE2 protein bands from Coomassie-stained gels and generated enzymatically derived peptides employing four separate enzymes. In brief, as previously described (45), gel pieces were destained and digested overnight with 1 μg of trypsin or Lys-C in 50 mm Na2HCO3, pH 7.8, at 37 °C, 1 μg of chymotrypsin in 100 mm Tris, 10 mm CaCl2·2H2O, pH 7.8, at 25 °C, or 0.3 μg of Asp-N in 50 mm Tris, 2.5 mm ZnSO4·7H2O, pH 8.0, at 25 °C. Generated peptides, which were extracted as previously described (45), were vacuum-concentrated, dissolved in loading buffer (2% acetonitrile, 1% TFA), and desalted using ZIP-TIP tips as instructed by the manufacturer (Millipore). The peptides were eluted with 80% acetonitrile, 1% TFA and vacuum-concentrated. Dried peptide samples were dissolved in 10 μl of 2% acetonitrile, 1% formic acid before analysis by reverse phase LC-MS/MS. Redissolved samples (2–5 μl) were injected into a Dionex Ultimate 3000 nano-UHPLC system (Sunnyvale) coupled online to a QExactive mass spectrometer (Thermo Fisher Scientific) equipped with a nano-electrospray ion source. LC separation was achieved with an Acclaim PepMap 100 column (C18, 3 μm beads, 100 Å, 75-μm inner diameter, 50 cm) and a LC-packing trap column (C18, 0.3-mm inner diameter, 300 Å). The flow rate (15 μl/min) was provided by the capillary pump. A flow rate of 300 nl/min was employed by the nano pump, establishing a solvent gradient of solvent B from 3 to 5% in 5 min and from 5 to 55% in 60 min. Solvent A was 0.1% formic acid, 2% acetonitrile, whereas solvent B was 0.1% formic acid, 90% acetonitrile. The mass spectrometer was operated in data-dependent mode to automatically switch between MS and MS/MS acquisition. Survey full scan MS spectra (from m/z 200 to 2,000) were acquired with the resolution r = 70,000 at m/z 200, with an automated gain control (AGC) target of 106, and ion accumulation time set at 100 ms. The seven most intense ions, depending on signal intensity (intensity threshold 5.6 × 103) were considered for fragmentation using higher-energy collisional induced dissociation (HCD) at r = 17,500 and normalized collision energy (NCE) = 30. Maximum ion accumulation time for MS/MS spectra was set at 180 ms. Dynamic exclusion of selected ions for MS/MS were set at 30 s. The isolation window (without offset) was set at m/z 2. The lock mass option was enabled in MS mode for internal recalibration during the analysis.
To perform a complete top-down MS analysis of the PilE1 and PilE2 proteoforms present in purified S. sanguinis Tfp, three separate methods were employed. For the first method, sample preparation and top-down ESI-MS on a LTQ Orbitrap were performed as previously described (46), and intact protein mass spectra were acquired with a resolution of 100,000 at m/z 400. For the second method, after initial preliminary testing of a gradient of solvent B from 3 to 55% in 10 min, and from 55 to 85% in 12–35 min, the data were acquired on an LTQ Orbitrap operated in positive ionization mode in the data-dependent mode, to automatically switch between MS and MS/MS acquisition. Survey full scan MS spectra (from m/z 200 to 2,000) were acquired with a resolution r = 100,000 at m/z 400, with AGC target of 106 and ion accumulation time set at 100 ms. The two most intense ions, depending on signal intensity (intensity threshold 5.6 × 103) were considered for fragmentation using HCD at r = 17,500 and NCE = 30. The lock mass option was enabled in MS mode for internal recalibration during the analysis. For the third method, a direct injection nano-ESI top-down procedure was employed. Briefly, the Dionex Ultimate 3000 nano-UHPLC system coupled to the LTQ Orbitrap was reconfigured so that after sample loading onto the loop, valve switching allowed the nanopump to inject directly the sample from the loop into the LTQ Orbitrap MS using an isocratic gradient of 10% methanol, 10% formic acid. Data acquisition was done manually from the LTQ Tune Plus (version 2.5.5 sp2) using r = 100,000 and NCE = 400.
Data processing and analysis was done as follows. Bottom-up MS data were analyzed using MaxQuant (version 1.5.2) and the Andromeda search engine against an in-house generated S. sanguinis whole proteome database, and a database containing common contaminants. Trypsin, chymotrypsin, Lys-C, Asp-N, and no enzyme (no restriction) were selected as enzymes, allowing two missed cleavage sites. We applied a tolerance of 10 ppm for the precursor ion in the first search, 5 ppm in the second, and 0.05 Da for the MS/MS fragments. In addition to methionine oxidation, protein N-terminal methylation was allowed, in a separate search, as a variable modification. The minimum peptide length was set at 4 aa, and the maximum peptide mass at 5.5 kDa. False discovery rate was set at <0.01. Deconvolution of the PilE1 and PilE2 mass envelopes from top-down analysis was done as previously described (47). Deconvoluted protein masses are reported as monopronated [M + H+]. Theoretical masses of PilE1 and PilE2 were determined from available sequences.
Twitching motility assays
Twitching motility was assessed macroscopically on agar plates as described elsewhere (24). Briefly, bacteria were re-streaked as straight lines on freshly poured TH plates containing 1% Eiken agar (Eiken Chemicals), which were incubated up to several days under anaerobic conditions in a jar, in the presence of water to ensure high humidity. Motility was analyzed microscopically as described elsewhere (24). In brief, bacteria resuspended in THT were added into an open experimental chamber with a glass bottom and grown for 2 h at 37 °C in the presence of 5% CO2. The chamber was then transferred to an upright Ti Eclipse microscope (Nikon) with an environment cabinet maintaining the same growth conditions, and movies of the motion of small bacterial chains were obtained and analyzed in ImageJ, as described (24). Cell speed was measured from collected trajectories using Matlab.
Protein purification
To produce pure PilA, PilB, and PilC proteins for generating antibodies, we cloned the corresponding genes in pET-28b (Novagen) (Table S2). The forward primer was designed to fuse a noncleavable N-terminal His6 tag to the soluble portion of these proteins, i.e. excluding the leader peptide and the predicted hydrophobic α1N helix. For pilA, we amplified the gene from the 2908 genome, whereas for pilB and pilC, we used synthetic genes (GeneArt), codon-optimized for expression in E. coli. Recombinant proteins were purified using a combination of affinity and gel-filtration chromatographies as follows. An overnight liquid culture, in selective LB, from a single colony of E. coli BL21 (DE3) transformed with the above expression plasmids, was back-diluted (1:500) the next day in 1 liter of the same medium and grown to an OD600 of 0.4–0.6 on an orbital shaker. The temperature was then set to 16 °C, the culture allowed to cool for 30 min, before protein expression was induced overnight by adding 0.1 mm isopropyl 1-thio-β-d-galactopyranoside (Merck Chemicals). The next day, cells were harvested by centrifugation at 8,000 × g for 20 min and subjected to one freeze/thaw cycle in binding buffer A (50 mm HEPES, pH 7.4, 200 mm NaCl, 10 mm imidazole, 1× SIGMAFAST EDTA-free protease inhibitor mixture (Sigma)). Cells were disrupted by repeated cycles of sonication, i.e. pulses of 5 s on and 5 s off during 3–5 min, until the cell suspension was visibly less viscous. The cell lysate was then centrifuged for 30 min at 17,000 × g to remove cell debris. The clarified lysate was then mixed with 2 ml of nickel-nitrilotriacetic acid-agarose resin (Qiagen), pre-washed in binding buffer A, and incubated for 2 h at 4 °C with gentle agitation. This chromatography mixture was then filtered through a Poly-Prep gravity-flow column (Bio-Rad) and washed several times with binding buffer A, before the protein was eluted with elution buffer A (50 mm HEPES, pH 7.4, 200 mm NaCl, 500 mm imidazole, 1× SIGMAFAST EDTA-free protease inhibitor mixture). The affinity-purified proteins were further purified, and simultaneously buffer-exchanged into 50 mm HEPES, pH 7.4, 200 mm NaCl, by gel-filtration chromatography on an Akta Purifier using a Superdex 75 10/300 GL column (GE Healthcare).
For structural characterization of PilE1, we cloned, as above, the portion of pilE1 encoding the soluble portion of this protein in pET-28b to produce a protein with a noncleavable N-terminal His6 tag (Table S2). An overnight pre-culture in LB was back-diluted 1:50 into 10 ml of selective M9 minimal medium, supplemented with a mixture of vitamins and trace elements. This was grown to saturation overnight at 30 °C in an orbital shaker, then back-diluted 1:500 into 1 liter of the same medium containing d-[13C]glucose and [15N]NH4Cl for isotopic labeling. Cells were grown in an orbital shaker at 30 °C until the OD600 reached 0.8, then 0.4 mm isopropyl 1-thio-β-d-galactopyranoside was added to induce protein production overnight at 30 °C. As above, cells were then harvested and disrupted in binding buffer B (50 mm Tris-HCl, pH 8.5, 200 mm NaCl, 10 mm imidazole, 1× SIGMAFAST EDTA-free protease inhibitor mixture). The protein was first purified by affinity chromatography and eluted in 50 mm Tris-HCl, pH 8.5, 200 mm NaCl, 200 mm imidazole, 1× SIGMAFAST EDTA-free protease inhibitor mixture. It was then further purified and buffer-exchanged into 50 mm Na2HPO4/NaH2PO4, pH 6, 200 mm NaCl by gel-filtration chromatography using a Superdex 75 10/300 GL column.
NMR structure determination of PilE1
Structure determination of PilE1 was done by NMR, essentially as described (48). In brief, isotopically labeled purified His6-PilE1 was concentrated to ∼750 μm in NMR buffer (50 mm Na2HPO4/NaH2PO4, pH 6, 50 mm NaCl, 10% D2O). A full set of triple resonance NMR spectra was recorded on a Bruker Avance III 800 MHz spectrometer equipped with triple resonance cryoprobes at 295 K, and processed with NMRPipe (49). Backbone assignments were completed using a combination of HBHA, HNCACB, HNCO, HN(CA)CO, and CBCA(CO)NH experiments using NMRView (One Moon Scientific) (50). Side chain resonance assignments were obtained from a combination of CC(CO)NH, HC(C)H-TOCSY, and (H)CCH-TOCSY experiments using in-house software developed within NMRView (51). Distance restraints were obtained from 3D 1H,1H,15N-NOESY and 1H,1H,13C-NOESY spectra and used for structure calculations in ARIA 2.3 (52), along with dihedral angle restraints obtained from chemical shift values calculated using the TALOS+ server (53). For each round of six calculations, 100 structures were calculated over eight iterations. In the final iteration, the 10 lowest energy structures were submitted to a water refinement stage to form the final structural ensemble.
Bioinformatics
All the sequences were from the genome of S. sanguinis 2908 (24). Protein alignments were done using the Clustal Omega server at EMBL-EBI, with default parameters. Pretty-printing and shading of alignment files was done using the BOXSHADE server at ExPASy. Prediction of functional domains was done by scanning the protein sequence either against InterPro protein signatures (26), or against the SUPERFAMILY database of structural protein domains (28). Both analyses were done using default parameters. MODELLER was used for modeling protein 3D structures (54). In brief, a homology model for full-length PilE1 was produced using MODELLER in multiple template mode, with the N-terminal 25 residues of PilE from the gonococcal Tfp (15) and a representative of the PilE1 NMR ensemble determined in this study. This PilE1 model was then used as a template to produce a PilE2 model using MODELLER in single template mode. These two molecules were then superimposed onto a single chain each of the gonococcal filament cryo-EM reconstruction (15), using COOT (55), and using the SSM superpose function. Sections of the polypeptide were then refined into the electron densities using the real space refine function in COOT to adjust fitting of PilE1/PilE2 monomers into the helical filament. Multiple copies of these models were then used to produce the final representation. Structural homologs of PilE1 were identified by scanning the Protein Data Bank using the Dali server (56).
Author contributions
J.-L. B., I. G., J. H. A., N. B., and V. P. data curation; J.-L. B., I. G., J. H. A., M. K., N. B., S. M., and V. P. formal analysis; J.-L. B., I. G., J. H. A., I. S., E. H., A. M. J. H., V. J. G., C. R., and V. P. investigation; J.-L. B., I. G., J. H. A., M. K., N. B., S. M., and V. P. methodology; J.-L. B., I. G., J. H. A., E. H., A. M. J. H., V. J. G., C. R., M. K., N. B., S. M., and V. P. writing-review and editing; M. K., N. B., S. M., and V. P. resources; M. K., N. B., S. M., and V. P. project administration; N. B. and V. P. supervision; S. M. and V. P. funding acquisition; V. P. conceptualization; V. P. writing-original draft.
Supplementary Material
Acknowledgments
We are grateful to Sophie Helaine (Imperial College London), Chiara Recchi (Imperial College London), and Romé Voulhoux (CNRS Marseille) for critical reading of this manuscript. We express our gratitude to Christian Köhler (University of Oslo) for valuable input and assistance in re-configuring the LC.
This work was supported by Medical Research Council Grants MR/L008408/1 and MR/P022197/1 (to V. P.). The authors declare that they have no conflicts of interest with the contents of this article.
This article contains Movies S1–S2, Tables S1–S3, and Figs. S1–S2
The atomic coordinates and structure factors (code 6I2O) have been deposited in the Protein Data Bank (http://wwpdb.org/).
The NMR solution structure has been deposited in the Biological Magnetic Resonance Data Bank under accession code 34325.
- Tfp
- type IV pili
- Tff
- type IV filament(s)
- PTM
- post-translational modification(s)
- T2SS
- type II secretion system
- TH
- Todd Hewitt
- LB
- lysogeny broth
- TEM
- transmission electron microscopy
- cryo-EM
- cryo-electron microscopy
- sPCR
- splicing PCR
- AGC
- automated gain control
- HCD
- higher-energy collisional induced dissociation
- NCE
- normalized collision energy
- aa
- amino acid(s).
References
- 1. Berry J. L., and Pelicic V. (2015) Exceptionally widespread nano-machines composed of type IV pilins: the prokaryotic Swiss Army knives. FEMS Microbiol. Rev. 39, 134–154 10.1093/femsre/fuu001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Chaudhury P., Quax T. E. F., and Albers S. V. (2018) Versatile cell surface structures of archaea. Mol. Microbiol. 107, 298–311 10.1111/mmi.13889 [DOI] [PubMed] [Google Scholar]
- 3. Giltner C. L., Nguyen Y., and Burrows L. L. (2012) Type IV pilin proteins: versatile molecular modules. Microbiol. Mol. Biol. Rev. 76, 740–772 10.1128/MMBR.00035-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Pelicic V. (2008) Type IV pili: e pluribus unum? Mol. Microbiol. 68, 827–837 10.1111/j.1365-2958.2008.06197.x [DOI] [PubMed] [Google Scholar]
- 5. Francetic O., Buddelmeijer N., Lewenza S., Kumamoto C. A., and Pugsley A. P. (2007) Signal recognition particle-dependent inner membrane targeting of the PulG pseudopilin component of a type II secretion system. J. Bacteriol. 189, 1783–1793 10.1128/JB.01230-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Arts J., van Boxtel R., Filloux A., Tommassen J., and Koster M. (2007) Export of the pseudopilin XcpT of the Pseudomonas aeruginosa type II secretion system via the signal recognition particle-Sec pathway. J. Bacteriol. 189, 2069–2076 10.1128/JB.01236-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. LaPointe C. F., and Taylor R. K. (2000) The type 4 prepilin peptidases comprise a novel family of aspartic acid proteases. J. Biol. Chem. 275, 1502–1510 10.1074/jbc.275.2.1502 [DOI] [PubMed] [Google Scholar]
- 8. Aly K. A., Beebe E. T., Chan C. H., Goren M. A., Sepúlveda C., Makino S., Fox B. G., and Forest K. T. (2013) Cell-free production of integral membrane aspartic acid proteases reveals zinc-dependent methyltransferase activity of the Pseudomonas aeruginosa prepilin peptidase PilD. Microbiologyopen 2, 94–104 10.1002/mbo3.51 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Strom M. S., and Lory S. (1991) Amino acid substitutions in pilin of Pseudomonas aeruginosa: effect on leader peptide cleavage, amino-terminal methylation, and pilus assembly. J. Biol. Chem. 266, 1656–1664 [PubMed] [Google Scholar]
- 10. Hu J., Xue Y., Lee S., and Ha Y. (2011) The crystal structure of GXGD membrane protease FlaK. Nature 475, 528–531 10.1038/nature10218 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Goosens V. J., Busch A., Georgiadou M., Castagnini M., Forest K. T., Waksman G., and Pelicic V. (2017) Reconstitution of a minimal machinery capable of assembling periplasmic type IV pili. Proc. Natl. Acad. Sci. U.S.A. 114, E4978–E4986 10.1073/pnas.1618539114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Chang Y. W., Rettberg L. A., Treuner-Lange A., Iwasa J., Søgaard-Andersen L., and Jensen G. J. (2016) Architecture of the type IVa pilus machine. Science 351, aad2001 10.1126/science.aad2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Mancl J. M., Black W. P., Robinson H., Yang Z., and Schubot F. D. (2016) Crystal structure of a type IV pilus assembly ATPase: insights into the molecular mechanism of PilB from Thermus thermophilus. Structure 24, 1886–1897 [DOI] [PubMed] [Google Scholar]
- 14. Kolappan S., Coureuil M., Yu X., Nassif X., Egelman E. H., and Craig L. (2016) Structure of the Neisseria meningitidis type IV pilus. Nat. Commun. 7, 13015 10.1038/ncomms13015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Wang F., Coureuil M., Osinski T., Orlova A., Altindal T., Gesbert G., Nassif X., Egelman E. H., and Craig L. (2017) Cryoelectron microscopy reconstructions of the Pseudomonas aeruginosa and Neisseria gonorrhoeae type IV pili at sub-nanometer resolution. Structure 25, 1423–1435.e4 10.1016/j.str.2017.07.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Koo J., Lamers R. P., Rubinstein J. L., Burrows L. L., and Howell P. L. (2016) Structure of the Pseudomonas aeruginosa type IVa pilus secretin at 7.4 Å. Structure 24, 1778–1787 10.1016/j.str.2016.08.007 [DOI] [PubMed] [Google Scholar]
- 17. Gold V. A., Salzer R., Averhoff B., and Kuhlbrandt W. (2015) Structure of a type IV pilus machinery in the open and closed state. eLife 4, e07380 10.7554/eLife.07380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Satyshur K. A., Worzalla G. A., Meyer L. S., Heiniger E. K., Aukema K. G., Misic A. M., and Forest K. T. (2007) Crystal structures of the pilus retraction motor PilT suggest large domain movements and subunit cooperation drive motility. Structure 15, 363–376 10.1016/j.str.2007.01.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Maier B., Koomey M., and Sheetz M. P. (2004) A force-dependent switch reverses type IV pilus retraction. Proc. Natl. Acad. Sci. U.S.A. 101, 10961–10966 10.1073/pnas.0402305101 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Clausen M., Jakovljevic V., Søgaard-Andersen L., and Maier B. (2009) High-force generation is a conserved property of type IV pilus systems. J. Bacteriol. 191, 4633–4638 10.1128/JB.00396-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Melville S., and Craig L. (2013) Type IV pili in Gram-positive bacteria. Microbiol. Mol. Biol. Rev. 77, 323–341 10.1128/MMBR.00063-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Rakotoarivonina H., Jubelin G., Hebraud M., Gaillard-Martinie B., Forano E., and Mosoni P. (2002) Adhesion to cellulose of the Gram-positive bacterium Ruminococcus albus involves type IV pili. Microbiology 148, 1871–1880 10.1099/00221287-148-6-1871 [DOI] [PubMed] [Google Scholar]
- 23. Varga J. J., Nguyen V., O'Brien D. K, Rodgers K., Walker R. A., and Melville S. B. (2006) Type IV pili-dependent gliding motility in the Gram-positive pathogen Clostridium perfringens and other Clostridia. Mol. Microbiol. 62, 680–694 10.1111/j.1365-2958.2006.05414.x [DOI] [PubMed] [Google Scholar]
- 24. Gurung I., Spielman I., Davies M. R., Lala R., Gaustad P., Biais N., and Pelicic V. (2016) Functional analysis of an unusual type IV pilus in the Gram-positive Streptococcus sanguinis. Mol. Microbiol. 99, 380–392 10.1111/mmi.13237 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Gurung I., Berry J. L., Hall A. M. J., and Pelicic V. (2017) Cloning-independent markerless gene editing in Streptococcus sanguinis: novel insights in type IV pilus biology. Nucleic Acids Res. 45, e40 10.1093/nar/gkw1177 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Jones P., Binns D., Chang H. Y., Fraser M., Li W., McAnulla C., McWilliam H., Maslen J., Mitchell A., Nuka G., Pesseat S., Quinn A. F., Sangrador-Vegas A., Scheremetjew M., Yong S. Y., Lopez R., and Hunter S. (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 10.1093/bioinformatics/btu031 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Imam S., Chen Z., Roos D. S., and Pohlschröder M. (2011) Identification of surprisingly diverse type IV pili, across a broad range of Gram-positive bacteria. PLoS ONE 6, e28919 10.1371/journal.pone.0028919 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Wilson D., Pethica R., Zhou Y., Talbot C., Vogel C., Madera M., Chothia C., and Gough J. (2009) SUPERFAMILY-sophisticated comparative genomics, data mining, visualization and phylogeny. Nucleic Acids Res. 37, D380–386 10.1093/nar/gkn762 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Pugsley A. P. (1993) Processing and methylation of PuIG, a pilin-like component of the general secretory pathway of Klebsiella oxytoca. Mol. Microbiol. 9, 295–308 10.1111/j.1365-2958.1993.tb01691.x [DOI] [PubMed] [Google Scholar]
- 30. Horiuchi T., and Komano T. (1998) Mutational analysis of plasmid R64 thin pilus prepilin: the entire prepilin sequence is required for processing by type IV prepilin peptidase. J. Bacteriol. 180, 4613–4620 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Thomas N. A., Chao E. D., and Jarrell K. F. (2001) Identification of amino acids in the leader peptide of Methanococcus voltae preflagellin that are important in posttranslational processing. Arch. Microbiol. 175, 263–269 10.1007/s002030100254 [DOI] [PubMed] [Google Scholar]
- 32. López-Castilla A., Thomassin J. L., Bardiaux B., Zheng W., Nivaskumar M., Yu X., Nilges M., Egelman E. H., Izadi-Pruneyre N., and Francetic O. (2017) Structure of the calcium-dependent type 2 secretion pseudopilus. Nat. Microbiol. 2, 1686–1695 10.1038/s41564-017-0041-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Parge H. E., Forest K. T., Hickey M. J., Christensen D. A., Getzoff E. D., and Tainer J. A. (1995) Structure of the fibre-forming protein pilin at 2.6 Å resolution. Nature 378, 32–38 10.1038/378032a0 [DOI] [PubMed] [Google Scholar]
- 34. Craig L., Taylor R. K., Pique M. E., Adair B. D., Arvai A. S., Singh M., Lloyd S. J., Shin D. S., Getzoff E. D., Yeager M., Forest K. T., and Tainer J. A. (2003) Type IV pilin structure and assembly: X-ray and EM analyses of Vibrio cholerae toxin-coregulated pilus and Pseudomonas aeruginosa PAK pilin. Mol. Cell 11, 1139–1150 10.1016/S1097-2765(03)00170-9 [DOI] [PubMed] [Google Scholar]
- 35. Cisneros D. A., Pehau-Arnaudet G., and Francetic O. (2012) Heterologous assembly of type IV pili by a type II secretion system reveals the role of minor pilins in assembly initiation. Mol. Microbiol. 86, 805–818 10.1111/mmi.12033 [DOI] [PubMed] [Google Scholar]
- 36. Giltner C. L., Habash M., and Burrows L. L. (2010) Pseudomonas aeruginosa minor pilins are incorporated into type IV pili. J. Mol. Biol. 398, 444–461 10.1016/j.jmb.2010.03.028 [DOI] [PubMed] [Google Scholar]
- 37. Korotkov K. V., and Hol W. G. (2008) Structure of the GspK-GspI-GspJ complex from the enterotoxigenic Escherichia coli type 2 secretion system. Nat. Struct. Mol. Biol. 15, 462–468 10.1038/nsmb.1426 [DOI] [PubMed] [Google Scholar]
- 38. Helaine S., Dyer D. H., Nassif X., Pelicic V., and Forest K. T. (2007) 3D structure/function analysis of PilX reveals how minor pilins can modulate the virulence properties of type IV pili. Proc. Natl. Acad. Sci. U.S.A. 104, 15888–15893 10.1073/pnas.0707581104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Cehovin A., Simpson P. J., McDowell M. A., Brown D. R., Noschese R., Pallett M., Brady J., Baldwin G. S., Lea S. M., Matthews S. J., and Pelicic V. (2013) Specific DNA recognition mediated by a type IV pilin. Proc. Natl. Acad. Sci. U.S.A. 110, 3065–3070 10.1073/pnas.1218832110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Bernard S. C., Simpson N., Join-Lambert O., Federici C., Laran-Chich M. P., Maïssa N., Bouzinba-Ségard H., Morand P. C., Chretien F., Taouji S., Chevet E., Janel S., Lafont F., Coureuil M., Segura A., et al. (2014) Pathogenic Neisseria meningitidis utilizes CD147 for vascular colonization. Nat. Med. 20, 725–731 10.1038/nm.3563 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Biais N., Higashi D. L., Brujic J., So M., and Sheetz M. P. (2010) Force-dependent polymorphism in type IV pili reveals hidden epitopes. Proc. Natl. Acad. Sci. U.S.A. 107, 11358–11363 10.1073/pnas.0911328107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Piepenbrink K. H., Maldarelli G. A., Martinez de la Pena C. F., Dingle T. C., Mulvey G. L., Lee A., von Rosenvinge E., Armstrong G. D., Donnenberg M. S., and Sundberg E. J. (2015) Structural and evolutionary analyses show unique stabilization strategies in the type IV pili of Clostridium difficile. Structure 23, 385–396 10.1016/j.str.2014.11.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Inoue H., Nojima H., and Okayama H. (1990) High efficiency transformation of Escherichia coli with plasmids. Gene 96, 23–28 10.1016/0378-1119(90)90336-P [DOI] [PubMed] [Google Scholar]
- 44. Sambrook J., and Russell D. W. (2001) Molecular cloning: a laboratory manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY [Google Scholar]
- 45. Anonsen J. H., Vik Å., Egge-Jacobsen W., and Koomey M. (2012) An extended spectrum of target proteins and modification sites in the general O-linked protein glycosylation system in Neisseria gonorrhoeae. J. Proteome Res. 11, 5781–5793 10.1021/pr300584x [DOI] [PubMed] [Google Scholar]
- 46. Anonsen J. H., Børud B., Vik Å., Viburiene R., and Koomey M. (2017) Structural and genetic analyses of glycan O-acetylation in a bacterial protein glycosylation system: evidence for differential effects on glycan chain length. Glycobiology 27, 888–899 10.1093/glycob/cwx032 [DOI] [PubMed] [Google Scholar]
- 47. Anonsen J. H., Vik Å., Børud B., Viburiene R., Aas F. E., Kidd S. W., Aspholm M., and Koomey M. (2016) Characterization of a unique tetrasaccharide and distinct glycoproteome in the O-linked protein glycosylation system of Neisseria elongata subsp. glycolytica. J. Bacteriol. 198, 256–267 10.1128/JB.00620-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Berry J. L., Xu Y., Ward P. N., Lea S. M., Matthews S. J., and Pelicic V. (2016) A comparative structure/function analysis of two type IV pilin DNA receptors defines a novel mode of DNA binding. Structure 24, 926–934 10.1016/j.str.2016.04.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Delaglio F., Grzesiek S., Vuister G. W., Zhu G., Pfeifer J., and Bax A. (1995) NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6, 277–293 [DOI] [PubMed] [Google Scholar]
- 50. Johnson B. A., and Blevins R. A. (1994) NMR view: a computer program for the visualization and analysis of NMR data. J. Biomol. NMR 4, 603–614 10.1007/BF00404272 [DOI] [PubMed] [Google Scholar]
- 51. Marchant J., Sawmynaden K., Saouros S., Simpson P., and Matthews S. (2008) Complete resonance assignment of the first and second apple domains of MIC4 from Toxoplasma gondii, using a new NMRView-based assignment aid. Biomol. NMR Assign. 2, 119–121 10.1007/s12104-008-9100-1 [DOI] [PubMed] [Google Scholar]
- 52. Rieping W., Habeck M., Bardiaux B., Bernard A., Malliavin T. E., and Nilges M. (2007) ARIA2: automated NOE assignment and data integration in NMR structure calculation. Bioinformatics 23, 381–382 10.1093/bioinformatics/btl589 [DOI] [PubMed] [Google Scholar]
- 53. Shen Y., Delaglio F., Cornilescu G., and Bax A. (2009) TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts. J. Biomol. NMR 44, 213–223 10.1007/s10858-009-9333-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Webb B., and Sali A. (2014) Comparative protein structure modeling using MODELLER. Curr. Protoc. Bioinformatics 47, 5.6.1–5.6.32 [DOI] [PubMed] [Google Scholar]
- 55. Emsley P., Lohkamp B., Scott W. G., and Cowtan K. (2010) Features and development of Coot. Acta Crystallogr. Sect. D Biol. Crystallogr. 66, 486–501 10.1107/S0907444910007493 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Holm L., and Laakso L. M. (2016) Dali server update. Nucleic Acids Res. 44, W351–355 10.1093/nar/gkw357 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.