Abstract
Klebsiella is widely distributed in nature and recognized for its dual role as a human commensal and an opportunistic pathogen capable of causing severe infections. Among its key virulence mechanisms, the Type VI Secretion System (T6SS) plays a critical role in microbial competition, interspecies interactions, and environmental adaptation. In this study, we investigated a representative set of high-quality Klebsiella genomes and identified two major T6SS loci (T6SSkleb1 and T6SSkleb2), which exhibit distinct structural features and are predominantly found within the Klebsiella pneumoniae species complex (KpSC). Comparative genomics further revealed the diverse distribution of effector and immunity proteins and their correlation with T6SS loci. Based on this distributional bias, we developed a novel computational approach to identify protein families significantly associated with T6SS. These T6SS-associated gene clusters were implicated in mediating adhesion to host cell surfaces during urinary tract infections, as well as in metabolism and immune defense. Moreover, we identified three putative orphan effectors harboring DUF3258, DUF3751, and Sel1 domains. Overexpression of these proteins in Escherichia coli resulted in cytotoxic effects, supporting their potential as T6SS effectors. These findings establish a comprehensive framework for T6SS analysis, aiming to expand and enrich our understanding of the diversity, evolutionary trajectory, and functional implications of T6SS in Klebsiella.
Introduction
The genus Klebsiella, a group of Gram-negative, oxidase-negative, rod-shaped bacteria, that are normally found as normal flora in the nose, mouth, and gastrointestinal tract; however, they can also act as opportunistic human pathogens. Klebsiella organisms can cause various disease states, notably pneumonia, urinary tract infections, sepsis, meningitis, diarrhea, peritonitis and soft tissue infections [1]. Among species of the genus Klebsiella, K. pneumoniae is the most prevalent. A significant proportion of infections caused by K. pneumoniae can be attributed to two primary pathotypes: multidrug-resistant (MDR) and hypervirulent (hv) clones [2]. Also, K. pneumoniae has demonstrated the ability to acquire mobile elements and genetic rearrangements that transfer antimicrobial resistance and/or virulence traits, leading to simultaneously exhibit hypervirulence and anti-microbial resistance, termed MDR-hv K. pneumoniae [3]. The increasing growth of serious infections due to hypervirulence and ineffective treatment caused by antibiotic resistance rendered MDR-hv K. pneumoniae a real superbug that poses a serious threat to public health [4]. Apart from K. pneumoniae, Klebsiella oxytoca is also a major source of community-acquired pneumonia and nosocomial infections and the second most common cause of gram-negative bacteremia and urinary tract infections in Asia [5].
One of the critical factors contributing to the pathogenicity of Klebsiella species is the Type VI Secretion System (T6SS), a complex nanomachine used by many Gram-negative bacteria to deliver effector proteins directly into target cells. Initially identified in Vibrio cholerae [6], the T6SS is composed of several components that form a phage-like structure anchored to the bacterial membrane and usually consists of 13 core components designated TssA-M. The core structure of T6SS resembles an inverted bacteriophage tail, containing membrane complex, baseplate, and tail tube/sheath complex [7]. The membrane complex is composed of three proteins, TssJ, TssL, and TssM are together involved in anchoring the system to the bacterial inner membrane and facilitating the assembly of the rest of the structure. Similar to the baseplates of contractile phages, the T6SS baseplate comprises a central hub surrounded by at least six proteins, which is essential for the assembly of the TssD tube and sheath and triggering sheath contraction. TssB and TssC constitute the contractile sheath of T6SS, which encases the inner TssD tube during assembly [7,8]. The T6SS is predominantly associated with antagonistic activities, mediating both inter- and intra-bacterial competition [9–11]. Its effector proteins exert antimicrobial activity by targeting peptidoglycan, bacterial nucleic acids, and disruption of cell membranes [12]. Recent studies have broadened the original understanding of T6SS effector proteins, demonstrating their roles not only in bacterial competition, but also as potent weapons against fungi and eukaryotic cells. For example, the first fungal-specific T6SS effector proteins, Tfe1 and Tfe2 from Serratia marcescens, showing antifungal activity against Saccharomyces cerevisiae and Candida glabrata [13,14]. Immunity proteins are typically encoded adjunct to effector proteins mediating antimicrobial activity to antagonize self-intoxication [15,16].
Effector protein identification in T6SS has been achieved through both experimental and computational approaches. Experimental approaches, such as comparative proteomics [11,17,18], transposon mutagenesis [19] have successfully identified effectors with antimicrobial and anti-eukaryotic activities in several bacterial species. However, these methods are labor-intensive, often require specific conditions to induce T6SS activity, and may fail to detect lowly expressed effectors under particular environmental or physiological contexts. Computerized approaches, such as analysis of genes within the T6SS cluster and auxiliary modules [20–22], as well as genes encoding T6SS marker domains (e.g., MIX and FIX) [22,23], demonstrate the potential to use such tools for discovering new toxins. Nevertheless, current approaches-whether experimental or computational-tend to focus on effectors encoded within T6SS loci or immediately downstream of vgrG, potentially overlooking orphan genes located outside these regions. In this context, Fridman et al. introduced a methodology relying on a comparative genomics that enabled the discovery of a widely distributed, membrane-disrupting effector family in Vibrio parahaemolyticus [24]. Building on this strategy, and to complement both traditional experimental methods and existing computational pipelines, we have developed an integrative framework combining comparative genomics, bioinformatic screening, and experimental validation. This approach enables the systematic identification of orphan T6SS effectors in Klebsiella species and provides a novel model for uncovering previously unrecognized components of the T6SS arsenal.
Current studies on the genus Klebsiella have focused predominantly on K. pneumoniae, while often overlooking other species within the genus. Moreover, research on K. pneumoniae has frequently been limited to a small number of clinical strains or has lacked comprehensive genomic analyses. Although T6SS was known to contribute to bacterial competition, cell invasion and colonization in K. pneumoniae [9], significant gaps remain in our understanding of the distribution and diversity of this system in Klebsiella, necessitating further investigation. This study addresses these gaps by comprehensively analyzing the genomes of all species within the genus Klebsiella, characterizing the genomic distribution and diversity of T6SS loci, employing statistical and computational approaches to predict effector proteins and identify putative candidates, and performing experimental validation to confirm the antagonistic activity and functional roles of key effectors. This work enhances our understanding of the genomic diversity and distribution of T6SS in Klebsiella, shedding light on its role in bacterial competition and species-specific adaptations.
Results
Identification and characterization of T6SS loci and phylogeny analysis in Klebsiella spp.
The high-quality genome that passed the quality control (S1 Table) were compared against the protein database obtained from SecReT6 [25] using Proteinortho v 6.3.0 [26] to identify T6SS loci based on the T6SS core components and other regionally intrinsic proteins. The distribution of T6SS of total 4,434 genomes from the NCBI databases was evaluated and two major types of T6SS loci in Klebsiella genus have been identified, here, namely, T6SSkleb1 and T6SSkleb2, respectively. To assess the similarity between homologous genes, a BLASTn alignment was performed, and the results suggested that these two T6SS loci are not derived from a single ancestral locus through duplication event (Fig 1A).
Fig 1. Distribution and characterization of T6SS loci across the Klebsiella genus.
(A) Genomic organization of two representative Klebsiella genomes displaying T6SS gene clusters. Core T6SS structural proteins are shown in colored arrows, and non-core proteins in gray. Percent identity between homologous proteins was calculated using BLASTp (default NCBI parameters) and is indicated above connecting dashed lines. Gray arrows indicate non-core regional proteins. Percent identity between homologous proteins was calculated using BLASTp with default NCBI parameters. (B) Percentage of genomes harboring complete, partial, or no T6SSkleb1 or T6SSkleb2 gene clusters. The Venn diagram shows the number of genomes containing one or both complete gene clusters. (C) Maximum-likelihood phylogenetic tree built from 11 Klebsiella species using 10,000 bootstrap replicates, with E. coli as the outgroup. The two columns on the right indicate the proportion of genomes with complete T6SSkleb1 or T6SSkleb2 gene clusters for each species. An unrooted phylogenetic tree of the core protein TssB was also constructed, showing nine distinct groups, with 139 representative TssB sequences from Klebsiella genus. All TssB sequences within Klebsiella belong to the i2 type, and branches have been collapsed for clarity.
Notably, loci T6SSkleb2 does not contain any tssE, which participates in the composition of baseplate, and T6SSkleb1 lacks several core components, including tssB, tssC, tssD and tssH. We observed that genes encoding tssD (11,544/14,160) and vgrG (4,366/11,195) were situated outside the T6SS locus among the T6SSkleb1,2 +genomes, and putative effectors are commonly encoded in the vicinity of these “orphan” genes which have also been observed in other studies [27,28]. Furthermore, the vgrG genes in both of the loci are located at a relative variable region where various effectors and immunity proteins were located, as also reported by previous studies [27,29].
Based on the classification of gene clusters as complete or incomplete, the majority of these strains contained complete T6SS loci encoding conserved genes, which accounted for 3,574/4,434 for T6SSkleb1 and 2,962/4,434 for T6SSkleb2, respectively. Additionally, 2,616 strains contained two complete gene loci within their genomes. In contrast, a total of 860 and 1,472 genomes which have incomplete T6SSkleb1 and T6SSkleb2 (Fig 1B).
In addition to the core components of the T6SS gene clusters, TagL has been identified in the formation of both of the loci. TagL acts as a component of the basal membrane complex to help anchor to the cell wall. It is worth noting that TagL Peptidoglycan-binding domain (PGBD) stabilizes the transmembrane complex of the T6SS by binding to peptidoglycan, thereby helping to withstand the forces generated during sheath contraction [30].
A phylogenetic tree was constructed to investigate the distribution of T6SS loci in relation to species phylogeny. The strains were mainly grouped into two phylogroups, and with T6SSkleb1 is notably more abundant in K. pneumoniae monophyletic group, which corresponds to species within the KpSC, including K. pneumoniae, Klebsiella quasipneumoniae, Klebsiella variicola, K. quasipneumoniae subsp. similipneumoniae, K. variicola subsp. tropica, Klebsiella quasivariicola, and Klebsiella africana. In contrast, species within the Klebsiella huaxiensis clade exhibited a lower number of T6SS loci. The prevalence of T6SSkleb1 and T6SSkleb2 in these species was markedly reduced (Fig 1C), with respective frequencies ranging from 0% to 17% and 0% to 25% of each species.
Moreover, based on the schemes proposed by Barret et al. [31,32] and Russell et al. [33] from SecReT6 [25], T6SS could be classified into four types according to the tssB genes, namely, T6SSi, T6SSii, T6SSiii and T6SSiv, with six subtypes (i1, i2, i3, i4a, i4b and i5) have been identified within T6SSi system. In our study, a total of 2,962 complete genomes containing tssB were obtained, of which 139 representative sequences after clustering were used for further classification. The type i2 has been identified in all strains with complete T6SSkleb2 cluster in Klebsiella spp. except for Klebsiella pasteurii and K. huaxiensis (Fig 1C), which do not possess any T6SSkleb2, enriching the knowledge of TssB classification for Klebsiella.
Comprehensive analysis of T6SS effectors and immunity proteins across Klebsiella genus reveals correlation with complete T6SS loci
To gain deeper insights into the T6SS effectors and immunity proteins, we performed a comprehensive homologous search across 4,434 genomes with experimentally validated T6SS effectors (T6SEs) and immunity proteins (T6SIs). A total of 50 effector protein groups and 28 immunity protein groups have been identified across genus (Figs 2A and S1A) with notable heterogeneous in their distribution. Additionally, we identified a total of 8 distinct categories along with corresponding structural domains of effector proteins including peptidase effector (Tpe), DNase and RNase effector (Tde), lipase effector (Tle), amidase effector (Tae), membrane-disrupting effector (Tme), metal iron acquisition effector (Tie), glycoside hydrolase effector (Tge) and unclassified secretion system effector (Tse) (Table 1). Seven effector and four immunity proteins were found across all Klebsiella species (Figs 2B and S1B). Notably, 10 effectors (EFF01499, EFF00186, EFF00053, EFF00045, EFF01811, EFF01522, EFF01461, EFF76629, EFF01873, EFF01523) were exclusively found in K. pneumoniae. The immunity proteins displayed a more homogeneous and less species-specific distribution compared to the effectors.
Fig 2. Distribution of T6SS effector proteins across Klebsiella species.
(A) Heatmap showing the proportions of 50 effector proteins in the genomes of various Klebsiella species. (B) Concentric rings represent the distribution of effector proteins (eff) across different species. The innermost ring represents 7 effector proteins found across all species. (C) Comparison of unique effector proteins between genomes with T6SSleb1,2+ and genomes without any T6SS loci (T6SS-), illustrated by a Venn diagram. (D) Fold-change analysis (log2-transformed) of genomic counts for effector proteins between T6SSkleb1,2+ and T6SS- genomes, with a fitted trend line. Effector proteins are labeled numerically. (E) Comparison of the mean value of effector proteins per each genome between T6SSkleb1,2+ (n = 2,616) and T6SS- (n = 190). Error bars represent the standard deviation, and statistical significance was assessed using a one-sided Welch’s t-test (****p < 0.0001).
Table 1. Identified effectors in Klebsiella spp.
| Category | Effectors | Identified domains |
|---|---|---|
| Tse | EFF01806, EFF00142, EFF01453, EFF01478, EFF01504, EFF01505, EFF01802 | Mn_catalase; CSD; RNAse_A_bac; rve, rve_2, HTH_Tnp_1, HTH_21; |
| Tpe | EFF01888, EFF01461, EFF01469 | Not detected |
| Tde | EFF01467, EFF01484, EFF01485, EFF01499, EFF01520, EFF01522, EFF01523, EFF01530, EFF01811, EFF01873, EFF01894, EFF01900, EFF18627 | Phage_GPD, Pyocin_S, DUF2345, T6SS_Vgr, Phage_base_V; BON; tRNA-synt_1e, DALR_2; PAAR_motif, DUF6531, RHS, RHS_repeat, RHS_repeat, RHS_repeat, RHS_repeat; T6SS_HCP; Pyocin_S, T6SS_HCP; Usp; Pyocin_S, PAAR_motif; Imm52; Tox-REase-5; AHH, RHS; |
| Tle | EFF00145, EFF01279, EFF01348, EFF01465, EFF00019, EFF01348, EFF01899 | DUF2235, DUF2235; PLDc_2; PGAP1; PLDc, PLDc; |
| Tae | EFF00045, EFF01502, EFF01510, EFF01870, EFF76642 | Tae4; Amidase_2; Amidase, Amidase; |
| Tme | EFF01834, EFF01836 | Not detected |
| Tie | EFF01889 | SBP_bac_11 |
| Tge | EFF76634 | Peptidase_M23, LysM; |
Tse, unclassified secretion system effector; Tpe, T6SS peptidase effector; Tde, T6SS DNase/RNase effector; Tle, T6SS lipase effector; Tae, T6SS amidase effector; Tme, T6SS muramidase effector; Tie, T6SS metal iron acquisition effector; Tge, T6SS glycoside hydrolase effector.
We hypothesis that the genomes with a complete T6SS locus exhibit a higher abundance of T6SS-associated genes than the genomes without any. To test the hypothesis, we conducted a comparative genomics analysis between the strains with genomes with either both or without any T6SS loci (T6SSkleb1,2 + /2,616genomes, T6SS-/190genomes), with detailed definitions provided in the Methods section. For effectors, 40 types were identified in the T6SSkleb1,2 + genomes, whereas only 29 were found in the T6SS- genomes (Fig 2C). The distribution of T6SS effectors showed a skew distribution with a higher number of effectors in the T6SSkleb1,2 + group compared with the T6SS- group (Fig 2D). Similarly, the Welch’s t-test revealed that the mean value of effectors in T6SSkleb1,2 + group is significantly higher than that in T6SS- group (p < 0.0001) (Fig 2E).
Meanwhile, comparisons of immunity proteins between the two groups yielded results consistent with effectors that the immunity proteins are more prevalent in the T6SSkleb1,2 + group (S2 Fig). These results indicated that T6SS effectors and immunity proteins are widely distributed across the genus with various distribution, and the statistical significance supports the hypothesis that the presence of a complete T6SS locus correlates with a greater abundance of T6SS-related genes.
Categorization of T6SS effectors and immunity proteins downstream of vgrG in Klebsiella species
In fact, the C-terminal tail of the VgrG protein plays a critical role in determining the specificity of effector recognition and delivery [34]. Notably, orphan vgrG and tssD genes—those located outside the main T6SS cluster—are often found in the proximity of genes encoding putative effector proteins [28]. Previous studies have also shown that effectors and cognate immunity proteins are often encoded downstream of vgrGs [35]. For these reasons, vgrG often serves as an essential resource for effector identification. Therefore, to systematically identify effectors associated with vgrGs, explore their relationships with immunity proteins, and characterize domains associated with these genes, we analyzed up to three downstream genes in all T6SSkleb1,2+ and T6SS- genomes. As a first step, we compared the number of vgrGs between T6SSkleb1,2+ and T6SS- genomes. As expected, the number of vgrG genes is significantly lower in genomes without any loci compared with those containing one or both of the loci (p < 0.001) (Fig 3A). Each T6SSkleb1,2 + genome contains 2.51 vgrG genes while the T6SS- genomes possess 0.70 vgrGs on average. Moreover, 13,736 proteins downstream of vgrGs were identified, among which 5,305 have been experimentally validated as homologs in the SecReT6 [25] database.
Fig 3. Characterization of structural domains in genes downstream of vgrGs.
(A) Boxplot comparing the number of vgrG genes across four genome groups: T6SSkleb1,2 + , T6SSkleb1 + , T6SSkleb2 + , and T6SS- genomes. ****p < 0.0001 (Mann-Whitney U Test). Black circles represent outliers. Median lines are shown in a uniform color to improve visibility when overlapping with box borders. (B) Structural annotation of proteins downstream of vgrGs. The network diagram shows connections between domains in downstream proteins. Circle size represents the relative abundance of each domain, and line thickness indicates the frequency of specific domain combinations. (C) Pie chart summarizing the classification of structural domains of effector proteins based on experimentally validated data from the SecReT6 database. Domains with less than 1% relative abundance were grouped under “Others”. (D) Network analysis of three downstream genes annotated as identified effector and immunity proteins. Genes that are adjacent on the same genome are linked. The size of the circle represents the abundance of certain domain, and thickness of the connected line indicates frequency of particular connection. (E) Schematic representation of gene clusters for genes encoding effectors and immunity proteins. Gray arrows indicate genes encoding other regional proteins other than T6SS effectors or immunity proteins.
Analysis of the genes downstream of vgrGs revealed several highly frequent functional domains associated with both effector and immunity proteins (Fig 3B), along with functional domains characteristic of experimentally validated effector proteins (Fig 3C). Among these, DUF3304 is the most abundant, appearing 3,681 times across all analyzed proteins. Other frequently observed domains included Pyocin_S (1,465 occurrences), LysM (991 occurrences), and Peptidase_M23 (809 occurrences). DUF3304 domain was found in several known bacterial immunity proteins [36–38] (S2 Table), suggesting their potential role in effector-immunity interactions while the exact function of the domain remains to be uncharacterized. The consistent presence of Pyocin_S, LysM, and Peptidase_M23 highlights their widespread use in bacterial competition strategies. Additionally, SLT transglycosylase (801 occurrences), primarily involved in cell wall degradation, was also commonly identified.
Network analysis of the structural domains identified recurring combinations in effector proteins, reflecting their modular architecture. Here, we operationally define a “modular architecture” as a recurrent pattern of connected protein domains, based on their co-occurrence in individual proteins. For example, effector proteins frequently included RHS domains in combination with DUF6531, GH-E, AHH, Tox-GHH, Tox-ART-HYD-1 forming large multi-domain toxins with diverse functionalities. Similarly, LysM, Peptidase_M23, and SLT were often co-localized within degradative effectors, underscoring their role in targeting bacterial cell walls within the genus. Phage-derived domains, such as Phage_base_V and Phage_GPD, were also prevalent among the identified effectors. These domains reflect the evolutionary relationship between T6SS and bacteriophage tail-like structures, as previous studies have proposed that T6SS machinery shares a common evolutionary origin with phage contractile systems [8,39].
The relationship between effectors and immunity proteins demonstrated notable complexity and flexibility. A single effector can associate with multiple immunity proteins, and conversely, a single immunity protein can cognate with different effectors (Fig 3D and 3E). For instance, IMU17302, whose cognate effector was identified as EFF01888 (Tpe) in Aeromonas dhakensis strain SSU [40], was also found adjacent to multiple effectors, including EFF01826 (Tpe), EFF01467 (Tde), and EFF01461 (Tpe). Meanwhile, EFF01826 was shown to cognate with IMU00349, highlighting the dynamic interplay between effectors and immunity proteins.
Identification of T6SS-associated orthologous groups
To establish a baseline for the association between T6SS loci and T6SS-associated genes, we first compared genomes harboring both T6SSkleb1 and T6SSkleb2 loci (T6SSkleb1,2+) with genomes lacking both loci (T6SS-). This comparison revealed a clear enrichment of known T6SS-associated proteins in T6SS+ genomes (Fig 2E). Building on this, we conducted two independent locus-specific analyses: T6SSkleb1 + vs T6SSkleb1- (regardless of T6SSkleb2 status), and T6SSkleb2 + vs T6SSkleb2- (regardless of T6SSkleb1 status). These comparisons aimed to identify orthologous groups that are specifically associated with each individual locus. For completeness, we also performed alternative comparisons between T6SS kleb1,2 + vs T6SSkleb1+,2− or T6SSkleb2+,1−, which may reflect cumulative or combinatorial effects. These additional comparisons are provided in S3 Fig and S3 Table. A total of 89,182 and 84,859 protein families were clustered for the genomes containing T6SSkleb1 + /T6SSkleb1- and T6SSkleb2 + /T6SSkleb2- and the orthologous groups that are significantly associated with T6SS were identified (see Methods for the details). Overall, 608 orthologous groups are significantly associated with T6SSkleb1 and 377 are significantly associated with T6SSkleb2. The protein families were further annotated with SecReT6 [25,41] database, Bastion6 [42] and eggNOG-mapper v2 [43] as listed in S2 Table. A network analysis was performed to study the interaction of those protein groups which revealed that the orthologous groups are well connected in the genomes.
Based on the results from various annotations, known T6SS-associated proteins, core-components, potential effectors, and unknown functional gene clusters were identified among the T6SS-associated protein families. Known proteins associated with the T6SS include components of the incomplete locus itself, as well as related effector and immunity proteins. Furthermore, several potential novel T6SS effectors have also surfaced.
Cluster9525 and Cluster986 (DUF2345 domain containing protein), which contain the highest number of proteins as shown as central node in the interaction network in T6SSkleb2 and T6SSkleb1 respectively (Figs 4A and S4), likely play important roles in the biological functions associated with T6SS. Notably, DUF2345 domain can be encoded either as an independent gene or as part of other proteins, being fused to the C-terminus of VgrG or the N-terminus of toxin, and it has been experimentally demonstrated that proteins containing DUF2345 assist the interaction between VgrG and corresponding effectors, which is critical in the secretion process of T6SS effectors [35]. However, the protein family in which Cluster9525 lacks definitive structural annotation, necessitating further research to confirm its potential role in T6SS.
Fig 4. Network analysis of protein families enriched in T6SSkleb1/2-associated genomes.
(A) The network, constructed using Cytoscape, includes 377 protein families associated with T6SSkleb2. Node size is proportional to the number of proteins in each family, with colors transitioning from dark to light based on abundance. Edge colors transition from yellow to green to purple based on interaction frequency. Node labels are prioritized by preferred names, followed by eggNOG-mapper annotations and cluster IDs. Representative loci of (B) T6SSkleb2 and (C) T6SSkleb1-associated protein families. Gene clusters are depicted with consistent color hues for proteins located within the same locus or sharing related biological functions. Node size represents the number of proteins within a family, and colors transition from dark to light based on abundance. Line thickness reflects the frequency of protein-protein interactions.
Some putative EI pairs also caught our attention. In T6SSkleb2, Cluster_51573 and Cluster_10480 show high linking frequencies (n = 780), with Cluster_10480 containing DUF3289 and Cluster_51573 containing DUF943 which was predicted as T6SE and non-T6SE by Bastion6 [42] respectively among the identified protein families (Fig 4B and S2 Table). Coincidentally, a recent study that used Alphafold-Multimer-based regression model have predicted the same DUF3289-containing protein as a T6SS effector. This study demonstrated that bacterial cells expressing the effector candidate T6EC1 (DUF3289 domain containing protein) in their peripheral plasma changed to a rounded morphology, indicative of cytotoxicity. In contrast, cells co-expressing T6EC1 and its cognate immunity protein T6IC1 (containing a DUF943 domain) maintained their characteristic rod shape [44]. This overlapping discovery further strengthens the reliability of our predictions.
In addition to the putative orphan effector proteins, several gene clusters consisting of genes located adjacent to one another within the same genome have also been identified (Figs 4B, 4C, and S5 and S4 Table). Not surprisingly, core-components of the T6SS locus were unequivocally identified in both T6SSkleb2 and T6SSkleb1(Fig 4B and 4C). Furthermore, we discovered a widely enriched gene cluster that facilitates bacterial adhesion to host cell surfaces in both T6SSkleb1 and T6SSkleb2 among the protein families enriched in T6SS+ genomes. This system, consisting of PapC_C, PapC_N, and Usher-containing proteins, constitutes the bacterial “Usher-Chaperone” system, which plays a critical role in mediating adhesion to host cell surfaces, particularly during the early stages of urinary tract infections [45]. Synergistically, Fimbrial containing proteins contribute to bacterial virulence by mediating surface adhesion, promoting biofilm formation, and enabling self-aggregation-critical processes that are critical for establishing interactions with host environments [46].
Additionally, among the protein families that are associated with T6SSkleb1 are several gene clusters associated with basal metabolism and immune defense (Fig 4C). For example, proteins with PTS_EIIC, PTS_IIB, and PTS_IIA participate in sugar metabolism and transport [47], while those containing Aminotran_1_2 and Cys_Met_Meta_PP domains are involved in the metabolism of amino acids and sulfur-containing compounds [48], and proteins containing HTH_1 and LysR_substrate domains regulate the gene transcription. Lastly, proteins containing CRISPR_Cse1, CRISPR_Cse2, and related domains form a gene locus significantly associated with T6SS, suggesting a coordinated role in bacterial defense and environmental adaptation [49,50]. In fact, recent studies have revealed potential interactions between the T6SS and CRISPR-Cas systems at both regulatory and genomic levels. In Aliivibrio wodanis, quorum sensing (QS) regulators, particularly LitR, coordinately control the expression of both T6SS and a type I-F CRISPR-Cas system, with T6SS1/2 and CRISPR genes being upregulated at high cell density and low temperature [51]. Meanwhile, in Vibrio cholerae, a complete CRISPR-Cas system and a T6SS gene cluster are co-localized on the excisable pathogenicity island VPI-6, which contains conserved recombination modules, suggesting a potential evolutionary linkage and cooperative defense strategy [52]. These findings support the view that T6SS and CRISPR-Cas may function in coordination to respond to environmental challenges and genetic threats. We have also observed a statistically significant difference in Fumarase-containing proteins in T6SSkleb2 + vs T6SSkleb2-, although its biological correlation to T6SS integrity remains unclear.
Proteins harboring DUF3258, DUF3751, and Sel1 domains were toxic to E. coli
Among the T6SS-associated protein families, we applied a tiered selection strategy based on bioinformatic filtering, experimental feasibility, and scientific relevance to refine our candidate pool. We excluded families previously validated as T6SS-related, those with known annotated functions, and those predicted as “no” by Bastion6 (S2 Table). To ensure experimental tractability, we prioritized candidates encoded in our clinical K. pneumoniae isolates, which facilitated downstream assays. This filtering yielded a smaller set of high-confidence candidates with distinct features warranting further investigation. The orthologous group Cluster_28032 includes DUF3258 (pfam11646) and the phage_integrase domain, which is essential for the integration and excision of bacteriophage DNA into and from a host genome during bacterial infection. Cluster_4724 contains DUF3751 (pfam12571) and the phage_fiber_2 domain, which is responsible for phage attachment to host cells. Sel1, found in Cluster_8841, is associated with two Sel1-like proteins, RS09150 and RS09155 in Salmonella pullorum, which have been shown to promote the expression of T3SS genes and contribute to bacterial infection in chickens [53]. Importantly, two candidates contain DUF3258 or DUF3751 domains, which are entirely novel and uncharacterized, while the Sel1-like domain, previously linked to T3SS, is identified here for the first time as a potential T6SS effector. Taken together, these three groups demonstrate significant in silico association with T6SS and potential cytotoxic activity, prompting subsequent experimental investigation to explore their functional roles.
Consequently, we cloned the genes KP117_01723, KP186_04383, KP122_03665 whose protein containing the DUF3258, DUF3751, and Sel1 domain respectively into an inducible expression vector and transformed this construct into the E. coli strain DH5α. Following induction with IPTG, we observed a marked reduction in the growth of the E. coli strains expressing these proteins compared to the control carrying the empty vector, with the extent of growth inhibition varying among the different proteins (Fig 5A). Western blot analysis using a mouse anti-His monoclonal antibody confirmed these proteins were successfully expressed in E. coli upon IPTG induction. No protein was detected in the empty vector control (ptac), confirming specificity of expression. (Fig 5B). These findings provide evidence that the proteins identified via our bioinformatics approach are likely candidates for T6SS substrates.
Fig 5. Expression of proteins harboring DUF3258, DUF3751, and Sel1 domain in E. coli and their homologs distribution across taxa from different domains of life.
(A) Drop assay showing a marked reduction in the growth of the E. coli DH5α expressing ptac empty vector (control), KP117_01723, KP186_04383, and KP122_03665. Exponential-phase cultures of E. coli expressing different proteins were adjusted for OD600 and serial diluted onto LB media supplemented with and without 0.2mM IPTG. The cells were prepared with serial 5-fold dilutions from left to right. (B) Western blot assay using an anti-His monoclonal antibody (IB: His) demonstrating the expression of KP117_01723, KP186_04383, KP122_03665 in E. coli DH5α with 0.2mM IPTG. Control with the ptac promoter only, with and without IPTG induction. (C) Unrooted phylogenetic tree was constructed by IQ-TREE with 10,000 bootstrap replicates of the protein containing DUF3258 domain at the genus level, in which the highlighted colors presented represent the three most distributed genera. Genera that accounted for less than 5 were combined into one group “Others”. (D) Unrooted phylogenetic tree was constructed by IQ-TREE with 10,000 bootstrap replicates of the proteins containing DUF3751 domain at the genus level. Escherichia spp. accounts for 90% of all genera, are predominantly represented. (E) Maximum-likelihood tree with 10,000 bootstrap replicates was constructed for protein containing Sel1 domain at the family level, with tip labels indicating strains after three iterations of PSI-blast (with an expect value threshold of 5 × 10−3 in each iteration).
To further clarify the distribution of these proteins containing above structural domains in nature, multiple sequence alignment of their homologs, identified in PSI–BLAST. The phylogenetic analysis at the genus level of KP117_01723 and KP186_04383 indicates that they were predominantly distributed in prokaryotes (Fig 5C and 5D). Specifically, KP117_01723 is mainly found in the Enterobacter (181/500) and Klebsiella (113/500), while KP186_04383 is primarily associated with the genus Escherichia (457/500). All genomes encoding KP117_01723 possessed complete T6SS locus, and 50% of these genomes contain at least one tssD gene. In contrast, KP186_04383-encoding genomes is exclusively harbored T6SSkleb1 (tssD gene was missing) and 94.3% of the genomes at least harbored one vgrG gene, together with tssD, which are hallmark T6SS-secreted tail tube components. Analysis of the phylogenetic distribution of KP122_03665 revealed a more diverse family (Fig 5E). The bootstrap values for the major branches containing these three proteins are all above 80, supporting the robustness of the inferred phylogenetic relationships. Genomes containing KP122_03665 at the family level were prevalent in Vibrionaceae and Enterobacteriaceae. Remarkably, 99% genomes encoding KP122_03665 also harbored T6SS locus (containing at least 8 out of 13 T6SS core components that were shown to specifically predict T6SS), and 52.1% of the genomes harbored at least one tssD gene. Moreover, Eukaryotes like Mytilidae and Gallionellaceae have also been found, which is not surprising that previous study showed the eukaryotic Sel1 proteins are involved in the ER-associated protein degradation [41]. These results suggest KP117_01723, KP186_04383 and KP122_03665 proteins are predominantly associated with T6SSs. Further support for this conclusion was revealed by the experimental results described above.
Discussion
In this study, we conducted a comprehensive analysis of the T6SS including core structural components, effectors and immunity proteins in Klebsiella genus. We investigated the genes families that are significantly enriched in the presence of T6SS, and validate novel putative effectors in Klebsiella spp.. We found that majority of Klebsiella strains possess a complete T6SS apparatus, organized into two distinct loci, designated as T6SSkleb1 and T6SSkleb2. Similar locus architectures have previously been identified in specific strains, such as K. pneumoniae CH1157 [54] and HS11286 [55]. Notably, T6SSkleb1 differs from T6SSkleb2 in that it does not encode essential T6SS core components such as TssB, TssC, TssD, and TssH. The low identity between orthologous proteins at these loci suggests that they were not derived by a duplication event.
The two T6SS loci, T6SSkleb1 and T6SSkleb2, differ not only in their genetic architecture and core components but in their phylogenetic distribution across Klebsiella species. As shown in Fig 1C, these loci are differentially enriched within two major lineages: the KpSC and the K. oxytoca species complex (KoSC). Both loci are broadly conserved in KpSC—particularly in K. pneumoniae and K. africana—which are commonly isolated from the gut, respiratory tract, and bloodstream, and frequently encountered in both clinical and environmental contexts. The high prevalence of T6SSs in these species suggests a reliance on T6SS-mediated antagonism to compete effectively in densely populated and competitive niches, consistent with their pathogenic potential and ecological adaptability. In contrast, KoSC species—including K. oxytoca, K. pasteurii, Klebsiella grimontii, and Klebsiella michiganensis—typically retain only T6SSkleb2 or lack both loci altogether. These bacteria often occupy more stable or specialized environments and tend to function as commensals, facing reduced interbacterial competition or relying on alternative survival strategies. Collectively, the divergent distribution of T6SSkleb1 and T6SSkleb2 underscores that T6SS-mediated antagonism is not universally essential across the genus but is shaped by ecological pressures, microbial community complexity, and host associations. In highly competitive environments, the conservation of both loci suggests that they may play complementary and essential roles in interbacterial interactions. Nevertheless, T6SSkleb2 may also contribute to community stabilization or interactions beyond direct antagonism.
In particular, the function of T6SSkleb2 has been identified as essential for long-term intestinal colonization, suggesting that this system mediates antagonistic bacterial interactions that can compromise microbiota stability and resilience [54]. Actually, microbial competition mediated by the T6SS of K. pneumoniae is more complex, Daniel Storey et al identified strain K. pneumoniae Kp52145 possess three T6SS loci: locus I (T6SSkleb2 in our study) contains vgrG1 gene, which plays an essential role in Klebsiella-mediated killing. In contract, two additional incomplete gene clusters exhibit T6SS-dependent killing activity that varies according to environmental cues [10]. Despite these findings, research on the T6SS in Klebsiella, and more specifically on T6SSkleb1, remains rather limited. T6SSkleb1 typically encodes key T6SS-related proteins, it often lacks certain core genes, suggesting a role in specialized functions. Our research reveals the presence of numerous orphan T6SS-related genes outside the major loci in Klebsiella genomes. This observation further indicates that the functionality of T6SSkleb1 may be interdependent with T6SSkleb2 or other loci. Moreover, the presence of multiple vgrG and tssD paralogs in many genomes suggests redundancy or specialization in effector delivery, as seen by other bacteria like Vibrio fischeri [56], or Pseudomonas aeruginosa [57]. Future work needs to combine transcriptomics and functional studies, to test whether the Klebsiella paralogs are differentially expressed, or functionally interchangeable. Additionally, we identified rare T6SS variants that account for less than 2% of total genomic content. These variants likely represent strain-specific adaptations.
Thus far, only two T6SS effectors, Tle1KP and VgrG4, have been characterized in the Klebsiella genus [10,58], emphasizing the significant gap in our understanding of T6SS in this bacterium. This limited knowledge highlights the necessity of comprehensive studies to identify additional T6SS effector and immunity proteins, as well as associated protein families. To address this, we performed a homologous protein search method and positional information to identify 50 total effector proteins and 28 immunity proteins within the genus, which expanded the knowledge of T6SS systems in Klebsiella.
We found that there are seven conserved effectors and four immunity proteins common to all Klebsiella species. Such conserved proteins may be fundamental to bacterial fitness, perhaps as key players in interbacterial competition or resilience to environmental stressors. Yet, the exact protein mechanisms involved in these processes are still to be elucidated. Besides these conserved proteins, we identified numerous effectors and immunity proteins unique to an individual Klebsiella species or strain. For instance, we observed ten effectors that are uniquely found in K. pneumoniae, which might be due to its distinct adaptations to environmental pressure. For example, Hcp-ET3 (EFF01522), as described in a previous study, was also isolated from E. coli strain PE086 and had antibacterial activity on recipient E. coli [27]. This enrichment of unique effectors aligns with K. pneumoniae’s broader ecological adaptability, high virulence, and antibiotic resistance. However, the roles of these proteins in Klebsiella remain speculative and require further investigation.
Furthermore, we discovered complex relationships among T6SS effector and immunity proteins in our analysis. We found examples in which one immunity protein neutralizes several effector proteins and examples in which one effector protein is targeted by several immunity proteins. This plasticity may be due to adaptation to interbacterial competition polymicrobial environments. This phenomenon is aligned with results reported in Serratia proteamaculans, where a T6SS effector, Tre1, targets bacterial essential protein FtsZ through ADP-ribosylation [59], thus blocking cellular division. The immunity protein Tri1 protects against this toxicity via two mechanisms of action, the occlusion of the active site and the enzymatic removal of ADP-ribose modifications. Notably, the former mechanism provides broad protection against non-cognate toxins, highlighting the plasticity and flexibility of effector-immunity networks in interbacterial competition. Similarly, the Bacteroides genus has been reported to possess bacterial acquired interbacterial defense (AID) systems that confer the capacity to neutralize a wide repertoire of T6SS effectors from other species. These systems are composed of arrays of orphan immunity genes that have broad-spectrum mechanisms of action that confer resistance to diverse toxins. For example, Bacteroides fragilis employs recombinase-associated AID (rAID-1) clusters that enable resistance against its own effectors along with microbial competitors. This underscores the role of interbacterial antagonism in shaping diverse and highly adaptable effector-immunity networks [60].
Experimental approaches to identify T6SS-associated proteins include comparative proteomics [11,15,17,18,61] and transposon mutagenesis [19], whereas computational methods include primarily the search for genes located in proximity to T6SS clusters [20–22,62–64] or genes encoding marker domains [22,23]. While these methods have proven effective, they are very limited in their application, as they do not detect orphan effectors or auxiliary proteins that sit outside of T6SS loci or lack conserved domains. Comparative genomics have been previously used to identify a large family of membrane-disrupting T6SS effectors (Tme), demonstrating the effectiveness of this method [24]. In addition to identifying effectors, our study utilizes comparative genomics to reveal a wide range of T6SS-associated protein families, including regulatory, structural, and auxiliary genes. The broadened perspective allows us to gain important insights on the genomic and functional landscape of T6SS, and its role in bacterial competition and adaptation.
By employing our bioinformatics approach, we successfully uncovered and experimentally validated three T6SS-associated proteins, underscoring the robustness of our method at elucidating novel effectors. We specifically validated proteins harboring DUF3258, DUF3751, and Sel1 domains and found them strongly associated with the presence of T6SS. These proteins displayed cytotoxicity in growth inhibition assays in E. coli DH5α, thus confirming their activity as potential T6SS effectors. Phylogenetic analyses of these proteins containing above domains also revealed their predominated distribution in Klebsiella and Escherichia. These results demonstrate the power of our integrative approach in identifying and characterizing both canonical and orphan T6SS effectors and expanding our knowledge of their ecological and functional roles.
Beyond identifying putative orphan effector proteins, our analysis identified T6SS-associated gene clusters sharing links to bacterial physiology and adaptation. This includes the “Usher-Chaperone” system and its fimbrial components essential for biofilm formation and tropicalization at the host interface. We also identified regulatory and metabolic genes, for example, those involved in sugar transport or gene transcription regulation, that likely impacted T6SS expression and activity. The additional presence of defense-related loci with CRISPR-associated domains signify the implication of T6SS in the coordination of immunity and environmental response mechanisms in bacteria. This study demonstrates that T6SS represents an integrated system with both fundamental influences on bacterial homeostasis and ecological interactions, which forms the basis for investigating new therapeutic approaches directed toward its wider functional network.
In summary, our work presents a comprehensive survey of T6SS loci, effector proteins and their associated gene clusters in the genus Klebsiella. T6SS is the key player in bacterial competition, immunity and environmental adaptation and thus our investigation emphasizes the multifaceted and diverse nature of this secretion system. Moreover, the identification of gene clusters related to adhesion, metabolism, and defense systems provides insights into the wider physiological and ecological functions of T6SS. These findings provide a genomic framework for understanding the structural and functional diversity of T6SSs across the Klebsiella genus, and offer a foundation for future studies on their ecological roles and potential contributions to bacterial competitiveness or host interactions.
Materials and methods
Data acquisition and quality assessment
The comprehensive assemblies from Klebsiella genus available in the NCBI Genome database (as of November.1th, 2023) were downloaded. We first performed a rarefaction analysis to evaluate the diversity of T6SS-associated proteins. The rarefaction curve demonstrated that the number of unique T6SS-associated proteins, including effectors, immunity proteins and regulators rapidly approached saturation as the number of genomes increased (S6 Fig). This result indicated that the inclusion of additional genomes would contribute minimal novel T6SS-associated genes. Based on this observation, we implemented a stratified sampling strategy to select a representative subset of genomes for downstream analyses.
For K. pneumoniae, sequence typing (ST) was performed using Multilocus Sequence Typing (MLST) via Kleborate v2.3.2 [65] to classify the assemblies, followed by random selection of 10% of genomes from each sequence type for downstream analyses. For other species within the genus, 10% of the genomes were randomly selected. The quality and reliability of the genome were assessed by CheckM v 1.2.2 [66], acquiring a high-quality genome (>99%) and low contamination (<1%).
Identification of T6SS loci in Klebsiella spp.
All assemblies that passed the quality check were subjected genome annotation with Prokka v 1.14.6 [67]. Experimentally validated T6SEs and T6SIs and structural proteins archived in SecReT6 [25] were used as the databased for orthologous searching against the annotated Klebsiella proteins with Proteinortho v 6.3.0 [26]. T6SS loci were identified based on the presence of the core components (TssA-M) alongside regional-specific proteins documented in the database. To ensure accurate delineation of T6SS gene loci, a stringent criterion was required: a maximum allowance of five non-T6SS-core genes between two core genes, ensuring their co-localization within the same gene locus on the same contig. Occurrences of identical arrangement patterns exceeding 2% of the total number of genomes were classified as a distinct structural category. Informed by prior study [54] and our own data-derived observations, loci with eight or more core components in T6SSkleb1 and eleven or more in T6SSkleb2 are defined as T6SSkleb1+ and T6SSkleb2 + respectively, and loci with less than five core components both loci are T6SS-. Also, those with fewer than five in only one are designated as T6SSkleb1- or T6SSkleb2-. Those not meeting any of these criteria are considered partial. Genomes carrying both T6SSkleb1+ and T6SSkleb2 + loci are designated as T6SSkleb1,2 + . Structural characteristics of loci were visualized using the ggplot2 v3.5.1 (https://ggplot2.tidyverse.org) package in R studio (version 4.3.3). The proportion of genomes in locus was visualized using the matplotlib [68] package in Python.
Genus-wide phylogenetic tree construction and T6SS locus typing
A random high-quality representative genome with completeness>99%, contamination<1% and fewer than 50 contigs was chosen for each species to construct the genus-level phylogeny. The phylogenetic tree was built using core gene alignments generated from PhyloPhlAn v 3.1.68 [69] and inferred with IQ-TREE 2 v 2.2.6 [70] based on VT + F + R3 model with the lowest Bayesian Information Criterion (BIC) score. To enhance the robustness of the results, 10,000 bootstrap replicates were conducted. E. coli assembly ASM584v2 was used as the outgroup. The phylogenetic tree and gene locus distribution were visualized with iTol [71]. Loci from genomes with a complete T6SS were further typed based on tssB genes, which were clustered using CD-HIT v 4.8.1 [72] with 100% similarity and 100% coverage (-c 1 -s 1). The tssB sequences were aligned by MAFFT v 7.520 [73], followed by the removal of poorly aligned regions using trimAl v1.4.rev15.build [74] and retained aligned sequences were constructed to phylogenetic tree by FastTree v2.1 [75], and visualized with iTol [71].
Comparative analysis of T6SS effector and immunity proteins distribution across Klebsiella genomes
High-quality genomics data and experimentally validated effector and immunity proteins from the SecRet6 [25] database were homologated to quantify the types and numbers of T6SS effector and immunity proteins in 11 species and Klebsiella sp. within the genus to compare their inter- and intraspecific differences. Differences in protein variety and genomic counts were further examined between genomes containing complete T6SS gene clusters (T6SSkleb1,2+) and those lacking any complete clusters (T6SS-). The data was first normalized by the total number of genomes in each group (T6SSkleb1,2+ and T6SS-) to account for the genomic counts of homologous effector and immunity proteins. Then, the log2 of the ratio of the normalized counts between two groups was calculated to assess the differences in protein distribution. Statistical significance was subsequently computed based on these normalized values. The heatmap, histogram and statistical analyses were visualized with the pheatmap v1.0.12 (https://CRAN.R-project.org/package=pheatmap) and ggplot2 v3.5.1 (https://ggplot2.tidyverse.org) packages in R studio (version 4.3.3).
Identification and annotation of VgrG-associated effector-immunity pairs
Previous studies have indicated that VgrG-associated effectors, co-effectors, and immunity proteins were typically encoded downstream of the vgrG genes in the adjacent genomic regions. To develop a comprehensive catalog of effectors/immunity proteins, vgrG genes were identified using HMMER 3.4 [76] with a vgrG sequence as the seed in hmmsearch. Up to three genes downstream vgrGs within the same contig in genomes with T6SSkleb1,2+ and T6SS- were selected and subsequently annotated with Pfam-scan against the full PFam A database [77]. Additionally, a homologous search was performed using Proteinortho v 6.3.0 [26] to compare these genes with experimentally validated effectors and immunity proteins documented in the SecRet6 [25] database. The corresponding protein structural domains of all these genes, paired effector and immunity proteins, were visualized using Cytoscape v 3.10.2 [78].
Identification and functional annotation of T6SS-associated protein families
Our results indicate that known T6SS-associated proteins, including effectors and immunity proteins are distributed differently between T6SS+ and T6SS- genomes. To investigate protein families potentially linked to T6SS functionality, we focused on those enriched in T6SS+ genomes. The protein sequences from genomes were grouped based on the presence or absence of T6SS at the T6SSkleb1 and T6SSkleb2 loci, and clustering was performed separately for two group (i.e., T6SSkleb1+ and T6SSkleb1-, T6SSkleb2+ and T6SSkleb2-) using CD-HIT v4.8.1 [72] (-c 0.7, -s 0.7). Homologous protein families across different species were analyzed, and the abundance of each protein family in T6SS+ and T6SS- genomes was normalized to the total number of genomes in each group. Statistical analysis of the normalized data was performed using the Chi-square test and protein families significantly associated with different T6SS loci were identified based on specific cutoff: Log2 P/A ≥ 2 & Adjusted p-value < 0.05. The homologous groups were then subjected to annotation with SecReT6 [25] database, Bastion6 [42] and eggNOG-mapper v2 [43], respectively, in order to identify protein families with uncharacterized T6SS-associated functions. The structural domains of these T6SS-associated protein families are further visualized using Cytoscape v 3.10.2 [78] and protein sequences containing DUF3258 (Protein family cluster 28032), DUF3751 (Protein family cluster 4724) and Sel1 (Protein family cluster 8841) domains were further validated in vitro. Protein sequences containing DUF3258, DUF3751 and Sel1 in clinical strains were identified, three iterations of Position-Specific Iterated (PSI–BLAST) were performed against the reference protein sequences (a maximum of 500 hits with an expect value threshold of 5 × 10−3 were used in each iteration). 500 homologous protein sequences downloaded were aligned by MAFFT v7.520 [73] and gaps were removed from each alignment using trimAl v1.4.rev15.build [74], the maximum-likelihood tree was constructed using IQ-TREE2 v2.2.6 [70] based on the trimmed alignment and visualization in iTol [71].
Collection and analysis of clinical isolates for T6SS-associated protein validation
To validate our predictions of T6SS-associated protein families, we analyzed clinical isolates of K. pneumoniae to identify and confirm the presence of key orthologous genes associated with T6SS functionality. Clinical isolates from patients infected by K. pneumoniae were collected in the respiratory department of the First Hospital of Jilin University. Genomic DNA was extracted from isolates using TIANGEN Bacterial Genomic DNA Extraction Kit (Beijing, China). Genomic DNA sample was fragmented by Covaris LE220R-plus (Covaris, USA) to a size of 350 bp for Illumina sequencing and further PCR amplification. PCR products were purified by AMPure XPsystem (Beckman Coulter, Beverly, USA). Subsequently, the library quality was assessed using the Agilent 5400 system (AATI) and quantified by real-time PCR at a concentration of 1.5 nM. Whole-genome sequencing was performed on Illumina platforms at Novogene Bioinformatics Technology Co., Ltd. (Beijing, China) using the PE150 strategy. To ensure data quality, raw sequencing data was processed including de-adapter, removal of low-quality nucleotides and unrecognizable nucleotide (N) and Fastp v 0.23.1 [79] was used to perform basic statistics on the quality of the raw reads. Specific clinical bacterial isolates KP117, KP186, and KP122 were subsequently selected for experimental validation.
Construction of recombinant plasmids and inducible expression in clinical isolates
The gene of interest, derived from clinical bacterial isolates, was amplified via polymerase chain reaction (PCR). The PCR products were then cleaved with restriction enzymes and ligated into the modified pET28a-tac plasmid, which confers kanamycin resistance. This plasmid is a derivative of the pET28a vector, wherein the T7 promoter has been substituted with the tac promoter to facilitate inducible expression in E. coli DH5α cells. The vector retains the His-tag sequence for subsequent protein purification and detection. The digestion-ligation reaction was performed overnight at 16°C using T4 DNA ligase. Subsequently, the ligation mixture was introduced into E. coli DH5α competent cells via heat shock transformation. Following kanamycin resistance-based selection, individual transformants were cultured in Lysogeny Broth (LB) medium, and the plasmids were isolated, confirmed by restriction digestion, and sequenced for accuracy.
A single colony of E. coli DH5α harboring the recombinant plasmid was selected from a fresh LB agar plate and used to inoculate LB medium supplemented with 30 μg/mL kanamycin. The next day, the cultures were diluted 1:100 into 5 mL of fresh LB medium and incubated with shaking at 220 rpm for 2–3 hours at 37°C until the optical density at 600 nm (OD600) reached 0.6–0.8. One milliliter of each culture was then transferred into three separate 1.5 mL microcentrifuge tubes and centrifuged at 4500 rpm for 2 minutes. The supernatant was discarded, and the bacterial pellets were resuspended in sterile water to adjust the OD600 to 1.0. A series of 5-fold dilutions was prepared, and these dilutions were spotted onto LB agar plates containing 30 μg/mL kanamycin, with and without the addition of 0.2 mM Isopropyl β-D-1-thiogalactopyranoside (IPTG). The plates were incubated at 37°C for 18 hours. Antibacterial activity against E. coli DH5α was evaluated through image analysis. The bacterial strains and plasmids employed in this study are described in detail in S5 Table.
Western blot
Bacterial cultures, both induced and uninduced with IPTG, were harvested and subjected to centrifugation at 12,000 rpm for 2 minutes. The expression of target proteins was analyzed by western blot as described before [80,81]. Briefly, the resulting pellets were resuspended in 50 µl of 1 × SDS-PAGE loading buffer and subsequently boiled in a metal block heater at 100°C for 10 minutes to denature the proteins. The protein samples were then separated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) using a 10% polyacrylamide gel under a constant voltage of 90V. Following electrophoresis, the proteins were transferred onto a polyvinylidene fluoride (PVDF) membrane at a constant current of 200 mA.
The PVDF membrane was then blocked with 5% non-fat dry milk for 1 hour to prevent non-specific binding. Subsequently, the membrane was incubated with a mouse anti-His monoclonal antibody (proteintech, Cat NO.:66005, 1:3000) overnight at 4°C. After thorough washing to remove unbound antibody, the membrane was probed with horseradish peroxidase (HRP)-conjugated goat anti-mouse secondary antibody (ABclonal, Cat.NO: A5003, 1:3000) for 1 hour at room temperature. Protein detection was carried out using an enhanced chemiluminescence (ECL) kit, and the protein bands were visualized using a gel documentation system (Tanon5200) to evaluate the expression levels.
Statistical analysis and data representation
To compare the abundance of T6SS-associated proteins, an F-test was first conducted to assess the homogeneity of variances. When the variances were equal, a Student’s t-test was used; otherwise, Welch’s t-test was applied. For comparisons involving more than two groups, the Mann–Whitney U test was performed, and Bonferroni correction was applied to adjust for multiple comparisons. A p-value of less than 0.05 was considered as significant. Normality was tested with the Kolmogorov-Smirnov test. All analyses were performed using R studio 4.3.3 and Python.
Supporting information
(A) Heatmap displaying the proportions of 28 immunity proteins in the genomes of various Klebsiella species. (B) Concentric rings represent the distribution of immunity proteins (imm) across different species. The innermost ring represents 4 immunity proteins found across all species.
(TIF)
(A) Comparison of unique immunity proteins between genomes with T6SSkleb1,2+ and genomes without any T6SS loci (T6SS-), illustrated by a Venn diagram. (B) Fold-change analysis (log2-transformed) of genomic counts for immunity proteins between T6SSkleb1,2+ and T6SS- genomes, with a fitted trend line. Immunity proteins are labeled numerically. (C) Histogram of log2-transformed fold changes for all identified immunity proteins, normalized by genome counts (T6SSkleb1,2 + vs. T6SS-) among the two genome groups. The dotted line represents the mean fold-change value. (D) Comparison of the mean value of immunity proteins per genome between T6SSkleb1,2+ (n = 2,616) and T6SS- (n = 190). Error bars represent the standard deviation, and statistical significance was assessed using a one-sided Welch’s t-test (****p < 0.0001).
(TIF)
(A) Comparison of the mean number of effector proteins between genomes carrying both T6SSkleb1+ and T6SSkleb2 + loci and those carrying only the T6SSkleb1 + locus. Error bars represent standard deviations. Statistical significance was assessed using a two-tailed Student’s t-test (****p < 0.0001). (B) Comparison of the mean number of effector proteins between genomes carrying both T6SSkleb1+ and T6SSkleb2 + loci and those carrying only the T6SSkleb2 + locus. Error bars represent standard deviations. Statistical significance was assessed using a two-tailed Welch’s t-test (**p < 0.01). (C) Venn diagram showing the number of protein families significantly associated with the T6SSkleb1 and T6SSkleb2 locus, respectively. (D) Venn diagram displaying the total number of predicted and previously characterized effector proteins associated with each locus.
(TIF)
The network, constructed using Cytoscape, includes 608 protein families associated with T6SSkleb1. Node size is proportional to the number of proteins in each family, with colors transitioning from dark to light based on abundance. Edge colors transition from yellow to green to purple based on interaction frequency. Node labels are prioritized by preferred names, followed by eggNOG-mapper annotations and cluster IDs.
(TIF)
Each gene cluster is colored to match the corresponding cluster shown in Fig 4B and 4C. Gray arrows represent additional proteins found in the surrounding genomic regions.
(TIF)
Each curve corresponds to a distinct T6SS protein group: effector proteins (purple), immunity proteins (orange), regulators (green) and merged (gray).
(TIF)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(DOCX)
Acknowledgments
We extend our sincere thanks to the colleagues from Bioinformatics Platform and Infectious Diseases and Pathogen Biology Center at The First Hospital of Jilin University for their valuable discussions and suggestions, which have significantly enhanced the quality of this manuscript.
Data Availability
The genome assembly summary and the phylogenetic tree files used in this study are available at Zenodo (https://doi.org/10.5281/zenodo.17044063).
Funding Statement
This work was supported by the Jilin Provincial Science and Technology Development Project (YDZJ202401276ZYTS to Q.G.), the Bethune Project of Jilin University (2024B20 to L.S.), and the Science and Technology Development Project of Changchun City (23YQ10 to L.S.). No authors received salary support from these funders.The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Podschun R, Ullmann U. Klebsiella spp. as nosocomial pathogens: epidemiology, taxonomy, typing methods, and pathogenicity factors. Clin Microbiol Rev. 1998;11(4):589–603. doi: 10.1128/CMR.11.4.589 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chen L, Kreiswirth BN. Convergence of carbapenem-resistance and hypervirulence in Klebsiella pneumoniae. Lancet Infect Dis. 2018;18(1):2–3. doi: 10.1016/S1473-3099(17)30517-0 [DOI] [PubMed] [Google Scholar]
- 3.Bialek-Davenet S, Criscuolo A, Ailloud F, Passet V, Jones L, Delannoy-Vieillard A-S, et al. Genomic definition of hypervirulent and multidrug-resistant Klebsiella pneumoniae clonal groups. Emerg Infect Dis. 2014;20(11):1812–20. doi: 10.3201/eid2011.140206 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Marr CM, Russo TA. Hypervirulent Klebsiella pneumoniae: a new public health threat. Expert Rev Anti Infect Ther. 2019;17(2):71–3. doi: 10.1080/14787210.2019.1555470 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wu S-W, Liu K-S, Lin C-K, Hung T-W, Tsai H-C, Chang H-R, et al. Community-acquired urinary tract infection in kidney transplantation: risk factors for bacteremia and recurrent infection. J Formos Med Assoc. 2013;112(3):138–43. doi: 10.1016/j.jfma.2012.01.010 [DOI] [PubMed] [Google Scholar]
- 6.Cianfanelli FR, Monlezun L, Coulthurst SJ. Aim, Load, Fire: The Type VI Secretion System, a Bacterial Nanoweapon. Trends Microbiol. 2016;24(1):51–62. doi: 10.1016/j.tim.2015.10.005 [DOI] [PubMed] [Google Scholar]
- 7.Wang J, Brodmann M, Basler M. Assembly and Subcellular Localization of Bacterial Type VI Secretion Systems. Annu Rev Microbiol. 2019;73:621–38. doi: 10.1146/annurev-micro-020518-115420 [DOI] [PubMed] [Google Scholar]
- 8.Ho BT, Dong TG, Mekalanos JJ. A view to a kill: the bacterial type VI secretion system. Cell Host Microbe. 2014;15(1):9–21. doi: 10.1016/j.chom.2013.11.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hsieh P-F, Lu Y-R, Lin T-L, Lai L-Y, Wang J-T. Klebsiella pneumoniae Type VI Secretion System Contributes to Bacterial Competition, Cell Invasion, Type-1 Fimbriae Expression, and In Vivo Colonization. J Infect Dis. 2019;219(4):637–47. doi: 10.1093/infdis/jiy534 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Storey D, McNally A, Åstrand M, Sa-Pessoa Graca Santos J, Rodriguez-Escudero I, Elmore B, et al. Klebsiella pneumoniae type VI secretion system-mediated microbial competition is PhoPQ controlled and reactive oxygen species dependent. PLoS Pathog. 2020;16(3):e1007969. doi: 10.1371/journal.ppat.1007969 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ray A, Schwartz N, de Souza Santos M, Zhang J, Orth K, Salomon D. Type VI secretion system MIX-effectors carry both antibacterial and anti-eukaryotic activities. EMBO Rep. 2017;18(11):1978–90. doi: 10.15252/embr.201744226 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Russell AB, Peterson SB, Mougous JD. Type VI secretion system effectors: poisons with a purpose. Nat Rev Microbiol. 2014;12(2):137–48. doi: 10.1038/nrmicro3185 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Trunk K, Coulthurst SJ, Quinn J. A New Front in Microbial Warfare-Delivery of Antifungal Effectors by the Type VI Secretion System. J Fungi (Basel). 2019;5(2):50. doi: 10.3390/jof5020050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Monjarás Feria J, Valvano MA. An Overview of Anti-Eukaryotic T6SS Effectors. Front Cell Infect Microbiol. 2020;10:584751. doi: 10.3389/fcimb.2020.584751 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Russell AB, Singh P, Brittnacher M, Bui NK, Hood RD, Carl MA, et al. A widespread bacterial type VI secretion effector superfamily identified using a heuristic approach. Cell Host Microbe. 2012;11(5):538–49. doi: 10.1016/j.chom.2012.04.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hood RD, Singh P, Hsu F, Güvener T, Carl MA, Trinidad RRS, et al. A type VI secretion system of Pseudomonas aeruginosa targets a toxin to bacteria. Cell Host Microbe. 2010;7(1):25–37. doi: 10.1016/j.chom.2009.12.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Salomon D, Kinch LN, Trudgian DC, Guo X, Klimko JA, Grishin NV, et al. Marker for type VI secretion system effectors. Proc Natl Acad Sci U S A. 2014;111(25):9271–6. doi: 10.1073/pnas.1406110111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Altindis E, Dong T, Catalano C, Mekalanos J. Secretome analysis of Vibrio cholerae type VI secretion system reveals a new effector-immunity pair. mBio. 2015;6(2):e00075. doi: 10.1128/mBio.00075-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Dong TG, Ho BT, Yoder-Himes DR, Mekalanos JJ. Identification of T6SS-dependent effector and immunity proteins by Tn-seq in Vibrio cholerae. Proc Natl Acad Sci U S A. 2013;110(7):2623–8. doi: 10.1073/pnas.1222783110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hachani A, Allsopp LP, Oduko Y, Filloux A. The VgrG proteins are “à la carte” delivery systems for bacterial type VI effectors. J Biol Chem. 2014;289(25):17872–84. doi: 10.1074/jbc.M114.563429 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Unterweger D, Miyata ST, Bachmann V, Brooks TM, Mullins T, Kostiuk B, et al. The Vibrio cholerae type VI secretion system employs diverse effector modules for intraspecific competition. Nat Commun. 2014;5:3549. doi: 10.1038/ncomms4549 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jana B, Fridman CM, Bosis E, Salomon D. A modular effector with a DNase domain and a marker for T6SS substrates. Nat Commun. 2019;10(1):3595. doi: 10.1038/s41467-019-11546-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dar Y, Salomon D, Bosis E. The Antibacterial and Anti-Eukaryotic Type VI Secretion System MIX-Effector Repertoire in Vibrionaceae. Mar Drugs. 2018;16(11):433. doi: 10.3390/md16110433 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Fridman CM, Keppel K, Gerlic M, Bosis E, Salomon D. A comparative genomics methodology reveals a widespread family of membrane-disrupting T6SS effectors. Nat Commun. 2020;11(1):1085. doi: 10.1038/s41467-020-14951-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zhang J, Guan J, Wang M, Li G, Djordjevic M, Tai C, et al. SecReT6 update: a comprehensive resource of bacterial Type VI Secretion Systems. Sci China Life Sci. 2023;66(3):626–34. doi: 10.1007/s11427-022-2172-x [DOI] [PubMed] [Google Scholar]
- 26.Lechner M, Findeiss S, Steiner L, Marz M, Stadler PF, Prohaska SJ. Proteinortho: detection of (co-)orthologs in large-scale analysis. BMC Bioinformatics. 2011;12:124. doi: 10.1186/1471-2105-12-124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ma J, Pan Z, Huang J, Sun M, Lu C, Yao H. The Hcp proteins fused with diverse extended-toxin domains represent a novel pattern of antibacterial effectors in type VI secretion systems. Virulence. 2017;8(7):1189–202. doi: 10.1080/21505594.2017.1279374 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.De Maayer P, Venter SN, Kamber T, Duffy B, Coutinho TA, Smits THM. Comparative genomics of the Type VI secretion systems of Pantoea and Erwinia species reveals the presence of putative effector islands that may be translocated by the VgrG and Hcp proteins. BMC Genomics. 2011;12:576. doi: 10.1186/1471-2164-12-576 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Huang Y, Du P, Zhao M, Liu W, Du Y, Diao B, et al. Functional Characterization and Conditional Regulation of the Type VI Secretion System in Vibrio fluvialis. Front Microbiol. 2017;8:528. doi: 10.3389/fmicb.2017.00528 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Nguyen VS, Spinelli S, Cascales É, Roussel A, Cambillau C, Leone P. Anchoring the T6SS to the cell wall: Crystal structure of the peptidoglycan binding domain of the TagL accessory protein. PLoS One. 2021;16(7):e0254232. doi: 10.1371/journal.pone.0254232 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Barret M, Egan F, Fargier E, Morrissey JP, O’Gara F. Genomic analysis of the type VI secretion systems in Pseudomonas spp.: novel clusters and putative effectors uncovered. Microbiology (Reading). 2011;157(Pt 6):1726–39. doi: 10.1099/mic.0.048645-0 [DOI] [PubMed] [Google Scholar]
- 32.Barret M, Egan F, O’Gara F. Distribution and diversity of bacterial secretion systems across metagenomic datasets. Environ Microbiol Rep. 2013;5(1):117–26. doi: 10.1111/j.1758-2229.2012.00394.x [DOI] [PubMed] [Google Scholar]
- 33.Russell AB, Wexler AG, Harding BN, Whitney JC, Bohn AJ, Goo YA, et al. A type VI secretion-related pathway in Bacteroidetes mediates interbacterial antagonism. Cell Host Microbe. 2014;16(2):227–36. doi: 10.1016/j.chom.2014.07.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Liang X, Pei T-T, Li H, Zheng H-Y, Luo H, Cui Y, et al. VgrG-dependent effectors and chaperones modulate the assembly of the type VI secretion system. PLoS Pathog. 2021;17(12):e1010116. doi: 10.1371/journal.ppat.1010116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wang C, Chen M, Shao Y, Jiang M, Li Q, Chen L, et al. Genome wide analysis revealed conserved domains involved in the effector discrimination of bacterial type VI secretion system. Commun Biol. 2023;6(1):1195. doi: 10.1038/s42003-023-05580-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Garin T, Brin C, Préveaux A, Brault A, Briand M, Simonin M, et al. The type VI secretion system of Stenotrophomonas rhizophila CFBP13503 limits the transmission of Xanthomonas campestris pv. campestris 8004 from radish seeds to seedlings. Mol Plant Pathol. 2024;25(1):e13412. doi: 10.1111/mpp.13412 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Crisan CV, Chande AT, Williams K, Raghuram V, Rishishwar L, Steinbach G, et al. Analysis of Vibrio cholerae genomes identifies new type VI secretion system gene clusters. Genome Biol. 2019;20(1):163. doi: 10.1186/s13059-019-1765-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Perault AI, Chandler CE, Rasko DA, Ernst RK, Wolfgang MC, Cotter PA. Host Adaptation Predisposes Pseudomonas aeruginosa to Type VI Secretion System-Mediated Predation by the Burkholderia cepacia Complex. Cell Host Microbe. 2020;28(4):534-547.e3. doi: 10.1016/j.chom.2020.06.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Records AR. The type VI secretion system: a multipurpose delivery system with a phage-like machinery. Mol Plant Microbe Interact. 2011;24(7):751–7. doi: 10.1094/MPMI-11-10-0262 [DOI] [PubMed] [Google Scholar]
- 40.Liang X, Pei T-T, Wang Z-H, Xiong W, Wu L-L, Xu P, et al. Characterization of Lysozyme-Like Effector TseP Reveals the Dependence of Type VI Secretion System (T6SS) Secretion on Effectors in Aeromonas dhakensis Strain SSU. Appl Environ Microbiol. 2021;87(12):e0043521. doi: 10.1128/AEM.00435-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Mittl PRE, Schneider-Brachert W. Sel1-like repeat proteins in signal transduction. Cell Signal. 2007;19(1):20–31. doi: 10.1016/j.cellsig.2006.05.034 [DOI] [PubMed] [Google Scholar]
- 42.Wang J, Yang B, Leier A, Marquez-Lago TT, Hayashida M, Rocker A, et al. Bastion6: a bioinformatics approach for accurate prediction of type VI secreted effectors. Bioinformatics. 2018;34(15):2546–55. doi: 10.1093/bioinformatics/bty155 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol. 2021;38(12):5825–9. doi: 10.1093/molbev/msab293 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Geller AM, Shalom M, Zlotkin D, Blum N, Levy A. Identification of type VI secretion system effector-immunity pairs using structural bioinformatics. Mol Syst Biol. 2024;20(6):702–18. doi: 10.1038/s44320-024-00035-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ford B, Rêgo AT, Ragan TJ, Pinkner J, Dodson K, Driscoll PC, et al. Structural homology between the C-terminal domain of the PapC usher and its plug. J Bacteriol. 2010;192(7):1824–31. doi: 10.1128/JB.01677-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Abraham SN, Jonsson AB, Normark S. Fimbriae-mediated host-pathogen cross-talk. Curr Opin Microbiol. 1998;1(1):75–81. doi: 10.1016/s1369-5274(98)80145-8 [DOI] [PubMed] [Google Scholar]
- 47.Postma PW, Lengeler JW, Jacobson GR. Phosphoenolpyruvate:carbohydrate phosphotransferase systems of bacteria. Microbiol Rev. 1993;57(3):543–94. doi: 10.1128/mr.57.3.543-594.1993 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Sung MH, Tanizawa K, Tanaka H, Kuramitsu S, Kagamiyama H, Hirotsu K, et al. Thermostable aspartate aminotransferase from a thermophilic Bacillus species. Gene cloning, sequence determination, and preliminary x-ray characterization. J Biol Chem. 1991;266(4):2567–72. doi: 10.1016/s0021-9258(18)52282-1 [DOI] [PubMed] [Google Scholar]
- 49.Makarova KS, Haft DH, Barrangou R, Brouns SJJ, Charpentier E, Horvath P, et al. Evolution and classification of the CRISPR-Cas systems. Nat Rev Microbiol. 2011;9(6):467–77. doi: 10.1038/nrmicro2577 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Garside EL, Schellenberg MJ, Gesner EM, Bonanno JB, Sauder JM, Burley SK, et al. Cas5d processes pre-crRNA and is a member of a larger family of CRISPR RNA endonucleases. RNA. 2012;18(11):2020–8. doi: 10.1261/rna.033100.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Maharajan AD, Hjerde E, Hansen H, Willassen NP. Quorum Sensing Controls the CRISPR and Type VI Secretion Systems in Aliivibrio wodanis 06/09/139. Front Vet Sci. 2022;9:799414. doi: 10.3389/fvets.2022.799414 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Carpenter MR, Kalburge SS, Borowski JD, Peters MC, Colwell RR, Boyd EF. CRISPR-Cas and Contact-Dependent Secretion Systems Present on Excisable Pathogenicity Islands with Conserved Recombination Modules. J Bacteriol. 2017;199(10):e00842-16. doi: 10.1128/JB.00842-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Zhang Y, Xian H, Jiang X, Yuan Y, Ji R, Jiao X, et al. Identification of two Sel1-like proteins in SPI-19 of Salmonella enterica serovar Pullorum that can mediate bacterial infection through T3SS. Microbiol Res. 2022;262:127085. doi: 10.1016/j.micres.2022.127085 [DOI] [PubMed] [Google Scholar]
- 54.Merciecca T, Bornes S, Nakusi L, Theil S, Rendueles O, Forestier C, et al. Role of Klebsiella pneumoniae Type VI secretion system (T6SS) in long-term gastrointestinal colonization. Sci Rep. 2022;12(1):16968. doi: 10.1038/s41598-022-21396-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Barbosa VAA, Lery LMS. Insights into Klebsiella pneumoniae type VI secretion system transcriptional regulation. BMC Genomics. 2019;20(1):506. doi: 10.1186/s12864-019-5885-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Guckes KR, Cecere AG, Wasilko NP, Williams AL, Bultman KM, Mandel MJ, et al. Incompatibility of Vibrio fischeri Strains during Symbiosis Establishment Depends on Two Functionally Redundant hcp Genes. J Bacteriol. 2019;201(19):e00221-19. doi: 10.1128/JB.00221-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Hachani A, Lossi NS, Hamilton A, Jones C, Bleves S, Albesa-Jové D, et al. Type VI secretion system in Pseudomonas aeruginosa: secretion and multimerization of VgrG proteins. J Biol Chem. 2011;286(14):12317–27. doi: 10.1074/jbc.M110.193045 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Liu L, Ye M, Li X, Li J, Deng Z, Yao Y-F, et al. Identification and Characterization of an Antibacterial Type VI Secretion System in the Carbapenem-Resistant Strain Klebsiella pneumoniae HS11286. Front Cell Infect Microbiol. 2017;7:442. doi: 10.3389/fcimb.2017.00442 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Ting S-Y, Bosch DE, Mangiameli SM, Radey MC, Huang S, Park Y-J, et al. Bifunctional Immunity Proteins Protect Bacteria against FtsZ-Targeting ADP-Ribosylating Toxins. Cell. 2018;175(5):1380-1392.e14. doi: 10.1016/j.cell.2018.09.037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ross BD, Verster AJ, Radey MC, Schmidtke DT, Pope CE, Hoffman LR, et al. Human gut bacteria contain acquired interbacterial defence systems. Nature. 2019;575(7781):224–8. doi: 10.1038/s41586-019-1708-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Salomon D, Klimko JA, Trudgian DC, Kinch LN, Grishin NV, Mirzaei H, et al. Type VI Secretion System Toxins Horizontally Shared between Marine Bacteria. PLoS Pathog. 2015;11(8):e1005128. doi: 10.1371/journal.ppat.1005128 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Ma L-S, Hachani A, Lin J-S, Filloux A, Lai E-M. Agrobacterium tumefaciens deploys a superfamily of type VI secretion DNase effectors as weapons for interbacterial competition in planta. Cell Host Microbe. 2014;16(1):94–104. doi: 10.1016/j.chom.2014.06.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Chatzidaki-Livanis M, Geva-Zatorsky N, Comstock LE. Bacteroides fragilis type VI secretion systems use novel effector and immunity proteins to antagonize human gut Bacteroidales species. Proc Natl Acad Sci U S A. 2016;113(13):3627–32. doi: 10.1073/pnas.1522510113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Berni B, Soscia C, Djermoun S, Ize B, Bleves S. A Type VI Secretion System Trans-Kingdom Effector Is Required for the Delivery of a Novel Antibacterial Toxin in Pseudomonas aeruginosa. Front Microbiol. 2019;10:1218. doi: 10.3389/fmicb.2019.01218 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lam MMC, Wick RR, Watts SC, Cerdeira LT, Wyres KL, Holt KE. A genomic surveillance framework and genotyping tool for Klebsiella pneumoniae and its related species complex. Nat Commun. 2021;12(1):4188. doi: 10.1038/s41467-021-24448-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25(7):1043–55. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9. doi: 10.1093/bioinformatics/btu153 [DOI] [PubMed] [Google Scholar]
- 68.Okura Y. [Fundamentals] 4. Visualization in Python Programming: How to Use Matplotlib and seaborn. Nihon Hoshasen Gijutsu Gakkai Zasshi. 2023;79(7):723–31. doi: 10.6009/jjrt.2023-2228 [DOI] [PubMed] [Google Scholar]
- 69.Asnicar F, Thomas AM, Beghini F, Mengoni C, Manara S, Manghi P, et al. Precise phylogenetic analysis of microbial isolates and genomes from metagenomes using PhyloPhlAn 3.0. Nat Commun. 2020;11(1):2500. doi: 10.1038/s41467-020-16366-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020;37(5):1530–4. doi: 10.1093/molbev/msaa015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Letunic I, Bork P. Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics. 2007;23(1):127–8. doi: 10.1093/bioinformatics/btl529 [DOI] [PubMed] [Google Scholar]
- 72.Li W, Fu L, Niu B, Wu S, Wooley J. Ultrafast clustering algorithms for metagenomic sequence analysis. Brief Bioinform. 2012;13(6):656–68. doi: 10.1093/bib/bbs035 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25(15):1972–3. doi: 10.1093/bioinformatics/btp348 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Price MN, Dehal PS, Arkin AP. FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5(3):e9490. doi: 10.1371/journal.pone.0009490 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Finn RD, Clements J, Eddy SR. HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 2011;39(Web Server issue):W29-37. doi: 10.1093/nar/gkr367 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42(Database issue):D222-30. doi: 10.1093/nar/gkt1223 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504. doi: 10.1101/gr.1239303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34(17):i884–90. doi: 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Song L, Xie Y, Li C, Wang L, He C, Zhang Y, et al. The Legionella Effector SdjA Is a Bifunctional Enzyme That Distinctly Regulates Phosphoribosyl Ubiquitination. mBio. 2021;12(5):e0231621. doi: 10.1128/mBio.02316-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Song L, Luo J, Wang H, Huang D, Tan Y, Liu Y, et al. Legionella pneumophila regulates host cell motility by targeting Phldb2 with a 14-3-3ζ-dependent protease effector. Elife. 2022;11:e73220. doi: 10.7554/eLife.73220 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
(A) Heatmap displaying the proportions of 28 immunity proteins in the genomes of various Klebsiella species. (B) Concentric rings represent the distribution of immunity proteins (imm) across different species. The innermost ring represents 4 immunity proteins found across all species.
(TIF)
(A) Comparison of unique immunity proteins between genomes with T6SSkleb1,2+ and genomes without any T6SS loci (T6SS-), illustrated by a Venn diagram. (B) Fold-change analysis (log2-transformed) of genomic counts for immunity proteins between T6SSkleb1,2+ and T6SS- genomes, with a fitted trend line. Immunity proteins are labeled numerically. (C) Histogram of log2-transformed fold changes for all identified immunity proteins, normalized by genome counts (T6SSkleb1,2 + vs. T6SS-) among the two genome groups. The dotted line represents the mean fold-change value. (D) Comparison of the mean value of immunity proteins per genome between T6SSkleb1,2+ (n = 2,616) and T6SS- (n = 190). Error bars represent the standard deviation, and statistical significance was assessed using a one-sided Welch’s t-test (****p < 0.0001).
(TIF)
(A) Comparison of the mean number of effector proteins between genomes carrying both T6SSkleb1+ and T6SSkleb2 + loci and those carrying only the T6SSkleb1 + locus. Error bars represent standard deviations. Statistical significance was assessed using a two-tailed Student’s t-test (****p < 0.0001). (B) Comparison of the mean number of effector proteins between genomes carrying both T6SSkleb1+ and T6SSkleb2 + loci and those carrying only the T6SSkleb2 + locus. Error bars represent standard deviations. Statistical significance was assessed using a two-tailed Welch’s t-test (**p < 0.01). (C) Venn diagram showing the number of protein families significantly associated with the T6SSkleb1 and T6SSkleb2 locus, respectively. (D) Venn diagram displaying the total number of predicted and previously characterized effector proteins associated with each locus.
(TIF)
The network, constructed using Cytoscape, includes 608 protein families associated with T6SSkleb1. Node size is proportional to the number of proteins in each family, with colors transitioning from dark to light based on abundance. Edge colors transition from yellow to green to purple based on interaction frequency. Node labels are prioritized by preferred names, followed by eggNOG-mapper annotations and cluster IDs.
(TIF)
Each gene cluster is colored to match the corresponding cluster shown in Fig 4B and 4C. Gray arrows represent additional proteins found in the surrounding genomic regions.
(TIF)
Each curve corresponds to a distinct T6SS protein group: effector proteins (purple), immunity proteins (orange), regulators (green) and merged (gray).
(TIF)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(DOCX)
Data Availability Statement
The genome assembly summary and the phylogenetic tree files used in this study are available at Zenodo (https://doi.org/10.5281/zenodo.17044063).





