Skip to main content
Nature Communications logoLink to Nature Communications
. 2024 May 29;15:4555. doi: 10.1038/s41467-024-48352-8

The evolution of antibiotic resistance islands occurs within the framework of plasmid lineages

Yiqing Wang 1, Tal Dagan 1,
PMCID: PMC11137137  PMID: 38811529

Abstract

Bacterial pathogens carrying multidrug resistance (MDR) plasmids are a major threat to human health. The acquisition of antibiotic resistance genes (ARGs) in plasmids is often facilitated by mobile genetic elements that copy or translocate ARGs between DNA molecules. The agglomeration of mobile elements in plasmids generates resistance islands comprising multiple ARGs. However, whether the emergence of resistance islands is restricted to specific MDR plasmid lineages remains understudied. Here we show that the agglomeration of ARGs in resistance islands is biased towards specific large plasmid lineages. Analyzing 6784 plasmids in 2441 Escherichia, Salmonella, and Klebsiella isolates, we quantify that 84% of the ARGs in MDR plasmids are found in resistance islands. We furthermore observe rapid evolution of ARG combinations in resistance islands. Most regions identified as resistance islands are shared among closely related plasmids but rarely among distantly related plasmids. Our results suggest the presence of barriers for the dissemination of ARGs between plasmid lineages, which are related to plasmid genetic properties, host range and the plasmid evolutionary history. The agglomeration of ARGs in plasmids is attributed to the workings of mobile genetic elements that operate within the framework of existing plasmid lineages.

Subject terms: Molecular evolution, Phylogenetics, Bacterial genes


In their study, Wang and Dagan show that 84% of antibiotic resistance genes in multidrug resistance plasmids in Escherichia, Salmonella, and Klebsiella reside within resistance islands that evolve within the framework of plasmid lineages.

Introduction

Plasmids are autonomously replicating genetic elements that reside in prokaryotic organisms. Their ability to persist in the host population often depends on auxiliary functions encoded in their genome that are beneficial for their host, for example, resistance to antibiotics. The ability of plasmids to transfer horizontally between bacterial cells renders them a common driver of antibiotic resistance gene transfer1. Indeed, the emergence of antibiotic resistant bacteria in clinical environments is often linked to the acquisition of plasmids encoding antibiotic resistance genes (ARGs; e.g. refs. 24). Plasmids that encode multiple ARGs, termed multiple drug resistance (MDR) plasmids, provide their host with a combination of resistances to multiple antibiotics5. Pathogens carrying MDR plasmids are one of the major threats to human health. For example, an epidemiological study estimated that Escherichia coli strains were responsible for most human mortality attributed to antibiotic resistance in 2019, followed by Klebsiella pneumonia strains6.

The evolution of antibiotic resistance gene content in MDR bacteria is often facilitated by diverse mobile genetic elements (MGEs) that can mediate gene transfer between DNA molecules via homologous recombination, site-specific recombination, or transposition7. These include insertion sequences (ISs) and transposons8, as well as integrons9,10, all of which can be associated with adjacent or cargo genes that encode for antibiotic resistance (note that this MGE definition excludes plasmids). Examples are IS26, which is a frequent component in pseudo-compound transposons11, which mobilize diverse ß-lactamase variants, and the sulfonamide resistance gene sul1 that is associated with class 1 integrons (reviewed in ref. 7). The transposition and recombination of genetic elements in bacterial genomes generate diverse ARG combinations12 that typically cluster in specific genomic loci termed antimicrobial resistance island (REIs)13 or multi-resistance regions (MRRs)7. For example, IS26-mediated translocatable units (TUs) have been shown to insert most efficiently adjacent to an existing IS26 copy to form a pseudo-compound transposon11,14,15. The assembly of resistance islands in plasmid genomes has been studied in detail for specific plasmid types and bacterial taxa (e.g., refs. 1618), nonetheless, general quantifications of MGE contribution to the evolution of antibiotic resistance gene content in MDR plasmids are lacking.

Plasmids encoding MDR genes are typically large and their genome size is positively associated with the number of ARGs in the plasmid genome19. Small plasmids typically carry only a few ARGs (e.g., ColE1-like plasmids20). Notably, plasmid size and plasmid mobility are associated since conjugative plasmids are typically large21. A recent study suggested that ISs associated with ARGs are frequently found in conjugative plasmids2224. Disentangling the association of plasmid genome size and plasmid mobility with a high number of ARGs requires us to consider the plasmid evolutionary history. Indeed, recent studies in the field of plasmid biology suggested a classification scheme of plasmids into taxonomic units (PTUs25,26). These are groups of putatively closely related plasmids, that are inferred based on genome sequence similarity and specific plasmid backbone genes27. Similar to organisms in the same species, plasmids in the same taxonomic unit are expected to have similar plasmid properties and a common evolutionary origin. Integrating the plasmid taxonomic unit in MDR plasmid research enables us to go beyond the plasmid most general properties (i.e., size and mobility) while accounting for the evolutionary history of plasmid lineages.

Here we study the evolution of antibiotic resistance gene content in different lineages of MDR plasmids, with a focus on three genera: Klebsiella, Escherichia and Salmonella (KES). Plasmids in KES strains share gene content, suggesting low barriers for plasmid transfer among hosts in the three genera as well as common plasmid origins25. To examine ARGs in resistance islands, we analyze colinear syntenic regions of cooccurring antibiotic resistance genes and mobile genetic elements. Furthermore, we compare the syntenic regions identified as resistance islands among plasmids in different plasmid taxonomic units.

Results

The majority of resistance genes in KES plasmids are clustered in compact resistance islands

For the purpose of our study, we clustered 11,995,860 protein-coding genes in chromosomes and plasmids of 2441 KES isolates into 32,623 clusters of homologous gene families (Supplementary Data 1). Using the comprehensive antibiotic resistance database (CARD)28, we identified 114,457 homologs of 397 ARGs in 138 gene families (Fig. 1A). Members of those 138 families not identified by CARD as ARGs were excluded from further analysis. Most of the excluded genes (97%) correspond to chromosomal gene variants not classified as ARGs in CARD. Note that ARGs in our data are clustered into homologs by global protein sequence similarity, hence specific epidemiologically relevant gene variants may be clustered into the same gene family (see Supplementary Data 2). The majority (2013; 78%) of the 2591 ARG-coding plasmids in our set comprise multiple ARGs. Significant cooccurrence of ARG families was tested for all possible paired ARG combinations. About 30% (ca. 260) of the tested ARG pairs were found to significantly cooccur in plasmids, with little variation among the genera (Supplementary Data 3). These ARG pairs are termed here: coARGs. Cooccurring ARGs that are associated with the same MGE are expected to have a similar genomic neighborhood (i.e., similar neighboring genes). As the next step in the analysis workflow, coARGs were retained only if they were found within the same collinear syntenic block (CSB) (Fig. 1A; CSBs are clusters of genes whose order is conserved in multiple genomes29). The majority of coARGs cooccurred in at least one plasmid CSB (e.g., 85% coARG combinations in Escherichia) (Supplementary Data 3). The CSBs that contained coARGs were further filtered based on their comparison to sequences of previously recognized transposable elements3032. Only CSBs that match the known transposable elements were retained, which was the majority of CSBs comprising coARGs (e.g., 99.6% in Escherichia; Supplementary Data 3; Fig. 1A). The length of the retained CSBs had a median of 8 genes, with the majority (65%) of CSB instances comprising ≤10 genes (Supplementary Fig. S1). The CSBs identified here do not necessarily correspond to intact MGEs or complete resistance islands, which may be found in plasmids in partial forms, e.g., due to genome rearrangements and MGE degradation. The majority of the CSBs we identify here are better described as pieces of genetic elements12, specifically here, ‘pieces of resistance islands.’ That being said, some of the CSBs we identified are much longer than MGEs and correspond to conserved MGE genomic neighborhoods (see Supplementary Note SN1). Notably, the CSBs we identified are found in the majority of MDR plasmids (1866; 93%). Taken together, most cooccurring ARGs in KES plasmids are organized in CSBs bearing similarity to transposable elements.

Fig. 1. Antibiotic resistance genes cooccur in plasmids in dense resistance islands.

Fig. 1

A An illustration of our workflow for detecting resistance islands in plasmid genomes. Arrow boxes correspond to genes. (1) Protein-coding genes in bacterial genomes were clustered into gene families. ARGs and SSR genes were identified. (2) Significant cooccurrence tests for ARG pairs were conducted per replicon type in each genus. (3) Collinear syntenic blocks (CSBs) including coARGs were identified across genomes and (4) further compared to previously recognized transposable elements by sequence similarity (see Methods for additional clarifications). (B) Cumulative Distribution Function (CDF) plots show that the distance between ARGs is shorter than the distance between a control group of randomly sampled genes in plasmids (left), but not in chromosomes (right). Intergenic distances were calculated by counting the number of protein-coding genes between two, and up to ten, adjacent ARGs. Source data are provided as a Source Data file.

To examine which genetic elements are involved in the generation of resistance islands, we identified all the gene families encoding either transposases or site-specific recombinases (collectively referred here as SSRs for simplicity) in our dataset. The majority of SSR gene families encode DDE transposases (257 families) and tyrosine recombinases (161 families). The large number of families in these functional classes is most likely related to frequent gene duplications. Since our clustering approach is aimed to classify mainly orthologs (as in ref. 33, but considering multiple replicons), gene families having frequent (non-identical) duplicates on the same replicon are typically split by our clustering procedure into multiple clusters that correspond to different transposase or recombinase variants (e.g., the paralogs IS26 and IS26-v134,35). In addition, we characterized families encoding DEDD transposases (31 families), HUH endonucleases (25 families), serine recombinases (34 families), as well as three putative Cas1 endonucleases families (Supplementary Data 4). Most of the SSR families (107 out of 116) are characterized by a biased distribution towards plasmids compared to chromosomes (Supplementary Data 4). A total of 76 SSR gene families cooccur with 56 ARG families in plasmids within the CSBs (i.e., resistance island pieces). The most frequent SSR gene families include four DDE transposases (IS26, IS26-v1, IS6100, Tn3 transposase), two serine recombinases (Tn3 resolvase), a tyrosine recombinase (class 1 integron integrase), a HUH endonuclease (IS91 transposase) and a DEDD transposase (IS110 transposase). Together, these gene families comprise 66% of the SSR genes in antibiotic resistance islands as identified in plasmids using our analysis workflow.

Several of the transposases and site-specific recombinases are expected to coincide in genomes as components of functional transposable elements. In our data, components of Tn3 elements, including the two gene families of Tn3 resolvase and the family of Tn3 transposases, significantly cooccur in plasmids (P < 0.001, using Fisher test), which serves as an internal validation for our approach. Different SSR genes may also cooccur due to causal interference between different transposable elements. For example, IS4321/IS5075 (IS110 family transposases) are known to target 38 bp terminal inverted repeats of Tn21 subgroup of Tn3 family36. Indeed, in our data IS110 family transposases significantly cooccur with Tn3 family transposase in plasmids (P < 0.001, using Fisher test), hence their distribution in plasmids is not independent. Similarly, families encoding transposases of IS110 and Tn3 cooccur in CSBs (i.e., pieces of resistance islands) significantly more than expected by chance alone (P < 0.001, using Fisher test). We note that while integrons are often found in transposons, our approach cannot infer the exact contribution of nested integrons to resistance islands. In our data, tyrosine recombinases (9 gene families) correspond to 16% of the total SSR genes in the plasmid resistance islands and are present in 1116 (55%) MDR plasmids within resistance islands. Taken together, the resistance islands identified using our workflow comprise diverse combinations of SSRs and antibiotic resistance genes.

To further study the distribution of ARGs in plasmid genomes, we examined the distance between consecutive ARGs by counting the number of protein-coding genes between two, and up to ten ARGs. If the ARGs are randomly distributed in the genome, the distance distribution should fit to a negative binomial distribution. Our results show that the distance between consecutive multiple ARGs deviates significantly from the random expectation (using Goodness-of-fit test, distance distributions were tested separately with α = 0.05). Furthermore, the distance between multiple ARGs was significantly shorter than the distance of the control group of randomly sampled genes (Wilcoxon rank sum test, distance distributions were tested separately, α = 0.05). Notably, the median distance between two adjacent ARGs in plasmids is zero, which means that over half of the ARGs have an ARG as neighbor gene on the same strand. ARGs are thus densely clustered in plasmid genomes. Further evaluation of the ARG loci in plasmids showed that 84% of ARGs in MDR plasmids are found in the CSBs we identified, i.e., in compact resistance islands.

ARG and SSR gene combinations in resistance islands are diverse and change little over time

To gain an insight into the composition of ARGs and SSR genes in resistance islands we constructed a network of gene cooccurrence in resistance islands (i.e., the CSBs identified in our workflow). Nodes in the network correspond to either ARG or the main SSR gene families. The gene nodes are connected by edges if they coincide in at least one resistance island with the edge weight corresponding to the number of plasmids where the cooccurrence was observed. The resulting network has a highly condensed structure, that is, the genes (nodes) are highly connected among each other within the plasmid resistance islands (Fig. 2A). Highly connected hub-genes appear at the center of the network: these correspond to two transposase genes IS26 and IS26-v1 and resistance genes sul1 and qacEΔ1. The central location in the network indicates that these genes occur in combinations with most SSR genes and ARGs in plasmid resistance islands.

Fig. 2. Temporal network of ARG-SSR cooccurrence in plasmid resistance islands.

Fig. 2

A Network of ARG and SSR gene families that coincide in resistance islands. The nodes correspond to gene families and the edges to cooccurrence events. ARG nodes are plotted in a bigger size. B Temporal assembly of nodes in the network according to the strain isolation date. The number of plasmids used in the network analysis is shown in a purple line. C Temporal assembly of edges connecting to ARG nodes in the network (D) Phylogenetic tree of IS26. The tree scale bar is in amino acid substitutions per site. Stacked bars at the tree leaves show the number of each variant in the gene family, where two dominant transposases, IS26 (yellow) and IS26-v1 (blue), had 6266 and 1385 members, respectively. The stacked bars at tree tips are colored according to the predicted plasmid mobility class (see legend in Fig. 3). The histogram shows the isolation dates of plasmids carrying the two dominant transposases. Low frequency variants in the data (colored in gray) have not been verified hence we cannot rule out that they stem from sequencing errors. Temporal accumulation of (E) edges and (F) edge weights in the network (dots represent the weight medians, whiskers represent the weight ranges). Plasmids from 36 isolates sampled before the year 2000 were binned with the year 2000 isolates. Source data are provided as a Source Data file.

To investigate temporal dynamics in the cooccurrence network we labeled each edge with the earliest report of the genes’ cooccurrence in a resistance island. Estimating the age of resistance islands from sequence divergence is challenging due to the high sequence conservation of both ARGs and SSR genes24,37. Here we use the isolation date of the respective strain as a proxy for the time of presence of specific ARG-SSR combinations. Note that in our dataset, time is positively associated with the amount of data used in the analysis (i.e., number of plasmids publicly available). Between the years 2000 and 2010 there is an increase of 24 plasmids per year, on average. The rate of data accumulation increases markedly between 2010-2020 to 145 plasmids per year, on average. Combining the temporal data with the network components shows that the network size increases with time and the increase in the number of plasmids at a homogeneous rate (Fig. 2B). The number of new ARG nodes added to the network per year is increasing slowly and steadily over time. At the same time, we note that about half of the ARG nodes (32; 57%) are already present in the network with the early sampled strains. Between 2011 and 2022, seven ARGs are added to the network (Fig. 2C). These include resistance genes to more recently introduced antibiotics. Examples are tet(X4) that supplies resistance to tigecycline (first reported in 201938), mef(B) (first reported in 200939) and mcr−1.1 that was first reported in 20163, but may have been present in livestock associated E. coli already since the 1980s40. We note that mcr−1.1 in E. coli plasmids was so far reported to be mobilized as a single gene by ISApl141,42 (hence, it is not expected to be found as a coARG). Nonetheless, this ARG is connected in our network with other ARGs due to a CSB that is conserved in four plasmids and includes multiple mobile genetic elements (Supplementary Fig. S2).

The number of SSR nodes has a similar trend, with a slight increase after 2010 (i.e., with the stark accumulation in sequenced plasmids; Fig. 2B), where it surpasses the increase in the number of ARGs. The increase in the number of SSRs may be related to diversification of the SSR gene families, and the increase in plasmid data is useful for revealing SSR gene variants. For example, IS26 and IS26-v134 transposases comprise 6,266 and 1,385 identical protein sequences in our data, respectively. The two transposases are often present in the same replicon. The protein sequence of IS26-v1 transposase has a single G184N replacement, which was reported to increase the cointegration efficiency of IS2635. The two dominant IS26 transposases correspond to 92% of the gene family members and the remaining variants have few substitutions. The high frequency of identical IS26 homologs may correspond to either a very recently emerged variant or a highly conserved variant in the population. To gain further insight into the antiquity of the two main variants, we examined the isolation date of the isolates encoding those variants. The distribution along the isolation timeline as reflected in our data shows that IS26 is first observed in an isolate from 1954 and IS26-v1 in an isolate from 1990. The higher frequency of ARG combinations with IS26 in comparison to combinations with IS26-v1 (Fig. 2A) is in agreement with earlier observations of IS26 in the three genera KES. The positive trend of increasing frequency in time for both IS26 and IS26-v1 suggests that both ISs are highly mobile. Other common SSR gene families in our data (e.g., Tn3, IS91, and IS110) are characterized by a similar pattern of diversification and rapid increase in the frequency of evolved variants (Supplementary Fig. S3).

The accumulation of edges in the network supply hints for temporal aspects in the evolution of gene combinations in resistance islands. The number of edges is initially increasing with time and the number of sampled plasmids (Fig. 2E). From ca. 2014 onwards, the cumulative number of edges plateaus, suggesting that the repertoire of gene combinations in the network is close to saturation, despite a large number of plasmids added to the data. The higher frequency of ARG-SSR edges overall may be attributed to the larger number of possible ARG-SSR combinations in the data. The distribution of edge weight in the network shows that the increase in data from 2010 onwards, contributed mainly to an increase in the median edge weight (Fig. 2F), which corresponds to increasing frequencies of the observed gene cooccurrences in plasmid resistance islands. The distribution of edge weights indicates that the cooccurrence of ARG pairs has a higher abundance in the network compared to ARG-SSR and SSR-SSR pairs (Fig. 2F). Taken together, the network composition suggests that the repertoire of ARG combinations in plasmid resistance islands is diverse but changes only little over time. Based on these results we predict that additional sequencing may reveal additional instances of existing combinations but rarely uncover new cooccurring gene combinations (with the exception of newly emerging resistance genes and SSR gene variants).

Scarce occurrence of resistance islands in small plasmid types

Which plasmid lineages are typical vehicles of multiple drug-resistance genes? Previous large-scale genomic studies in the literature mainly focus on the plasmid size and mobility class (e.g. refs. 19,22,43), thus neglecting the plasmid evolutionary history. Here we make use of a recently suggested classification scheme of plasmids into taxonomic units25 and marker genes for the distinction between KES small and large plasmids (see Supplementary Note SN2). The small plasmid types are commonly smaller than 19Kb, they are either non-mobilizable (but see ref. 44) or mobilizable and mostly correspond to ColE1-like plasmids. ARGs are rarely found in the small plasmid types; of the total 1961 small plasmid genomes, only 7% harbor ARGs, with 72% of the small plasmids encoding a single ARG. Correspondingly, resistance islands are both small and rare in the small plasmid types (Fig. 3).

Fig. 3. A comparison of resistance islands among PTUs.

Fig. 3

Stacked bar plots from left to right show plasmid properties in the corresponding PTU. Size: plasmid size categories (small plasmids here are <19Kb). Host: genus classification of the plasmid host. Mob: predicted plasmid mobility class. REI: plasmids encoding resistance islands (as identified here). The number of plasmids in each PTU is reported right of the stacked bar plots. Boxplots show the number of ARGs and SSR genes within the resistance island pieces (thick center lines represent the medians, box limits represent the upper and lower quantiles, whiskers represent within 1.5x interquartile range, points represent the outliers). PTUs belonging to the small-size Col plasmid group exhibit a limited occurrence of resistance islands. PTUs that include plasmids with a Rop protein are written in blue, while those including plasmids with a RepB protein are written in red. Source data are provided as a Source Data file.

To further investigate the evolution of small resistance plasmids we examined the phylogenetic tree of Rop (plasmid primer RNA-binding protein; RefSeq accession: WP_003978814.1) and compared the presence of cooccurring ARGs and SSR genes among closely related plasmids. The Rop sequence is short and highly conserved, hence our ability to reconstruct a reliable phylogeny using all 903 members of that gene family is limited. Consequently, we split the Rop phylogenetic analysis into two plasmid groups: plasmids not carrying resistance genes that reside mostly in Escherichia and Salmonella (Supplementary Fig. S4), and plasmids residing mostly in Klebsiella (Fig. 4). The Klebsiella Rop phylogeny shows a clear split into two clades that correspond to Col440I (partially belongs to PTU-E71) and ColRNAI (PTU-E4) plasmids. The ColRNAI clade includes large plasmids from 21 isolates that correspond to cointegrates (of plasmid fusions; Supplementary Data 1) and small resistance plasmids in 27 isolates; in 24 of the isolates, the small plasmids encode one aac(6’)-Ib in the same Tn3-based transposon previously reported as Tn1331Δ:IS26 (Fig. 4)45,46. Comparing the plasmid sequences reveals that the plasmid backbones of this sub-clade are almost identical to its sister sub-clade, the only difference between plasmids in the two sub-clades in the presence of Tn1331Δ:IS26 (Supplementary Fig. S5). The branching pattern of the clustered ColRNAI plasmids suggests that a single ancestral transposon insertion event in the ancestor of these plasmids is the most parsimonious scenario for the presence of Tn1331Δ:IS26 in those plasmids. The ARG-carrying ColRNAI plasmids are reported in isolates from human sampled between 1974 and 2019, suggesting that this plasmid has a stable persistence in Klebsiella. The Klebsiella ColRNAI clade further includes three plasmids reported in Escherichia isolates, which likely correspond to plasmid transfer from Klebsiella to Escherichia. The larger plasmids in the ColRNAI clade are putatively mobilizable or conjugative; these plasmids are untypically large for Rop-encoding plasmids (see also Supplementary Data 1) and likely originated from plasmid fusion events. Taken together, resistance islands comprising multiple drug resistance genes are rare in small ColE1-like plasmids with only a few exceptions.

Fig. 4. Phylogenetic tree of 469 Rop sequences in 443 plasmids.

Fig. 4

Rings of the phylogenetic tree show (from inner to the outer ring): the presence of ARGs, host genus, and PTU. The majority of plasmids reside in Klebsiella isolates. The Rop phylogeny constitutes two main Rop clades that correspond to non-mobilizable ColRNAI (PTU-E4) plasmids and mobilizable Col440I (PTU-E71) plasmids. A subclade of ColRNAI plasmids harboring ARGs is marked in purple, in that clade, small plasmids carrying Tn1331Δ:IS26 are marked by a white square within the ARG ring. Note that while the isolates in our data are not identical, the plasmid variants presented here may correspond to the same plasmid origin, hence the frequency of specific plasmid types reflects the number of isolates where they have been observed. Branches with bootstrap support ≤70 are colored in orange. The tree scale bar is in amino acid substitutions per site. Plasmids lacking a PTU assignment could not be classified using COPLA27. Source data are provided in supplementary file.

ARG cooccurrence in resistance islands is restricted to specific large conjugative plasmid taxonomic units

Most of the plasmid ARG homologs (98%) in our data are in large plasmids. Nonetheless, only 38% of the large KES plasmids harbor cooccurring ARGs and SSR genes, hence a large plasmid size is not necessarily associated with the presence of resistance islands. Examples are plasmids classified in PTUs E5, E13, E15, E16, and E21, all of which are putatively non-mobilizable (Fig. 3). For example, only three non-mobilizable E15 plasmids harbor resistance islands that comprise multiple ARGs to ß-lactams and aminoglycosides; the plasmids were reported in Klebsiella isolates from different clinical studies (e.g. NZ_CP045677.147). Indeed, the number of ARGs in MDR plasmids is associated with plasmid size19; however, we note that the proportion of ARGs in such large plasmids is rather low, with a mean of 5.5 ± 4.7% (±SD). Most of the ARG cooccurrence in resistance islands were observed in conjugative F-plasmids (IncF) that reside in Escherichia (PTU-FE) and Klebsiella (PTU-FK) hosts but not in Salmonella (PTU-FS) hosts. Notwithstanding, several classes of large conjugative plasmids, including PTUs E41, I2, X1, and X4 are rarely associated with resistance islands (Fig. 3). Taken together, ARG cooccurrence in resistance islands is common in specific taxonomic units of large conjugative plasmids.

To study the evolution of large MDR plasmid types we compared the patterns of cooccurring ARGs and SSR genes among closely related plasmids. For that purpose, we inferred the phylogeny of RepB (plasmid replication initiator protein; RefSeq accession: WP_004182030.1), which is the most commonly shared gene family among large conjugative plasmid types. RepB-encoding plasmids are the major carriers of ARGs, where 44% (1045/2392) of the RepB-encoding plasmids carry 68% (10,490/15,534) of the plasmid ARG homologs. The phylogeny reveals several RepB clades that correspond to recognized PTUs (and plasmid incompatibility groups) (Fig. 5A).

Fig. 5. Resistance islands in large plasmids.

Fig. 5

A Phylogenetic tree of 2529 RepB sequences in 2354 plasmids. Rings of the phylogenetic tree show (from inner to the outer ring): host genus, predicted plasmid mobility class, PTU and the presence of resistance islands. Branches with bootstrap support ≤70 are colored in orange. The tree scale bar is in amino acid substitutions per site. B Multidimensional scaling of shared gene content among plasmids. Plasmids are shown in different colors for each PTU. C Multidimensional scaling of shared CSBs among plasmids. Plasmids are shown in the same colors for each PTU. Plasmids lacking a PTU assignment could not be classified using COPLA27. Source data are provided in Supplementary file.

The comparison of related PTUs that differ in their mobility may supply hints for the association of resistance islands with specific plasmid characteristics. The RepB phylogeny indicates an evolutionary relatedness between PTUs FE, E5, and E41, which has been previously suggested based on plasmid sequence similarity48 (note that 17 out of 42 E41 in our set are not included in the RepB phylogeny since they do not encode a RepB). Plasmids in the three PTUs commonly reside in Escherichia hosts, with rare occurrence in Salmonella or Klebsiella. PTU-FE plasmids are mostly conjugative48 and abundantly encode resistance islands. In contrast, plasmids in the related PTUs E5 and E41 rarely encode ARGs, where PTU-E5 are non-mobilizable plasmids and PTU-E41 plasmids are mostly conjugative. While most of the PTU-FE plasmids are conjugative (n = 462), this group includes also rare mobilizable plasmids (n = 22) and non-mobilizable plasmids (n = 17). The presence of resistance islands among PTU-FE plasmids is significantly different among plasmids depending on the plasmid mobility class (P = 0.03, using Fisher test), where the proportion of plasmids encoding resistance islands is lower among the conjugative plasmids (57%) compared to mobilizable (77%) and non-mobilizable (76%) plasmids. Furthermore, there is no significant difference in the number of ARGs among the three mobility classes in PTU-FE plasmids encoding resistance islands (P = 0.43, using the Kruskal–Wallis test). The comparison among the related PTUs FE, E5, and E41 indicates that the presence of resistance islands in plasmids is PTU-specific and furthermore that conjugative plasmids are not necessarily enriched for resistance islands.

The RepB phylogeny further suggests an evolutionary relatedness between PTU-FK and PTU-E21 plasmids that commonly reside in Klebsiella hosts. Plasmids in PTU-FK are mostly conjugative and abundantly encode resistance islands, while PTU-E21 are mostly non-mobilizable and rarely encode ARGs (Fig. 3; Fig. 5A). The RepB phylogeny supports the classification of plasmids in PTU-FS as closely related plasmids (Fig. 5A). Plasmids in that group reside in Salmonella hosts and include both conjugative and non-mobilizable plasmids that rarely encode antibiotic resistance genes. There is no significant difference in the occurrence of resistance islands between conjugative and non-mobilizable plasmids in that group (P = 0.66, using the Fisher test). The comparison of plasmids in these PTUs shows that differences in the frequency of resistance islands depending on the plasmid mobility class are PTU-specific.

Large conjugative plasmids in PTUs HI1A, HI1B, and HI2 have been suggested to have shared plasmid sequence similarity25, and the RepB phylogeny indicates that PTU-HI1A and PTU-HI2 are more closely related (Fig. 5A). This inference aligns well with the difference in their host range as reflected in the isolate genomes; HI1B plasmids reside in Klebsiella, while HI1A and HI2 plasmids reside in Escherichia and Salmonella (Figs. 3 and 5A). The split between HI1B and the other two PTUs may thus be linked to plasmid divergence during the evolution of host-specificity. Further comparison among the plasmids depending on their host genera shows that there is no significant difference in the presence of resistance islands between HI2 plasmids depending on their Escherichia or Salmonella host (P = 0.24, using Fisher test), while the proportion of HI1A plasmids encoding resistance islands is larger in plasmids that reside in Salmonella compared to those that reside in Escherichia (P < 0.01, using Fisher test). Hence the presence of resistance islands in specific PTUs may depend additionally on the plasmid host range.

The association of resistance islands with specific plasmid types (PTUs), and in some cases also the plasmid host range, is in agreement with previous studies showing that closely related plasmids have a shared resistance island content (e.g. ref. 49). To examine this aspect of MDR plasmid evolution we focus on the five prominent MDR plasmid PTUs among the RepB plasmids. The pattern of shared plasmid genes is in accordance with the plasmid PTU classification, and also the expected patterns of their relatedness as inferred by the RepB phylogeny (Fig. 5B). That is, the plasmids are generally clustered by PTU with an overlap between the clusters of HI1A and HI2 plasmids. Plasmids in PTU-FE form a tight cluster that supports their common evolutionary history. In contrary, the PTU-FK plasmids form a rather sparse cloud, indicating a high diversity of plasmids in this PTU, and likely a combination of highly diversified plasmid sub-groups. The proportion of shared genes between the plasmids is weakly positively associated with the proportion of shared CSBs in PTUs HI1A and HI1B (rHI1A = 0.67, rHI1B = 0.68, P < 0.01, using spearman correlation), but less so for the other examined PTUs (rHI2 = 0.2, rFE = 0.21, rFK = 0.33, P < 0.01, using spearman correlation). Examination of shared CSBs among plasmids in the five PTUs shows that most of the plasmids form a dense cluster, indicating common CSB content across the five PTUs (Fig. 5C). Plasmids in PTUs FE and FK form two additional clusters, indicating that they commonly share CSBs with each other, but not with the other plasmids in the main cluster (Fig. 5C). The pattern of shared CSBs among the five PTUs suggests that resistance islands have a narrow distribution in specific PTUs.

The CSBs that are shared among plasmids in the five PTUs are rather short, with a median CSB length of four genes. Example is a CSB that was inferred in 313 plasmids in the five PTUs and comprises two ARG families: aminoglycoside nucleotidyltransferase (ANT(3”)) and MFS efflux pump (QacE) (see CSB_F_1 in Fig. 6); both of these families in our dataset include multiple epidemiological variants of the ARGs. This short CSB is included in multiple CSBs that correspond to previously recognized MGEs. This includes a CSB similar to a class 1 integron previously reported to be associated with sul350 (CSB_HI2_3 in Fig. 6) that has a narrow distribution in PTU-HI2 plasmids and a CSB similar to a class 1 integron previously reported in Tn645051 (CSB_FK_3 in Fig. 6) that has a narrow distribution in PTU-FK plasmids. Another overlapping CSB includes an ARG cassette previously reported in class 1 integron52 (CSB_FE_3 in Fig. 6) and has a narrow distribution in PTU-FE plasmids. Several CSBs reported in Fig. 6 are shared among plasmids classified into different PTUs. An example is the CSB_HI2_2 (Fig. 6) that we detected in one plasmid classified as PTU-FK. The plasmid pLH94-8 was reported in a K. pneumonia isolate sampled from feces of a healthy human53. The pLH94-8 plasmid locus of this CSB is almost identical to several other E. coli plasmids harboring the same CSB (e.g., pSLK172-154). Another example is CSB_FK_3 (Fig. 6) which we detected in a plasmid classified as PTU-HI1A. The plasmid p24362-1 was reported in a S. enterica strain isolated from swine55. The locus corresponding to the CSB is identical to other PTU-FK plasmids (e.g., pDA33144-22056). The demonstrative example presented here thus suggests that CSBs in our dataset are largely PTU specific.

Fig. 6. The distribution of CSBs matching mobile genetic elements is biased towards specific PTUs.

Fig. 6

A demonstrative example of overlapping CSBs and their frequency in plasmids classified into five closely related PTUs. Protein-coding genes in the CSBs are shown as arrows. ARGs in the same gene family have the same color. Genes with no name correspond to hypothetical proteins. orf_1 corresponds to GrpB family protein. orf_2 corresponds to SDR family oxidoreductase. The adjacent table shows the number of plasmids in each PTU where an instance of the CSB was found. Cells showing the dominant PTU are shaded. The CSB diversity (Div.) was calculated using the Gini-Simpson index. Details of the presented CSBs are listed in Supplementary Data 6. *IS440 is annotated as a pseudogene in the genome versions analyzed here (in more recent versions this locus is annotated as a gene). Source data are provided in the full CSB dataset.

To test if PTU specificity is a general trend in our dataset, we examined the diversity of plasmids harboring the same CSB using the Gini-Simpson (GS) index57. This index measures the probability that two randomly selected plasmids harboring the same CSB are classified into different PTUs. Note that the CSBs appearing to have a narrow PTU distribution in the demonstrative example are characterized by a low GS index (Fig. 6). Our result revealed that the median GS index for all CSBs and all PTUs is 0. Furthermore, the majority of CSBs (59%) in our set have a GS index ≤0.1. The PTU diversity of CSBs is significantly negatively correlated with the CSB length (rs = −0.66; P < 0.001), with longer CSBs having a lower PTU diversity (Supplementary Fig. S6). Taking together, our demonstrative example shows that CSBs matching to previously documented MGEs have a rather narrow distribution in specific PTUs, and furthermore the CBSs we analyzed here are characterized by a low diversity of PTUs. Both lines of evidence indicate that resistance islands are mostly shared among closely related plasmids having similar plasmid properties.

Discussion

Microbial organisms encounter diverse challenges in their habitat including predation, infection, as well as extreme or toxic environmental conditions. The response to such stressors is diverse and range from plastic phenotypes to defensive multicellular structures (e.g. ref. 58). Furthermore, adaptive response in prokaryotes can be mediated by the acquisition of new functions via gene transfer, i.e., in response to phage infection59 or to antibiotics52. In both of these cases, the acquired genes are typically organized in dense genomic islands60,61. Our results supply a quantitative assessment, according to which, the majority of antibiotic resistance genes in plasmids are located adjacent to mobile genetic elements in resistance islands. The remaining ARGs in MDR plasmids may still be located in resistance islands, however, if these are rarely found in our dataset, the coARG instance would be excluded from the final results. We note that filtering the coARGs for CSBs appearing in ≥4 strains lead to exclusion of 10-15% of the plasmid coARGs (Supplementary Data 3). In agreement with previous studies, our results implicate the interaction between mobile genetic elements and plasmids as the major driving force in the evolution of ARG content in MDR plasmids. Resistance islands are hotspots for the integration of ARG-carrying MGEs due to several reasons, including functional dependencies between specific SSR families (e.g., Tn402 and Tn21 family62) and possibly also deleterious fitness effect of MGE integration outside resistance islands on the plasmid functions (e.g., in essential plasmid loci63). Plasmids carrying mobile genetic elements are therefore, in themselves, a hotspot for the acquisition of multiple drug resistance genes (e.g., as shown for an IncP-1 plasmid49).

The temporal analysis of resistance islands shows that ARG combinations tend to be rapidly saturated. In the last decade, the number of available plasmid sequences in genomic databases increased considerably, yet the frequency of ARG combinations was growing slower (Fig. 2B). More than half of the ARGs are already found in resistance islands in the earliest sampled isolates. The temporal integration of ARG nodes in the network largely fits earlier reports on the emergence of specific resistance genes, providing validation for our approach. An example is the high connectivity of sul1 that supplies resistance to sulfonamides introduced in the 1930’s64. Additionally, recently reported resistance genes appear in the network periphery. One example is mcr-1.1 that supplies resistance to polymyxin and colistin, and was reported in 2016 in Escherichia isolated from pig feces3,40 (Fig. 2C). Another example is tet(X4) that was reported in 2019; the tet(X4) gene supplies resistance to tigecycline, a last resort antibiotic that was introduced in 2005. The gene was observed in plasmids of Escherichia isolates sampled already during 2016-2018 from a pig farm as well as human patients38. The newly added ARG nodes increase in connectivity rather fast, indicating that the agglomeration of ARGs in resistance islands is a rapid process.

Previous studies on the evolution of MDR plasmids typically compare plasmid ARG content in the context of plasmid size and mobility class, without much distinction of the plasmid origins or properties related to the host range. A recent classification of plasmids into taxonomic units supplies a framework for comparative plasmid studies25 while considering their distinct characteristics and origins. The results we present here add support for the evolutionary relatedness of plasmids in the same PTU and furthermore expand on the evolution of ARG content in specific plasmid lineages. We further show that the distribution of resistance islands has a narrow PTU range, and their differential presence among plasmids within the same PTU may furthermore depend on the plasmid mobility class or the host range. Quantifying the diversity of plasmids harboring the same CSBs further revealed a low PTU diversity of CSBs. However, an interpretation of that result in the context of plasmid evolution is challenging, since our approach does not allow for an accurate annotation of resistance island boundaries. Long CSBs in our dataset correspond to conserved neighborhoods of MGE agglomerates, that are unlikely to emerge independently in distantly related plasmids. In contrast, short CSBs in our dataset may correspond either to genuine transposable elements or degraded pieces of resistance islands. Notably, while we are unable to distinguish between these two possibilities, our analysis shows that the plasmid diversity is generally low for all CSB lengths. The pattern of shared resistance islands can be thus explained either by common origins of the islands in ancestral plasmids and evolution by plasmid divergence (i.e., vertical plasmid inheritance), or alternatively, barriers for horizontal transfer of ARGs between plasmids, which are associated with the specific characteristics of plasmids in the same PTUs.

Barriers for the integration of ARG-carrying MGEs into plasmids may be related to the plasmid genetic properties (e.g., mode of replication and mobility), ecology (i.e., host range) and evolutionary history. For example, small plasmid types are almost devoid of ARGs; in our dataset, 62% of the plasmids in small plasmid PTUs are documented in a host where a large plasmid encoding resistance island is found as well. Hence the rarity of coARG containing CSBs in small plasmid types cannot be well explained by plasmid ecology (or host range) only. The narrow distribution of plasmid size among the ColE1-like plasmids in KES hosts (see also20 and Supplementary Note SN2) may indicate that ColE1-like replicons cannot well support a plasmid size inflation, hence the barrier for ARG acquisition is linked to the plasmid replicon type. For example, it has been suggested that the replication mode of IncQ plasmids, which are typically small similarly to ColE1-like plasmids, may become unstable if the plasmid size surpasses ca. 20 Kb65. In contrast, barriers for MGE-mediated integration of ARGs in the large PTU-FS plasmids (where only 5 out of 114 plasmids had resistance islands; Fig. 3), are unlikely to be related to plasmid size. An examination of their hosts showed that only 21 (18%) PTU-FS plasmids reside with other plasmids that encode a resistance island in the same host, while the remaining FS plasmids are either a sole plasmid in their host (54%) or reside with other plasmids that do not encode resistance islands. The FS taxonomic unit includes homologs of plasmid pSLT, a virulence plasmid in Salmonella66 where ARGs are rather rare (e.g.67). The interaction of PTU-FS plasmids with ARG-encoding MGEs is likely rare due to the plasmid host range. Taken together, our study reveals that the interaction between plasmids and MGEs is a major driving force in the evolution of MDR plasmids, where the interaction and its frequency largely depend on the plasmid taxonomic units, that is, the plasmid characteristics and its evolutionary history.

Methods

Data collection

Complete genomes of isolates in the genera Escherichia, Salmonella and Klebsiella where plasmids are reported were downloaded from the NCBI RefSeq database (version 01/2021; the same sequenced genomes dataset was used in our recent publication37). Isolate metadata was downloaded from BioSample database. The metadata of samples lacking host information was examined manually in detail. A total of 23 strains sequenced within the framework of laboratory experiments were excluded. The surveyed genomes include 1114 strains comprising 3098 plasmids from Escherichia, 755 strains comprising 2693 plasmids from Klebsiella, and 572 strains comprising 993 plasmids from Salmonella (Supplementary Data 1). The dataset does not include redundant genomes of the same strain. Similar and even identical plasmids were retained if they were identified in different strains (see Supplementary Data 1).

Identification of ARGs

Homologs of antibiotic resistance genes were identified based on the comprehensive antibiotic resistance database28 (CARD version 3.1.0) using Resistance Gene Identifier (version 5.1.1, with parameter --clean). To reduce the number of mis-assembled or mis-annotated ARGs, we excluded protein-coding genes whose sequence had less than 70% identical amino acids to the CARD reference ARG, or their length was less than 90% or more than 130% of the CARD reference ARG (as in ref.37). Of the total 3044 genes in CARD, 416 (14%) ARGs had 114,464 homologs in the examined isolates.

Construction of homologous gene families

Gene families were inferred based on sequence similarity as previously described in our study of Escherichia strains19. Briefly, reciprocal best hits (RBHs) of protein sequences between all pairs of 9225 replicons were identified using MMseqs268 (v.13.45111, with module easy-rbh applying a threshold of E-value ≤ 1 × 10−10). RBHs were further compared by global alignment of the protein sequences using parasail-python69 (v. 1.2.4, with the Needleman-Wunsch algorithm). Sequence pairs with ≥30% identical amino acids in the global alignment were clustered into gene families using a high-performance parallel implementation of the Markov clustering algorithm70 (HipMCL with parameter --abc -I 2.0). A total of 114,457 ARGs were identified in 138 gene families; the CARD classification was sometime only partially in agreement with the homologous gene family classification (114,457/203,591). Most of the contradictions between CARD and our clusters were chromosomal gene (97%) with only 3% corresponding to plasmid genes. Aiming for a conservative analysis, members not identified as ARGs by CARD were excluded.

Transposases and site-specific recombinases identification and biased distribution test

All gene families constructed in this study were scanned using InterProScan71 (version 5.59-91.0) against InterPro’s member databases72. HMM profiles of transposases and site-specific recombinases were obtained from previous studies73,74 to search SSRs using hmmsearch (HMMER version 3.3.2)75. Three chromosomal gene families were not included in the further analysis: a serine recombinase (flagellar phase variation DNA invertase Hin) and tyrosine recombinases XerC and XerD. The biased distribution of gene families towards plasmids or chromosomes was tested by comparing their frequency in both replicon types with the frequency of remaining family members on both replicon types using Fisher’s exact test (as in ref. 37), implemented with the fisher.test function in R (version 4.0.3, two-sided α = 0.05 and correction for multiple comparisons with false discovery rate (FDR76)).

Gene cooccurrence test

Significant cooccurrence of ARG pairs from different gene families was tested using Fisher’s exact test with fisher.test function in R (version 4.0.3, two-sided, α = 0.05) and correction for multiple comparisons with false discovery rate (FDR) 76. The test was performed for plasmids in different genera separately (as in ref. 37). For each test, the sample size n was equal to the number of ARG-coding plasmids. The significantly cooccurring ARGs are supplied in Supplementary Data 5.

Inference of collinear syntenic blocks

Collinear syntenic blocks (i.e., conserved gene order) were identified using CSBFinder-S29 allowing no gene insertion and maximal length of 200 protein-coding sequences (version 0.6.3 with parameter -q 4 -ins 0 -lmax 200 -c -cs -alg MATCH_POINTS). CBSs that were found identical in at least four genomes were retained for further analysis.

Detection of CSBs matching to mobile genetic elements

Sequences of intact transposable elements were collected from three mobile genetic element (MGE) databases: The Transposon Registry30 (version 10/2021), TnCentral31 (version 10/2021), and ISfinder32 database (version 10/2020). Partial sequences and complete plasmid sequences were excluded. We searched for the known MGEs in all 2441 complete genomes using Blast+77 (version: 2.9.0+ parameter: E-value ≤ 1 × 10−10). The blast hits were further compared with the CSBs genomic loci (i.e., the CSB coordinates). CSBs that overlapped with identified MGEs with at least two complete protein-coding genes were retained for further analysis.

ARG island analysis

The distance between ARGs was counted as the number of protein-coding genes on the same strand from one ARG to the adjacent n-th ARG (n = 2–10). The stochasticity of ARG distance was tested using chi square goodness-of-fit test with negative binomial distribution as the null hypothesis. The distances between randomly sampled genes was used as a control, where a similar number of gene families was sampled. Each sampled gene family has a similar family size to the corresponding ARG family. The distance between members of the randomly sampled gene families was counted in the same way as for the ARGs, the sampling was repeated 100 times. The distance distribution of ARGs and the control group were compared using Wilcoxon rank-sum test for each step-size separately (n = 2–10).

Network visualization

The network visualization was performed with Cytoscape78 (version 3.10.1).

Plasmid typing

We identified the incompatibility (Inc) groups of Rop- and RepB-coding plasmids using PlasmidFinder (version 2.1.6 with parameters: -l 0.6 -t 0.9 -mp blastn). Plasmids that remained of unknow Inc groups were further examined using pMLST79 (version 2.0.3, with parameter -mp blastn). Plasmid classification into predicted mobility classes (non-mobile, mobile, or conjugative) was done on the basis of the plasmid gene content using MOB-suite80 (version 3.0.3). Plasmids were assigned into PTUs using COPLA27 (version 1.0).

Phylogenetic analysis

Protein sequences were aligned using MAFFT81 (version v7.475). Owing to the large number of identical protein sequences among transposases, the sequences were filtered into non-redundant amino-acid sequences prior to the alignment. Maximum likelihood trees were reconstructed using IQ-TREE82 (version 2.1.2) with Le and Gascuel (LG83) substitution model and the LG4X model as additional alternative (with parameter -mset LG -madd LG4X -bb 1000). The resulting trees were rooted using the midpoint criterion and visualized using iTOL84 (version 6.5.8).

Diversity index

The plasmid diversity of CSBs was calculated according to the Gini-Simpson index57 with vegan package85 (v2.6-4) in R. The number of plasmids in each PTU was counted while excluding duplicated CSB instances in the same plasmid.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Supplementary information

Peer Review File (4MB, pdf)
41467_2024_48352_MOESM3_ESM.pdf (125.1KB, pdf)

Description of Additional Supplementary Files

Supplementary Data S1 (802.2KB, xlsx)
Supplementary Data S2 (32.7KB, xlsx)
Supplementary Data S3 (10.4KB, xlsx)
Supplementary Data S4 (47.5KB, xlsx)
Supplementary Data S5 (74.9KB, xlsx)
Supplementary Data S6 (74.7KB, xlsx)
Supplementary Data S7 (22.3KB, xlsx)
Reporting Summary (1.4MB, pdf)

Source data

Source data (15.5KB, xlsx)

Acknowledgements

The authors thank Hinrich Schulenburg, Giddy Landan, Nils Hülter, Devani Picazo, Dustin Hanke, Ishan Bhatt for fruitful discussion and critical comments on the manuscript. We thank Liam Shaw for constructive comments and critical review of the manuscript earlier versions. We are grateful for financial support from the German Science Foundation (RTG 2501 TransEvo), the China Scholarship Council (CSC scholarship to Y.W.), and the European Research Commission (pMolEvol, grant number 101043835). This research was supported in part through high-performance computing resources available at the Kiel University Computing Centre.

Author contributions

Y.W. and T.D. conceived the study. Y.W. designed and performed the data analysis and visualizations. Y.W. and T.D. interpreted the results and wrote the manuscript.

Peer review

Peer review information

Nature Communications thanks Liam Shaw, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Data availability

Source data are provided with this paper. The genome data is publicly available (https://ftp.ncbi.nlm.nih.gov/genomes/refseq/). Antibiotic resistance genes in comprehensive antibiotic resistance database (CARD v3.1.0; https://card.mcmaster.ca/); intact transposable elements were collected from The Transposon Registry (v10/2021; https://transposon.lstmed.ac.uk/tn-registry), TnCentral (v10/2021; https://tncentral.ncc.unesp.br/), and ISfinder database (v10/2020; https://isfinder.biotoul.fr/). The full CSB dataset and gene family information used in this study is available in Figshare86. Phylogenetic data of five transposase gene families, Rop, and RepB are also supplied86Source data are provided with this paper.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-024-48352-8.

References

  • 1.Castañeda-Barba S, Top EM, Stalder T. Plasmids, a molecular cornerstone of antimicrobial resistance in the One Health era. Nat. Rev. Microbiol. 2024;22:18–32. doi: 10.1038/s41579-023-00926-x. [DOI] [PubMed] [Google Scholar]
  • 2.Kumarasamy KK, et al. Emergence of a new antibiotic resistance mechanism in India, Pakistan, and the UK: a molecular, biological, and epidemiological study. Lancet Infect. Dis. 2010;10:597–602. doi: 10.1016/S1473-3099(10)70143-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Liu Y-Y, et al. Emergence of plasmid-mediated colistin resistance mechanism MCR-1 in animals and human beings in China: a microbiological and molecular biological study. Lancet Infect. Dis. 2016;16:161–168. doi: 10.1016/S1473-3099(15)00424-7. [DOI] [PubMed] [Google Scholar]
  • 4.León-Sampedro R, et al. Pervasive transmission of a carbapenem resistance plasmid in the gut microbiota of hospitalized patients. Nat. Microbiol. 2021;6:606–616. doi: 10.1038/s41564-021-00879-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Rozwandowicz M, et al. Plasmids carrying antimicrobial resistance genes in Enterobacteriaceae. J. Antimicrob. Chemother. 2018;73:1121–1137. doi: 10.1093/jac/dkx488. [DOI] [PubMed] [Google Scholar]
  • 6.Murray CJL, et al. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet. 2022;399:629–655. doi: 10.1016/S0140-6736(21)02724-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Partridge SR. Analysis of antibiotic resistance regions in Gram-negative bacteria. FEMS Microbiol. Rev. 2011;35:820–855. doi: 10.1111/j.1574-6976.2011.00277.x. [DOI] [PubMed] [Google Scholar]
  • 8.Cohen SN. Transposable genetic elements and plasmid evolution. Nature. 1976;263:731–734. doi: 10.1038/263731a0. [DOI] [PubMed] [Google Scholar]
  • 9.Stokes HW, Hall RM. A novel family of potentially mobile DNA elements encoding site-specific gene-integration functions: integrons. Mol. Microbiol. 1989;3:1669–1683. doi: 10.1111/j.1365-2958.1989.tb00153.x. [DOI] [PubMed] [Google Scholar]
  • 10.Escudero, J. A., Loot, C., Nivina, A. & Mazel, D. The Integron: adaptation on demand. Microbiol. Spectr.3, MDNA3-0019-2014 (2015). [DOI] [PubMed]
  • 11.Harmer CJ, Pong CH, Hall RM. Structures bounded by directly-oriented members of the IS26 family are pseudo-compound transposons. Plasmid. 2020;111:102530. doi: 10.1016/j.plasmid.2020.102530. [DOI] [PubMed] [Google Scholar]
  • 12.Baquero F. From pieces to patterns: evolutionary engineering in bacterial pathogens. Nat. Rev. Microbiol. 2004;2:510–518. doi: 10.1038/nrmicro909. [DOI] [PubMed] [Google Scholar]
  • 13.Miriagou V, Carattoli A, Fanning S. Antimicrobial resistance islands: resistance gene clusters in Salmonella chromosome and plasmids. Microbes Infect. 2006;8:1923–1930. doi: 10.1016/j.micinf.2005.12.027. [DOI] [PubMed] [Google Scholar]
  • 14.Harmer CJ, Moran RA, Hall RM. Movement of IS26-associated antibiotic resistance genes occurs via a translocatable unit that includes a single IS26 and preferentially inserts adjacent to another IS26. mBio. 2014;5:e01801–e01814. doi: 10.1128/mBio.01801-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Harmer CJ, Hall RM. IS26 cannot move alone. J. Antimicrob. Chemother. 2021;76:1428–1432. doi: 10.1093/jac/dkab055. [DOI] [PubMed] [Google Scholar]
  • 16.Ambrose SJ, Harmer CJ, Hall RM. Evolution and typing of IncC plasmids contributing to antibiotic resistance in Gram-negative bacteria. Plasmid. 2018;99:40–55. doi: 10.1016/j.plasmid.2018.08.001. [DOI] [PubMed] [Google Scholar]
  • 17.Reid CJ, Chowdhury PR, Djordjevic SP. Tn6026 and Tn6029 are found in complex resistance regions mobilised by diverse plasmids and chromosomal islands in multiple antibiotic resistant Enterobacteriaceae. Plasmid. 2015;80:127–137. doi: 10.1016/j.plasmid.2015.04.005. [DOI] [PubMed] [Google Scholar]
  • 18.Rafei R, et al. Analysis of pCl107 a large plasmid carried by an ST25 Acinetobacter baumannii strain reveals a complex evolutionary history and links to multiple antibiotic resistance and metabolic pathways. FEMS Microbes. 2022;3:1–13. doi: 10.1093/femsmc/xtac027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wein T, Wang Y, Hülter NF, Hammerschmidt K, Dagan T. Antibiotics interfere with the evolution of plasmid stability. Curr. Biol. 2020;30:3841–3847.e4. doi: 10.1016/j.cub.2020.07.019. [DOI] [PubMed] [Google Scholar]
  • 20.Ares-Arroyo M, Rocha EPC, Gonzalez-Zorn B. Evolution of ColE1-like plasmids across γ-Proteobacteria: From bacteriocin production to antimicrobial resistance. PLOS Genet. 2021;17:e1009919. doi: 10.1371/journal.pgen.1009919. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Smillie C, Garcillan-Barcia MP, Francia MV, Rocha EPC, de la Cruz F. Mobility of plasmids. Microbiol. Mol. Biol. Rev. 2010;74:434–452. doi: 10.1128/MMBR.00020-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Che Y, et al. Conjugative plasmids interact with insertion sequences to shape the horizontal transfer of antimicrobial resistance genes. Proc. Natl Acad. Sci. USA. 2021;118:e2008731118. doi: 10.1073/pnas.2008731118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Shaw LP. Null models for gene enrichment in plasmids. Proc. Natl Acad. Sci. USA. 2021;118:e2103665118. doi: 10.1073/pnas.2103665118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Partridge SR, et al. Classifying mobile genetic elements and their interactions from sequence data: the importance of existing biological knowledge. Proc. Natl Acad. Sci. USA. 2021;118:e2104685118. doi: 10.1073/pnas.2104685118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Redondo-Salvo S, et al. Pathways for horizontal gene transfer in bacteria revealed by a global map of their plasmids. Nat. Commun. 2020;11:3602. doi: 10.1038/s41467-020-17278-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Garcillán-Barcia MP, Redondo-Salvo S, de la Cruz F. Plasmid classifications. Plasmid. 2023;126:102684. doi: 10.1016/j.plasmid.2023.102684. [DOI] [PubMed] [Google Scholar]
  • 27.Redondo-Salvo S, et al. COPLA, a taxonomic classifier of plasmids. BMC Bioinform. 2021;22:390. doi: 10.1186/s12859-021-04299-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Alcock BP, et al. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2019;10:226–229. doi: 10.1093/nar/gkz935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Svetlitsky D, Dagan T, Ziv-Ukelson M. Discovery of multi-operon colinear syntenic blocks in microbial genomes. Bioinformatics. 2020;36:i21–i29. doi: 10.1093/bioinformatics/btaa503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Tansirichaiya S, Rahman Md A, Roberts AP. The transposon registry. Mob. DNA. 2019;10:40. doi: 10.1186/s13100-019-0182-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Ross K, et al. TnCentral: a prokaryotic transposable element database and web portal for transposon analysis. mBio. 2021;12:e02060-21. doi: 10.1128/mBio.02060-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Siguier P, Perochon J, Lestrade L, Mahillon J, Chandler M. ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res. 2006;34:D32–D36. doi: 10.1093/nar/gkj014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Tatusov RL, Koonin EV, Lipman DJ. A genomic perspective on protein families. Science. 1997;278:631–637. doi: 10.1126/science.278.5338.631. [DOI] [PubMed] [Google Scholar]
  • 34.Harmer CJ, Pong CH, Hall RM. A brief guide to correct annotation of IS26 and variants. J. Antimicrob. Chemother. 2021;76:2213–2215. doi: 10.1093/jac/dkab139. [DOI] [PubMed] [Google Scholar]
  • 35.Pong CH, Harmer CJ, Ataide SF, Hall RM. An IS26 variant with enhanced activity. FEMS Microbiol. Lett. 2019;366:fnz031. doi: 10.1093/femsle/fnz031. [DOI] [PubMed] [Google Scholar]
  • 36.Partridge SR, Hall RM. The IS1111 family members IS4321 and IS5075 have subterminal inverted repeats and target the terminal inverted repeats of Tn21 family transposons. J. Bacteriol. 2003;185:6371–6384. doi: 10.1128/JB.185.21.6371-6384.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wang Y, Batra A, Schulenburg H, Dagan T. Gene sharing among plasmids and chromosomes reveals barriers for antibiotic resistance gene transfer. Philos. Trans. R. Soc. B: Biol. Sci. 2022;377:20200467. doi: 10.1098/rstb.2020.0467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.He T, et al. Emergence of plasmid-mediated high-level tigecycline resistance genes in animals and humans. Nat. Microbiol. 2019;4:1450–1456. doi: 10.1038/s41564-019-0445-2. [DOI] [PubMed] [Google Scholar]
  • 39.Liu J, Keelan P, Bennett PM, Enne VI. Characterization of a novel macrolide efflux gene, mef(B), found linked to sul3 in porcine Escherichia coli. J. Antimicrob. Chemother. 2009;63:423–426. doi: 10.1093/jac/dkn523. [DOI] [PubMed] [Google Scholar]
  • 40.Shen Z, Wang Y, Shen Y, Shen J, Wu C. Early emergence of mcr-1 in Escherichia coli from food-producing animals. Lancet Infect. Dis. 2016;16:293. doi: 10.1016/S1473-3099(16)00061-X. [DOI] [PubMed] [Google Scholar]
  • 41.Sun J, et al. Complete nucleotide sequence of an IncI2 plasmid coharboring blaCTX-M-55 and mcr−1. Antimicrob. Agents Chemother. 2016;60:5014–5017. doi: 10.1128/AAC.00774-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Zurfluh K, Klumpp J, Nüesch-Inderbinen M, Stephan R. Full-length nucleotide sequences of mcr−1-harboring plasmids isolated from extended-spectrum-β-lactamase-producing Escherichia coli isolates of different origins. Antimicrob. Agents Chemother. 2016;60:5589–5591. doi: 10.1128/AAC.00935-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Acman M, et al. Role of mobile genetic elements in the global dissemination of the carbapenem resistance gene blaNDM. Nat. Commun. 2022;13:1131. doi: 10.1038/s41467-022-28819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Ares-Arroyo M, Coluzzi C, Rocha EPC. Origins of transfer establish networks of functional dependencies for plasmid transfer by conjugation. Nucleic Acids Res. 2023;51:3001–3016. doi: 10.1093/nar/gkac1079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Ramirez MS, Iriarte A, Reyes-Lamothe R, Sherratt DJ, Tolmasky ME. Small Klebsiella pneumoniae plasmids: neglected contributors to antibiotic resistance. Front. Microbiol. 2019;10:2182. doi: 10.3389/fmicb.2019.02182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Tolmasky ME, Crosa JH. Tn1331, a novel multiresistance transposon encoding resistance to amikacin and ampicillin in Klebsiella pneumoniae. Antimicrob. Agents Chemother. 1987;31:1955–1960. doi: 10.1128/AAC.31.12.1955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Chen Y, et al. Emergence of carbapenem-resistant Klebsiella pneumoniae harbouring blaOXA-48-like genes in China. J. Med. Microbiol. 2021;70:001306. doi: 10.1099/jmm.0.001306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Coluzzi C, Garcillán-Barcia MP, de la Cruz F, Rocha EPC. Evolution of plasmid mobility: origin and fate of conjugative and nonconjugative plasmids. Mol. Biol. Evol. 2022;39:msac115. doi: 10.1093/molbev/msac115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Sota M, et al. Region-specific insertion of transposons in combination with selection for high plasmid transferability and stability accounts for the structural similarity of IncP-1 plasmids. J. Bacteriol. 2007;189:3091–3098. doi: 10.1128/JB.01906-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Antunes P, Machado J, Peixe L. Dissemination of sul3-containing elements linked to class 1 integrons with an unusual 3′ conserved sequence region among Salmonella isolates. Antimicrobial Agents Chemother. 2007;51:1545–1548. doi: 10.1128/AAC.01275-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Chen Y-P, et al. Tn6450, a novel multidrug resistance transposon characterized in a Proteus mirabilis isolate from chicken in China. Antimicrob. Agents Chemother. 2018;62:e02192–17. doi: 10.1128/AAC.02192-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Partridge SR, Kwong SM, Firth N, Jensen SO. Mobile genetic elements associated with antimicrobial resistance. Clin. Microbiol. Rev. 2018;31:e00088–17. doi: 10.1128/CMR.00088-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hadjadj L, Baron SA, Olaitan AO, Morand S, Rolain J-M. Co-occurrence of variants of mcr-3 and mcr-8 genes in a Klebsiella pneumoniae isolate from Laos. Front. Microbiol. 2019;10:2720. doi: 10.3389/fmicb.2019.02720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Bai L, et al. A novel disrupted mcr-1 gene and a lysogenized phage P1-like sequence detected from a large conjugative plasmid, cultured from a human atypical enteropathogenic Escherichia coli (aEPEC) recovered in China. J. Antimicrob. Chemother. 2017;72:1531–1533. doi: 10.1093/jac/dkw564. [DOI] [PubMed] [Google Scholar]
  • 55.Zhao S, et al. Comparative genomic analysis of 450 strains of Salmonella enterica isolated from diseased animals. Genes. 2020;11:1025. doi: 10.3390/genes11091025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Nicoloff H, Hjort K, Levin BR, Andersson DI. The high prevalence of antibiotic heteroresistance in pathogenic bacteria is mainly caused by gene amplification. Nat. Microbiol. 2019;4:504–514. doi: 10.1038/s41564-018-0342-0. [DOI] [PubMed] [Google Scholar]
  • 57.Jost L. Entropy and diversity. Oikos. 2006;113:363–375. doi: 10.1111/j.2006.0030-1299.14714.x. [DOI] [Google Scholar]
  • 58.Smith WPJ, Wucher BR, Nadell CD, Foster KR. Bacterial defences: mechanisms, evolution and antimicrobial resistance. Nat. Rev. Microbiol. 2023;21:519–534. doi: 10.1038/s41579-023-00877-3. [DOI] [PubMed] [Google Scholar]
  • 59.Koonin EV, Makarova KS, Wolf YI, Krupovic M. Evolutionary entanglement of mobile genetic elements and host defence systems: guns for hire. Nat. Rev. Genet. 2020;21:119–131. doi: 10.1038/s41576-019-0172-9. [DOI] [PubMed] [Google Scholar]
  • 60.Rocha EPC, Bikard D. Microbial defenses against mobile genetic elements and viruses: who defends whom from what? PLOS Biol. 2022;20:e3001514. doi: 10.1371/journal.pbio.3001514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Shaw LP, Neher RA. Visualizing and quantifying structural diversity around mobile resistance genes. Microb. Genomics. 2023;9:001168. doi: 10.1099/mgen.0.001168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Minakhina S, Kholodii G, Mindlin S, Yurieva O, Nikiforov V. Tn5053 family transposons are res site hunters sensing plasmidal res sites occupied by cognate resolvases. Mol. Microbiol. 1999;33:1059–1068. doi: 10.1046/j.1365-2958.1999.01548.x. [DOI] [PubMed] [Google Scholar]
  • 63.Porse A, Schønning K, Munck C, Sommer MOA. Survival and evolution of a large multidrug resistance plasmid in new clinical bacterial hosts. Mol. Biol. Evol. 2016;33:2860–2873. doi: 10.1093/molbev/msw163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Clatworthy AE, Pierson E, Hung DT. Targeting virulence: a new paradigm for antimicrobial therapy. Nat. Chem. Biol. 2007;3:541–548. doi: 10.1038/nchembio.2007.24. [DOI] [PubMed] [Google Scholar]
  • 65.Loftie-Eaton W, Rawlings DE. Diversity, biology and evolution of IncQ-family plasmids. Plasmid. 2012;67:15–34. doi: 10.1016/j.plasmid.2011.10.001. [DOI] [PubMed] [Google Scholar]
  • 66.Rotger R, Casadesús J. The virulence plasmids of Salmonella. Int. Microbiol. 1999;2:177–184. [PubMed] [Google Scholar]
  • 67.Kingsley RA, et al. Epidemic multiple drug resistant Salmonella Typhimurium causing invasive disease in sub-Saharan Africa have a distinct genotype. Genome Res. 2009;19:2279–2287. doi: 10.1101/gr.091017.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol. 2017;35:1026–1028. doi: 10.1038/nbt.3988. [DOI] [PubMed] [Google Scholar]
  • 69.Daily J. Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments. BMC Bioinform. 2016;17:81. doi: 10.1186/s12859-016-0930-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Azad A, Pavlopoulos GA, Ouzounis CA, Kyrpides NC, Buluç A. HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks. Nucleic Acids Res. 2018;46:e33. doi: 10.1093/nar/gkx1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Jones P, et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30:1236–1240. doi: 10.1093/bioinformatics/btu031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Blum M, et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 2021;49:D344–D354. doi: 10.1093/nar/gkaa977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Khedkar S, et al. Landscape of mobile genetic elements and their antibiotic resistance cargo in prokaryotic genomes. Nucleic Acids Res. 2022;50:3155–3168. doi: 10.1093/nar/gkac163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Smyshlyaev G, Bateman A, Barabas O. Sequence analysis of tyrosine recombinases allows annotation of mobile genetic elements in prokaryotic genomes. Mol. Sys Biol. 2021;17:e9880. doi: 10.15252/msb.20209880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Mistry J, Finn RD, Eddy SR, Bateman A, Punta M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 2013;41:e121. doi: 10.1093/nar/gkt263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B: Stat. Methodol. 1995;57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x. [DOI] [Google Scholar]
  • 77.Camacho C, et al. BLAST+: architecture and applications. BMC Bioinform. 2009;10:421–429. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Shannon P. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Carattoli, A. & Hasman, H. In Horizontal Gene Transfer: Methods and Protocols (ed. de la Cruz, F.) 285–294 (Springer US, 2020). 10.1007/978-1-4939-9877-7_20.
  • 80.Robertson J, Nash JHE. MOB-suite: software tools for clustering, reconstruction and typing of plasmids from draft assemblies. Micro. Genom. 2018;4:e000206. doi: 10.1099/mgen.0.000206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Minh BQ, et al. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 2020;37:1530–1534. doi: 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Le SQ, Gascuel O. An improved general amino acid replacement matrix. Mol. Biol. Evol. 2008;25:1307–1320. doi: 10.1093/molbev/msn067. [DOI] [PubMed] [Google Scholar]
  • 84.Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47:W256–W259. doi: 10.1093/nar/gkz239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Oksanen, J. & et al. vegan: Community Ecology Package. (2022).
  • 86.Wang, Y. & Dagan, T. The evolution of antibiotic resistance islands occurs within the framework of plasmid lineages. 10.6084/m9.figshare.25605450.v3 (2024). [DOI] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Peer Review File (4MB, pdf)
41467_2024_48352_MOESM3_ESM.pdf (125.1KB, pdf)

Description of Additional Supplementary Files

Supplementary Data S1 (802.2KB, xlsx)
Supplementary Data S2 (32.7KB, xlsx)
Supplementary Data S3 (10.4KB, xlsx)
Supplementary Data S4 (47.5KB, xlsx)
Supplementary Data S5 (74.9KB, xlsx)
Supplementary Data S6 (74.7KB, xlsx)
Supplementary Data S7 (22.3KB, xlsx)
Reporting Summary (1.4MB, pdf)
Source data (15.5KB, xlsx)

Data Availability Statement

Source data are provided with this paper. The genome data is publicly available (https://ftp.ncbi.nlm.nih.gov/genomes/refseq/). Antibiotic resistance genes in comprehensive antibiotic resistance database (CARD v3.1.0; https://card.mcmaster.ca/); intact transposable elements were collected from The Transposon Registry (v10/2021; https://transposon.lstmed.ac.uk/tn-registry), TnCentral (v10/2021; https://tncentral.ncc.unesp.br/), and ISfinder database (v10/2020; https://isfinder.biotoul.fr/). The full CSB dataset and gene family information used in this study is available in Figshare86. Phylogenetic data of five transposase gene families, Rop, and RepB are also supplied86Source data are provided with this paper.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES