Skip to main content
Microbiology Spectrum logoLink to Microbiology Spectrum
. 2024 Jun 4;12(7):e04108-23. doi: 10.1128/spectrum.04108-23

Utilizing co-abundances of antimicrobial resistance genes to identify potential co-selection in the resistome

Hannah-Marie Martiny 1,, Patrick Munk 1, Christian Brinch 1, Frank M Aarestrup 1, M Luz Calle 2, Thomas N Petersen 1
Editor: Monica Adriana Garcia-Solache3
PMCID: PMC11218503  PMID: 38832899

ABSTRACT

The rapid spread of antimicrobial resistance (AMR) is a threat to global health, and the nature of co-occurring antimicrobial resistance genes (ARGs) may cause collateral AMR effects once antimicrobial agents are used. Therefore, it is essential to identify which pairs of ARGs co-occur. Given the wealth of next-generation sequencing data available in public repositories, we have investigated the correlation between ARG abundances in a collection of 214,095 metagenomic data sets. Using more than 6.76∙108 read fragments aligned to acquired ARGs to infer pairwise correlation coefficients, we found that more ARGs correlated with each other in human and animal sampling origins than in soil and water environments. Furthermore, we argued that the correlations could serve as risk profiles of resistance co-occurring to critically important antimicrobials (CIAs). Using these profiles, we found evidence of several ARGs conferring resistance for CIAs being co-abundant, such as tetracycline ARGs correlating with most other forms of resistance. In conclusion, this study highlights the important ARG players indirectly involved in shaping the resistomes of various environments that can serve as monitoring targets in AMR surveillance programs.

IMPORTANCE

Understanding the collateral effects happening in a resistome can reveal previously unknown links between antimicrobial resistance genes (ARGs). Through the analysis of pairwise ARG abundances in 214K metagenomic samples, we observed that the co-abundance is highly dependent on the environmental context and argue that these correlations can be used to show the risk of co-selection occurring in different settings.

KEYWORDS: metagenomics, correlation, network analysis, compositional data analysis, co-abundances, antimicrobial resistance, microbiome

INTRODUCTION

Antimicrobial resistance (AMR) has become a global health threat, with ramifications not only for modern medicine but also for agriculture and environmental health (13). It is widely acknowledged that the misuse of antimicrobials has accelerated the dissemination and prevalence of antimicrobial resistance genes (ARGs) (4). Most attempts to reduce the burden of AMR have focused on reducing the use of single classes of antimicrobial agents considered of critical importance (5). Despite considerable efforts in various settings, such as livestock, these regulations have not significantly reduced the spread of ARGs (6, 7). It has been shown that after banning the use of specific antimicrobials, resistance to that antibiotic will still be prevalent in the environment (8). Indirect co-occurrence of ARGs has also been shown to happen under selective antibiotic pressure (7, 911). We have also recently observed that changes in the selective pressure of even a single antimicrobial agent influence several ARGs in pig metagenomes (11). However, there is also evidence that adding mobile ARGs to a microbiome increased its stability, whereas if the ARGs were encoded in the chromosome, the stability decreased (12). The origin of the interplay between the microbiome and resistome, however, remains largely unresolved, as there are many ecological, functional, and evolutionary properties unaccounted for. It is unclear whether the co-selection of ARGs happens due to genetic linkage or inter-species interactions.

Studies that focused on studying the link between antimicrobial usages (AMUs) and the prevalence of AMR have typically only focused on the effects of using one antimicrobial agent and the impact on developing resistance to that agent, ignoring the overall effect on the abundance of ARGs in the environment. This effect on the resistome must be understood better to prevent these collateral damage effects from happening (13, 14). Co-occurrence of microbes has been evaluated in soil (15, 16) and marine environments (17, 18), whereas ARG co-occurrences have been studied in sewage sludge (19), freshwater (20), marine (21), swine (8), and cattle (22). However, most of these studies have only been evaluated on a smaller scale and do not always use the same methods. There is currently a large collection of next-generation sequencing data sets from metagenomic samples available in public repositories, providing an excellent resource to quantify the prevalence of ARGs by analyzing the read abundances (2326).

In this study, we have analyzed the co-abundance of sequencing reads aligned to acquired ARGs to assess how resistance to one specific antimicrobial agent is linked to the abundance of another class of antimicrobial. With a collection of 214,095 metagenomic data sets from public repositories (27), we examined the correlation between pairwise ARG read abundances with a compositional approach (28, 29) using SparCC (Sparse Correlations for Compositional data) (30). Our results demonstrated that many ARG pairs interact but are highly specific to the environment. We believe that these interactions provide another way to study how ARGs are being co-selected independently of the microbial context, and the findings can be used to design targeted interventions to limit the spread of AMR.

MATERIALS AND METHODS

Data collection and pre-processing

We have previously described in detail the process of downloading and analyzing 214,095 metagenomic samples (25, 27), but in brief, we downloaded raw sequencing reads corresponding to 442 Tbp from metagenomic samples deposited in the European Nucleotide Archive (31) that were uploaded between 1 January 2010 and 1 January 2020 and had at least 100,000 reads and were shotgun sequenced. The raw sequencing reads were quality-checked with FASTQC v.0.11.15 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and trimmed with BBduk2 36.49 (32). The trimmed reads were then globally aligned using the k-mer alignment tool (KMA) (33) 1.2.21 against two reference sequence databases: ResFinder (34) (downloaded 25 January 2020) and Silva (35) (version 38, downloaded 16 January 2020). ResFinder is a database of 3,085 acquired ARGs, whereas Silva is a 16S/18S rRNA database of 2,225,272 sequences.

Homology-based clustering of ResFinder sequences

With the possibility of detecting up to C(3,085,2)=3,085!(2!(3,0852)!)=4,757,070 pairwise interactions between the 3,085 ARGs of ResFinder, we decided to cluster similar reference sequences together to reduce the number of possible combinations. Using USEARCH (v11.0.667), we did a homology-based clustering that grouped ARG sequences into 90% identity groups, producing a total of 716 clusters (36, 37). For each gene member of an ARG cluster, the read counts for the gene were first adjusted by the gene sequence length, and then all length-adjusted counts were summed together (38). These new ARG cluster counts are used in the correlation analysis described in the next section and are referred to as simply ARGs in the remainder of the manuscript.

Calculating relative abundances

For a category label, we calculated the relative abundance of fragment counts assigned to different genes or classes as:

Relative abundance(x)=κnini,

where x is the label, ni is the count of read fragments assigned to gene i, and κ=100 is a scaling constant (38).

Inferring pairwise correlations with SparCC

The SparCC algorithm was used to obtain correlations using pairs of log-ratio transformed ARG read counts to infer linear Pearson correlations (30). SparCC obtains linear Pearson correlations and P values through an iterative approach that adjusts for spurious correlations and lowers the false discovery rate. The ARG–ARG correlations were inferred as the average over 50 iterations, and one-sided pseudo-P values were obtained through a bootstrapping procedure of 100 rounds. In each bootstrapping round, the input count matrix was shuffled, and correlations were averaged over 10 iterations to infer one-sided P values to test whether the correlations for the observed data were statistically significant. Correlation 0.6 with P 0.01 were selected for further analysis. We implemented SparCC to run on GPUs on the Danish National Supercomputer for Life Sciences (https://www.computerome.dk).

We ran SparCC on the entire data set of the 214K metagenomic samples and subsets of samples grouped by sampling host and environment, where at least 800 samples existed. Due to inconsistent labeling of the sampling sources, we made new source groups, as shown in Table 1. We only consider genes for SparCC analysis that are present in at least 10 samples with a minimum read fragment count of 50. In total, 11 different correlation matrices were made, that is, one for each of the source groups listed in Table 1. The correlation networks were visualized in R 4.1.0 (39) with packages igraph (40), qgraph (41), and ggraph (42) using the Fruchterman–Reingold layout algorithm (43).

TABLE 1.

Grouped labels for hosts and environmentsa

Source group Sampling source label(s) Number of samples Number of samples with ARGs
Air Air metagenome 914 870
Chicken Gallus gallus (1,219), chicken gut metagenome (4) 1,223 1,215
Cow Bos taurus (872), cow dung metagenome (14) 886 824
Dog Canis lupus familiaris 3,439 3,182
Freshwater Freshwater metagenome 4,494 585
Human Homo sapiens 95,003 57,239
Marine Marine metagenome 30,002 5,444
Mouse Mus musculus (1,462), mouse metagenome (50), mouse gut metagenome (2,435) 3,947 3,909
Pig Sus scrofa (673), Sus scrofa domesticus (355), pig metagenome (2,129), pig gut metagenome (72) 3,229 3,461
Soil Soil metagenome 6,533 2,822
All 214,095 119,206
a

The parenthesis after each sampling source label denotes the number of samples assigned to that label if the group consisted of multiple labels.

Network comparisons

The topology of the different networks is described using different metrics: the number of nodes (N) and edges (E), the global clustering coefficient, network density, edge density, and the number of components. The global clustering coefficient, or the graph transitivity, measures the density of node triplets in the network (44). The network density is calculated as 2E/NN-1 as given in Parente et al. (2018) (45). Edge density is the number of edges over the number of possible edges (46). The average correlation between ARGs of two antimicrobial classes was calculated using Fisher’s z-transformation on the correlation values, averaging the z-scores and converting it back to a correlation score with the inverse Fisher transformation (47).

RESULTS

This study investigated the correlation of pairwise ARG abundances across a set of 214,095 metagenomic sequencing data sets. This collection represents a highly diverse set of publicly available metagenomic samples collected between 2000 and 2020 from locations all over the world, where the sequencing data sets consist of short-read reads produced on Illumina platforms (Fig. S1). We observed that not all samples contained sequence read fragments aligned to ARGs, so we only included those that did in the correlation analyses (n = 119,206; Table 1). Looking at the various sampling sources, read fragments that aligned to ARGs’ conferring resistance to tetracycline and beta-lactams were very common in host-associated sources, whereas phenicol resistance was more frequently observed in environmental sources (freshwater, marine, and soil) (Fig. 1). catA1 was the most dominant gene in the environmental samples, especially in marine samples, whereas various tet genes had high relative abundances in livestock or human samples: tet(W) in chicken and pig samples, and tet(Q) in cow, human, and pig metagenomes. blaTEM-52B accounted for more than 30% of the read fragments assigned to ARGs in air metagenomes (Fig. S2).

Fig 1.

Fig 1

Relative abundance of read fragments aligned to each resistance class (row) for each sampling source (column). The relative abundance is the number of read fragments aligned to the group out of all fragments aligned to ARGs. For each resistance class, the parenthesis shows the number of ARGs belonging to that category.

Balancing the sparsity and network complexity

Initially, we inferred correlations with SparCC on all samples that had at least one read fragment aligned to an ARG. We observed that most ARGs were found to correlate with each other in all environments, even in cases where there was only one read fragment per reference (Table S1). Across all samples, we observed template hits ranging from 1 to 252 with a median of 20 and fragment counts spanning 1 to 4.5⋅107 , with a median of 880 fragment counts. To balance the sparsity and network complexity, we introduced a filtering step. This step determined whether an ARG should be included or excluded based on a minimum number of fragment counts and a minimum number of samples supporting hits to that ARG template. Among the 716 ARGs considered, we noticed that without any filters, SparCC indicated correlations even with very low read fragment counts. Therefore, we required that for an ARG to be included, it had to have a minimum fragment count of 50 across at least 10 samples (Table S1).

Analysis of large-scale metagenomic correlation networks

Based on our filter settings, we constructed a global network using the correlation coefficients for the entire collection of metagenomes, with each node representing an ARG and each edge representing a pairwise ARG connection (correlation 0.6, P §amp;lt; 0.01; Fig. S3). The global network, nicknamed “all,” contained 225 ARGs connected through 2,344 correlation edges (Fig. 2A). As this all network was hard to interpret due to the many highly interconnected ARGs, we also inferred pairwise ARG correlations in specific sampling groups (Table 1). The genes that were part of these networks were found to correlate with varying degrees of strength (Fig. 2A and B). For example, the human network contained many correlation coefficients, but most were less than 0.8 (Fig. 2C). Another example is the marine network, where only a few ARGs were found to correlate but with values above 0.9 (Fig. 2B and C). Despite the networks reflecting the composition of the various environments, we still observed overlaps between which ARGs were found to correlate. One example is that all the correlations inferred from the pig metagenomes also existed in the human metagenomic network (Fig. 2D).

Fig 2.

Fig 2

The resistome networks consisting of correlations (edges) between pairs of ARGs (nodes). (a) Each correlation network is visualized, where each ARG node is colored by resistance class, and edges are colored by the correlation coefficient value. (b) Metrics of the correlation networks reveal how interconnected and complex the networks are. "No." stands for "number of". (c) Distribution of included correlation coefficients in each network. Figure S3 shows the distribution of all inferred correlations and their P values. (d) A heatmap showing how similar the content of the two networks is. In the upper half of the heatmap, the number of overlapping ARG nodes is shown. An overlap means that a specific ARG has a correlation in both networks, ignoring what it is co-abundant with. In the lower half of the heatmap, the number of overlapping correlation edges is shown. An overlapping edge is defined as whether the correlation coefficient is reported in both networks, regardless of the value of the coefficient.

There was a limited number of correlations for ARGs encoding resistance to fluoroquinolones, steroid antibiotics (fusidic acid), colistin, and rifampicin. On the other hand, beta-lactam, tetracycline, and aminoglycoside ARGs had many correlations with each other and with other classes (Fig. S4 and S6). Despite resistance to some antimicrobial classes being the most abundant, the ARGs did not always correlate to many others. For example, tetracycline ARGs were the most abundant in dog metagenomes, but no correlations were inferred for these ARGs. Similarly, in the human samples where aminoglycoside and beta-lactam ARGs were less abundant than tetracycline ARGs, aminoglycoside and beta-lactam resistance genes had a higher number of correlation coefficients were reported (Fig. 1; Fig. S4).

On the level of ARG abundance, we observed that just because an ARG was highly abundant in a sampling group, it did not automatically mean that it correlated to many other ARGs. The highly abundant catA1 gene in marine (97.4% of all ARG reads), freshwater (55.1%), and soil samples (84.7%) (Fig. S2) did only correlate with one or two other genes in the water environmental networks and none in the soil network. On the other hand, catA1 did seem to be correlated with 15 other genes in the pig correlation network despite not being highly abundant in that group of samples (Fig. S5a). mef(A)_1 accounted for 15.9% of the reads aligned to ARGs in cow samples and 6.63% in pigs (Fig. S2) and was also strongly correlating with other genes, which mainly conferred resistance to aminoglycosides, (fluoro)quinolones, and tetracyclines (Fig. S5b). tet(L)_4 only accounted for 4.01% of the read fragments aligned to ARGs in metagenomes of chicken origins (Fig. S2) but was shown to correlate in its abundance with the abundance of 8 other ARGs, for example, with a correlation of 0.77 with lnu(A)_1 (Fig. S5c).

The hidden signals between ARG profile and the potential risk of co-selection in different environmental contexts

Using the correlations between ARGs in the various environments (Fig. 2), we calculated the average correlation between ARGs of different antimicrobial classes (Fig. S6). These average correlations can then serve as profiles to assess the risk of indirectly selecting ARGs that confer resistance to different ARGs through co- and cross-resistance. These risk profiles can then be used to judge the strength of interactions upon using one antimicrobial in each setting. Upon constructing these profiles, we observed that the strength and the number of correlations highly depend on the antimicrobial classes and the environmental context (Fig. S6). We hypothesize that if an important antimicrobial (IA) or highly important antimicrobial (HIA) is used, first, the resistance to the antimicrobial class will likely flourish, and, secondly, through co- and cross-resistance, so will ARGs conferring resistance to other classes, including those that are CIA.

Figure 3 shows two risk profiles for ARG correlations for glycopeptide and macrolide, two highly CIAs. Correlations between ARGs of glycopeptide resistance to other resistance classes were much rarer than those connected with macrolide ARGs (Fig. S6), and those correlations that were observed were relatively low (correlation < 0.8; Fig. 3). Different vancomycin resistance cassettes were responsible in different environments, namely, VanHAX, VanC2, and VanX_bc in human samples and VanHDX and VanC1XY in pig samples (Fig. S7a). VanHAX only has one correlation, whereas the remaining five correlate with many different ARGs. On the contrary, ARGs conferring resistance to macrolide were much more interactive with other classes of resistance genes in all networks and had strong correlations with specific classes (correlation > 0.9; Fig. 3; Fig. S6). The macrolide ARGs co-abundant with other ARGs were many but separated into distinct network clusters (Fig. S7b). For example, mef(A) and msr(D) were usually found together in different environments, both in small and large clusters.

Fig 3.

Fig 3

Correlation profiles for ARGs conferring resistance to the CIA classes glycopeptides (left) and macrolides (right). Each column shows the average correlation from, for example, macrolide ARGs to ARGs for other antimicrobial classes. The circle is colored by the average correlation, where a white circle indicates no statistically significant correlations of ARGs observed between the two antimicrobial classes.

While Fig. 3 highlights how CIA ARGs interact, it is just as important to investigate what ARGs of less CIAs correlate with. As seen in Fig. 4, ARGs for pleuromutilin resistance (IA) and for tetracycline resistance (HIA) were found to interact with many other classes, including those that are critically important. For example, pleuromutilin ARGs are few but well connected (Fig. 4; Fig. S6), as seen with the connections with lsa(E) and cfr(C) (Fig. S8a). Tetracycline ARGs correlated with the abundance of multiple ARGs, such as those conferring resistance to lincosamides, macrolides, and phenicols (Fig. 4). While there were many ARGs for tetracycline resistance, they often correlated to the same ARGs (Fig. S8b).

Fig 4.

Fig 4

Correlation profiles for ARGs conferring resistance to important pleuromutilins (left) and highly important tetracyclines (right). Each column shows the average correlation from, for example, tetracycline ARGs to ARGs for other antimicrobial classes. The circle is colored by the average correlation, where a white circle indicates no statistically significant correlations of ARGs observed between the two antimicrobial classes.

DISCUSSION

Considering the complexity of microbiomes, studying how microbial composition shapes the distribution of ARGs is a challenging task but one that could shed light on how ARGs indirectly select one another. However, with the high-throughput sequencing technologies and many metagenomic data sets available in public repositories, it is now much more feasible to extract the patterns of how ARGs co-occur without knowing their microbial origin. Using our recently published collection of 214K metagenomic datasets (27), we have inferred correlations of pairwise ARG abundances to profile which types of resistance influence the shape of resistome and which genes are the key players. To the best of our knowledge, this is the first study to relate ARG abundances on such a large and broad scale.

Our correlation networks revealed that not all ARGs are connected, as the pairwise ARG interactions were largely shaped by the composition of the environmental resistome. We observed cases where a highly abundant ARG had no correlations and the opposite where a sparse ARG had many correlations, highlighted by our effort to apply appropriate filters to ensure robust correlations (Table S1). However, by filtering on the individual abundance of ARGs in each resistome, there is a risk of removing important links between ARGs, especially in environments with very low abundant ARGs. We continued with settings that ensured that sparse ARGs in complex environments were removed but that ARGs in overall sparse environments could still be included, such as for the air resistome. More sensitive filtering could be applied to further improve the robustness of the inferred ARGs for each resistome.

The network showing correlations for all metagenomic samples was the most complex, which we speculate is due to both the wide variety of sampling sources and the overrepresentation of human metagenomes (Table 1). We found the differences in the animal and environmental networks much more interesting, as they reflect the dynamics of ARG abundances in more localized settings (Fig. 2). On the level of which antimicrobial class an ARG confers resistance to, we could also observe that some resistance classes had more and stronger correlations than others (Fig. S6). We found several cases of strongly correlated ARG pairs that have also been reported in other studies. These cases were often due to the ARGs being present in the same microbial genome. For example, the gene cassette vanHAX has been found together with msr(C) and aac(6′) in genomes of human isolates (48) and mef(A) linked with tet(O) (49) and mdf(A) with blaTEM, aph(6), sul2, and tet(A) (50) (Fig. S7).

As highlighted in Fig. 3 and 4, we argue that the correlations can serve as a way to profile the collateral effects occurring in a resistome when the abundance of different ARG mechanisms changes. Antimicrobials have been classified differently to reflect their importance to human medicine, of which glycopeptides and macrolides are CIA with the highest priorities. Glycopeptide and macrolide resistance have previously been linked genetically (7, 9) in pigs, where we also can report the presence of correlations between ARGs of glycopeptide and macrolide resistances not only in pigs but also in human and mouse environments (Fig. 3). Pleuromutilin and tetracycline antimicrobials are classified as being less critically important (51), but SparCC reported many correlations of ARGs for these two classes in various networks (Fig. 4), which suggests that there are still risks associated with the use of these two. Tetracycline resistance has been found to occur together with resistance to macrolides (5254), aminoglycosides (52, 55), folate pathway antagonists (56, 57), lincosamide (11), and beta-lactams (55), to name a few studies. This high connectivity of tetracycline ARGs seems in line with our results, as this specific group of ARGs was connected to almost all classes of antimicrobials in most of our networks (Fig. 4; Fig. S8b). The variety of connections to antimicrobials of less importance to human health should be more in focus, as our results show that there are risks of critical AMRs emerging from the enrichment of less essential resistance genes.

Following this line of thought of focusing on ARGs that give resistance to less CIAs, a recent study by Tarek and Garner (2022) (58) proposed to create a monitoring framework based on isolated components, or clusters, in correlation networks. They constructed a correlation network encompassing ARG abundances in samples from wastewater treatment plants and argued that representative ARG members from each cluster in their network should be monitoring targets. Our networks suggest that limiting to only a few ARGs would fail to capture the complete picture in some environments, such as human microbiomes (Fig. 2). Instead, we propose to limit the set of monitoring targets to only what happens to ARG abundances during exposure to one type of antimicrobials (Fig. 3 and 4; Fig. S7 and S8). If a monitoring system is implemented, it would need to be updated regularly to show the changes in AMU and ARG co-occurrences since the correlations we have inferred in this study only reflect the current and past abundances.

In order to use correlation networks for surveillance of AMR, more work is needed to confirm that the observed interactions do indeed exist in nature (59). We have defined interactions as being indirect since SparCC does not indicate which way the interaction occurred. One way to investigate this could be by investigating physical linkages or using other analytical methods. For example, to determine whether vanHAX influences msr(C) or the other way around (Fig. S7a), the SPIEC-EASI (60) method could be used to infer such directional dependences. A directional correlation could be included in a risk profile.

Our choice of only studying the co-abundances of ARGs does not capture the causality and directionality of the interactions. We have not disentangled whether the interactions are due to co-selection or cross-selection and neither investigating genetic linkages, co-exposure, and the presence of other genetic elements such as mobile genetic elements. Understanding some of these factors could be achieved by expanding our analysis with bacterial read counts and counts of mobile genetic elements. We have speculated that the resulting correlations could reflect the community’s response to external stressors, such as different usages of antimicrobials. While there has been work on linking the ARG prevalence with AMUs in various settings (7, 9, 11), incorporating AMU values, other non-antimicrobial stressors (61), and microbial signatures (12) in our analysis would likely help us better understand the microbiomes’ response to selective pressures.

By utilizing the wealth of information on ARG abundances available in a collection of 214K metagenomic data sets, we have studied the co-abundance of ARGs to discover how these interactions shape the prevalence of resistances in different environments. The inferred correlation networks provide insights into how two resistance types indirectly and species independently select for each other in different habitats. Our results further highlight that there are instances of genes of one type of resistance often co-occurring with many other types of resistance and that the environmental context plays an important role, revealing them as important targets in surveillance programs to limit their impact on global health.

Supplementary Material

Reviewer comments
reviewer-comments.pdf (124.3KB, pdf)

Contributor Information

Hannah-Marie Martiny, Email: hanmar@food.dtu.dk.

Monica Adriana Garcia-Solache, Brown University, Providence, Rhode Island, USA.

DATA AVAILABILITY

The matrix of raw ARG read counts is available at https://doi.org/10.5281/zenodo.6519843 . The code used to run the analysis and create figures and the SparCC output files are available at https://github.com/hmmartiny/global_resistome_correlations. Classifications of antimicrobial importance were retrieved from the 6th revision of critically important antimicrobials (CIAs) for human medicine from https://www.who.int/publications/i/item/9789241515528, accessed 10 October 2022.

SUPPLEMENTAL MATERIAL

The following material is available online at https://doi.org/10.1128/spectrum.04108-23.

Supplementary material. spectrum.04108-23-s0001.pdf.

Table S1; Fig. S1 to S8.

DOI: 10.1128/spectrum.04108-23.SuF1
OPEN PEER REVIEW. reviewer-comments.pdf.

An accounting of the reviewer comments and feedback.

reviewer-comments.pdf (124.3KB, pdf)
DOI: 10.1128/spectrum.04108-23.SuF2

ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

  • 1. O’Neill J. 2016. The review on antimicrobial resistance, p 1–80. In Tackling drug-resistant infections globally: final report and recommendations [Google Scholar]
  • 2. Murray CJL, Ikuta KS, Sharara F, Swetschinski L, Robles Aguilar G, Gray A, Han C, Bisignano C, Rao P, Wool E, et al. 2022. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet 399:629–655. doi: 10.1016/S0140-6736(21)02724-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Zhang Z, Zhang Q, Wang T, Xu N, Lu T, Hong W, Penuelas J, Gillings M, Wang M, Gao W, Qian H. 2022. Assessment of global health risk of antibiotic resistance genes. Nat Commun 13:1553. doi: 10.1038/s41467-022-29283-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Holmes AH, Moore LSP, Sundsfjord A, Steinbakk M, Regmi S, Karkey A, Guerin PJ, Piddock LJV. 2016. Understanding the mechanisms and drivers of antimicrobial resistance. Lancet 387:176–187. doi: 10.1016/S0140-6736(15)00473-0 [DOI] [PubMed] [Google Scholar]
  • 5. Collignon P, Powers JH, Chiller TM, Aidara-Kane A, Aarestrup FM. 2009. World health organization ranking of antimicrobials according to their importance in human medicine: a critical step for developing risk management strategies for the use of antimicrobials in food production animals. Clin Infect Dis 49:132–141. doi: 10.1086/599374 [DOI] [PubMed] [Google Scholar]
  • 6. Jensen HH, Hayes DJ. 2014. Impact of Denmark’s ban on antimicrobials for growth promotion. Curr Opin Microbiol 19:30–36. doi: 10.1016/j.mib.2014.05.020 [DOI] [PubMed] [Google Scholar]
  • 7. Aarestrup FM, Seyfarth AM, Emborg HD, Pedersen K, Hendriksen RS, Bager F. 2001. Effect of abolishment of the use of antimicrobial agents for growth promotion on occurrence of antimicrobial resistance in fecal enterococci from food animals in Denmark. Antimicrob Agents Chemother 45:2054–2059. doi: 10.1128/AAC.45.7.2054-2059.2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Sun J, Li L, Liu B, Xia J, Liao X, Liu Y. 2014. Development of aminoglycoside and β-lactamase resistance among intestinal microbiota of swine treated with lincomycin, chlortetracycline, and amoxicillin. Front Microbiol 5:580. doi: 10.3389/fmicb.2014.00580 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Aarestrup FM. 2000. Characterization of glycopeptide-resistant Enterococcus faecium (GRE) from broilers and pigs in Denmark: genetic evidence that persistence of GRE in pig herds is associated with coselection by resistance to macrolides. J Clin Microbiol 38:2774–2777. doi: 10.1128/JCM.38.7.2774-2777.2000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Looft T, Johnson TA, Allen HK, Bayles DO, Alt DP, Stedtfeld RD, Sul WJ, Stedtfeld TM, Chai B, Cole JR, Hashsham SA, Tiedje JM, Stanton TB. 2012. In-feed antibiotic effects on the swine intestinal microbiome. Proc Natl Acad Sci U S A 109:1691–1696. doi: 10.1073/pnas.1120238109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Andersen VD, Aarestrup FM, Munk P, Jensen MS, de Knegt LV, Bortolaia V, Knudsen BE, Lukjancenko O, Birkegård AC, Vigre H. 2020. Predicting effects of changed antimicrobial usage on the abundance of antimicrobial resistance genes in finisher’ gut microbiomes. Prev Vet Med 174:104853. doi: 10.1016/j.prevetmed.2019.104853 [DOI] [PubMed] [Google Scholar]
  • 12. Coyte KZ, Stevenson C, Knight CG, Harrison E, Hall JPJ, Brockhurst MA. 2022. Horizontal gene transfer and ecological interactions jointly control microbiome stability. PLoS Biol 20:e3001847. doi: 10.1371/journal.pbio.3001847 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Peterson LR. 2005. Squeezing the antibiotic balloon: the impact of antimicrobial classes on emerging resistance. Clin Microbiol Infect 11 Suppl 5:4–16. doi: 10.1111/j.1469-0691.2005.01238.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Paterson DL. 2004. ‘Collateral damage’ from cephalosporin or quinolone antibiotic therapy. Clin Infect Dis 38:S341–S345. doi: 10.1086/382690 [DOI] [PubMed] [Google Scholar]
  • 15. Barberán A, Bates ST, Casamayor EO, Fierer N. 2012. Using network analysis to explore co-occurrence patterns in soil microbial communities. ISME J 6:343–351. doi: 10.1038/ismej.2011.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Yuan MM, Guo X, Wu L, Zhang Y, Xiao N, Ning D, Shi Z, Zhou X, Wu L, Yang Y, Tiedje JM, Zhou J. 2021. Climate warming enhances microbial network complexity and stability. Nat Clim Chang 11:343–348. doi: 10.1038/s41558-021-00989-9 [DOI] [Google Scholar]
  • 17. Zhang Z, Li H, Shen W, Du X, Li S, Wei Z, Zhang Z, Feng K, Deng Y. 2022. The large-scale spatial patterns of ecological networks between phytoplankton and zooplankton in coastal marine ecosystems. Sci Total Environ 827:154285. doi: 10.1016/j.scitotenv.2022.154285 [DOI] [PubMed] [Google Scholar]
  • 18. Li LG, Xia Y, Zhang T. 2017. Co-occurrence of antibiotic and metal resistance genes revealed in complete genome collection. ISME J 11:651–662. doi: 10.1038/ismej.2016.155 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Ju F, Li B, Ma L, Wang Y, Huang D, Zhang T. 2016. Antibiotic resistance genes and human bacterial pathogens: co-occurrence, removal, and enrichment in municipal sewage sludge digesters. Water Res 91:1–10. doi: 10.1016/j.watres.2015.11.071 [DOI] [PubMed] [Google Scholar]
  • 20. Pan X, Lin L, Zhang W, Dong L, Yang Y. 2020. Metagenome sequencing to unveil the resistome in a deep subtropical lake on the Yunnan-Guizhou Plateau, China. Environ Pollut 263:114470. doi: 10.1016/j.envpol.2020.114470 [DOI] [PubMed] [Google Scholar]
  • 21. Wu J, Su Y, Deng Y, Guo Z, Mao C, Liu G, Xu L, Cheng C, Bei L, Feng J. 2019. Prevalence and distribution of antibiotic resistance in marine fish farming areas in Hainan, China. Sci Total Environ 653:605–611. doi: 10.1016/j.scitotenv.2018.10.251 [DOI] [PubMed] [Google Scholar]
  • 22. Liu J, Taft DH, Maldonado-Gomez MX, Johnson D, Treiber ML, Lemay DG, DePeters EJ, Mills DA. 2019. The fecal resistome of dairy cattle is associated with diet during nursing. Nat Commun 10:4406. doi: 10.1038/s41467-019-12111-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Munk P, Knudsen BE, Lukjancenko O, Duarte ASR, Van Gompel L, Luiken REC, Smit LAM, Schmitt H, Garcia AD, Hansen RB, Petersen TN, Bossers A, Ruppé E, EFFORT Group, Lund O, Hald T, Pamp SJ, Vigre H, Heederik D, Wagenaar JA, Mevius D, Aarestrup FM. 2018. Abundance and diversity of the faecal resistome in slaughter pigs and broilers in nine European countries. Nat Microbiol 3:898–908. doi: 10.1038/s41564-018-0192-9 [DOI] [PubMed] [Google Scholar]
  • 24. Hendriksen RS, Munk P, Njage P, van Bunnik B, McNally L, Lukjancenko O, Röder T, Nieuwenhuijse D, Pedersen SK, Kjeldgaard J, et al. 2019. Global monitoring of antimicrobial resistance based on metagenomics analyses of urban sewage. Nat Commun 10:1124. doi: 10.1038/s41467-019-08853-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Martiny H-M, Munk P, Brinch C, Szarvas J, Aarestrup F, Petersen TN. 2022. Global distribution of mcr gene variants in 214,095 metagenomic samples. mSystems. doi: 10.21203/rs.3.rs-640935/v2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Karkman A, Do TT, Walsh F, Virta MPJ. 2018. Antibiotic-resistance genes in waste water. Trends Microbiol 26:220–228. doi: 10.1016/j.tim.2017.09.005 [DOI] [PubMed] [Google Scholar]
  • 27. Martiny H-M, Munk P, Brinch C, Aarestrup FM, Petersen TN. 2022. A curated data resource of 214K metagenomes for characterization of the global antimicrobial resistome. PLoS Biol 20:e3001792. doi: 10.1371/journal.pbio.3001792 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Aitchison J. 1982. The statistical analysis of compositional data. J R Stat Soc Ser B 44:139–160. doi: 10.1111/j.2517-6161.1982.tb01195.x [DOI] [Google Scholar]
  • 29. Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. 2017. Microbiome datasets are compositional: and this is not optional. Front Microbiol 8:2224. doi: 10.3389/fmicb.2017.02224 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Friedman J, Alm EJ. 2012. Inferring correlation networks from genomic survey data. PLoS Comput Biol 8:e1002687. doi: 10.1371/journal.pcbi.1002687 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Leinonen R, Akhtar R, Birney E, Bower L, Cerdeno-Tárraga A, Cheng Y, Cleland I, Faruque N, Goodgame N, Gibson R, Hoad G, Jang M, Pakseresht N, Plaister S, Radhakrishnan R, Reddy K, Sobhany S, Ten Hoopen P, Vaughan R, Zalunin V, Cochrane G. 2011. The European nucleotide archive. Nucleic Acids Res 39:D28–D31. doi: 10.1093/nar/gkq967 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Bushnell Brian. 2014. BBMap: A Fast, Accurate, Splice-Aware Aligner.
  • 33. Clausen PTLC, Aarestrup FM, Lund O. 2018. Rapid and precise alignment of raw reads against redundant databases with KMA. BMC Bioinformatics 19:307. doi: 10.1186/s12859-018-2336-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Zankari E, Hasman H, Cosentino S, Vestergaard M, Rasmussen S, Lund O, Aarestrup FM, Larsen MV. 2012. Identification of acquired antimicrobial resistance genes. J Antimicrob Chemother 67:2640–2644. doi: 10.1093/jac/dks261 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. 2013. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Res 41:D590–D596. doi: 10.1093/nar/gks1219 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Edgar RC, Bateman A. 2010. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26:2460–2461. doi: 10.1093/bioinformatics/btq461 [DOI] [PubMed] [Google Scholar]
  • 37. Munk P, Brinch C, Møller FD, Petersen TN, Hendriksen RS, Seyfarth AM, Kjeldgaard JS, Svendsen CA, van Bunnik B, Berglund F, Global Sewage Surveillance Consortium, Larsson DGJ, Koopmans M, Woolhouse M, Aarestrup FM. 2022. Genomic analysis of sewage from 101 countries reveals global landscape of antimicrobial resistance. Nat Commun 13:7251. doi: 10.1038/s41467-022-34312-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Martiny H-M, Munk P, Brinch C, Szarvas J, Aarestrup FM, Petersen TN. 2022. Global distribution of mcr gene variants in 214K metagenomic samples. mSystems 7:e0010522. doi: 10.1128/msystems.00105-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Team, R. C . 2021. R: a language and environment for statistical computing.
  • 40. Csardi G, Nepusz T. 2006. The igraph software package for complex network research. Inter J Complex Sy 1695 [Google Scholar]
  • 41. Epskamp S, Cramer AOJ, Waldorp LJ, Schmittmann VD, Borsboom D. 2012. qgraph: network visualizations of relationships in psychometric data. J Stat Soft 48:1–18. doi: 10.18637/jss.v048.i04 [DOI] [Google Scholar]
  • 42. Pedersen TL. 2021. ggraph: an implementation of grammar of graphics for graphs and networks
  • 43. Fruchterman TMJ, Reingold EM. 1991. Graph drawing by force-directed placement. Softw Pract Exp 21:1129–1164. doi: 10.1002/spe.4380211102 [DOI] [Google Scholar]
  • 44. Barrat A, Barthélemy M, Pastor-Satorras R, Vespignani A. 2004. The architecture of complex weighted networks. Proc Natl Acad Sci U S A 101:3747–3752. doi: 10.1073/pnas.0400087101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Parente E, Zotta T, Faust K, De Filippis F, Ercolini D. 2018. Structure of association networks in food bacterial communities. Food Microbiol 73:49–60. doi: 10.1016/j.fm.2017.12.010 [DOI] [PubMed] [Google Scholar]
  • 46. Wasserman S, Faust K. 1994. Social network analysis. In Social network analysis: methods and applications. Cambridge University Press. [Google Scholar]
  • 47. Silver NC, Dunlap WP. 1987. Averaging correlation coefficients: should Fisher’s Z transformation be used? J Appl Psychol 72:146–148. doi: 10.1037/0021-9010.72.1.146 [DOI] [Google Scholar]
  • 48. Freitas AR, Tedim AP, Almeida-Santos AC, Duarte B, Elghaieb H, Abbassi MS, Hassen A, Novais C, Peixe L. 2022. High-resolution genotyping unveils identical ampicillin-resistant Enterococcus faecium strains in different sources and countries: a one health approach. Microorganisms 10:632. doi: 10.3390/microorganisms10030632 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Giovanetti E, Brenciani A, Lupidi R, Roberts MC, Varaldo PE. 2003. Presence of the tet(O) gene in erythromycin- and tetracycline-resistant strains of Streptococcus pyogenes and linkage with either the mef(A) or the erm(A) gene. Antimicrob Agents Chemother 47:2844–2849. doi: 10.1128/AAC.47.9.2844-2849.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Trongjit S, Chuanchuen R. 2021. Whole genome sequencing and characteristics of Escherichia coli with coexistence of ESBL and mcr genes from pigs. PLoS One 16:e0260011. doi: 10.1371/journal.pone.0260011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. WHO . 2018. Critically important antimicrobials for human medicine, 6th revision. [Google Scholar]
  • 52. Arredondo A, Blanc V, Mor C, Nart J, León R. 2020. Tetracycline and multidrug resistance in the oral microbiota: differences between healthy subjects and patients with periodontitis in Spain. J Oral Microbiol 13:1847431. doi: 10.1080/20002297.2020.1847431 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Nielsen HUK, Hammerum AM, Ekelund K, Bang D, Pallesen LV, Frimodt-Møller N. 2004. Tetracycline and macrolide co-resistance in Streptococcus pyogenes: co-selection as a reason for increase in macrolide-resistant S. pyogenes? Microb Drug Resist 10:231–238. doi: 10.1089/mdr.2004.10.231 [DOI] [PubMed] [Google Scholar]
  • 54. Ardic N, Ozyurt M, Sareyyupoglu B, Haznedaroglu T. 2005. Investigation of erythromycin and tetracycline resistance genes in methicillin-resistant staphylococci. Int J Antimicrob Agents 26:213–218. doi: 10.1016/j.ijantimicag.2005.06.013 [DOI] [PubMed] [Google Scholar]
  • 55. Stepanauskas R, Glenn TC, Jagoe CH, Tuckfield RC, Lindell AH, King CJ, McArthur JV. 2006. Coselection for microbial resistance to metals and antibiotics in freshwater microcosms. Environ Microbiol 8:1510–1514. doi: 10.1111/j.1462-2920.2006.01091.x [DOI] [PubMed] [Google Scholar]
  • 56. Gao P, Munir M, Xagoraraki I. 2012. Correlation of tetracycline and sulfonamide antibiotics with corresponding resistance genes and resistant bacteria in a conventional municipal wastewater treatment plant. Sci Total Environ 421–422:173–183. doi: 10.1016/j.scitotenv.2012.01.061 [DOI] [PubMed] [Google Scholar]
  • 57. Zhang MQ, Yuan L, Li ZH, Zhang HC, Sheng GP. 2019. Tetracycline exposure shifted microbial communities and enriched antibiotic resistance genes in the aerobic granular sludge. Environ Int 130:104902. doi: 10.1016/j.envint.2019.06.012 [DOI] [PubMed] [Google Scholar]
  • 58. Tarek MH, Garner E. 2023. A proposed framework for the identification of indicator genes for monitoring antibiotic resistance in wastewater: insights from metagenomic sequencing. Sci Total Environ 854:158698. doi: 10.1016/j.scitotenv.2022.158698 [DOI] [PubMed] [Google Scholar]
  • 59. Blanchet FG, Cazelles K, Gravel D. 2020. Co-occurrence is not evidence of ecological interactions. Ecol Lett 23:1050–1063. doi: 10.1111/ele.13525 [DOI] [PubMed] [Google Scholar]
  • 60. Kurtz ZD, Müller CL, Miraldi ER, Littman DR, Blaser MJ, Bonneau RA. 2015. Sparse and compositionally robust inference of microbial ecological networks. PLOS Comput Biol 11:e1004226. doi: 10.1371/journal.pcbi.1004226 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Nagata N, Nishijima S, Miyoshi-Akiyama T, Kojima Y, Kimura M, Aoki R, Ohsugi M, Ueki K, Miki K, Iwata E, Hayakawa K, Ohmagari N, Oka S, Mizokami M, Itoi T, Kawai T, Uemura N, Hattori M. 2022. Population-level metagenomics uncovers distinct effects of multiple medications on the human gut microbiome. Gastroenterology 163:1038–1052. doi: 10.1053/j.gastro.2022.06.070 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reviewer comments
reviewer-comments.pdf (124.3KB, pdf)
Supplementary material. spectrum.04108-23-s0001.pdf.

Table S1; Fig. S1 to S8.

DOI: 10.1128/spectrum.04108-23.SuF1
OPEN PEER REVIEW. reviewer-comments.pdf.

An accounting of the reviewer comments and feedback.

reviewer-comments.pdf (124.3KB, pdf)
DOI: 10.1128/spectrum.04108-23.SuF2

Data Availability Statement

The matrix of raw ARG read counts is available at https://doi.org/10.5281/zenodo.6519843 . The code used to run the analysis and create figures and the SparCC output files are available at https://github.com/hmmartiny/global_resistome_correlations. Classifications of antimicrobial importance were retrieved from the 6th revision of critically important antimicrobials (CIAs) for human medicine from https://www.who.int/publications/i/item/9789241515528, accessed 10 October 2022.


Articles from Microbiology Spectrum are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES