Skip to main content
PLOS One logoLink to PLOS One
. 2020 Feb 12;15(2):e0228877. doi: 10.1371/journal.pone.0228877

Phylogenetic analysis reveals an ancient gene duplication as the origin of the MdtABC efflux pump

Kamil Górecki 1, Megan M McEvoy 1,2,3,*
Editor: William M Shafer4
PMCID: PMC7015380  PMID: 32050009

Abstract

The efflux pumps from the Resistance-Nodulation-Division family, RND, are main contributors to intrinsic antibiotic resistance in Gram-negative bacteria. Among this family, the MdtABC pump is unusual by having two inner membrane components. The two components, MdtB and MdtC are homologs, therefore it is evident that the two components arose by gene duplication. In this paper, we describe the results obtained from a phylogenetic analysis of the MdtBC pumps in the context of other RNDs. We show that the individual inner membrane components (MdtB and MdtC) are conserved throughout the Proteobacterial species and that their existence is a result of a single gene duplication. We argue that this gene duplication was an ancient event which occurred before the split of Proteobacteria into Alpha-, Beta- and Gamma- classes. Moreover, we find that the MdtABC pumps and the MexMN pump from Pseudomonas aeruginosa share a close common ancestor, suggesting the MexMN pump arose by another gene duplication event of the original Mdt ancestor. Taken together, these results shed light on the evolution of the RND efflux pumps and demonstrate the ancient origin of the Mdt pumps and suggest that the core bacterial efflux pump repertoires have been generally stable throughout the course of evolution.

Introduction

The resistance-nodulation-division efflux pumps (RNDs) comprise a large family of proteins, widely distributed among bacterial species [1,2]. Their main function is to extrude superfluous or harmful substances, such as metabolites, antibiotics, toxins, or metal ions. Some RNDs are also believed to be involved in export of siderophores and quorum sensing molecules [3,4], and there may be additional functions to be discovered, given the fact that the vast majority of RND pumps have not been characterized [5]. In general, the RNDs are divided into two groups depending on the substrates they transport: hydrophobic and amphiphilic efflux (HAE) and heavy metal efflux (HME).

Virtually all bacteria contain multiple RND assemblies with often at least partially overlapping functions. For instance, Escherichia coli contains six RNDs in its genome (five HAEs, transporting a broad range of substrates, and one HME, transporting Cu and Ag) [6], while the opportunistic pathogen Pseudomonas aeruginosa can contain up to 13 different RND systems, depending on the strain [7]. This abundance remains a puzzle. While in E. coli deletion of all RNDs results in drastic changes in the phenotype and seriously decreased ability to grow, deletion of one or two RND systems does not seem to have a strong effect (with the exception of the HME Cu-transporting Cus system, which is required for Cu-resistance) [6]. These results suggest functional overlap between the RND systems, and the pumps may be expressed depending on circumstances like exponential/stationary phase or aerobic/anaerobic conditions.

Classically, the efflux system is formed as a tripartite assembly [8]. Most RND systems share the same architecture, with an RND homotrimer in the inner membrane bound to six protomers of a membrane fusion protein (MFP) in the periplasm, which in turn connect the assembly with a trimer of outer membrane proteins (OMP). However, there are exceptions. In the MdtABC (multidrug transport) system from E. coli the inner membrane part is formed by a heterotrimer of MdtB2C1 stoichiometry [9]. A similar system called MuxABC was described in P. aeruginosa, with the MuxA, MuxB and MuxC proteins being homologous to MdtA, MdtB and MdtC, respectively (40, 65 and 61% sequence identity, and 78, 91 and 88% sequence similarity between the corresponding proteins) [10,11]. Other homologous systems were found and characterized in Salmonella enterica Serovar Typhimurium [12], Serratia marcescens [13], Erwinia amylovora [14], Pseudomonas putida [15], and Photorhabdus luminescens [16].

There are conflicting reports in the literature in regard to functional flexibility between the two subunits. Kim et al. reported that deletion of MdtC, but not MdtB, completely abolished the function of the Mdt system [9], while Da Wang and Fierke showed the opposite to be true [17]. It is possible there is a partial functional overlap between the two proteins, yet both subunits are needed for full function. Interestingly, the Mdt system has been shown to be able to facilitate both heavy metal and hydrophobic and amphiphilic efflux, and the heterogeneity of the inner membrane components may be a source of this promiscuity [17].

The evolutionary history of the two-RND subunit systems such as MdtBC remains unknown. While it may be hypothesized they arose originally through a gene duplication of the progenitor Mdt gene, both due to their high sequence similarity (e.g. 50% between MdtB and MdtC, compared to 25–30% between MdtB and other RNDs in E. coli) and their adjacent positions in genomes, it is not known if this gene duplication happens commonly in bacterial genomes or if it is rather an older phenomenon. The two-RND subunit systems from E. coli and P. aeruginosa are quite similar to each other, with higher homology between MdtB and MuxB, and between MdtC and MuxC, than between the proteins from the same organisms. This observation suggests that the original RND gene duplication might indeed be an infrequent older phenomenon, and not a widespread feature happening frequently in bacterial genomes.

Within the highly diverse Proteobacteria, Epsilonproteobacteria separated earliest from the rest, in an event placed at around 2.8 bln years ago by Battistuzzi and Hedges [25]. Subsequently, Deltaproteobacteria split from the rest of the lineage around 2.6 bln years ago, and Alphaproteobacteria around 2.4 bln years ago. The split between the two last groups, Beta- and Gammaproteobacteria, happened the latest, around 1.6 bln years ago. We set out to investigate how the phylogenetics of the Mdt proteins compares to the evolution of the phylum, in order to shed light on the evolutionary history of the Mdt systems. We thus performed a number of phylogenetic analyses and present the results in this paper.

Materials and methods

Phylogenetic analyses

In order to place the Mdt proteins in the context of other RNDs, the RND sequences from the work of Godoy et al. were used [5]. Out of over 2000 sequences there, 1106 were identified in UniProt (a full list is provided in the S1 File). These sequences were aligned with MAFFT using the default settings [18]. The alignment was then used to construct a phylogenetic tree based on all non-gapped positions and using neighborhood joining. The heterogeneity among sites was estimated by the MAFFT algorithm and the bootstrap values were calculated from 100 replicates.

The sequences that clustered together with E. coli MdtB and MdtC (and E. coli AcrB as an outgroup) were aligned with MAFFT using G-INS-i, an iterative refinement method, and a phylogenetic tree was constructed using neighborhood joining (NJ) of all of gap-free sites (JTT substitution model, the heterogeneity among sites was estimated by MAFFT, and bootstrap of 100 was used) [18]. The tree was then rooted on AcrB.

Sequence similarity network and genomic neighborhood diagrams

The sequence similarity network (SSN) was generated with the Enzyme Similarity Tool (ESI-EST) and visualized with Cytoscape [2024], with an alignment score of 200. The genomic neighborhoods of the genes in Fig 2 were investigated with the Gene Neighborhood Tool (ESI-GNT) [2023], and visualized together with the phylogenetic trees in iTOL [19].

Fig 2. A phylogenetic tree of the Mdt-like proteins.

Fig 2

The bootstrap values are represented by a branch color as in Fig 1 (the branches with bootstrap support lower than 50 were not collapsed, in order to show the genomic neighborhood of these genes). The taxonomy of each organism is represented with shading of the labels. To the right of the protein and organism names the genomic context is presented. The actual protein at the leaf tip is represented with a filled symbol: a dark red star for “progenitor-like” RNDs, a red star for “true” MexN-like proteins, a blue star for MdtB-like proteins and an orange star for MdtC-like proteins). The open symbols provide the genomic context for the RNDs. For example, in the second row, Thiobacillus denitrificans, the lack of symbols under “MdtABC” means there are no proteins from this group present in this organism. Further to the right under “MexMN”, an open green square means there is an OMP present, followed by an MFP (an open purple triangle), and an RND (a closed star). Lack of symbols under “OMP” and “MFS” means there are no further proteins in this set of genes.

The sequences for membrane fusion proteins and outer membrane proteins were identified with the help of the ESI-GNT, and the further analysis was done in the same way as for RND proteins, using E. coli AcrA and TolC as outgroups, respectively. The phylogenetic trees were visualized with iTOL.

Results & discussion

RNDs form a number of distinct clusters

The comparison of over 1000 sequences of RND proteins, previously identified by Godoy et al. [5], was performed in order to divide them into functional groups, and thus clarify their possible evolutionary origins. In particular, we were interested in how the Mdt system is placed in relation to the better characterized efflux pumps like Acr, Mex (HAE) or Cus (HME). Since constructing reliable sequence alignments of large proteins containing both transmembrane helices and large periplasmic domains can be difficult, we also generated a sequence similarity network (SSN), to visualize direct relationships between the sequences [2024].

There was a high similarity between the results obtained with the traditional phylogenetic analysis and the SSN. As seen in Fig 1, most proteins formed several large branches and clusters, with a smaller number remaining separated. The largest cluster (cluster 1) encompassed most of the characterized RNDs (all HAEs from E. coli). The less studied RNDs from P. aeruginosa clustered as MexI/W (cluster 2) and TriC/MexK (cluster 5). As expected, the HME proteins clustered together, with a further subdivision into mono- and di- valent transporting RNDs (cluster 3).

Fig 1. Analysis of >1000 RND sequences.

Fig 1

Left, an unrooted phylogenetic tree, with bootstrap values represented as branch colors. Leaves are labelled with colors corresponding to their clusters (black represents proteins not belonging to the colored clusters, or singletons). Right, sequence similarity network. Clusters containing more than 10 sequences are colored and given a number. P. aeruginosa and E. coli proteins are marked with thick circles, black and white, respectively. On both panels P. aeruginosa and E. coli proteins are additionally marked with their abbreviations (E. coli proteins underlined).

The Mdt proteins formed a distinct cluster (cluster 4), with one of the longest branches from the middle in the phylogenetic tree. The MdtB-like and MdtC-like proteins split early in the phylogenetic tree. The MdtB-like proteins, which are always directly adjacent to their respective MFPs, clustered together into one branch. The MdtC-like protein, which are never directly adjacent to their respective MFPs (i.e. there is always an MdtB-like protein in between), also clustered together into one branch. The fact that the gene organization has been preserved corroborates the notion that this heteromeric RND system is a result of an ancient gene duplication. This relationship seems to be very old, since Alpha-, Beta- and Gammaproteobacterial MdtBs and MdtCs form separate clusters, so that would put this duplication event to be older than the split between the major groups of Proteobacteria (over 2 billion years ago [25]).

Surprisingly, the branch/cluster containing Mdt-like proteins also included other RNDs, notably the MexN from P. aeruginosa and its homologues from other Pseudomonodales, as well as a number of other proteins. To investigate if this was an artefact caused by aligning a large number of sequences, we performed a new multiple sequence alignment with these 126 sequences, with E. coli AcrB as an outgroup. The results are shown in Fig 2, together with their genomic neighborhoods.

A closer look into the Mdt cluster reveals the evolutionary history of the subfamily

As observed in the analysis of all RNDs (Fig 1), the MexN-like proteins clustered together with MdtB- and MdtC-like proteins (Fig 2). However, an interesting observation was that the MexN-like proteins were divided into two distinct groups. The first group was formed by MexN-like proteins from strains that also contained an MdtBC system. These MexN-like proteins clustered together with MdtB-like proteins, suggesting their common evolutionary origin (i.e. these MexN-like proteins and MdtB-like proteins are descendant from one of the originally duplicated genes). The second group was formed by MexN-like proteins from strains that did not contain an MdtBC system, and these MexN-like proteins separated from the rest of the tree before the split between MdtB- and MdtC-like proteins. Because this second group of RNDs split earliest from the rest, it is likely that they are directly descendant from the progenitor single RND, and no gene duplication occurred during their evolution. Since this subset of MexN-like proteins never underwent the gene duplication event, we subsequently named them “progenitor-like” RNDs in order to distinguish them from the “true” MexN-like proteins, with “true” meaning here “clustering together with P. aeruginosa MexN and therefore having the same evolutionary history”.

The “progenitor-like” RND group contained all Deltaproteobacterial sequences represented in our analysis, as well as the only sequence from a non-Proteobacterium, Gloeobacter violaceus, a Cyanobacterium (Fig 2). The fact that these “progenitor-like” RNDs did not cluster together with known MdtB- and MdtC-like proteins suggests they are direct descendants of the ancient common ancestor of the whole Mdt cluster, the progenitor gene. We also performed searches for MdtB- and MdtC-like proteins (i.e. having sequence similarity at least 40%) in Deltaproteobacteria and found only three hits, suggesting the MdtBC-like systems are virtually absent in these two groups. A number of Alpha- and Betaproteobacterial orders contained the “progenitor-like” RNDs, but no Gammaproteobacteria did. Interestingly, all the Alpha- and Betaproteobacterial representants can fix nitrogen and/or reduce nitrate, suggesting a common habitat [26]. The fact that all the older bacterial lineages appeared in this group suggests that the original gene duplication that produced MdtB- and MdtC-like proteins occurred in the common ancestor of the Alpha-, Beta- and Gammaproteobacteria, around the end of the Archean Eon [25], and the sporadic occurrence of a “progenitor-like” RND in Alpha- and Betaproteobacteria is more likely a result of a horizontal gene transfer.

The rest of the RNDs formed two groups, with all the MdtB-like proteins in one and all the MdtC-like proteins in the other. Noticeably, the “true” MexN-like proteins clustered together with the MdtB-like proteins. This observation suggested the MexN separation happened after the original gene duplication that formed MdtB and MdtC from the progenitor RND gene. In general, the branching of both MdtB and MdtC groups was similar: Alphaproteobacteria separated earliest (with the exception of Gluconobacter oxydans and Zymomonas mobilis, see below), and then Beta- and Gammaproteobacteria. Surprisingly, the Gammaproteobacterial order Xanthomonodales separated together with Alphaproteobacteria (both in the MdtB- and the MdtC-like groups, with moderate to low bootstrap support, however). In Alphaproteobacteria, homologs of MdtBC/MexN were numerously found only in orders Rhizobiales and Rhodospirillales, and sporadically in a few other orders. In Betaproteobacteria, homologs of MdtBC/MexN were widespread and found in all major orders, and in Gammaproteobacteria homologs of MdtBC/MexN were found in most orders. In all three major Proteobacterial families there were examples of closely related species and strains where one contained MdtBC, MexN or both, and the other with no MdtBC/MexN homologs. In many organisms it was also suspected the process of losing the RND pumps was ongoing. For instance, in Shigella flexneri, Serratia marcescens, Pseudomonas syringae pv tomato and Magnetospirillum magneticum an MdtB was missing; in Salmonella paratyphi A an MFP was missing; and in Burkholderia mallei the whole MdtABC operon was absent (see S1 File for details).

The genomic neighborhoods provided additional insights into the evolutionary history of the Mdt systems. Among the MdtBC systems, many contained OMP components, and the architecture was conserved in the main groups: in Alphaproteobacteria the OMP preceded the MFP, and in Beta- and Gammaproteobacteria it followed the MdtC protein. It is likely that the OMP components were acquired after the original gene duplication and this acquisition happened separately, once in Alphaproteobacteria, and once in a common ancestor to Beta- and Gammaproteobacteria, and in many cases it was subsequently lost (see S1 File for details). Moreover, all Enterobacterales possessed an additional inner membrane protein from the Major Facilitator Superfamily (MFS), called MdtD in E. coli, an iron and citrate exporter [27], and no outer membrane proteins. The order Enterobacterales is an example of how the outer membrane channel function had converged on just one protein (e.g. TolC in E. coli), and the redundant outer membrane components of RND systems are removed from the genomes (with the exception of specialized functions, e.g. E. coli CusC as an outer membrane component for the Cu-exporting Cus system). The outer membrane proteins were also missing in the order Xanthomonodales and sporadically in other organisms. Notably, the Burkholderia MexMNs also contained an MFS, not related to other MFSs observed here.

Horizontal gene transfers

The exception to the observation that organisms containing a “progenitor-like” RND did not contain an MdtBC system occurred in Cupriavidus pinatubonensis (Betaproteobacteria, order Burkholderiales), and one of two strains of Rhodopseudomonas palustris, namely strain HaA2 (Alphaproteobacteria, order Rhizobiales). These two organisms possessed both an MdtBC-like system, similar to other Proteobacteria in their respective groups, and a “progenitor-like” RND, likely a result of a horizontal gene transfer. The C. pinatubonensis RND showed close similarity to an RND from Nitrosospira multiformis, a distantly related Betaproteobacterium (order Nitrosomonadales), and their “progenitor-like” RNDs grouped together with other “progenitor-old” RNDs. The R. palustris HaA2 strain possibly lost the original MexN-like system and incorporated a “progenitor-like” RND, judging from its genomic contexts (see S1 File). The other R. palustris strain, ATCC BAA-871, did not contain a “progenitor-like” RND system, and its other MdtBC- and MexN-like proteins behaved as its relatives in other Alphaproteobacteria.

A number of sequences originally clustering with other Mdts in Fig 1 did not align well and in consequence showed poor or unresolved phylogeny with low bootstrap values regardless of the methods used and were therefore removed from the analysis prior to the results shown in Fig 2. These sequences are described in the S1 File.

Proteins from two Alphaproteobacteria, Gluconobacter oxydans and Zymomonas mobilis, did not cluster together with other Alphaproteobacterial Mdts, but were found closest to respective proteins from the order Burkholderia. While the long branches observed for all four proteins as well as moderate bootstrap values might render this clustering less reliable, it is possible those two organisms had lost their original Mdts and acquired new ones via horizontal gene transfer. Moreover, G. oxydans possesses a third protein with high sequence similarity to its own MdtC (not shown in Fig 2, see S1 File). It is likely a result of a discrete gene duplication, particularly since this third gene does not possess an MFP.

As mentioned above, the order Xanthomonadales clustered somewhat reliably with Alphaproteobacteria, both in MdtB- and MdtC-like groups. They did not possess a third RND, either a “true” MexN-like protein or a “progenitor-like” RND. Since they separated the earliest from other Gammaproteobacteria, it is possible their ancestors lost both their original MdtABC and MexMN systems, and subsequently incorporated an MdtABC from an Alphaproteobacterium [28].

Reconstructing the MdtABC/MexMN evolution

The results described here, together with analysis of corresponding MFPs (see S1 File) made it possible to propose an evolutionary scenario for the appearance of MdtBC and MexN pumps (Fig 3). The original RND progenitor gene underwent a duplication in the common ancestor to Alpha-, Beta- and Gammaproteobacteria, while remaining single in other bacterial groups (as “progenitor-like” RNDs). The MFP and the adjacent RND were duplicated, forming the “true” MexMN system. In the next step an OMP was acquired, and was inserted before the MFP in Alphaproteobacteria, or after the MdtC in the common ancestor to the Beta- and Gammaproteobacteria. From these points many organisms lost the MexMN system. In Alphaproteobacteria only two orders represented in Figs 1 and 2 retained the original genes. Many Betaproteobacteria retained the OMPs (occurring always after the MdtCs) but lost the duplicated MexMN, with the exception of the Burkholderia genus, which lost the OMPs, but retained the MexMN and also gained an MFS next to it. In Gammaproteobacteria the configuration was generally kept intact, with the exception of Enterobacterales, which lost the MexMN and incorporated an MFS into its MdtABC operon.

Fig 3. The proposed evolutionary scenario.

Fig 3

The evolution within the Alpha-, Beta- and Gamma- proteobacteria groups is shown, as deduced from the phylogenetic tree in Fig 2 and a timeline of evolution of Proteobacteria (Battistuzzi, Feijao and Hedges 2004). The cladograms lengths and timepoints of evolutionary events are not to scale. As an example of horizontal gene transfer, Cupriavidus pinatubonensis is also shown.

Supporting information

S1 File. Supporting information containing the list of used sequences, as well as detailed discussion, is available.

(DOCX)

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.Nikaido H. RND transporters in the living world. Res Microbiol. 2018;169: 363–371. 10.1016/j.resmic.2018.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Alvarez-Ortega C, Olivares J, Martínez JL. RND multidrug efflux pumps: what are they good for? Front Microbiol. 2013;4 10.3389/fmicb.2013.00007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Horiyama T, Nishino K. AcrB, AcrD, and MdtABC Multidrug Efflux Systems Are Involved in Enterobactin Export in Escherichia coli. PLoS ONE. 2014;9: e108642 10.1371/journal.pone.0108642 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Dou Y, Song F, Guo F, Zhou Z, Zhu C, Xiang J, et al. Acinetobacter baumannii quorum-sensing signalling molecule induces the expression of drug-resistance genes. Mol Med Rep. 2017;15: 4061–4068. 10.3892/mmr.2017.6528 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Godoy P, Molina-Henares AJ, La Torre De J, Duque E, Ramos JL. Characterization of the RND family of multidrug efflux pumps: in silico to in vivo confirmation of four functionally distinct subgroups. Microb Biotechnol. 2010;3: 691–700. 10.1111/j.1751-7915.2010.00189.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Anes J, McCusker MP, Fanning S, Martins M. The ins and outs of RND efflux pumps in Escherichia coli. Front Microbiol. 2015;6 10.3389/fmicb.2015.00587 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Poole K. Pseudomonas aeruginosa: Resistance to the Max. Front Microbiol. 2011;2 10.3389/fmicb.2011.00065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Du D, Wang Z, James NR, Voss JE, Klimont E, Ohene-Agyei T, et al. Structure of the AcrAB–TolC multidrug efflux pump. Nature. 2014;509: 512–515. 10.1038/nature13205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Kim H-S, Nagore D, Nikaido H. Multidrug efflux pump MdtBC of Escherichia coli is active only as a B2C heterotrimer. J Bacteriol. 2010;192: 1377–1386. 10.1128/JB.01448-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Mima T, Kohira N, Li Y, Sekiya H, Ogawa W, Kuroda T, et al. Gene cloning and characteristics of the RND-type multidrug efflux pump MuxABC-OpmB possessing two RND components in Pseudomonas aeruginosa. Microbiology (Reading, Engl) 2009;155: 3509–3517. 10.1099/mic.0.031260-0 [DOI] [PubMed] [Google Scholar]
  • 11.Yang L, Chen L, Shen L, Surette M, Duan K. Inactivation of MuxABC-OpmB Transporter System in Pseudomonas aeruginosa Leads to Increased Ampicillin and Carbenicillin Resistance and Decreased Virulence. J Microbiol. 2011;49: 107–114. 10.1007/s12275-011-0186-2 [DOI] [PubMed] [Google Scholar]
  • 12.Nishino K, Nikaido E, Yamaguchi A. Regulation of Multidrug Efflux Systems Involved in Multidrug and Metal Resistance of Salmonella enterica Serovar Typhimurium. J Bacteriol. 2007;189: 9066–9075. 10.1128/JB.01045-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Begic S, Worobec EA. Characterization of the Serratia marcescens SdeCDE multidrug efflux pump studied via gene knockout mutagenesis. Can J Microbiol. 2008;54: 411–416. 10.1139/w08-019 [DOI] [PubMed] [Google Scholar]
  • 14.Pletzer D, Weingart H. Characterization and regulation of the resistance-nodulation-cell division-type multidrug efflux pumps MdtABC and MdtUVW from the fire blight pathogen Erwinia amylovora. BMC Microbiol. 2014;14: 185 10.1186/1471-2180-14-185 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Henríquez T, Stein NV, Jung H. PvdRT-OpmQ and MdtABC-OpmB efflux systems are involved in pyoverdine secretion in Pseudomonas putida KT2440. Environ Microbiol Rep. 2018;33: 6391 10.1111/1758-2229.12708 [DOI] [PubMed] [Google Scholar]
  • 16.Khattar ZA, Lanois A, Hadchity L, Gaudriault S, Givaudan A. Spatiotemporal expression of the putative MdtABC efflux pump of Phototorhabdus luminescens occurs in a protease-dependent manner during insect infection. PLoS ONE 2019;14: e0212077 10.1371/journal.pone.0212077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wang Da, Fierke CA. The BaeSR regulon is involved in defense against zinc toxicity in E. coli. Metallomics. The Royal Society of Chemistry 2013;5: 372–383. 10.1039/C3MT20217H [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20: 1160–1166. 10.1093/bib/bbx108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucl Acids Res. 2019;47: W256–W259. 10.1093/nar/gkz239 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Gerlt JA, Bouvier JT, Davidson DB, Imker HJ, Sadkhin B, Slater DR, et al. Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST): A web tool for generating protein sequence similarity networks. BBA—Proteins Proteom. 2015;1854: 1019–1037. 10.1016/j.bbapap.2015.04.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Gerlt JA. Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence–Function Space and Genome Context to Discover Novel Functions. Biochemistry. 2017;56: 4293–4308. 10.1021/acs.biochem.7b00614 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Zallot R, Oberg NO, Gerlt JA. “Democratized” genomic enzymology web tools for functional assignment. Curr Opin Chem Biol. 2018;47: 77–85. 10.1016/j.cbpa.2018.09.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Zallot R, Oberg N, Gerlt JA. The EFI Web Resource for Genomic Enzymology Tools: Leveraging Protein, Genome, and Metagenome Databases to Discover Novel Enzymes and Metabolic Pathways. Biochemistry. 2019;58: 4169–4182. 10.1021/acs.biochem.9b00735 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11): 2498–2504. 10.1101/gr.1239303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Battistuzzi FU, Hedges SB. A Major Clade of Prokaryotes with Ancient Adaptations to Life on Land. Mol Biol Evol. 2009;26: 335–343. 10.1093/molbev/msn247 [DOI] [PubMed] [Google Scholar]
  • 26.Salinero KK, Keller K, Feil WS, Feil H, Trong S, Di Bartolo G, et al. Metabolic analysis of the soil microbe Dechloromonas aromatica str. RCB: indications of a surprisingly complex life-style and cryptic anaerobic pathways for aromatic degradation. BMC Genomics. 2009;10: 1–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Frawley ER, Crouch M-LV, Bingham-Ramos LK, Robbins HF, Wang W, Wright GD, et al. Iron and citrate export by a major facilitator superfamily pump regulates metabolism and stress resistance in Salmonella Typhimurium. Proc Natl Acad Sci USA. 2013;110: 12054–12059. 10.1073/pnas.1218274110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Williams KP, Gillespie JJ, Sobral BWS, Nordberg EK, Snyder EE, Shallom JM, et al. Phylogeny of Gammaproteobacteria. J Bacteriol. 2010;192: 2305–2314. 10.1128/JB.01480-09 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

William M Shafer

13 Jan 2020

PONE-D-19-34793

Phylogenetic analysis reveals an ancient gene duplication as the origin of the MdtABC efflux pump

PLOS ONE

Dear Dr. Gorecki,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

As you see from the two critiques, both reviewers were highly enthusiastic about your paper, which I share.  Their concerns are minor and editorial in nature.

We would appreciate receiving your revised manuscript by February 13th, 2020. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

William M Shafer, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This manuscript by Gorecki and McEvoy examines the evolutionary history of RND efflux pumps. The authors examine the Mdt efflux system which uniquely consists fo two RND subunits (MdtBC). Through genomic analyses, the work makes a solid case that this two protein arrangement arose through an ancient gene duplication event. Moreover, the authors reveal an evolutionary relationship between MdtBC and the MexN RND subunit. The authors reconstruct a likely history of how these RND subunits evolved via gene duplication and subsequent acquisition of additional membrane fusion proteins and outer membrane proteins. A pleasing outcome of these analyses is the observation that loss of MdtB occurs in preference to loss of MdtC; so, it is likely that MdtC can at least partially perform MdtB function while MtdB may not have the ability to function in stead of MdtC. This findings will inform current thinking on the relative importance of MdtBC subunits.

I only have minor text changes to suggest:

-The difference between “old” and “true” MexN could be better explained. Symbols in Fig 2 also need better explanation - what exactly is the difference between filled and opened symbols?

-Line 237 “Fig. s 1” typo

-Line 238: “post OMPs” is not a very clear description of (I assume) genomic organization.

Reviewer #2: This paper presents an interesting hypothesis for the origin of the two component resistance nodulation division pump, MdtABC. The work is based upon a freely available dataset published in 2010 containing over 2000 RND gene sequences, providing good representation of RND gene sequences found across populations. In general, more explanation is needed throughout the manuscript to clarify the meaning of the text and justify the conclusions drawn. With additional explanation in certain parts, we feel this paper will add to our understanding of the origin and conservation of two component RND pumps, across Gram-negative bacteria.

The grammar and punctuation should be reviewed throughout (e.g. line 144).

The authors state with confidence that MdtABC arose from a gene duplication. In line 65, the authors state that it is “obvious” that MdtBC arose through gene duplication. While we agree this is highly likely, we aren’t sure it is fair to say it obvious without providing any justification or evidence for this. This section should be reworded and justification for this assertion added.

Line 70 The authors refer to the gene duplication event being a “rare older phenomenon”. While we think we understand what the authors are implying we feel this suggests that such gene duplication events would/can no longer occur. Perhaps it would be better to say it is an infrequent occurrence.

The methods section is rather brief and more information is required throughout to improve clarity. For example on line 80, is the algorithm estimated by MAFFT or are the authors referring to something else? Should a reference be added for the Gene Neighbourhood tool? Also in the methods section, the mex genes are mentioned but have not previously been introduced. These should be added to the introduction.

Line 122 – “Indeed, all proteins adjacent to their respective MFPs cluster into one branch, and all the other ones form another branch.” It isn’t clear what it meant by this statement and more detail/explanation is needed to understand the conclusions being drawn from the phylogenetic tree.

The use of “old” to describe ancestral genes is confusing, and slightly misleading. The authors state the origin of a subset of the MexN genes is “likely” older than the gene duplication event so they named them “old”RNDs. We think these should be renamed and that further justification for these assumptions given. We don’t necessarily disagree but it is important that the conclusions drawn from the data are clear and justified.

It is not clear what a ‘typical MdtABC’ pump means? If a particular consensus sequence has been used then it should be stated or referenced.

To help readers understand the relevance of the gene variation between proteobacteria strains, perhaps it would be useful to add some information in the introduction about the evolution of the phylum.

Minor comments

Inconsistent abbreviations are used for proteins mentioned throughout (in lines 13, 16 52, 81, 143, 167, 168). MdtB is frequently referred as so, however, MdtC commonly just referred to as “C” but would be clearer if spelt out as MdtC.

MdtB and MdtC referred to as “Mdt pumps”, yet MexMN that form the same sort of complex referred to as one unit (line 19). Perhaps this could be edited to provide consistency.

Abbreviation of “sequence similarity network” is not consistent. First introduced as SSN on line 87, then described by full name on 90, then mentioned again as “sequence similarity network (SSN)” on 101.

Typo on line 188 - Beta+Gammaproteobacteria?

Both figures 1 and 2 are very blurry the text in figure 2 is very small when printed.

Sentence from 143 to 147 could be clarified. I think if the sentence is split into two, it would be easier to follow and understand the differences between MdtABC positive and negative strains.

On line 152, are the authors referring to figure 2? The text is slightly ambiguous and could be clarified.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Jessica Blair, Hannah Pugh

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Feb 12;15(2):e0228877. doi: 10.1371/journal.pone.0228877.r002

Author response to Decision Letter 0


22 Jan 2020

(as in the Response to reviewers document)

We thank the reviewers for their careful consideration and suggestions to improve the manuscript. We’ve incorporated their suggested changes as described below. We feel that with these changes the manuscript is much improved and hope that it is now considered suitable for publication in PLoS ONE.

Reviewer #1: This manuscript by Gorecki and McEvoy examines the evolutionary history of RND efflux pumps. The authors examine the Mdt efflux system which uniquely consists fo two RND subunits (MdtBC). Through genomic analyses, the work makes a solid case that this two protein arrangement arose through an ancient gene duplication event. Moreover, the authors reveal an evolutionary relationship between MdtBC and the MexN RND subunit. The authors reconstruct a likely history of how these RND subunits evolved via gene duplication and subsequent acquisition of additional membrane fusion proteins and outer membrane proteins. A pleasing outcome of these analyses is the observation that loss of MdtB occurs in preference to loss of MdtC; so, it is likely that MdtC can at least partially perform MdtB function while MtdB may not have the ability to function in stead of MdtC. This findings will inform current thinking on the relative importance of MdtBC subunits.

I only have minor text changes to suggest:

-The difference between “old” and “true” MexN could be better explained.

In accordance to similar comments from Reviewer 2, we modified the text to bring more clarity to the discussed difference and stress the difference between the two sets of MexN-like proteins (see below).

- Symbols in Fig 2 also need better explanation - what exactly is the difference between filled and opened symbols?

We expanded the figure legend to provide more detailed description of the symbols, and also included an example of how to read the figure.

-Line 237 “Fig. s 1” typo

The typo in line 237 was corrected.

-Line 238: “post OMPs” is not a very clear description of (I assume) genomic organization.

We corrected the text in line 238 to clearly reflect that we mean the genomic organization.

Reviewer #2: This paper presents an interesting hypothesis for the origin of the two component resistance nodulation division pump, MdtABC. The work is based upon a freely available dataset published in 2010 containing over 2000 RND gene sequences, providing good representation of RND gene sequences found across populations. In general, more explanation is needed throughout the manuscript to clarify the meaning of the text and justify the conclusions drawn. With additional explanation in certain parts, we feel this paper will add to our understanding of the origin and conservation of two component RND pumps, across Gram-negative bacteria.

The grammar and punctuation should be reviewed throughout (e.g. line 144).

We reread the manuscript and corrected the mistakes.

The authors state with confidence that MdtABC arose from a gene duplication. In line 65, the authors state that it is “obvious” that MdtBC arose through gene duplication. While we agree this is highly likely, we aren’t sure it is fair to say it obvious without providing any justification or evidence for this. This section should be reworded and justification for this assertion added.

We modified the section to reflect the still hypothetical nature of our notion at that point of the manuscript. We also slightly modified the paragraph regarding the results shown in Fig. 1: the fact that all MdtBs and MdtCs clustered together corroborates the notion that they arose from a single gene via gene duplication.

Line 70 The authors refer to the gene duplication event being a “rare older phenomenon”. While we think we understand what the authors are implying we feel this suggests that such gene duplication events would/can no longer occur. Perhaps it would be better to say it is an infrequent occurrence.

We agree, and we changed the wording accordingly from “rare” to “infrequent” in line 70.

The methods section is rather brief and more information is required throughout to improve clarity. For example on line 80, is the algorithm estimated by MAFFT or are the authors referring to something else? Should a reference be added for the Gene Neighborhood tool? Also in the methods section, the mex genes are mentioned but have not previously been introduced. These should be added to the introduction.

We edited the methods section to provide more information: the algorithm was by MAFFT; The Gene Neighborhood Tool is part of the toolset provided by the authors of Enzyme Similarity Tool, we therefore repeated the references to make this clear. And we also clarified which genes neighborhoods were investigated by removing the “mex” acronym in the methods section.

Line 122 – “Indeed, all proteins adjacent to their respective MFPs cluster into one branch, and all the other ones form another branch.” It isn’t clear what it meant by this statement and more detail/explanation is needed to understand the conclusions being drawn from the phylogenetic tree.

We modified this section to better explain the line of thinking. We described how the genomic organization (MdtB being directly adjacent to an MFP) was in agreement with sequence similarities and the conclusion drawn from that fact being the gene duplication happened once and the order of the genes was preserved.

The use of “old” to describe ancestral genes is confusing, and slightly misleading. The authors state the origin of a subset of the MexN genes is “likely” older than the gene duplication event so they named them “old”RNDs. We think these should be renamed and that further justification for these assumptions given. We don’t necessarily disagree but it is important that the conclusions drawn from the data are clear and justified.

We agree with the Reviewer 2 that the word “old” can be misleading, and also took into consideration a similar comment from Reviewer 1. In this paper we aim to distinguish between single subunit RNDs that have a history of gene duplication (like P. aeruginosa MexN, through a gene duplication) and single RNDs that are directly descendant from the original progenitor Mdt gene (and not through any gene duplication). We decided to keep the word “true” for describing the RNDs that are like the P. aeruginosa MexN, in the meaning of having the same evolutionary history (i.e. first via the gene duplication of the progenitor RND into two component RND, and then subsequent MFP+RND1 gene duplication). In order to avoid using a common word “old”, we chose to name the second group “progenitor-like” RNDs, to underline their direct resemblance to the progenitor RND Mdt gene and lack of any gene duplication history. We modified the text throughout accordingly and included more detailed explanation both in the Introduction and Results sections.

It is not clear what a ‘typical MdtABC’ pump means? If a particular consensus sequence has been used then it should be stated or referenced.

By ‘typical’ we meant a pump consisting of two RND subunits. We modified the text in line 154 to reflect that by stating the 40% sequence similarity cut-off.

To help readers understand the relevance of the gene variation between proteobacteria strains, perhaps it would be useful to add some information in the introduction about the evolution of the phylum.

We expanded the last paragraph of the introduction according to the suggestion and described how the Proteobacteria are believed to have evolved.

Minor comments

Inconsistent abbreviations are used for proteins mentioned throughout (in lines 13, 16 52, 81, 143, 167, 168). MdtB is frequently referred as so, however, MdtC commonly just referred to as “C” but would be clearer if spelt out as MdtC.

We spelled out MdtC throughout the text to avoid confusion.

MdtB and MdtC referred to as “Mdt pumps”, yet MexMN that form the same sort of complex referred to as one unit (line 19). Perhaps this could be edited to provide consistency.

We changed the wording to “MdtBC” pumps where we mean “a system containing two RND subunits”, in order to distinguish it from a possible misunderstanding of “Mdt pumps” as “MdtBC + MexN” systems.

Abbreviation of “sequence similarity network” is not consistent. First introduced as SSN on line 87, then described by full name on 90, then mentioned again as “sequence similarity network (SSN)” on 101.

We corrected the inconsistency by changing the wording in lines 87, 90 and 101.

Typo on line 188 - Beta+Gammaproteobacteria?

This is not a typo, we meant here a common ancestor to both Beta- and Gammaproteobacteria (therefore a plus sign). We modified the text to make it clear what we mean by spelling it out as “a common ancestor to both Beta- and Gammaproteobacteria” in line 188.

Both figures 1 and 2 are very blurry the text in figure 2 is very small when printed.

We believe the low quality of the figures is due to PLOS One’s PDF rendering. Upon clickling on “Click here to download Figure X” in the top right corner of a respective figure page, a high-resolution figure can be downloaded. The text size in Fig. 2 can unfortunately not be increased without splitting the figure into two pages. We prefer to avoid that, since the main message of Fig 2. is to show the clustering of proteins of similar genomic organization and comparison to the taxonomy of the organisms. The figures were also uploaded to PACE and adjusted accordingly.

Sentence from 143 to 147 could be clarified. I think if the sentence is split into two, it would be easier to follow and understand the differences between MdtABC positive and negative strains.

We expanded this section to more clearly describe the difference between the various groups of proteins, also to meet the similar request of Reviewer 1.

On line 152, are the authors referring to figure 2? The text is slightly ambiguous and could be clarified.

Yes, we are referring to Figure 2. We modified the text to improve clarity by adding a reference.

Attachment

Submitted filename: Response-to-reviewers.docx

Decision Letter 1

William M Shafer

27 Jan 2020

Phylogenetic analysis reveals an ancient gene duplication as the origin of the MdtABC efflux pump

PONE-D-19-34793R1

Dear Dr. Gorecki,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

William M Shafer, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

William M Shafer

31 Jan 2020

PONE-D-19-34793R1

Phylogenetic analysis reveals an ancient gene duplication as the origin of the MdtABC efflux pump

Dear Dr. Górecki:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor William M Shafer

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Supporting information containing the list of used sequences, as well as detailed discussion, is available.

    (DOCX)

    Attachment

    Submitted filename: Response-to-reviewers.docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES