Skip to main content
eLife logoLink to eLife
. 2023 Jan 25;12:e75860. doi: 10.7554/eLife.75860

GWAS and functional studies suggest a role for altered DNA repair in the evolution of drug resistance in Mycobacterium tuberculosis

Saba Naz 1,2,3,, Kumar Paritosh 4,, Priyadarshini Sanyal 1, Sidra Khan 1, Yogendra Singh 3, Umesh Varshney 5, Vinay Kumar Nandicoori 1,2,
Editors: Digby F Warner6, Bavesh D Kana7
PMCID: PMC9876569  PMID: 36695572

Abstract

The emergence of drug resistance in Mycobacterium tuberculosis (Mtb) is alarming and demands in-depth knowledge for timely diagnosis. We performed genome-wide association analysis using 2237 clinical strains of Mtb to identify novel genetic factors that evoke drug resistance. In addition to the known direct targets, we identified for the first time, a strong association between mutations in DNA repair genes and the multidrug-resistant phenotype. To evaluate the impact of variants identified in the clinical samples in the evolution of drug resistance, we utilized knockouts and complemented strains in Mycobacterium smegmatis and Mtb. Results show that variant mutations compromised the functions of MutY and UvrB. MutY variant showed enhanced survival compared with wild-type (Rv) when the Mtb strains were subjected to multiple rounds of ex vivo antibiotic stress. In an in vivo guinea pig infection model, the MutY variant outcompeted the wild-type strain. We show that novel variant mutations in the DNA repair genes collectively compromise their functions and contribute to better survival under antibiotic/host stress conditions.

Research organism: Other

Introduction

The acquisition of drug resistance in Mycobacterium tuberculosis (Mtb) has evoked a perilous situation worldwide (WHO, 2020). Resistance to isoniazid and rifampicin, the first-line drugs, results in multidrug resistant-TB (MDR-TB). The pathogen is defined as extensively drug resistant (XDR) when it becomes resistant to first-line TB drugs, any fluoroquinolones, and at least one additional Group A drug (moxifloxacin, levofloxacin, linezolid, and bedaquiline) (WHO, 2021). Prolonged treatment duration, high drug toxicity, and the expensive drug regimen pose a challenge for treating the MDR and XDR-TB. Moreover, the inadequate treatment of drug-resistant TB leads to the augmentation of resistance to other anti-TB drugs, increasing the probability of transmission of these strains in the population (Alexander and De, 2007; Bastos et al., 2014).

Seven major lineages of Mtb are present across the globe, out of which four lineages: Lineage 1-Indo Oceanic (EAI); Lineage 2- Beijing; Lineage 3- Central Asian (CAS); and Lineage 4- Euro-American (Gagneux et al., 2006) are prevalent in humans. Clinical strains belonging to lineage 2 are more prone to developing drug resistance than lineage 4 strains (Ford et al., 2013). Acquisition of drug resistance in Mtb is majorly attributed to the chromosomal mutations that either modify the antibiotic’s direct target or increase the expression of efflux pumps that helps in decreasing the effective concentration of the drug inside the cell. The expression of drug modifying/degrading enzymes also contributes to the acquisition of drug resistance (Gygli et al., 2017). Despite well-known mechanisms of drug resistance, it is difficult to predict the resistance based on direct target mutations alone, implying the presence of hitherto unknown mechanisms that impart drug resistance. In addition, the diagnosis based on mutations in a particular known target region increases the bias that may have multiple repercussions, such as misdiagnosis and eventual spread of drug resistance (CRyPTIC Consortium and the 100,000 Genomes Project et al., 2018; The CRyPTIC Consortium, 2022b; The CRyPTIC Consortium, 2022a). The current knowledge of the mechanisms and biological triggers involved in the evolution of MDR or XDR-TB is inadequate. This knowledge is crucial for developing new drug targets and improved diagnosis. Multiple efforts have been made to determine the mechanisms for the emergence of MDR and XDR-TB. Genome-wide association studies (GWAS) identified different genes that abet the emergence of drug resistance (Farhat et al., 2013; Hicks et al., 2018; Zhang et al., 2013; Safi et al., 2019). However, only a few genes, such as ponA1, prpR, ald, glpK, and the mutation in the thyA-Rv2765 thyX-hsdS.1 loci, are validated (Farhat et al., 2013; Hicks et al., 2018; Zhang et al., 2013; Safi et al., 2019).

In a quest to identify genetic triggers that aid in the evolution of antibiotic resistance in Mtb, we performed GWAS using global data set of 2237 clinical strains that consist of antibiotic susceptible, MDR, poly-drug resistant (resistant to more than one first-line anti-TB drug other than both isoniazid and rifampicin), pre-XDR, and XDR. Interestingly, we have identified mutations in the multiple DNA repair genes of Mtb associated with the MDR phenotype. Functional validation of the identified mutations in DNA repair enzymes revealed that perturbations in the DNA repair mechanisms result in the enhanced survival of strains in the presence of antibiotics ex vivo and in vivo.

Results

GWAS unveils mutations in the DNA repair genes

To identify the genetic determinants contributing to the development of antibiotic resistance in Mtb, we performed genome-wide association analysis using the whole-genome sequences of clinical strains from nine published studies. After the quality filtering of raw reads, the dataset had 2773 clinical strains from 9 different countries belonging to all 4 lineages (Figure 1a, Figure 1—figure supplement 1; Hicks et al., 2018; Zhang et al., 2013; Casali et al., 2014; Blouin et al., 2012; Shanmugam et al., 2019; Guerra-Assunção et al., 2015; Clark et al., 2013; Bryant et al., 2013; Walker et al., 2013). The dataset chiefly represented Lineage 2 and 4 isolates that are predominant across the globe (Figure 1b). Based on the computational predictions and the phenotypes provided by the previous studies, strains were categorized as susceptible, mono-drug resistant, MDR, Poly-DR, and pre-XDR (Supplementary file 1; Supplementary file 2Manson et al., 2017). We identified ~160,000 single nucleotide polymorphisms (SNPs) and indels after mapping the short reads on the reference Rv genome. A phylogenetic tree constructed using the SNPs shows the proper clustering of lineages (Figure 1b). The total number of SNPs observed for all the strains was comparable, suggesting the absence of genetic drift (Figure 1c).

Figure 1. Genome-wide association study unveils mutations in the DNA repair genes.

(a) Geographical distribution of 2773 clinical strains of Mycobacterium tuberculosis (Mtb). The donut plot represents the proportion of susceptible and drug-resistant (DR) strains in each lineage. DR includes mono-DR, poly-DR, multidrug resistant (MDR), and pre-extensively drug resistant (XDR). A detailed breakup of distribution is given in Supplementary file 1. (b) Phylogenetic tree constructed using 1,60,000 single nucleotide polymorphisms (SNPs) using Mycobacterium canetti as an outgroup. (c) Dot-plot showing the number of SNPs identified in each strain. Different colored dots indicate the drug resistance phenotype of strain.

Figure 1.

Figure 1—figure supplement 1. Country-wide distribution of clinical strains.

Figure 1—figure supplement 1.

Each donut plot represents proportion of clinical strains used for the genome-wide association study. Susceptible refers to the antibiotic sensitive strains. DR refers to ‘drug resistance’ to first- and second-line antibiotics.

We hypothesized that the probability of finding the genetic determinants contributing to drug resistance would be higher in the strains resistant to more than two antibiotics. Thus, we performed GWAS using 1815 drug-susceptible and 422 drug-resistant strains (Figure 1—figure supplement 1, Figure 2—figure supplements 14 & Supplementary file 2). We employed a genome association and prediction integrated tool (GAPIT) software, with stringent false discovery rate (FDR) adjusted p-value (Gao et al., 2016; Zegeye et al., 2014). After setting the adjusted p-value cut-off at 10–5, we identified 188 mutations, including 24 intergenic regions correlated with multidrug resistance (Supplementary file 3; Supplementary file 4; Supplementary file 5; Supplementary file 6). The effect of identified SNPs on the development of MDR/XDR reveals positive or negative contributions (Figure 2a). We have identified known first- and second-line drug resistance target genes (Figure 2b & Table 1). Although we identified multiple mutations in rpoB, only p.Leu452Pro and p.Val496Met were above the cut-off. Notably, mutations in the rrs, katG (p.Ser315Thr), embB (p.Gly406Ser), pncA (p.His71Arg), gyrA (p.Ala90Val), and recently reported genetic determinants such as folC (p.Ser150Gly), and pks were part of the 164 genes, validating our approach (Figure 2b & Table 1, Supplementary file 4; Supplementary file 6). Also, we identified the known compensatory mutations in the fabG1 upstream region, eis-Rv2417c, and oxyR-ahpC loci (Coll et al., 2018; Supplementary file 6). Importantly, the above mutations were absent in the mono-DR and drug-susceptible strains (Figure 2—figure supplement 6).

Figure 2. Drug-resistant strains carry mutations in the DNA repair genes.

(a) Volcano plot represents the effect of identified single nucleotide polymorphisms (SNPs) on the development of multidrug resistant/extensively drug resistant TB (MDR/XDR-TB). The positive effect (green dots) shows that the identified SNPs would aid in MDR/XDR development. The negative effect (red dots) shows that the SNPs would restrain the development of MDR/XDR. (b) Manhattan plot representing the association between the genes and drug resistance phenotype. A total of 188 genes that include intergenic regions were identified above the 10–5 cut-off value through association studies. Blue dots represent mutation in the lipid metabolism, membrane proteins, intermediary metabolism genes, and others. Green dots represent mutation in the direct targets for the first- and second-line antibiotics. Red dots represent mutations associated with the DNA repair genes. A detailed list of associated genes is provided in Supplementary file 3; Supplementary file 4. (c–e) Pie chart represents the total (c), non-synonymous (d), and synonymous (e) SNPs identified in the genes that belong to different categories. (f) Bar plot represents the percentage of synonymous mutations in the genes that resulted in abundant/moderate codon usage to the rare codon compared to H37Rv.

Figure 2—source data 1. Mutations identified in genes that belong to different categories.

Figure 2.

Figure 2—figure supplement 1. Genome-wide association study analysis.

Figure 2—figure supplement 1.

Bar graph representing the (a) heterozygosity of strains, (b) heterozygosity of markers, and (c) frequency and accumulative frequency of marker density.
Figure 2—figure supplement 2. Linkage disequilibrium, minor allele frequency and quantile-quantile plot in the genome-wide association analysis.

Figure 2—figure supplement 2.

(a) Linkage disequilibrium decay over distance. (b) Minor allele frequency (MAF). (c) Quantile-quantile – plot of p-values.
Figure 2—figure supplement 3. Density of markers in the genome-wide association analysis.

Figure 2—figure supplement 3.

(a–d) Density of markers.
Figure 2—figure supplement 4. Plots showing optimum compression and Type-I error.

Figure 2—figure supplement 4.

(a) The profile for the optimum compression. (b) Type-I error plot.
Figure 2—figure supplement 5. Genome-wide association study-based hypothesis.

Figure 2—figure supplement 5.

In the natural process of evolution, host-imposed stress and antibiotic treatment result in the evolution of wild-type bacteria to multidrug resistant/extensively drug resistant (MDR/XDR). However, in bacteria with compromised DNA repair pathways, the evolution to MDR/XDR-TB is accelerated.
Figure 2—figure supplement 6. Distribution of mutations in the DNA repair genes.

Figure 2—figure supplement 6.

(a–d) Distribution plot showing single nucleotide polymorphisms (SNPs) in mutY, recF, uvrA, and uvrB identified in drug-resistant strains. Wild type and the alternative alleles are shown. (e–h) Distribution plot showing SNPs in mutY, recF, uvrA, and uvrB identified in drug-resistant strains. Wild type and the alternative alleles are shown. (i) Table representing the distribution of drug resistant–associated mutations. (j&k) Bar graph representing direct target mutations in the strains harboring mutY mutation (e) or mutations in the other identified DNA repair genes (f). (l) Box and Whisker plot represent mutation spectrum of clinical strains harboring mutY variant and closely related drug susceptible strains. The analysis showed a trend toward higher C→T, A→G, and C→A mutations, but we could not perform statistical analysis, as the strains harboring mutY variant were limited.

Table 1. Mutations identified in the direct targets of antibiotics.

Antibiotic Gene Mutations identified
Rifampicin rpoB Leu452Pro, Val496Meth
Isoniazid katG Ser315Thr
Ethambutol embB Gly406Ser
Ofloxacin gyrA Ala90Val, Ser91Pro
Kanamycin rrs 7 independent mutations
Pyrazinamide pncA His71Arg
Ethionamide ethA Met95Arg, Pro160(frame-shift)
Streptomycin gidB Leu35 (frame-shift)
Cycloserine ald Thr427Pro

Among the 188 genes, 45% of the mutations resulted in non-synonymous changes, whereas 33% resulted in synonymous changes, 14% in the upstream regions of the genes, and 8% in the stop/frameshift mutations (Figure 2c–e & -- Supplementary file 3; Supplementary file 4; Supplementary file 5; Supplementary file 6). While the non-synonymous and the stop/frameshift mutations most likely affect the functions of the proteins, the intergenic region mutations may impact gene expression. We identified mutations in genes involved in lipid metabolism, intermediary metabolism and respiration, membrane transporters, cell wall and cell processes, membrane-associated proteins, and others (https://mycobrowser.epfl.ch/) (Figure 2c–e). Synonymous changes may alter the mRNA stability or stall the translation process by changing an abundant codon to a rare codon (Brandis and Hughes, 2016; Kristofich et al., 2018; Plotkin and Kudla, 2011). Analysis of the synonymous mutations for the codon bias revealed that in >50% of events, codons were converted from moderate/abundant to rare codons (compare 75% in MDR/XDR with 25% in H37Rv) (Figure 2f, Supplementary file 7).

In addition to the mutations described above, we identified novel mutations in base excision repair (BER), nucleotide excision repair (NER), and homologous recombination (HR) pathway genes, mutY, uvrA, uvrB, and recF that are associated with the MDR and XDR-TB (Figure 2b & Table 2). Mutations in the DNA repair pathway genes could contribute to the selection and evolution of antibiotic resistance (Table 2; Figure 2—figure supplement 5). Analysis showed that mutations in the DNA repair genes are distributed specifically in MDR, PDR, and preXDR/XDR strains (Figure 2—figure supplement 6a–d). Furthermore, these strains also harbored mutations in the direct targets of the antibiotics (Figure 2—figure supplement 6i–k). Collectively, in addition to mutations in the direct targets, we identified novel uncharacterized variants, including mutations in the DNA repair genes.

Table 2. Mutations in DNA repair genes associated with drug resistance phenotype.

Gene Amino acid change Wild type Mutated False discovery rate-adjusted p-value
mutY Arg262Gln G A 3.83E-09
uvrB Ala524Val C T 2.15E-07
uvrA Gln135Lys C A 3.83E-09
RecF Gly269Gly G T 2.15E-07

The mutations in DNA repair genes result in their deficient function

DNA repair pathways, including NER, HR, and BER, guard the genomic integrity (Cole et al., 1998; Singh, 2017). The Mtb MutY is a 302 amino acid (aa) long adenine DNA glycosylase encoded by rv3589. We identified Arg262Gln mutation at the C-terminal region of the MutY. Oxidative damage to DNA results in the formation of 7,8-dihydro 8-oxoguanine (8-oxoG). If left unrepaired by MutM (fpg), it results in 8-oxoG:A (mostly) or 8-oxoG:G base pairing. MutY removes A or G paired against 8-oxoG, allowing MutM to correct the mistake. The absence of repair leads to G:C to T:A or C:G mutations in the genome (Figure 3a; Kurthkoti et al., 2010). The analysis of the mutation spectrum in the drug-resistant clinical strains harboring mutY-R262Q mutation and closely related drug-susceptible strains showed a bias toward C→A, A→G, and C→T mutations (Figure 2—figure supplement 6l). To decipher the biological role of the identified variant, we cloned Mtb mutY and performed site-directed mutagenesis to generate the mutant allele. The wild type and mutant mutY genes were subcloned into an integrative Mtb shuttle vector. Constructs were electroporated into Mycobacterium smegmatis mutY mutant strain (msmΔmutY) to generate msmΔmutY::mutY and msmΔmutY::mutY-R262Q strains. We performed mutation frequency analysis to evaluate the impact of Arg262Gln mutation on its DNA repair function. In accordance with the published data, deletion resulted in a 4.32-fold increase in the mutation frequency (Figure 3b; Kurthkoti et al., 2010). While complementation with wild-type mutY rescued the phenotype, complementation with mutY-R262Q failed to do so (Figure 3b and c).

Figure 3. Variants identified in DNA repair genes abrogate their function.

(a) A schematic representation of the base excision repair pathway that operates in mycobacteria. Oxidative damage can result in the conversion of G to 8-oxo-G. If MutM (Fpg) does not repair 8-oxo-G before replication, often an A is inserted against 8-oxo-G during replication. Under these conditions, MutM must avoid repair of 8-oxo-G until MutY removes the erroneously incorporated A. The predominant target of MutY is 8-oxo-G:A pair where it removes A; thus, the action of MutY provides another opportunity to incorporate C (the correct base) against 8-oxo-G. Now the DNA becomes a target for MutM again, leading to the removal of 8-oxo-G and allowing incorporation of G. (b) Mutation frequency was calculated using msm, msmΔmutY, msmΔmutY::mutY, msmΔmutY::mutY-R262Q. (c) Fold increase in the mutation frequency with respect to wild-type msm. (d) A schematic representation of the nucleotide excision repair pathway showing the recognition and initiation of repair by UvrA-UvrB and UvrC. (e) Mutation frequency of msm, msmΔuvrB, msmΔuvrB::uvrB, and msmΔuvrB::uvrB-A524V. (f) Fold increase in the mutation frequency with respect to wild-type msm. Two biologically independent experiment sets were performed. Each biological experiment was performed in a biological sextet. Data represent one set of experiments. Statistical analysis (two-way ANOVA) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01. (g, h & i) Mutation rate was calculated for different strains in the presence of isoniazid (g), rifampicin (h), or ciprofloxacin (i). (j) Table showing the fold increase in the mutation rate in comparison with wild-type Rv. The experiment was performed using six independent colonies. Data represent mean and standard deviation. Statistical analysis (two-way ANOVA) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01.

Figure 3—source data 1. Mutation rate analysis in the presence of different drugs.

Figure 3.

Figure 3—figure supplement 1. Genome-wide association study of lineage 4 strains identified mutations in the DNA repair genes.

Figure 3—figure supplement 1.

(a) Manhattan plot showing identifying genes that belong to DNA repair and direct target of antibiotics. (b) Schematic depicting the generation of gene replacement mutant of mutY. The hygromycinr cassette disrupted the native allele. (c) PCR using F1-R1 (gene-specific primers) amplified 900 bp in Rv and ~1.6 kb in RvΔmutY. PCR using F2-R2 (hygromycinr cassette forward primer and reverse primer beyond the 3' flank) resulted from amplification in the RvΔmutY but not in Rv. (d) Immunoblot analysis to confirm the expression of complementation constructs RvΔmutY::mutY and RvΔmutY::mutY-R262Q. The upper panel probed with α-FLAG antibody, and the lower panel probed with α-GroEl-1 antibody as a control.
Figure 3—figure supplement 1—source data 1. Confirmation of gene repalcement mutant and complementation strains.
Figure 3—figure supplement 2. Mutation frequency analysis.

Figure 3—figure supplement 2.

(a) Mutation frequency was calculated for different strains in the presence of rifampicin (a) or isoniazid (b). (c) Table showing the fold increase in the mutation frequency in comparison with wild-type Rv. Two biologically independent experiment sets were performed, and each experiment was performed in sextet. Data represent mean and standard deviation. Statistical analysis (two-way ANOVA) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01.
Figure 3—figure supplement 2—source data 1. Analysis of Mutation frequency.

Next, we investigated the role of mutation in the NER pathway gene UvrB. UvrA, UvrB, and UvrC recognize and initiate the NER pathway upon DNA damage. UvrB, a 698 aa long DNA helicase encoded by rv1633, plays a pivotal role in the NER pathway by interacting with the UvrA and UvrC (Figure 3d; Kurthkoti et al., 2008). UvrB harbors N and C-terminal helicase domain, interaction domain, YAD/RRR motif, and UVR domain. Identified UvrB variant, Ala524Val, mapped to the C-terminal helicase like domain (Theis et al., 2000). To evaluate the functional significance of the mutation of uvrB, Mtb uvrB and uvrB-A524V genes were cloned into an integrative vector. The absence of uvrB led to higher mutation frequency, which could be rescued upon complementation with the wild type but not with the variant (Figure 3e and f).

Subsequently, we sought to extend our investigations in Mtb. Toward this, we generated the gene replacement mutant of mutY in laboratory strain Mtb H37Rv (Rv), wherein the mutY at native loci was disrupted with a hygromycin resistance cassette. Replacement at the native loci was confirmed by performing multiple PCRs (Figure 3—figure supplement 1b–c). Complementation constructs harboring mutY or mutY-R262Q were electroporated in the RvΔmutY to generate RvΔmutY::mutY and RvΔmutY::mutY-R262Q. Western blot analysis showed comparable expression of the MutY or MutY-R262Q (Figure 3—figure supplement 1d). We determined the mutation rates in the presence of isoniazid, rifampicin, and ciprofloxacin (Figure 3g–j). The fold increase in the mutation rates relative to Rv for RvΔmutY, RvΔmutY:mutY, and RvΔmutY::mutY-R262Q were 2.90, 0.76, and 3.0 in the presence of isoniazid; 5.62, 1.13, and 5.10 in the presence of rifampicin; and 9.14, 1.57, and 8.71 in the presence of ciprofloxacin, respectively (Figure 3j). Also, we have determined the mutation frequencies in the presence of isoniazid and rifampicin (Figure 3—figure supplement 2). Results are in line with the mutation rate experiments presented in Figure 3. Together these data suggest that variants of mutY and uvrB compromise their function.

The variant of mutY resists antibiotic killing

The killing kinetics in the presence and absence of isoniazid, rifampicin, ciprofloxacin, and ethambutol was performed to evaluate the effect of different drugs on the survival of RvΔmutY or RvΔmutY::mutY-R262Q (Figure 4a). In the absence of antibiotics, the growth kinetics of Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q were similar (Figure 4b). In the presence of isoniazid, ~2 log-fold decreases in bacterial survival was observed on day 3 in Rv and RvΔmutY::mutY; however, in RvΔmutY and RvΔmutY::mutY-R262Q, the difference was limited to ~1.5 log-fold (Figure 4c). A similar trend was apparent on days 6 and 9, suggesting an ~fivefold increase in the survival of RvΔmutY and RvΔmutY::mutY-R262Q compared with Rv and RvΔmutY::mutY (Figure 4c). Interestingly, in the presence of ethambutol, we did not observe any significant difference (Figure 4d). In the presence of rifampicin and ciprofloxacin, we observed an ~10-fold increase in the survival of RvΔmutY and RvΔmutY::mutY-R262Q compared with Rv and RvΔmutY::mutY (Figure 4e–f). Thus, results suggest that the absence of mutY or the presence of mutY variant aids in subverting the antibiotic stress.

Figure 4. Killing kinetics in the presence of antibiotics show better survival of RvΔmutY and RvΔmutY::mutY R262Q.

Figure 4.

(a) Schematic representation of killing kinetics. (b) Growth kinetics in the absence of drugs. (c–f) Growth kinetics in the presence of isoniazid, rifampicin, ciprofloxacin, and ethambutol. Two biologically independent sets of experiments were performed. Each biological experiment was performed in biological triplicates. Data represent one set of experiments. Statistical analysis (two-way ANOVA) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01.

Figure 4—source data 1. Killing kinetics in the absence and presence of different antibiotics.

The variant of mutY confers survival advantage ex vivo

Next, we evaluated the survival of Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q in the peritoneal macrophages. We did not observe any differences in the survival of RvΔmutY or RvΔmutY::mutY-R262Q compared with Rv or RvΔmutY::mutY (Figure 5—figure supplement 1a–b). We speculated that the evolution of a strain to become antibiotic-resistant requires the continued presence of antibiotic and host-directed stress. Therefore, we infected peritoneal macrophages with Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q in the absence or presence of the antibiotics. The bacteria recovered after 120 hr post-infection (p.i.) were cultured in vitro for 5 days and used for the next round of infection. The process was repeated for three rounds, and CFUs were enumerated at 4 and 96 hr p.i. during the fourth (final) round of infection (Figure 5a). CFUs obtained at 4 hr p.i. showed equal load. RvΔmutY and RvΔmutY::mutY-R262Q exhibited better survival in the absence of antibiotics than Rv and RvΔmutY::mutY (Figure 5b–e). There was no additional advantage compared with untreated in the presence of isoniazid (Figure 5f). However, we observed a log-fold advantage for RvΔmutY and RvΔmutY::mutY-R262Q compared with Rv or RvΔmutY::mutY in the presence of rifampicin or ciprofloxacin (Figure 5f).

Figure 5. Mutations in the DNA repair genes provide a survival advantage in the presence of antibiotics.

(a) A schematic is representing the ex vivo infection experiment in the presence and absence of different antibiotics. (b–e) Survival of the strains in the peritoneal macrophages at 4 and 96 hr post-infection (p.i.) without and with antibiotics (isoniazid or rifampicin, or ciprofloxacin). (f) Percent survival with respect to 4 hr p.i. was determined for each strain without and with antibiotics (isoniazid or rifampicin or ciprofloxacin). Two biologically independent sets of experiments were performed. Each biological experiment was performed in biological triplicates. Data represent one set of experiments. Statistical analysis (two-way ANOVA) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01.

Figure 5—source data 1. Survival of different strains in the absence and presence of antibiotics ex vivo.

Figure 5.

Figure 5—figure supplement 1. Ex vivo survival of strains.

Figure 5—figure supplement 1.

(a) Survival of strains with respect to 4 hr post-infection (p.i.) in peritoneal macrophages. (b) CFU enumeration of strains at 4 and 96 hr p.i. (c) Percent survival of each strain in the competition experiment. (d) CFU enumeration of strains at 4 and 96 hr p.i. in competition experiment. (e) Competition experiment after passaging of strains ex vivo. In the absence of antibiotics, the CFU enumeration was performed at 4 and 96 hr p.i. CFU enumeration in the presence of (f) isoniazid, (g) rifampicin, and (h) ciprofloxacin at 4 and 96 hr p.i. Two biologically independent experiments and each experiment was performed in the biological triplicates. Data represents one set of the biological experiments.
Figure 5—figure supplement 1—source data 1. Survival of strains before and after passage in the peritoneal macrophages.

Acquisition of direct target mutations ex vivo in the presence of drugs

We sought to determine if the improved survival of mutY mutant and mutY variant in the above experiment (Figure 5) is due to the acquisition of mutations in the direct target of antibiotics. To identify the mutations, we performed Whole Genome Sequencing (WGS). Genomic DNA extracted from 10 independent colonies (grown in vitro) was mixed in equal proportion prior to library preparation. Only those SNPs present in >20% of the reads were retained for the analysis. Analysis of Rv sequences grown in vitro suggested that the laboratory strain accumulated 100 SNPs compared with the reference strain (data not shown). The sequence of the Rv laboratory strain was used as the reference for the subsequent analysis. WGS data for RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q strains grown in vitro did not show the presence of any mutations in the antibiotic target genes. In a similar vein, 10 independent colonies, each from the 7H11-OADC plates, after the final round of ex vivo infection in the presence or absence of antibiotics, were selected for WGS. Data indicated that in the absence of antibiotics, no direct target mutations were identified in the ex vivo passaged strains (Figure 6a & e). However, in the presence of isoniazid, we found mutations in the katG (Ser315Thr or Ser315Ileu) in the Rv, RvΔmutY but not in RvΔmutY::mutY and RvΔmutY::mutY-R262Q (Figure 6b & e). These findings are in congruence with the ex vivo evolution CFU analysis, wherein we did not observe a significant increase in the survival of RvΔmutY and RvΔmutY::mutY-R262Q in the presence of isoniazid (Figure 5). In the presence of ciprofloxacin and rifampicin, direct target mutations were identified in the gyrA and rpoB (Figure 6c–e). Asp94Glu/Asp94Gly mutations were identified in gyrA, and, His445Tyr/Ser450Leu mutations were identified in rpoB of RvΔmutY and RvΔmutY::mutY-R262Q, respectively. No direct target mutations were identified in the Rv and RvΔmutY::mutY, suggesting that the perturbed DNA repair aids in acquiring the drug resistance-conferring mutations in Mtb (Figure 6c–e & Supplementary file 8; Paritosh, 2022; https://github.com/kumar-paritosh/analysis_of_Mtb_genome).

Figure 6. WGS reveals the acquisition of direct target mutations in the ex vivo passaged strains.

Figure 6.

(a–d) Circos plot showing the WGS analysis of the strains passaged ex vivo in the absence (a) and in the presence of isoniazid (b), rifampicin (c), and ciprofloxacin (d). The outermost circle represents the reference genome labeled with the known direct target mutations. Circles (from outside to inside) represent Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q genome. (e) Heat map represents the single nucleotide polymorphisms identified in the Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q after ex vivo passage.

Competition experiment ex vivo reveals MutY variant confers a survival advantage

Results above suggested that the mutY variant impacts survival advantage when subjected to antibiotic selection, likely due to its ability to accumulate mutations. We reasoned that, if this is indeed the case, the mutY variant may outcompete the wild-type Rv when both the strains are present together. To test this hypothesis, we infected peritoneal macrophages with a combination of Rv + RvΔmutY or Rv + RvΔmutY::mutY or Rv + RvΔmutY::mutY-R262Q. At 96 hr p.i. host cells were lysed, and Mtb CFUs were enumerated on plain 7H11 or kanamycin (kan) (Rv) or hygromycin (hyg) (RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q) to evaluate the survival rates (Figure 5—figure supplement 1c–d). The percent survival was calculated as (CFUs obtained on kan or hyg/CFUs on kan + hyg) × 100. It is apparent from the data that the survival rates of competing strains were comparable, suggesting that mutY deletion or complementation with variant did not confer a significant advantage (Figure 5—figure supplement 1c). These results indicate that in the absence of prior antibiotic selection, deletion or the presence of mutY variant does not confer an advantage (Figure 5—figure supplement 1c–d).

To test these conclusions, we performed ex vivo co-infection experiment with the strains that were subjected to three rounds of prior selection (Figure 5). Peritoneal macrophages were infected with a combination of Rv + RvΔmutY or Rv + RvΔmutY::mutY or Rv + RvΔmutY::mutY-R262Q (Figure 7a). At 24 hr p.i., cells were either treated or not treated with an antibiotic for the subsequent 72 hr, and total CFUs were enumerated as described above to evaluate the survival rates (Figure 5—figure supplement 1e–f). As expected, there was no difference in the CFUs either at 4 hr p.i. (Figure 7b–e) or 24 hr p.i. (data not shown). At 96 hr p.i., RvΔmutY and RvΔmutY::mutY-R262Q strains showed a distinct advantage over Rv both in the absence or presence of antibiotics. Importantly, RvΔmutY::mutY did not show any advantage over Rv under any conditions. These results suggest that subjecting deletion or variant strains to antibiotic stress in the host helps in evolution of the strains that can outcompete the wild-type strain.

Figure 7. Variant of mutY outcompetes Rv in competition experiment.

Figure 7.

(a) Schematic representing the competition experiment performed in peritoneal macrophages. Strains obtained after three rounds of infection in the peritoneal macrophages were used to perform a competition experiment (Figure 4a). (b–e) Percent survival of Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY R262Q in the absence and presence of antibiotics. Two biologically independent experiments, with each experiment performed in technical triplicates. Data represent one of the two biological experiments. Data represent mean and standard deviation. Statistical analysis (two-way ANOVA) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01.

Figure 7—source data 1. Competition experiment in the presence and absence of different drugs after passage in the peritoneal macrophages.

MutY variant exhibits enhanced survival in vivo

Upon entering the host macrophages, Mtb encounters multiple forms of stress that impede its growth. To survive and grow in such a hostile environment, Mtb employs various defense mechanisms (Chai et al., 2020). The treatment regimen with anti-TB drugs imposes a supplementary layer of stress on the pathogen. An auxiliary mechanism the pathogen uses is to accumulate mutations in its genome that improve its ability to combat antibiotic and host-induced stresses. We asked if the variant mutations identified in DNA repair genes provide one such auxiliary mechanism. To test this hypothesis, we performed guinea pig infection experiments using Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q (Figure 8a). CFUs were enumerated after 1 and 56 days post-infection. CFUs obtained on day 1 showed the deposition was equal for wild-type, mutant, and the complemented strains. Gross pathology and histopathology analysis of infected lungs showed the presence of well-formed granulomas (Figure 8b–c). Significantly, 56 hr p.i., RvΔmutY, RvΔmutY::mutY-R262Q strains showed ~fivefold superior survival than Rv RvΔmutY::mutY, suggesting that the variant mutant identified indeed confers advantage (Figure 8d).

Figure 8. Perturbation of DNA repair results in enhanced survival in vivo.

(a) A schematic representation of the guinea pig infection experiment. (b) Gross histopathology of lungs and spleen of infected guinea pigs. (c) Hematoxylin and eosin staining of infected lung tissue showing the well-formed granuloma. Magnification ×10. (d) Guinea pigs were challenged with Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q via the aerosol route. CFUs were enumerated at day 1 post-infection to determine the lungs’ initial load (n=3). The CFUs at 1 day post-infection (p.i.) are represented for the whole lung. At 56 days p.i., lungs and spleen were isolated to determine survival (CFU / ml, n=7). (e) Outline showing the competition experiment performed in guinea pigs. (f) Total CFU enumeration of Rv + RvΔmutY, Rv + RvΔmutY::mutY, and Rv + RvΔmutY::mutY-R262Q in the lungs and spleen of guinea pigs at 1 day (whole lung, n=3) and 56 days p.i. (CFU/ml, n=7). (g) CFU enumeration of Rv + RvΔmutY, Rv + RvΔmutY::mutY, and Rv + RvΔmutY::mutY-R262Q on kanamycin and hygromycin containing plates. Statistical analysis was performed using two-way ANOVA. Graphpad prism was employed for performing statistical analysis. ***p<0.0001, **p<0.001, and *p<0.01. (h) Survival of each strain at indicated time points in the mixed infection. (i) Percent survival at 1 and 56 days in the competition experiment. Statistical analysis was performed using unpaired t-test. ***p<0.0001, **p<0.001, and *p<0.01.

Figure 8—source data 1. Gross histopathology of the infected lungs and spleen isolated from guinea pig.
Figure 8—source data 2. Haematoxylin and eosin staining.
Figure 8—source data 3. Survival of different strains in vivo.

Figure 8.

Figure 8—figure supplement 1. Gross histopathology of infected lungs and spleen isolated at 56days post-infection.

Figure 8—figure supplement 1.

Figure 8—figure supplement 1—source data 1. Gross histopathology of lungs and spleen isolated from guinea pigs after competition experiment.
Figure 8—figure supplement 2. WGS analysis of strains isolated from guinea pig lungs.

Figure 8—figure supplement 2.

(a) Circos plot shows the WGS analysis of the strains isolated from guinea pig lungs. The outermost circle represents the reference genome labeled with the known direct target mutations. Circles (from outside to inside) represent Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q genome. (b) Heat map representing the single nucleotide polymorphisms identified in the Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q after ex vivo passage.

Next, we determined the survival ability of RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q, when competed against wild-type Rv strain (Figure 8e). The lung and spleen homogenates were plated on 7H11 to determine total CFUs (Figure 8f, Figure 8—figure supplement 1). The total CFUs were found to be comparable in both lungs and spleen across all combinations (Figure 8f). Lung and spleen homogenates were plated on either kan (Rv) or hyg (RvΔmutY, RvΔmutY::mutY or RvΔmutY::mutY-R262Q) containing plates to determine survival (Figure 8g). As with independent infections, RvΔmutY, or RvΔmutY::mutY-R262Q, showed fivefold elevated CFUs compared with Rv (Figure 8g). When plotted as percent survival, the data showed that mutY deletion or complementation with the variant confers a survival advantage to the pathogen (Figure 8h). To determine the plausible cause of enhanced survival in vivo, we performed WGS of the strain isolated from guinea pig lungs. Analysis revealed that the specific genes such as cobQ1, smc, espI, and valS were mutated only in RvΔmutY and RvΔmutY::mutY-R262Q but not in Rv and RvΔmutY::mutY. Besides, tcrA and gatA were mutated only in RvΔmutY, whereas rv0746 was mutated exclusively in the RvΔmutY::mutY (Figure 8—figure supplement 2a–b). However, we did not observe any direct antibiotic resistance target mutations; this may be because guinea pigs were not subjected to antibiotic treatment. However, the total number of SNPs observed in all four strains was comparable. Thus the precise mechanistic role of MutY in Mtb pathogenesis needs further investigation. Collectively, results show that variants identified in DNA repair genes abrogate their function and contribute to a better survival in different stress conditions.

Discussion

The WGS of clinical strains provides vital information about many aspects such as, the acquisition and transmission of drug resistance, evolution of compensatory mutations, and evolution of the drug resistance in patients (Cohen et al., 2019). A recent WGS analysis of 10,219 diverse Mtb isolates successfully predicted mutations associated with pyrazinamide resistance (CRyPTIC Consortium and the 100,000 Genomes Project et al., 2018). Large-scale GWAS, employed initially for analyzing human genome data, is an invaluable tool to delineate the mutations that confer antibiotic resistance in bacteria (Power et al., 2017). For example, using the PhyC test on 116 clinical strains of Mtb, ponA1 was identified as one of the targets of independent mutations (Farhat et al., 2013). Analysis of 161 Mtb genomes identified polymorphisms in the intergenic regions that confer resistance to p-aminosalicylic acid (PAS) through overexpression of thyA and thyX (Zhang et al., 2013). GWAS of 498 sequences revealed that mutation in alanine dehydrogenase correlated with the resistance to a second-line drug D-cycloserine (Desjardins et al., 2016). Evaluation of 549 strains led to the identification of prpR, which in an ex vivo infection model confers conditional drug tolerance through regulation of propionate metabolism (Hicks et al., 2018). Largest GWAS involving 6465 clinical isolates uncovered novel resistance-associated mutations in ethA and thyX promoter, associated with ethionamide and PAS resistance, respectively (Coll et al., 2018).

We performed a gene-based GWAS analysis on a large dataset of susceptible and MDR/XDR clinical strains (Figure 1). We identified mutations in the known direct targets of both first- and second-line antibiotics and a few recently reported genetic variants using association analysis (Tables 1 and 2, Table 3). We also identified frameshift and non-synonymous mutations in rv2333c and rv1250, encoding for the transporters known to be differentially expressed in MDR patients (Table 3; Umar et al., 2019). Besides, the analysis captured mutations in genes involved in cell metabolism (Figure 2). This finding supports a recent study in Escherichia coli, where drug treatment led to the acquisition of mutations in the metabolic genes that impart drug resistance (Lopatkin et al., 2021). We believe that these may be compensatory mutations that placate the fitness cost associated with antibiotic resistance, and hence, might be the consequence of antibiotic resistance.

Table 3. Transporters associated with drug resistance phenotype.

Gene False discovery rate-adjusted p-value Variant
rv1258c 1.60E-08 Non-syn
mmpL2 9.51E-05 Non-syn
rv0987 1.60E-08 Non-syn
rv1250 3.44E-07 Non-syn
kdpC 2.15E-07 Non-syn
rv0928 2.15E-07 Non-syn
rv2333c 1.32E-20 Frameshift
mmpL13a 1.25E-06 Syn
kdpB 2.15E-07 Syn

Most importantly, for the first time, we identified a significant association between mutations in three of the DNA repair pathway genes with drug resistance. This study is in line with the studies on the other pathogens such as Pseudomonas aeruginosa, Helicobacter pylori, Neisseria meningitides, and Salmonella typhimurium, where mutations or deletions in the DNA repair genes were identified in the clinical isolates (Chopra et al., 2003). Moreover, the deletion of ung and udgB (BER pathway genes), independently or together, provides a survival advantage to the bacteria (Naz et al., 2021). It is known that lineage 2 clinical strains have polymorphisms in the DNA repair and replication genes (Mestre et al., 2011; Ebrahimi-Rad et al., 2003). However, we identified mutations in DNA repair genes in lineage 4, suggesting that the phenomenon is not confined to lineage 2 (Figure 3—figure supplement 1a).

The recently published CRYPTIC consortium associates 12,289 Mtb strains with the resistance phenotype to 13 antibiotics does not contain a mutY-Arg262Gln mutation in their data set. This may be due to the different strains used in the study (CRyPTIC Consortium and the 100,000 Genomes Project et al., 2018). Mutation spectrum analysis of the clinical strains harboring the mutY variant and closely related strains showed a trend toward higher C→T, A→G, and C→A mutations, but we could not perform statistical analysis, as the strains harboring the mutY variant were limited. We investigated if the mutations in DNA repair genes are the cause or the consequence of the antibiotic resistance. We functionally validated the identified mutations of two different pathway genes, mutY and uvrB, in Msm using gene replacement mutants. Mutation frequency analysis suggests that GWAS identified Arg262Gln and Ala524Val mutations in mutY and uvrB, respectively, abrogated their functions (Figure 3). The data agrees with the previously published study, wherein WGS analysis of antibiotic susceptible strain isolated from a patient showed a non-synonymous mutation in UvrB (A582V). Notably, the variant strain evolved into XDR-TB over 3.5 years after the first- and second-line drug treatment (Eldholm et al., 2014). Therefore, we propose that mutations identified in DNA repair genes contribute to the evolution of antibiotic resistance (Figure 2—figure supplement 5). Besides, the killing kinetics in the presence of different anti-TB drugs show that RvΔmutY and RvΔmutY::mutY-R262Q exhibit better survival (Figure 4).

Poor adherence of the patients to the antibiotic regimen is the leading cause of the emergence of drug resistance. The continual treatment with antibiotics provides sufficient time to evolve strains into MDR or XDR. We emulated the condition by repeatedly exposing strains to antibiotics in the ex vivo model, followed by growing them in vitro without antibiotics. This led to the improved survival of RvΔmutY and RvΔmutY::mutY-R262Q (Figure 5). WGS of the strains after ex vivo passage shows that the compromised DNA repair helps in acquiring mutations in the direct targets of the antibiotics (Figure 6). We then determined the survival of ex vivo evolved strains in the mixed infection scenario. A competition experiment using the evolved strains showed that RvΔmutY and RvΔmutY::mutY-R262Q could outcompete Rv (Figure 7). Host stress drives the selection of the bacterial population that acquires the ability to withstand adverse conditions. We evaluated the survival of the different strains in guinea pigs. Results suggest that RvΔmutY and RvΔmutY::mutY-R262Q exhibit improved survival. Similarly, we observed that RvΔmutY and RvΔmutY::mutY-R262Q could successfully outcompete Rv in the guinea pig infection model (Figure 8).

Using the GWAS approach and functional validation of the clinical mutations identified in the BER and NER pathways, we established a novel link between the compromised DNA repair and the evolution of antibiotic resistance (Figure 9).

Figure 9. Model.

Figure 9.

Model depicts the analysis and subsequent validation. Genome-wide association study (GWAS) revealed mutation in three DNA repair pathway genes in multidrug resistant/extensively drug resistant (MDR/XDR) strains. Based on GWAS, we proposed that mutations in DNA repair genes are associated with the evolution of antibiotic resistance in Mycobacterium tuberculosis (Mtb). Functional validation was performed using the gene replacement mutants of base excision repair (BER) and nucleotide excision repair (NER) pathway genes in Mycobacterium smegmatis and Mtb. In vitro, ex vivo, and in vivo experiments show that compromised DNA repair pathway leads to the enhanced survival of bacteria.

Collectively, the data presented here suggest that the loss of function of DNA repair genes helps acquire drug resistance in the presence of anti-TB drugs. We propose that the evolution of MDR or XDR-TB is likely a consequence of the loss of function of DNA repair genes. The presence of mutations in DNA repair genes can be an early-stage diagnostic marker for the evolution of the strain into MDR/XDR-TB. Molecular diagnosis of DNA repair gene mutations at the onset of infection should help design better therapies to impede the evolution of these strains into MDR or XDR. These findings indicate that bacteria having compromised DNA repair can contribute to the accumulation of mutations, providing an advantage to the bacilli when subjected to antibiotic treatment.

Materials and methods

Sequence retrieval and variant calling

Accession numbers for 2773 clinical strains of Mtb were obtained from 10 previous studies (Supplementary file 1), representing all 4 lineages from 9 countries. Sequence data were retrieved from EBA (https://www.ebi.ac.uk/) and NCBI databases (https://www.ncbi.nlm.nih.gov/) and quality filtered using Trimmometic software (Bolger et al., 2014). Adapter and E. coli sequence contaminations were removed, the sequences were analyzed at a sliding window of 4 bp, and those with an average phread value of 15 were clipped. Parameters for the trimmometic-based QC were set as (Leading:3, slidingwindow:4:15 trailing:3 minlen:60). Reads with a filtered length of <60 bp were removed. The filtered SE/PE reads were mapped on the H37Rv reference genome (Accession number –ASM19595) using the Burrows Wheeler Alignment (BWA) mem algorithm (Li and Durbin, 2009). GATK pipeline was used for sorting, PCR duplicate removal, and realignment of sequences (Van der Auwera et al., 2013). Variants (SNP and small InDels) were predicted in a batch mode with Platypus script (Rimmer et al., 2014). SNPs with read depth <5 or mapping quality of <20 were marked as missing. SNPs with missing calls in >40% of the accessions were removed from the analysis. Finally, ~160,000 SNPs from 2773 accessions with a variant dataset with an MAF of >1% were selected for the final analysis. The.vcf file was converted to.hmp file with Tassel (Bradbury et al., 2007). A phylogenetic tree was generated using ~160,000 SNPs using SNPhylo (Lee et al., 2014).

Association analysis

Genome-wide association analysis was carried out with 1815 susceptible strains and 422 MDR/XDR strains (Supplementary file 2). A GAPIT based on a compressed mix linear model was performed under the R environment (Lipka et al., 2012). VanRaden algorithm was employed for the calculation of the Kinship matrix. The kinship matrix assessed the relatedness among the strains included in the association panel. Principal components in GAPIT were used to classify the population structure. An association mapping analysis was carried out by combining the population structure analysis and the relative kinship matrix. The association mapping analysis obtained p-values, R2, and marker effect values. The FDR adjusted p-values in the GAPIT software are highly stringent as it corrects the effects of each marker based on the population structure (Q) as well as kinship (K) values and often lead to overcorrection (Gao et al., 2016; Zegeye et al., 2014). We selected a corrected p-value of 10–5 as the threshold for selecting associated genes. The associated SNPs were annotated using the snpEff v4.11 (Cingolani et al., 2012). A snpeff database was generated using the H37Rv (ASM19595v2, was used as a reference for mapping). The corresponding .gff file and the SNPs were annotated based on their position on the genome. Dot-plot, Manhattan’s plot, and volcano plot were generated using R scripts.

Generation of gene replacement mutant in Mtb

The upstream (5’ flank) and downstream region (3’ flank) of the mutY were amplified using the Rv genomic DNA. Flanks were digested with an appropriate restriction enzyme and ligated with oriE + lambda cos and hygromycin resistance cassette (Jain et al., 2014). The allelic exchange substrate was digested to generate linearized substrate and electroporated in the recombineering proficient Rv strain harboring pNit-ET plasmid (van Kessel and Hatfull, 2007). Colonies were screened post 3 weeks electroporation for gene replacement mutant.

Generation of complementation constructs and western blot analysis

Wild type allele of mutY or uvrB was PCR amplified using Rv genomic DNA. The PCR product and the vector pSTL-giles were digested with NdeI and HindIII to generate pSTL-giles-mutY or pSTL-giles-uvrB. Subsequently, the PCR product was ligated with the vector pSTL-giles. SapI-based cloning was employed for the generation of pSTL-giles-mutY-R262Q or pSTL-giles-uvrB-A524V. Constructs, pSTL-giles-mutY and pSTL-giles-mutY-R262Q, were electroporated in the msmΔmutY or RvΔmutY to generate msmΔmutY::mutY and msmΔmutY::mutY-R262Q or RvΔmutY::mutY and RvΔmutY::mutY-R262Q. Constructs, pSTL-giles-uvrB or pSTL-giles-uvrB-A524V, were electroporated in msmΔuvrB to generate msmΔuvrB::uvrB or msmΔuvrB::uvrB- A524V (oligonucleotides used in the study are given in the Supplementary file 9). 50 ml cultures of Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q were inoculated in 7H9-ADC medium at A600 ~0.1 and grown till A600 ~0.8. Cells were pelleted in 50-ml falcon tubes at 4000 rpm for 10 min and resuspended in the lysis buffer containing protease inhibitors. Cells were transferred in the bead beating tubes containing zirconium beads. Bead beating was performed for eight cycles. The cell lysate was centrifuged twice at 13,000 rpm for 45 min at 4°C. Protein was estimated using the Bradford assay reagent. 50 μg of Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q was loaded on two independent 10% SDS-PAGE and transferred to nitrocellulose membrane. 5% BSA prepared in 1XPBST20 was used for blocking the membrane for 2 hr. Membranes were incubated overnight at 4°C with α-FLAG (1:5000), andα-GroEL-1 (1:10000), respectively. Membranes were washed using 1XPBST20 (thrice) and incubated with anti-rabbit secondary antibody DARPO (1:10000) for 1 hr at room temperature (RT; 25°C). Membranes were washed thrice with 1XPBST20, and a blot was developed using a chemiluminescence reagent.

Analysis of mutation frequency and rate

Antibiotic sensitive cultures of msm, msmΔmutY, msmΔmutY::mutY, msmΔmutY::mutY-R262Q, msmΔuvrB, msmΔuvrB::uvrB, and msmΔuvrB::uvrB-A524V were grown in 7H9-ADC medium A600 ~0.6, and 50,000 cells/ml were inoculated in fresh 10-ml medium. Cultures were grown for 6 days in a 37°C incubator at 200 pm. On the seventh day, appropriate dilutions were plated on 7H11-OADC plain plates to determine the load, and 1 ml was plated on rifampicin (50 μg/ml). Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY R262Q strains were inoculated at A600~0.1. Strains were grown to A600 ~0.6 and plated on 7H11-plain or rifampicin (2 μg/ml) or isoniazid (5 μg/ml) plates to calculate the mutation frequency. Six antibiotic-sensitive colonies were grown in a 7H9-ADC medium up to A600 ~0.8, and 50,000 cells/ml were inoculated in a fresh 10-ml 7H9-ADC medium in the presence of 15% sterile Rv culture filtrate. On the 15th day, appropriate dilutions were plated on 7H11-OADC plain plates to determine the load, and 1 ml was plated on rifampicin (2 μg/ml), isoniazid (5 μg/ml), and ciprofloxacin (1.5 μg/ml). The mutation rate was determined as reported previously (Boshoff et al., 2003; David, 1970). Two biologically independent experiment sets were performed to determine mutation frequency. Each experiment was performed in the biological triplicate (mutation frequency) or six triplicate (mutation rate). Data represent one of the two biological sets of experiments. Data represent mean and standard deviation. Statistical analysis (two-way ANOVA) was performed using Graphpad prism software. ***p<0.0001, **p<0.001, and *p<0.01.

Killing kinetics in the presence of anti-TB drugs

Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q were grown in 7H9-ADC medium up to A600 ~0.8. Cultures were inoculated in the fresh medium at the A600 ~0.6 in 10-ml 7H9-ADC medium. Different antibiotics- rifampicin (2 μg/ml), isoniazid (5 μg/ml), and ciprofloxacin (1.5 μg/ml) were added, and the CFUs were enumerated at day 0, 3, 6, and 9 post on 7H11-OADC plates. Two biologically independent sets of experiments were performed. Each experiment was performed in the biological triplicate. Data represent one of the two biological sets of experiments. Data represent mean and standard deviation. Statistical analysis (two-way ANOVA) was performed using Graphpad prism software. ***p<0.0001, **p<0.001, and *p<0.01.

Survival of strains ex vivo

Balb/c mice were injected with thioglycollate, and 72 hr post-injection, peritoneal macrophages were isolated. One million cells were seeded in each well of a six-well plate. Cells were infected with Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY R262Q independently at an multiplicity of infection (MOI) of 1:5. After 24 hr p.i. cells were treated with rifampicin (1 μg/ml), isoniazid (1 μg/ml), and ciprofloxacin (2.5 μg/ml). Cells were lysed at 120 hr p.i. using 0.05% sodium dodecyl sulfate (SDS), and the bacteria was extracted. Bacteria were washed thrice with PBS to ensure the removal of SDS. Extracted bacteria were cultured in a 7H9-ADC medium without antibiotics for 5–7 days up to A600~0.4. These cultures were used for the next round of infection. The whole process was repeated three times. During the fourth round of infection, CFUs were enumerated at 4 hr p.i. and 96 hr p.i. to determine the survival of different strains. Percent survival was calculated by normalized CFU obtained at 96 hr with respect to the respective CFUs obtained at 4 hr. Two biologically independent experiment sets were performed. Each experiment was performed in biological triplicate. A representative experiment is shown in Figure 5. Similarly, the strains obtained after three rounds of infection from the above experiment were used for the competition experiment. Rv + RvΔmutY, Rv + RvΔmutY::mutY, and Rv + RvΔmutY::mutY-R262Q were mixed in a 1:1 ratio. 24 hr p.i, the cells were either not treated or treated with the antibiotics (isoniazid-1μg/ml, rifampicin- 1 μg/ml, and ciprofloxacin- 2.5 μg/ml) for subsequent 72 hr. CFUs were enumerated at 4 hr and 96 hr p.i. on 7H11-plain (all strains), 7H11-Kan (Rv), and 7H11-Hyg (RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q) plates. Percent survival was calculated as (CFUs on Kan or Hyg plates/[CFUs on Kan +CFUs on Hyg)]×100. Two biologically independent sets of experiments were performed. Each experiment was performed in the biological triplicate. Data represent one of the two biological sets of experiments. Statistical analysis (two-way ANOVA) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01.

WGS of strains under different conditions

Independent colonies of Rv (n=10), RvΔmutY (n=10), RvΔmutY::mutY (n=10), and RvΔmutY::mutY-R262Q (n=10) were randomly selected and grown in vitro without any selection for genomic DNA isolation. Rv (n=10), RvΔmutY (n=10), RvΔmutY::mutY (n=10), and RvΔmutY::mutY-R262Q (n=10) were obtained after ex vivo passage in the absence and in the presence of isoniazid, rifampicin, and ciprofloxacin were grown independently for genomic DNA isolation. Similarly, Rv (n=10), RvΔmutY (n=10), RvΔmutY::mutY (n=10), and RvΔmutY::mutY-R262Q (n=10) isolated from guinea pig lungs were grown for the genomic DNA isolation. During the library preparation, genomic DNA isolated from an independent colony of Rv was mixed in equal amounts. The library of the mixed sample was prepared, and the library was sent for WGS after a quality check. A similar procedure was used to prepare each strain’s libraries under different conditions. Variant calling was performed as described in the section-sequence retrieval and variant calling. SNPs less than or equal to 20% cut-off were discarded. A final matrix containing identified SNPs in the genes, mutation percentage, and blosum score was generated. Heat maps and circos plots were generated using custom python scripts.

Guinea pig infection

Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q cultures were grown up to A600 ~0.8. For preparing single-cell suspension, Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q cultures were pelleted at 4000 rpm at RT. After suspending cells in saline, cells were passed through a 26½ gauge needle to obtain a single-cell suspension. 1×108 cells were taken in the 15 ml saline for infection. Female outbred Hartley guinea pigs were challenged using a Madison chamber calibrated to deliver ~100 bacilli/lung through the aersolic route. For determining the deposition of bacteria in the lungs of guinea pigs (n=3), CFUs were enumerated at 1 day post-infection on 7H11-plain plates for each strain. Survival of each strain was determined at 56 days post-infection in the lungs and spleen (n=7). Rv + RvΔmutY, Rv + RvΔmutY::mutY, and Rv + RvΔmutY::mutY R262Q were mixed in a 1:1 ratio, and the guinea pigs (n=10 per strain) were challenged as described above. CFUs were enumerated at 1 and 56 days post-infection on 7H11-plain plates or those containing kanamycin or hygromycin-containing plates. Data represent the SEM. Statistical analysis (two-way ANOVA) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01. Percent survival was calculated as (CFUs on Kan or Hyg plates/[CFUs on Kan + CFUs on Hyg])×100. Data represents Standard deviation and mean. Statistical analysis (Unpaired t-test) was performed using Graph pad prism software. ***p<0.0001, **p<0.001, and *p<0.01. All guinea pig infection experiments were performed at the same time. Guinea pigs were not treated with any antibiotics before or after infection.

Acknowledgements

This work was funded by the Department of Biotechnology, Government of India (BT/PR13522/COE/34/27/2015) and the J.C Bose fellowship (JCB/2019/000015). SN is a Senior Project Associate in the J.C Bose fellowship (JCB/2019/000015). We thank the Tuberculosis Aerosol Challenge Facility at ICGEB and staff for their help in performing animal infection experiments. We are thankful to the bio-containment facility (BSL3) at NII. Vector-pYUB1471 is a kind gift from Prof. William R Jacobs’s laboratory.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Vinay Kumar Nandicoori, Email: vinaykn@nii.ac.in.

Digby F Warner, University of Cape Town, South Africa.

Bavesh D Kana, University of the Witwatersrand, South Africa.

Funding Information

This paper was supported by the following grants:

  • Department of Biotechnology, Ministry of Science and Technology, India BT/PR13522/COE/34/27/2015 to Vinay Kumar Nandicoori.

  • Department of Science and Technology, Ministry of Science and Technology, India JCB/2019/000015 to Vinay Kumar Nandicoori.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Validation, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Data curation, Software, Formal analysis.

Investigation.

Investigation.

Supervision.

Resources, Supervision.

Conceptualization, Resources, Supervision, Funding acquisition, Writing - original draft, Project administration, Writing - review and editing.

Ethics

Animal experiments protocol was approved by the Animal Ethics Committee of the National Institute of Immunology, New Delhi, India. The approval (IAEC#409/16) is as per the guidelines issued by the Committee for the Purpose of Control and Supervision of Experiments on Animals (CPCSEA), Government of India.

Additional files

Supplementary file 1. Total number of clinical strains used in this study.

The table contains the total number of clinical strains obtained from different studies.

elife-75860-supp1.xlsx (83.5KB, xlsx)
Supplementary file 2. Clinical strains used for the Genome-wide association analysis.

The table contains the clinical strains which are used for performing genome-wide association study analysis.

elife-75860-supp2.xlsx (68KB, xlsx)
Supplementary file 3. Synonymous change identified in the association analysis.

The table contains synonymous changes identified in the multidrug-resistant/extensively drug-resistant strains.

elife-75860-supp3.xlsx (17.6KB, xlsx)
Supplementary file 4. Non-synonymous change identified in the association analysis.

The table contains non-synonymous changes identified in the multidrug-resistant/extensively drug-resistant strains.

elife-75860-supp4.xlsx (21.4KB, xlsx)
Supplementary file 5. Upstream gene variants identified in the association analysis.

The table contains non-upstream gene variants identified in the multidrug-resistant/extensively drug-resistant strains.

elife-75860-supp5.xlsx (12.2KB, xlsx)
Supplementary file 6. Stop codon or frameshift mutations identified in the association analysis.

The table contains Stop codon or frameshift mutations identified in the multidrug-resistant/extensively drug-resistant strains.

elife-75860-supp6.xlsx (11.7KB, xlsx)
Supplementary file 7. Codon usage of the multidrug-resistant/extensively drug-resistant (MDR/XDR) strains.

The table contains codon usage in the MDR/XDR and Rv strain.

elife-75860-supp7.xlsx (12.2KB, xlsx)
Supplementary file 8. Mutations identified in Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q strains under different conditions.

The table consists of percentage of mutation and blosum score of genes that harbor s under different conditions.

elife-75860-supp8.xlsx (29.6KB, xlsx)
Supplementary file 9. Oligonucleotide used in the study.

The table consists of oligonucleotide used in the study.

elife-75860-supp9.xlsx (10.2KB, xlsx)
Transparent reporting form

Data availability

All data generated or analysed during this study are included in the manuscript and supporting file. Sequencing data have been deposited at NCBI under Bioproject PRJNA885615. Code availability at GitHub: https://github.com/kumar-paritosh/analysis_of_Mtb_genome; (copy archived at swh:1:rev:cf2547e50b00c57dee9b60bca899ed50e617106f).

The following dataset was generated:

Nandicoori VK. 2022. Whole genome sequencing of Mycobacterium tuberculosis strains under different conditions. NCBI BioProject. PRJNA885615

The following previously published datasets were used:

Hicks ND, Yang J, Zhang X, Zhao B, Grad YH, Liu L. 2018. Clinically prevalent mutations in Mycobacterium tuberculosis alter propionate metabolism and mediate multidrug tolerance. NCBI BioProject. PRJNA268900

Zhang H, Li D, Zhao L, Fleming J, Lin N, Wang T. 2013. Genome sequencing of 161 Mycobacterium tuberculosis isolates from China identifies genes and intergenic regions associated with drug resistance. NCBI Sequence Read Archive. SRA065095

Casali N, Nikolayevskyy V, Balabanova Y, Harris SR, Ignatyeva O, Kontsevaya I. 2014. Evolution and transmission of drug-resistant tuberculosis in a Russian population. ERP000192. PRJEB2138

Blouin Y, Hauck Y, Soler C, Fabre M, Vong R, Dehan C. 2012. Significance of the identification in the Horn of Africa of an exceptionally deep branching Mycobacterium tuberculosis clade. ERP001885. PRJEB3334

Shanmugam S, Kumar N, Nair D, Natrajan M, Tripathy SP, Peacock SJ. 2018. Genome Sequencing of Polydrug-, Multidrug-, and Extensively Drug-Resistant Mycobacterium tuberculosis Strains from South India. NCBI BioProject. 492975

Guerra-Assuncao JA, Houben RM, Crampin AC, Mzembe T, Mallard K, Coll F. 2015. Recurrence due to relapse or reinfection with Mycobacterium tuberculosis: a whole-genome sequencing approach in a large, population-based cohort with a high HIV infection prevalence and active follow-up. ERP001072. PRJEB2794

Clark TG, Mallard K, Coll F, Preston M, Assefa S, Harris D. 2013. Elucidating emergence and transmission of multidrug-resistant tuberculosis in treatment experienced patients by whole genome sequencing. EBI. PRJEB2424

Bryant JM, Harris SR, Parkhill J, Dawson R, Diacon AH, van Helden P. 2013. Whole-genome sequencing to establish relapse or re-infection with Mycobacterium tuberculosis: a retrospective observational study. DDBJ. ERA020628

Walker TM, Cl IP, Harrell RH, Evans JT, Kapatai G, Dedicoat MJ. 2013. Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. ERP000276. PRJEB2221

References

  1. Alexander PE, De P. The emergence of extensively drug-resistant tuberculosis (TB): TB/HIV coinfection, multidrug-resistant TB and the resulting public health threat from extensively drug-resistant TB, globally and in Canada. The Canadian Journal of Infectious Diseases & Medical Microbiology = Journal Canadien Des Maladies Infectieuses et de La Microbiologie Medicale. 2007;18:289–291. doi: 10.1155/2007/986794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bastos ML, Hussain H, Weyer K, Garcia-Garcia L, Leimane V, Leung CC, Narita M, Penã JM, Ponce-de-Leon A, Seung KJ, Shean K, Sifuentes-Osornio J, Van der Walt M, Van der Werf TS, Yew WW, Menzies D, Collaborative Group for Meta-analysis of Individual Patient Data in MDR-TB Treatment outcomes of patients with multidrug-resistant and extensively drug-resistant tuberculosis according to drug susceptibility testing to first- and second-line drugs: an individual patient data meta-analysis. Clinical Infectious Diseases. 2014;59:1364–1374. doi: 10.1093/cid/ciu619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Blouin Y, Hauck Y, Soler C, Fabre M, Vong R, Dehan C, Cazajous G, Massoure P-L, Kraemer P, Jenkins A, Garnotel E, Pourcel C, Vergnaud G. Significance of the identification in the horn of Africa of an exceptionally deep branching Mycobacterium tuberculosis clade. PLOS ONE. 2012;7:e52841. doi: 10.1371/journal.pone.0052841. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Boshoff HIM, Reed MB, Barry CE, Mizrahi V. DnaE2 polymerase contributes to in vivo survival and the emergence of drug resistance in mycobacterium tuberculosis. Cell. 2003;113:183–193. doi: 10.1016/s0092-8674(03)00270-8. [DOI] [PubMed] [Google Scholar]
  6. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. Tassel: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23:2633–2635. doi: 10.1093/bioinformatics/btm308. [DOI] [PubMed] [Google Scholar]
  7. Brandis G, Hughes D. The selective advantage of synonymous codon usage bias in Salmonella. PLOS Genetics. 2016;12:e1005926. doi: 10.1371/journal.pgen.1005926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bryant JM, Harris SR, Parkhill J, Dawson R, Diacon AH, van Helden P, Pym A, Mahayiddin AA, Chuchottaworn C, Sanne IM, Louw C, Boeree MJ, Hoelscher M, McHugh TD, Bateson ALC, Hunt RD, Mwaigwisya S, Wright L, Gillespie SH, Bentley SD. Whole-Genome sequencing to establish relapse or re-infection with Mycobacterium tuberculosis: a retrospective observational study. The Lancet. Respiratory Medicine. 2013;1:786–792. doi: 10.1016/S2213-2600(13)70231-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Casali N, Nikolayevskyy V, Balabanova Y, Harris SR, Ignatyeva O, Kontsevaya I, Corander J, Bryant J, Parkhill J, Nejentsev S, Horstmann RD, Brown T, Drobniewski F. Evolution and transmission of drug-resistant tuberculosis in a Russian population. Nature Genetics. 2014;46:279–286. doi: 10.1038/ng.2878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chai Q, Lu Z, Liu CH. Host defense mechanisms against Mycobacterium tuberculosis. Cellular and Molecular Life Sciences. 2020;77:1859–1878. doi: 10.1007/s00018-019-03353-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chopra I, O’Neill AJ, Miller K. The role of mutators in the emergence of antibiotic-resistant bacteria. Drug Resistance Updates. 2003;6:137–145. doi: 10.1016/s1368-7646(03)00041-4. [DOI] [PubMed] [Google Scholar]
  12. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. A program for annotating and predicting the effects of single nucleotide polymorphisms, snpeff. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Clark TG, Mallard K, Coll F, Preston M, Assefa S, Harris D, Ogwang S, Mumbowa F, Kirenga B, O’Sullivan DM, Okwera A, Eisenach KD, Joloba M, Bentley SD, Ellner JJ, Parkhill J, Jones-López EC, McNerney R. Elucidating emergence and transmission of multidrug-resistant tuberculosis in treatment experienced patients by whole genome sequencing. PLOS ONE. 2013;8:e83012. doi: 10.1371/journal.pone.0083012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cohen KA, Manson AL, Desjardins CA, Abeel T, Earl AM. Deciphering drug resistance in Mycobacterium tuberculosis using whole-genome sequencing: progress, promise, and challenges. Genome Medicine. 2019;11:45. doi: 10.1186/s13073-019-0660-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, Gordon SV, Eiglmeier K, Gas S, Barry CE, Tekaia F, Badcock K, Basham D, Brown D, Chillingworth T, Connor R, Davies R, Devlin K, Feltwell T, Gentles S, Hamlin N, Holroyd S, Hornsby T, Jagels K, Krogh A, McLean J, Moule S, Murphy L, Oliver K, Osborne J, Quail MA, Rajandream MA, Rogers J, Rutter S, Seeger K, Skelton J, Squares R, Squares S, Sulston JE, Taylor K, Whitehead S, Barrell BG. Deciphering the biology of mycobacterium tuberculosis from the complete genome sequence. Nature. 1998;393:537–544. doi: 10.1038/31159. [DOI] [PubMed] [Google Scholar]
  16. Coll F, Phelan J, Hill-Cawthorne GA, Nair MB, Mallard K, Ali S, Abdallah AM, Alghamdi S, Alsomali M, Ahmed AO, Portelli S, Oppong Y, Alves A, Bessa TB, Campino S, Caws M, Chatterjee A, Crampin AC, Dheda K, Furnham N, Glynn JR, Grandjean L, Minh Ha D, Hasan R, Hasan Z, Hibberd ML, Joloba M, Jones-López EC, Matsumoto T, Miranda A, Moore DJ, Mocillo N, Panaiotov S, Parkhill J, Penha C, Perdigão J, Portugal I, Rchiad Z, Robledo J, Sheen P, Shesha NT, Sirgel FA, Sola C, Oliveira Sousa E, Streicher EM, Helden PV, Viveiros M, Warren RM, McNerney R, Pain A, Clark TG. Genome-Wide analysis of multi- and extensively drug-resistant Mycobacterium tuberculosis. Nature Genetics. 2018;50:307–316. doi: 10.1038/s41588-017-0029-0. [DOI] [PubMed] [Google Scholar]
  17. CRyPTIC Consortium and the 100,000 Genomes Project. Allix-Béguec C, Arandjelovic I, Bi L, Beckert P, Bonnet M, Bradley P, Cabibbe AM, Cancino-Muñoz I, Caulfield MJ, Chaiprasert A, Cirillo DM, Clifton DA, Comas I, Crook DW, De Filippo MR, de Neeling H, Diel R, Drobniewski FA, Faksri K, Farhat MR, Fleming J, Fowler P, Fowler TA, Gao Q, Gardy J, Gascoyne-Binzi D, Gibertoni-Cruz A-L, Gil-Brusola A, Golubchik T, Gonzalo X, Grandjean L, He G, Guthrie JL, Hoosdally S, Hunt M, Iqbal Z, Ismail N, Johnston J, Khanzada FM, Khor CC, Kohl TA, Kong C, Lipworth S, Liu Q, Maphalala G, Martinez E, Mathys V, Merker M, Miotto P, Mistry N, Moore DAJ, Murray M, Niemann S, Omar SV, Ong RT-H, Peto TEA, Posey JE, Prammananan T, Pym A, Rodrigues C, Rodrigues M, Rodwell T, Rossolini GM, Sánchez Padilla E, Schito M, Shen X, Shendure J, Sintchenko V, Sloutsky A, Smith EG, Snyder M, Soetaert K, Starks AM, Supply P, Suriyapol P, Tahseen S, Tang P, Teo Y-Y, Thuong TNT, Thwaites G, Tortoli E, van Soolingen D, Walker AS, Walker TM, Wilcox M, Wilson DJ, Wyllie D, Yang Y, Zhang H, Zhao Y, Zhu B. Prediction of susceptibility to first-line tuberculosis drugs by DNA sequencing. The New England Journal of Medicine. 2018;379:1403–1415. doi: 10.1056/NEJMoa1800474. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. David HL. Probability distribution of drug-resistant mutants in unselected populations of Mycobacterium tuberculosis. Applied Microbiology. 1970;20:810–814. doi: 10.1128/am.20.5.810-814.1970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Desjardins CA, Cohen KA, Munsamy V, Abeel T, Maharaj K, Walker BJ, Shea TP, Almeida DV, Manson AL, Salazar A, Padayatchi N, O’Donnell MR, Mlisana KP, Wortman J, Birren BW, Grosset J, Earl AM, Pym AS. Genomic and functional analyses of Mycobacterium tuberculosis strains implicate ALD in D-cycloserine resistance. Nature Genetics. 2016;48:544–551. doi: 10.1038/ng.3548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Ebrahimi-Rad M, Bifani P, Martin C, Kremer K, Samper S, Rauzier J, Kreiswirth B, Blazquez J, Jouan M, van Soolingen D, Gicquel B. Mutations in putative mutator genes of Mycobacterium tuberculosis strains of the W-Beijing family. Emerging Infectious Diseases. 2003;9:838–845. doi: 10.3201/eid0907.020803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Eldholm V, Norheim G, von der Lippe B, Kinander W, Dahle UR, Caugant DA, Mannsåker T, Mengshoel AT, Dyrhol-Riise AM, Balloux F. Evolution of extensively drug-resistant mycobacterium tuberculosis from a susceptible ancestor in a single patient. Genome Biology. 2014;15:490. doi: 10.1186/s13059-014-0490-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Farhat MR, Shapiro BJ, Kieser KJ, Sultana R, Jacobson KR, Victor TC, Warren RM, Streicher EM, Calver A, Sloutsky A, Kaur D, Posey JE, Plikaytis B, Oggioni MR, Gardy JL, Johnston JC, Rodrigues M, Tang PKC, Kato-Maeda M, Borowsky ML, Muddukrishna B, Kreiswirth BN, Kurepina N, Galagan J, Gagneux S, Birren B, Rubin EJ, Lander ES, Sabeti PC, Murray M. Genomic analysis identifies targets of convergent positive selection in drug-resistant Mycobacterium tuberculosis. Nature Genetics. 2013;45:1183–1189. doi: 10.1038/ng.2747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ford CB, Shah RR, Maeda MK, Gagneux S, Murray MB, Cohen T, Johnston JC, Gardy J, Lipsitch M, Fortune SM. Mycobacterium tuberculosis mutation rate estimates from different lineages predict substantial differences in the emergence of drug-resistant tuberculosis. Nature Genetics. 2013;45:784–790. doi: 10.1038/ng.2656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Gagneux S, DeRiemer K, Van T, Kato-Maeda M, de Jong BC, Narayanan S, Nicol M, Niemann S, Kremer K, Gutierrez MC, Hilty M, Hopewell PC, Small PM. Variable host-pathogen compatibility in mycobacterium tuberculosis. PNAS. 2006;103:2869–2873. doi: 10.1073/pnas.0511240103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Gao L, Turner MK, Chao S, Kolmer J, Anderson JA. Genome wide association study of seedling and adult plant leaf rust resistance in elite spring wheat breeding lines. PLOS ONE. 2016;11:e0148671. doi: 10.1371/journal.pone.0148671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Guerra-Assunção JA, Houben RMGJ, Crampin AC, Mzembe T, Mallard K, Coll F, Khan P, Banda L, Chiwaya A, Pereira RPA, McNerney R, Harris D, Parkhill J, Clark TG, Glynn JR. Recurrence due to relapse or reinfection with Mycobacterium tuberculosis: a whole-genome sequencing approach in a large, population-based cohort with a high HIV infection prevalence and active follow-up. The Journal of Infectious Diseases. 2015;211:1154–1163. doi: 10.1093/infdis/jiu574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Gygli SM, Borrell S, Trauner A, Gagneux S. Antimicrobial resistance in Mycobacterium tuberculosis: mechanistic and evolutionary perspectives. FEMS Microbiology Reviews. 2017;41:354–373. doi: 10.1093/femsre/fux011. [DOI] [PubMed] [Google Scholar]
  28. Hicks ND, Yang J, Zhang X, Zhao B, Grad YH, Liu L, Ou X, Chang Z, Xia H, Zhou Y, Wang S, Dong J, Sun L, Zhu Y, Zhao Y, Jin Q, Fortune SM. Clinically prevalent mutations in Mycobacterium tuberculosis alter propionate metabolism and mediate multidrug tolerance. Nature Microbiology. 2018;3:1032–1042. doi: 10.1038/s41564-018-0218-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Jain P, Hsu T, Arai M, Biermann K, Thaler DS, Nguyen A, González PA, Tufariello JM, Kriakov J, Chen B, Larsen MH, Jacobs WR. Specialized transduction designed for precise high-throughput unmarked deletions in mycobacterium tuberculosis. MBio. 2014;5:e01214. doi: 10.1128/mBio.01245-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Kristofich J, Morgenthaler AB, Kinney WR, Ebmeier CC, Snyder DJ, Old WM, Cooper VS, Copley SD. Synonymous mutations make dramatic contributions to fitness when growth is limited by a weak-link enzyme. PLOS Genetics. 2018;14:e1007615. doi: 10.1371/journal.pgen.1007615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kurthkoti K, Kumar P, Jain R, Varshney U. Important role of the nucleotide excision repair pathway in mycobacterium smegmatis in conferring protection against commonly encountered DNA-damaging agents. Microbiology. 2008;154:2776–2785. doi: 10.1099/mic.0.2008/019638-0. [DOI] [PubMed] [Google Scholar]
  32. Kurthkoti K, Srinath T, Kumar P, Malshetty VS, Sang PB, Jain R, Manjunath R, Varshney U. A distinct physiological role of muty in mutation prevention in mycobacteria. Microbiology. 2010;156:88–93. doi: 10.1099/mic.0.033621-0. [DOI] [PubMed] [Google Scholar]
  33. Lee TH, Guo H, Wang X, Kim C, Paterson AH. SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data. BMC Genomics. 2014;15:162. doi: 10.1186/1471-2164-15-162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Li H, Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Lipka AE, Tian F, Wang Q, Peiffer J, Li M, Bradbury PJ, Gore MA, Buckler ES, Zhang Z. GAPIT: genome association and prediction integrated tool. Bioinformatics. 2012;28:2397–2399. doi: 10.1093/bioinformatics/bts444. [DOI] [PubMed] [Google Scholar]
  36. Lopatkin AJ, Bening SC, Manson AL, Stokes JM, Kohanski MA, Badran AH, Earl AM, Cheney NJ, Yang JH, Collins JJ. Clinically relevant mutations in core metabolic genes confer antibiotic resistance. Science. 2021;371:eaba0862. doi: 10.1126/science.aba0862. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Manson AL, Cohen KA, Abeel T, Desjardins CA, Armstrong DT, Barry CE, Brand J, Chapman SB, Cho SN, Gabrielian A, Gomez J, Jodals AM, Joloba M, Jureen P, Lee JS, Malinga L, Maiga M, Nordenberg D, Noroc E, Romancenco E, Salazar A, Ssengooba W, Velayati AA, Winglee K, Zalutskaya A, Via LE, Cassell GH, Dorman SE, Ellner J, Farnia P, Galagan JE, Rosenthal A, Crudu V, Homorodean D, Hsueh PR, Narayanan S, Pym AS, Skrahina A, Swaminathan S, Van der Walt M, Alland D, Bishai WR, Cohen T, Hoffner S, Birren BW, Earl AM, TBResist Global Genome Consortium Genomic analysis of globally diverse mycobacterium tuberculosis strains provides insights into the emergence and spread of multidrug resistance. Nature Genetics. 2017;49:395–402. doi: 10.1038/ng.3767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Mestre O, Luo T, Dos Vultos T, Kremer K, Murray A, Namouchi A, Jackson C, Rauzier J, Bifani P, Warren R, Rasolofo V, Mei J, Gao Q, Gicquel B. Phylogeny of Mycobacterium tuberculosis Beijing strains constructed from polymorphisms in genes involved in DNA replication, recombination and repair. PLOS ONE. 2011;6:e16020. doi: 10.1371/journal.pone.0016020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Naz S, Dabral S, Nagarajan SN, Arora D, Singh LV, Kumar P, Singh Y, Kumar D, Varshney U, Nandicoori VK. Compromised base excision repair pathway in Mycobacterium tuberculosis imparts superior adaptability in the host. PLOS Pathogens. 2021;17:e1009452. doi: 10.1371/journal.ppat.1009452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Paritosh K. Analysis_of_Mtb_Genome. cf4527eGithub. 2022 https://github.com/kumar-paritosh/analysis_of_Mtb_genome
  41. Plotkin JB, Kudla G. Synonymous but not the same: the causes and consequences of codon bias. Nature Reviews. Genetics. 2011;12:32–42. doi: 10.1038/nrg2899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Power RA, Parkhill J, de Oliveira T. Microbial genome-wide association studies: lessons from human GWAS. Nature Reviews. Genetics. 2017;18:41–50. doi: 10.1038/nrg.2016.132. [DOI] [PubMed] [Google Scholar]
  43. Rimmer A, Phan H, Mathieson I, Iqbal Z, Twigg SRF, WGS500 Consortium. Wilkie AOM, McVean G, Lunter G. Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications. Nature Genetics. 2014;46:912–918. doi: 10.1038/ng.3036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Safi H, Gopal P, Lingaraju S, Ma S, Levine C, Dartois V, Yee M, Li L, Blanc L, Ho Liang HP, Husain S, Hoque M, Soteropoulos P, Rustad T, Sherman DR, Dick T, Alland D. Phase variation in mycobacterium tuberculosis glpk produces transiently heritable drug tolerance. PNAS. 2019;116:19665–19674. doi: 10.1073/pnas.1907631116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Shanmugam S, Kumar N, Nair D, Natrajan M, Tripathy SP, Peacock SJ, Swaminathan S, Ranganathan UD. Genome sequencing of polydrug-, multidrug-, and extensively drug-resistant mycobacterium tuberculosis strains from south india. Microbiology Resource Announcements. 2019;8:12. doi: 10.1128/MRA.01388-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Singh A. Guardians of the mycobacterial genome: a review on DNA repair systems in mycobacterium tuberculosis. Microbiology. 2017;163:1740–1758. doi: 10.1099/mic.0.000578. [DOI] [PubMed] [Google Scholar]
  47. The CRyPTIC Consortium Genome-wide association studies of global mycobacterium tuberculosis resistance to 13 antimicrobials in 10,228 genomes identify new resistance mechanisms. PLOS Biology. 2022a;20:e3001755. doi: 10.1371/journal.pbio.3001755. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. The CRyPTIC Consortium A data compendium associating the genomes of 12,289 mycobacterium tuberculosis isolates with quantitative resistance phenotypes to 13 antibiotics. PLOS Biology. 2022b;20:e3001721. doi: 10.1371/journal.pbio.3001721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Theis K, Skorvaga M, Machius M, Nakagawa N, Van Houten B, Kisker C. The nucleotide excision repair protein UvrB, a helicase-like enzyme with a catch. Mutation Research. 2000;460:277–300. doi: 10.1016/s0921-8777(00)00032-x. [DOI] [PubMed] [Google Scholar]
  50. Umar F, Hatta M, Husain DR, Natzir R, Dwiyanti R, Junita AR, Primaguna MR. The effect of anti-tuberculosis drugs therapy on mrna efflux pump gene expression of rv1250 in mycobacterium tuberculosis collected from tuberculosis patients. New Microbes and New Infections. 2019;32:100609. doi: 10.1016/j.nmni.2019.100609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, Jordan T, Shakir K, Roazen D, Thibault J, Banks E, Garimella KV, Altshuler D, Gabriel S, DePristo MA. From fastq data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Current Protocols in Bioinformatics. 2013;43:11. doi: 10.1002/0471250953.bi1110s43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. van Kessel JC, Hatfull GF. Recombineering in Mycobacterium tuberculosis. Nature Methods. 2007;4:147–152. doi: 10.1038/nmeth996. [DOI] [PubMed] [Google Scholar]
  53. Walker TM, Ip CLC, Harrell RH, Evans JT, Kapatai G, Dedicoat MJ, Eyre DW, Wilson DJ, Hawkey PM, Crook DW, Parkhill J, Harris D, Walker AS, Bowden R, Monk P, Smith EG, Peto TEA. Whole-Genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. The Lancet. Infectious Diseases. 2013;13:137–146. doi: 10.1016/S1473-3099(12)70277-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. WHO . Global Tuberculosis Report. World Health Organization; 2020. [Google Scholar]
  55. WHO . WHO announces updated definitions of extensively drug-resistant tuberculosis. World Health Organization; 2021. [Google Scholar]
  56. Zegeye H, Rasheed A, Makdis F, Badebo A, Ogbonnaya FC. Genome-Wide association mapping for seedling and adult plant resistance to stripe rust in synthetic hexaploid wheat. PLOS ONE. 2014;9:e105593. doi: 10.1371/journal.pone.0105593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Zhang H, Li D, Zhao L, Fleming J, Lin N, Wang T, Liu Z, Li C, Galwey N, Deng J, Zhou Y, Zhu Y, Gao Y, Wang T, Wang S, Huang Y, Wang M, Zhong Q, Zhou L, Chen T, Zhou J, Yang R, Zhu G, Hang H, Zhang J, Li F, Wan K, Wang J, Zhang X-E, Bi L. Genome sequencing of 161 Mycobacterium tuberculosis isolates from China identifies genes and intergenic regions associated with drug resistance. Nature Genetics. 2013;45:1255–1260. doi: 10.1038/ng.2735. [DOI] [PubMed] [Google Scholar]

Editor's evaluation

Digby F Warner 1

This paper provides important evidence implicating polymorphisms in the mycobacterial adenine DNA glycosylase, MutY, in the emergence of antibiotic resistance in Mycobacterium tuberculosis. While the precise mechanism underlying this phenotype requires further investigation, the inference from genome-wide association analyses of sequenced clinical isolates, supported by laboratory experiments and animal infection models, is convincing. This work adds a new locus of interest to the list of polymorphisms associated with tuberculosis drug resistance, and is likely to be relevant to the mycobacterial research field.

Decision letter

Editor: Digby F Warner1

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "GWAS and functional studies implicate a role for altered DNA repair in the evolution of drug resistance in Mycobacterium tuberculosis" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Gisela Storz as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this summary to help you prepare a revised submission.

Essential revisions:

The reviewers were in general agreement that the work presents interesting and potentially important results; however, all reviewers felt that, in its current form, the conclusions are inadequately supported by the data. The following are therefore considered essential to address the major concerns:

1) The GWAS analysis provides the foundation of the paper, yet all three reviewers expressed concerns about the presentation of – and, therefore, inferences derived from – this part of the study. More, and clearer, information is required about the strains included, their potential phylogenetic relatedness, and the distributions of the identified resistance-associated mutations.

2) A key claim of this work is that the in vivo fitness advantage of the MutY mutant strain is due to hypermutagenesis and selection of a fitter strain, however the evidence supporting this conclusion is lacking.

3) Related to point (2) above: the clinical M. tuberculosis strains carrying the identified mutY and uvrB mutations should contain additional polymorphisms as a consequence of the loss/impairment of core DNA repair function(s), however no evidence is presented in support of (or refuting) this assumption.

4) There was some uncertainty among the reviewers whether the phenotypes reported are genuine drug-resistance, or rather a form of (heritable) tolerance. This uncertainty, which might be ascribed to the use throughout the manuscript of mutation frequency assessments, not mutation rates, must be resolved. For example, through the use of fluctuation assays; by determining resistance levels quantitatively (i.e., in MIC assessments); and/or by establishing the breadth of antimycobacterial agents (cidal and static drugs with different targets/mechanisms of action) to which reduced susceptibility is conferred by the identified DNA repair polymorphisms.

Reviewer #1 (Recommendations for the authors):

This manuscript presents intriguing data suggesting a role for defective DNA repair pathways in the development of drug resistance under treatment. However, there are a few key issues/questions which must be addressed.

(i) The GWAS analysis is confusing and/or unclear as described:

a. The authors report (L85-7) the use of known drug resistance phenotypes (where available) or inferences of drug-resistance from genotypic data to enhance the potential to identify other mutations that might be implicated in enabling the DR mutations, yet their list of known DR mutations (Table 1) seems predominantly to comprise rare or unusual mutations, not those commonly associated with clinical DR-TB.

b. In the same lines (85-7), the authors state that strains were separated into 5 drug susceptibility categories, the last of which was "pre-XDR", yet later sentences (e.g., L95, 104, 128, etc.) refer to "MDR/XDR" strains. Are these XDR or pre-XDR?

c. The distributions of the identified resistance-associated mutations across the different lineages need to be explained more clearly; this is necessary to strengthen claims of specific selection of the mutations under antibiotic treatment.

(ii) A central claim of the paper is that enhanced fitness, as a consequence of elevated mutagenesis, contributes to the prevalence of the uvrB and mutY mutations among the DR Mtb strains. While plausible, this is not explored in the manuscript: instead, the experimental work is limited to assessments of competitive survival in various models, with/without antibiotic selection, or to mutant frequency analyses. There is no direct evidence of increased mutational load in the mutY and uvrB strains. To strengthen the credibility of this central claim, WGS data of "successful"/DR mutants would be very useful.

(iii) Continuing from the above, it appears that these mutations confer heritable tolerance, rather than resistance. This does not undermine the value of this work, but it does suggest additional lines of experimental inquiry, for example:

a. mutation rate (not frequency) assessments;

b. use of higher drug concentrations (e.g., rifampicin at 200 ug/ml in M. smegmatis);

c. MIC assays to indicate precisely (quantitatively) the impact of the mutations on drug susceptibility;

d. expansion of the drug panel to include compounds with different mechanisms of action, cidal as well as static; that is, how broadly tolerant are these strains as a consequence of the individual mutations?

(iv) Continuing from the above, increased mutagenesis is generally deleterious, especially where enabled by the loss of core DNA repair functions; it would be useful to know, therefore, whether the mutY and uvrB mutations – both genes encoding proteins involved in excision repair pathways – incur any fitness costs. This could be assessed via DNA damage tolerance/survival assays. Also, are both mutations tolerated in a single strain (is there any evidence of this in the GWAS data?)? That is, do dual mutY uvrB alleles enhance drug survival, or reduce it, compared to the single mutants alone?

(v) Why is the wild-type parental strain (Rv) selectable on kanamycin-containing media? Presumably this is because it contains the complementation vector?

Reviewer #3 (Recommendations for the authors):

Text improvements:

– It is curious that this GWAS analysis did not identify some of the most common acquired drug-resistance conferring mutations. For example, as the authors note, this analysis identifies rpoB p.Leu452Pro (note that in the main text this is mislabeled as p.Leu452Val) and p.Val496Met, but misses the far more common Ser450L and Asp435Val alleles. A discussion as to why these (and other) known drug resistance mutations did not meet the cutoffs would be welcomed.

– Lines 49-50: "Although Mtb has a lower mutation rate…" Lower than what?

– Lines 56-59: "Despite well-known mechanisms of drug resistance, in 10-40% of the clinical isolates of Mtb, drug resistance cannot be determined by the mutations in the direct targets of antibiotics, implying the presence of hitherto unknown mechanisms that foster the development of resistance in Mtb (8)." It would be worth updating this statement based on the most recent work from the CRYPTIC consortium, for example.

– Lines 89-91: "The total number of SNPs observed for susceptible, mono-DR, MDR, or pre-XDR strains were comparable, suggesting no genetic drift during the evolution of antibiotic resistance (Figure 1b)." If the premise of the work is that drug-resistant TB strains are prone to mild hypermutator phenotypes, wouldn't one expect an elevated number of SNPs in DR strains? Probably need to compare within linaeg to nearest neighbors, sine diff lineages divereged from reference Rv at diff times

– Lines 128-130: "This result is in accord with the studies published in other bacteria, where a synonymous mutation impacts mRNA stability (26-29)." Since the authors do not measure mRNA stability in this manuscript, this statement is inaccurate.

– The reviewer would appreciate if gene common names were included as an individual column in Supplemental Tables (e.g. Table S3,S4, etc.) to facilitate evaluation of the data.

[Editors’ note: further revisions were suggested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "GWAS and functional studies suggest a role for altered DNA repair in the evolution of drug resistance in Mycobacterium tuberculosis" for further consideration by eLife. Your revised article has been evaluated by Bavesh Kana (Senior Editor) and a Reviewing Editor.

The editors and reviewers agree that the revised manuscript has been significantly improved by the incorporation of substantial new evidence; the authors are congratulated on the extent of the revisions, and the volume of new experimental and analytical data added. There is, however, one substantive issue remaining that must be addressed:

All three reviewers feel that the evidence is strong in support of the central claim that the identified mycobacterial MutY polymorphism confers increased frequency of resistance to antibiotics in vitro and impacts survival in vivo in models of M. tuberculosis infection. These are valuable insights. What is less certain, though, is whether this phenotype is due to hypermutagenesis, as the authors propose. In the absence of definitive evidence in support of this possibility, the authors are advised to temper any claims about mechanism; rather, the language should be softened to convey the conclusion that a polymorphism found in some clinical strains impairs MutY function, but the consequences of this for M. tuberculosis pathogenesis and/or emergence of drug resistance require further experimentation.

Reviewer #1 (Recommendations for the authors):

In my review of the original manuscript, I raised three concerns, namely that (i) the GWAS analysis was confusing; (ii) albeit tantalising, the proposed mechanism of enhanced/accelerated evolution of drug-resistance under antibiotic treatment was inadequately supported by the data presented; and (iii) it was difficult to determine whether the phenotypes reported resulted from genuine drug resistance or drug tolerance.

By incorporating significant new experimental and analytical data (for which the authors are congratulated), the revised version submitted here for review has substantially addressed these concerns.

My sole nagging doubt relates to the proposal that deficient MutY function results in hypermutagenesis, in turn allowing for selection of fitter (drug resistant) mutants under antibiotic treatment. The evidence presented in support of this conclusion remains inconclusive, despite the authors' contentions. For example, the analysis of C-T/C-G mutations (in Figure 2—figure supplement 6i) does not provide compelling evidence of defective 8-oxoG repair (it is notable, too, that these data are presented without any assessment of statistical significance). This is not a fatal doubt, however it does suggest that the authors should temper any claims about underlying mechanism – better to report the observed association of mutY mutations with propensity for drug resistance in the absence of overly speculative theories around mechanism.

Reviewer #2 (Recommendations for the authors):

The authors have submitted a revised manuscript in response to the prior reviews. In that review, the points raised were:

1) Provide additional data that the enhanced survival of the Mtb mutY KO in macrophages and guinea pigs is linked to hyper-mutability and consequent mutations that enhance fitness.

2) More statistical rigor in the number of replicates in the mutation frequency analysis.

3) Provide more data on strain relatedness.

The authors have responded positively to these critiques and have supplied more data.

1) The strain relatedness has been clarified.

2) The manuscript now includes mutation rate measurements which show enhanced mutation rates in the mutY KO.

3) On the question of whether the enhanced survival of the MutY KO can be attributed to enhanced fitness through mutagenesis, the authors have provided new data of whole genome sequencing of isolated colonies that arose from passage through macrophages or guinea pigs, in the former case in the presence or absence of antibiotics. I thank the authors for undertaking this extensive experimentation in response to the critique. In some cases, the data does seem to show a correlation between inactivation of mutY and evolution of antimicrobial resistance. This seems clearest for the gyrA mutation in the presence of cipro treatment (see Figure 6E) which is present in a high proportion of treated mutY null or R262Q strains, but not in wild type or complemented. This provides some support for the hypothesis that the MutY polymorphism can enhance the evolution of antimicrobial resistance, although one cannot derive true frequencies from this data.

The data for mutation accumulation in the absence of antibiotics is less clear and can't be as clearly linked to a phenotypic advantage in vivo. So, I think the genome sequencing adds to the story about evolution of drug resistance, but I don't think it supports the idea that the enhanced in vivo growth is due to adaptive evolution. I think this latter point should be softened as other explanations are possible, including a toxic effect of MutY DNA damage in the presence of oxo-G in vivo, as noted in the prior review.

Despite these caveats, I do think the paper discovers a MutY polymorphism and shows it is functionally important for mutagenesis, and in vivo survival, which is an important finding linking a clinical strain polymorphism to a mechanism of drug resistance and pathogenesis.

Reviewer #3 (Recommendations for the authors):

This reviewer thanks the authors for their revised submission. I have only one suggestion related to "Essential Revision" point 3 from the first review of this manuscript.

"(3) Related to point (2) above: the clinical M. tuberculosis strains carrying the identified mutY and uvrB mutations should contain additional polymorphisms as a consequence of the loss/impairment of core DNA repair function(s), however no evidence is presented in support of (or refuting) this assumption."

The primary conclusion of this paper is that loss-of-function mutations in the DNA repair genes uvrB and mutY contribute to the evolution of drug resistance in Mtb by elevating the mutation rate (Figure 2 —figure supplement 5). In support of this conclusion, the authors present data in M. smegmatis (uvrB) or Mtb (mutY) that two identified mutants (uvrB-A524V and mutY-R262Q) have elevated mutation rates in laboratory strains.

Where the authors work would benefit greatly is convincingly demonstrating this association- mutations in uvrB and mutY and elevated mutation rates- using publicly available whole-genome sequencing data from clinical Mtb strains. If the authors hypothesis is true, then it is reasonable to expect that clinical Mtb strains harboring loss-of-function mutations in uvrB and mutY would have more polymorphisms relative to strains WT for uvrB and mutY. The authors present a preliminary analysis to this effect and do not see this association (response to reviewers document). Figure 2 —figure supplement 6I supposedly shows this association, but this reviewer could not find a detailed explanation how this analysis was done, nor is it clear that the authors statement "showed increased C->T or C->G mutations" is true, particularly for C->G. The authors mention that lack of mutY should lead to elevated C->G or C->A mutations, so all SNP types should be included in this analysis to see if any hypermutator phenotype is specific to the SNP effects expected for mutY.

If the authors can analyze publicly available Mtb genome sequencing data and convincingly demonstrate an association between uvrB and mutY mutations and elevated polymorphism burden- or provide a convincing explanation as to why this association is not expected or seen- this would strongly support the primary conclusion of this manuscript.

eLife. 2023 Jan 25;12:e75860. doi: 10.7554/eLife.75860.sa2

Author response


Essential revisions:

The reviewers were in general agreement that the work presents interesting and potentially important results; however, all reviewers felt that, in its current form, the conclusions are inadequately supported by the data. The following are therefore considered essential to address the major concerns:

1) The GWAS analysis provides the foundation of the paper, yet all three reviewers expressed concerns about the presentation of – and, therefore, inferences derived from – this part of the study. More, and clearer, information is required about the strains included, their potential phylogenetic relatedness, and the distributions of the identified resistance-associated mutations.

We thank all the reviewers for their insightful comments. In the revised manuscript, we have performed the phylogenetic analysis of the strains used. A phylogenetic tree was generated using Mycobacterium canetti as an outgroup (Figure 1b). The phylogeny analysis suggests the clustering of the strains in lineage 1, 2, 3, and 4. Lineages 2, 3 and 4 are clustering together, and lineage 1 is monophyletic, as reported previously. The genome sequence data of 2773 clinical strains were downloaded from NCBI. These strains were also part of the GWAS analysis performed by Coll et al. (https://pubmed.ncbi.nlm.nih.gov/29358649/) and Manson et al. (https://pubmed.ncbi.nlm.nih.gov/28092681/). The phenotype of the strains used for the association analysis was reported in the previous studies. We have not performed other predictions. The supplementary table provides the lineage origin of each strain used in the study (Supplementary File 1 and 2). The distributions of resistance-associated mutations in different strains is shown (Figure 2—figure supplement 6a-h). As suggested, we have performed an analysis wherein we looked for the direct target mutations that harbor mutations in the DNA repair genes (Figure 2—figure supplement 6i-k). We have extensively worked on the presentation of the data to make it more discernible.

2) A key claim of this work is that the in vivo fitness advantage of the MutY mutant strain is due to hypermutagenesis and selection of a fitter strain, however the evidence supporting this conclusion is lacking.

To ascertain if the better survival of the RvDmutY, or RvDmutY::mutY-R262Q, is indeed due to the acquisition of mutations in the direct target of antibiotics, we performed WGS of the strain from the ex vivo evolution experiment (Figure 5). Genomic DNA extracted from ten independent colonies (grown in vitro), was mixed in equal proportion prior to library preparation. For the analysis, only those SNPs that were present in >20% of reads were retained. Analysis of Rv sequences grown in vitro suggested that the laboratory strain has accumulated 100 SNPs compared with the reference strain. The sequence of Rv laboratory strain was used as the reference strain for the subsequent analysis. WGS data for RvDmutY, RvDmutY::mutY, and RvDmutY::mutY-R262Q strains grown in vitro did not show the presence of the mutation in the antibiotic target genes. In a similar vein, ten independent colonies each from the 7H11-OADC plates after the final round of ex vivo selection in the presence or absence of antibiotics were selected for WGS. Data indicated that in the absence of antibiotics, no direct target mutations were identified in the ex vivo passaged strains (Figure 6a and e). In the presence of isoniazid, we found mutations in the katG (Ser315Thr or Ser315Ileu) in the Rv, RvDmutY but not in RvDmutY::mutY and RvDmutY::mutY-R262Q (Figure 6b and e). These findings are in congruence with the ex vivo evolution CFU analysis, wherein we did not observe a significant increase in the survival of RvDmutY and RvDmutY::mutY-R262Q in the presence of isoniazid (Figure 5). In the presence of ciprofloxacin and rifampicin, direct target mutations were identified in the gyrA and rpoB (Figure 6c-e). Asp94Glu/Asp94Gly mutations were identified in gyrA, and, His445Tyr/Ser450Leu mutations were identified in rpoB of RvDmutY and RvDmutY::mutY-R262Q, respectively. No direct target mutations were identified in the Rv and RvDmutY::mutY, suggesting that the perturbed DNA repair aids in acquiring the drug resistance-conferring mutations in Mtb (Figure 6c-e and Supplementary File 8).

To determine if the better survival of the RvDmutY, or RvDmutY::mutY-R262Q, in the guinea pig infection experiment (Figure 8) is due to the accumulation of mutations in the host, we performed WGS of the strain isolated from guinea pig lungs. Analysis revealed specific genes such as cobQ1, smc, espI, and valS were mutated only in RvDmutY and RvDmutY::mutYR262Q but not in Rv and RvDmutY::mutY. Besides, tcrA and gatA were mutated only in RvDmutY, whereas rv0746 were mutated exclusively in the RvDmutY::mutY (Figure 8—figure supplement 2). However, we did not observe any direct target mutations; this may be because guinea pigs were not subjected to antibiotic treatment. Data suggests that the continued longterm selection pressure is necessary for bacilli to acquire mutations.

3) Related to point (2) above: the clinical M. tuberculosis strains carrying the identified mutY and uvrB mutations should contain additional polymorphisms as a consequence of the loss/impairment of core DNA repair function(s), however no evidence is presented in support of (or refuting) this assumption.

We analyzed the genome of the clinical strains that possess DNA repair gene mutations to determine the additional polymorphisms. The number of SNPs in the strains harboring DNA repair mutation and the drug-susceptible strains appears to be similar. The marginal difference, if any were not statistically significant.

Author response image 1.

Author response image 1.

4) There was some uncertainty among the reviewers whether the phenotypes reported are genuine drug-resistance, or rather a form of (heritable) tolerance. This uncertainty, which might be ascribed to the use throughout the manuscript of mutation frequency assessments, not mutation rates, must be resolved. For example, through the use of fluctuation assays; by determining resistance levels quantitatively (i.e., in MIC assessments); and/or by establishing the breadth of antimycobacterial agents (cidal and static drugs with different targets/mechanisms of action) to which reduced susceptibility is conferred by the identified DNA repair polymorphisms.

As suggested, we determined the mutation rate in the presence of isoniazid, rifampicin, and ciprofloxacin (Figure 3g-j). The fold increase in the mutation rate relative to Rv for RvDmutY, RvDmutY:mutY, and RvDmutY::mutY-R262Q was 2.90, 0.76, and 3.0 in the presence of isoniazid and 5.62, 1.13, and 5.10 or 9.14, 1.57, and 8.71 in the presence of rifampicin and ciprofloxacin respectively.

In addition, we determined the effect of different drugs on the survival of RvDmutY or RvDmutY::mutY-R262Q by performing killing kinetics in the presence and absence of isoniazid, rifampicin, ciprofloxacin, and ethambutol (Figure 4a). In the absence of antibiotics, the growth kinetics of Rv, RvDmutY, RvDmutY:mutY, and RvDmutY::mutY-R262Q were similar (Figure 4b). In the presence of isoniazid, ~2 log-fold decreases in bacterial survival was observed on day 3 in Rv and RvDmutY:mutY; however, in RvDmutY and RvDmutY::mutY-R262Q, the difference was limited to ~1.5 log-fold (Figure 4c). A similar trend was apparent on days 6 and 9, suggesting a ~5-fold increase in the survival of RvDmutY and RvDmutY::mutY-R262Q compared with Rv and RvDmutY:mutY (Figure 4c). Interestingly, in the presence of ethambutol, we did not observe any significant difference (Figure 4d). In the presence of rifampicin and ciprofloxacin, we observed a ~10-fold increase in the survival of RvDmutY and RvDmutY::mutY-R262Q compared with Rv and RvDmutY:mutY (Figure 4e-f). Thus results suggest that the absence of mutY or the presence of mutY variant aids in subverting the antibiotic stress.

Reviewer #1 (Recommendations for the authors):

This manuscript presents intriguing data suggesting a role for defective DNA repair pathways in the development of drug resistance under treatment. However, there are a few key issues/questions which must be addressed.

(i) The GWAS analysis is confusing and/or unclear as described:

a. The authors report (L85-7) the use of known drug resistance phenotypes (where available) or inferences of drug-resistance from genotypic data to enhance the potential to identify other mutations that might be implicated in enabling the DR mutations, yet their list of known DR mutations (Table 1) seems predominantly to comprise rare or unusual mutations, not those commonly associated with clinical DR-TB.

We have modified the manuscript extensively to make it more discernible. In the revised manuscript, we have performed the phylogenetic analysis of the strains used. A phylogenetic tree was generated using Mycobacterium canetti as an outgroup (Figure 1b). The phylogeny analysis suggests the clustering of the strains in lineage 1, 2,3, and 4. Lineages 2,3 and 4 are clustering together, and lineage 1 is monophyletic, as reported previously. The genome sequence data of 2773 clinical strains were downloaded from NCBI. These strains were also part of the GWAS analysis performed by Coll et al. (https://pubmed.ncbi.nlm.nih.gov/29358649/) and Manson et al.

(https://pubmed.ncbi.nlm.nih.gov/28092681/). The phenotype of the strains used for the association analysis was reported in the previous studies. We have not performed other predictions. The supplementary table provides the lineage origin of each strain used in the study (Supplementary File 1 and 2). The distributions of resistance-associated mutations in different strains is shown (Figure 2—figure supplement 6 a-h). As suggested, we have performed an analysis wherein we looked for the direct target mutations that harbor mutations in the DNA repair genes (Figure 2—figure supplement 6 i-k).

We identified mostly the rare mutations due to the following reasons;

1. We looked for the mutations that were present only in the multidrug resistant strains as compared to the susceptible strains for association mapping. This strategy exclusively gave most variants associated with multidrug resistant phenotype.

2. We have used Mixed Linear Model (MLM) for association analysis. MLM removes all the population-specific SNPs based on PCA and kinship corrections. The false discovery rate (FDR) adjusted p-values in the GAPIT software are stringent as it corrects the effects of each marker based on the population structure (Q) as well as kinship (K) values. Therefore the probability of identifying the false-positive SNP is very low. We combined it with the Bonferroni corrections to identify markers associated with the drug resistant phenotype.

b. In the same lines (85-7), the authors state that strains were separated into 5 drug susceptibility categories, the last of which was "pre-XDR", yet later sentences (e.g., L95, 104, 128, etc.) refer to "MDR/XDR" strains. Are these XDR or pre-XDR?

Strains used for association analysis were drug susceptible, MDR, Poly-DR, pre-XDR, and XDR. MDR/XDR refers to all the categories put together.

c. The distributions of the identified resistance-associated mutations across the different lineages need to be explained more clearly; this is necessary to strengthen claims of specific selection of the mutations under antibiotic treatment.

As suggested, we have reported the lineage of identified resistant-associated mutation (Figure 2—figure supplement 6i).

(ii) A central claim of the paper is that enhanced fitness, as a consequence of elevated mutagenesis, contributes to the prevalence of the uvrB and mutY mutations among the DR Mtb strains. While plausible, this is not explored in the manuscript: instead, the experimental work is limited to assessments of competitive survival in various models, with/without antibiotic selection, or to mutant frequency analyses. There is no direct evidence of increased mutational load in the mutY and uvrB strains. To strengthen the credibility of this central claim, WGS data of "successful"/DR mutants would be very useful.

To ascertain if the better survival of the RvDmutY, or RvDmutY::mutY-R262Q, is indeed due to the acquisition of mutations in the direct target of antibiotics, we performed WGS of the strain from the ex vivo evolution experiment (Figure 5). Genomic DNA extracted from ten independent colonies (grown in vitro) was mixed in equal proportion prior to library preparation. Only those SNPs present in >20% of reads were retained for the analysis. Analysis of Rv sequences grown in vitro suggested that the laboratory strain has accumulated 100 SNPs compared with the reference strain. The sequence of Rv laboratory strain was used as the reference strain for the subsequent analysis. WGS data for RvDmutY, RvDmutY::mutY, and RvDmutY::mutY-R262Q strains grown in vitro did not show the presence of a mutation in the antibiotic target genes. In a similar vein, ten independent colonies, each from the 7H11-OADC plates, after the final round of ex vivo selection in the presence or absence of antibiotics, were selected for WGS. Data indicated that in the absence of antibiotics, no direct target mutations were identified in the ex vivo passaged strains (Figure 6a and e). In the presence of isoniazid, we found mutations in the katG (Ser315Thr or Ser315Ileu) in the Rv, RvDmutY but not in RvDmutY:mutY and RvDmutY::mutY-R262Q (Figure 6b and e). These findings are in congruence with the ex vivo evolution CFU analysis, wherein we did not observe a significant increase in the survival of RvDmutY and RvDmutY::mutY-R262Q in the presence of isoniazid (Figure 5). In the presence of ciprofloxacin and rifampicin, direct target mutations were identified in the gyrA and rpoB (Figure 6c-e).

Asp94Glu/Asp94Gly mutations were identified in gyrA, and, His445Tyr/Ser450Leu mutations were identified in rpoB of RvDmutY and RvDmutY::mutY-R262Q, respectively. No direct target mutations were identified in the Rv and RvDmutY::mutY, suggesting that the perturbed DNA repair aids in acquiring the drug resistance-conferring mutations in Mtb (Figure 6c-e and Supplementary File 8).

To determine if the better survival of the RvDmutY, or RvDmutY::mutY-R262Q, in the guinea pig infection experiment (Figure 8) is due to the accumulation of mutations in the host, we performed WGS of the strain isolated from guinea pig lungs. Analysis revealed specific genes such as cobQ1, smc, espI, and valS were mutated only in RvDmutY and RvDmutY::mutYR262Q but not in Rv and RvDmutY::mutY. Besides, tcrA and gatA were mutated only in RvDmutY, whereas rv0746 were mutated exclusively in the RvDmutY:mutY (Figure 8—figure supplement 2). However, we did not observe any direct target mutations; this may be because guinea pigs were not subjected to antibiotic treatment. Data suggests that the continued longterm selection pressure is necessary for bacilli to acquire mutations.

(iii) Continuing from the above, it appears that these mutations confer heritable tolerance, rather than resistance. This does not undermine the value of this work, but it does suggest additional lines of experimental inquiry, for example:

a. mutation rate (not frequency) assessments;

In the revised manuscript, we have performed the mutation rate analysis in the presence of different drugs (Figure 3g-j).

b. use of higher drug concentrations (e.g., rifampicin at 200 ug/ml in M. smegmatis);

We thank the reviewer for the comment. We initially used 100 µg/ml conc. of rifampicin. However, we did not get any colonies on the plates. Thus we chose 50 µg/ml rifampicin concentration in Msm for performing experiments.

c. MIC assays to indicate precisely (quantitatively) the impact of the mutations on drug susceptibility;

We thank the reviewer for the comment. In the revised manuscript, we have performed killing kinetics in the absence and presence of rifampicin, isoniazid, ethambutol and ciprofloxacin.

d. expansion of the drug panel to include compounds with different mechanisms of action, cidal as well as static; that is, how broadly tolerant are these strains as a consequence of the individual mutations?

To evaluate the effect of different drugs on the survival of RvDmutY or RvDmutY::mutYR262Q, we performed killing kinetics in the presence and absence of isoniazid, rifampicin, ciprofloxacin, and ethambutol (Figure 4a). In the absence of antibiotics, the growth kinetics of Rv, RvDmutY, RvDmutY::mutY, and RvDmutY::mutY-R262Q were similar (Figure 4b). In the presence of isoniazid, ~2 log-fold decreases in bacterial survival was observed on day 3 in Rv and RvDmutY:mutY; however, in RvDmutY and RvDmutY::mutY-R262Q, the difference was limited to ~1.5 log-fold (Figure 4c). A similar trend was apparent on days 6 and 9, suggesting a ~5-fold increase in the survival of RvDmutY and RvDmutY::mutY-R262Q compared with Rv and RvDmutY::mutY (Figure 4c). Interestingly, in the presence of ethambutol, we did not observe any significant difference (Figure 4d). In the presence of rifampicin and ciprofloxacin, we observed a ~10-fold increase in the survival of RvDmutY and RvDmutY::mutY-R262Q compared with Rv and RvDmutY:mutY (Figure 4e-f). Thus results suggest that the absence of mutY or the presence of mutY variant aids in subverting the antibiotic stress.

(iv) Continuing from the above, increased mutagenesis is generally deleterious, especially where enabled by the loss of core DNA repair functions; it would be useful to know, therefore, whether the mutY and uvrB mutations – both genes encoding proteins involved in excision repair pathways – incur any fitness costs. This could be assessed via DNA damage tolerance/survival assays. Also, are both mutations tolerated in a single strain (is there any evidence of this in the GWAS data?)? That is, do dual mutY uvrB alleles enhance drug survival, or reduce it, compared to the single mutants alone?

We thank the reviewer for the insightful comment. We have evaluated the survival of Rv, RvDmutY, RvDmutY::mutY, and RvDmutY::mutY-R262Q the presence of oxidative and nitrosative stress. We did not observe any significant decrease in the survival of the mutants compared with the wild type (data not shown).

Moreover, we have not observed both mutations on a single strain in our GWAS data. While it is possible that the presence of both mutations may be beneficial for the organism, we have not made any attempts to generate double mutant.

(v) Why is the wild-type parental strain (Rv) selectable on kanamycin-containing media? Presumably this is because it contains the complementation vector?

The wild-type parental strain (Rv) has a kanamycin resistance-containing complementation vector.

Reviewer #3 (Recommendations for the authors):

Text improvements:

– It is curious that this GWAS analysis did not identify some of the most common acquired drug-resistance conferring mutations. For example, as the authors note, this analysis identifies rpoB p.Leu452Pro (note that in the main text this is mislabeled as p.Leu452Val) and p.Val496Met, but misses the far more common Ser450L and Asp435Val alleles. A discussion as to why these (and other) known drug resistance mutations did not meet the cutoffs would be welcomed.

– Lines 49-50: "Although Mtb has a lower mutation rate…" Lower than what?

We have removed the sentence from the revised manuscript.

– Lines 56-59: "Despite well-known mechanisms of drug resistance, in 10-40% of the clinical isolates of Mtb, drug resistance cannot be determined by the mutations in the direct targets of antibiotics, implying the presence of hitherto unknown mechanisms that foster the development of resistance in Mtb (8)." It would be worth updating this statement based on the most recent work from the CRYPTIC consortium, for example.

As suggested, we have incorporated the changes in the revised manuscript.

– Lines 89-91: "The total number of SNPs observed for susceptible, mono-DR, MDR, or pre-XDR strains were comparable, suggesting no genetic drift during the evolution of antibiotic resistance (Figure 1b)." If the premise of the work is that drug-resistant TB strains are prone to mild hypermutator phenotypes, wouldn't one expect an elevated number of SNPs in DR strains? Probably need to compare within linaeg to nearest neighbors, sine diff lineages divereged from reference Rv at diff times

We analyzed the genome of the clinical strains that possess DNA repair gene mutations to determine the additional polymorphisms. The number of SNPs in the strains harboring DNA repair mutation and the drug-susceptible strains appears to be higher. We have also looked for the CàT and CàG mutations in the same strains. CàT mutations are higher in the strains harboring mutY variant compared with the susceptible strains (Figure 2—figure supplement 6l). However, we could not perform statistical analysis as the number of strains harbor mutY variant is limited to 8. Thus data suggest that empirically mutY phenotype sensitive the strains harboring mutY variant show higher SNPs elsewhere and CàT mutations. However, we are not stating these conclusions very strongly in the manuscript as the data is not statistically significant.

Author response image 2.

Author response image 2.

– Lines 128-130: "This result is in accord with the studies published in other bacteria, where a synonymous mutation impacts mRNA stability (26-29)." Since the authors do not measure mRNA stability in this manuscript, this statement is inaccurate.

As suggested, we have removed the line from the revised manuscript.

– The reviewer would appreciate if gene common names were included as an individual column in Supplemental Tables (e.g. Table S3,S4, etc.) to facilitate evaluation of the data.

As suggested, we have incorporated the changes in the revised manuscript. These tables are labelled as Supplementary File 3 and 4.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

The editors and reviewers agree that the revised manuscript has been significantly improved by the incorporation of substantial new evidence; the authors are congratulated on the extent of the revisions, and the volume of new experimental and analytical data added. There is, however, one substantive issue remaining that must be addressed:

All three reviewers feel that the evidence is strong in support of the central claim that the identified mycobacterial MutY polymorphism confers increased frequency of resistance to antibiotics in vitro and impacts survival in vivo in models of M. tuberculosis infection. These are valuable insights. What is less certain, though, is whether this phenotype is due to hypermutagenesis, as the authors propose. In the absence of definitive evidence in support of this possibility, the authors are advised to temper any claims about mechanism; rather, the language should be softened to convey the conclusion that a polymorphism found in some clinical strains impairs MutY function, but the consequences of this for M. tuberculosis pathogenesis and/or emergence of drug resistance require further experimentation.

As suggested, we have worked on the language and temper claims about the mechanism. The changes are incorporated and shown with the revised manuscript's track change.

Reviewer #1 (Recommendations for the authors):

In my review of the original manuscript, I raised three major concerns, namely that (i) the GWAS analysis was confusing; (ii) albeit tantalising, the proposed mechanism of enhanced/accelerated evolution of drug-resistance under antibiotic treatment was inadequately supported by the data presented; and (iii) it was difficult to determine whether the phenotypes reported resulted from genuine drug resistance or drug tolerance.

By incorporating significant new experimental and analytical data (for which the authors are congratulated), the revised version submitted here for review has substantially addressed these concerns.

My sole nagging doubt relates to the proposal that deficient MutY function results in hypermutagenesis, in turn allowing for selection of fitter (drug resistant) mutants under antibiotic treatment. The evidence presented in support of this conclusion remains inconclusive, despite the authors' contentions. For example, the analysis of C-T/C-G mutations (in Figure 2—figure supplement 6i) does not provide compelling evidence of defective 8-oxoG repair (it is notable, too, that these data are presented without any assessment of statistical significance). This is not a fatal doubt, however it does suggest that the authors should temper any claims about underlying mechanism – better to report the observed association of mutY mutations with propensity for drug resistance in the absence of overly speculative theories around mechanism.

As suggested, we have worked on the language and temper claims about mechanism. The changes are incorporated and shown with the track change (highlighted in blue) in the revised manuscript.

Reviewer #2 (Recommendations for the authors):

The authors have submitted a revised manuscript in response to the prior reviews. In that review, the major points raised were:

1) Provide additional data that the enhanced survival of the Mtb mutY KO in macrophages and guinea pigs is linked to hyper-mutability and consequent mutations that enhance fitness.

2) More statistical rigor in the number of replicates in the mutation frequency analysis.

3) Provide more data on strain relatedness.

The authors have responded positively to these critiques and have supplied more data.

1) The strain relatedness has been clarified.

2) The manuscript now includes mutation rate measurements which show enhanced mutation rates in the mutY KO.

3) On the question of whether the enhanced survival of the MutY KO can be attributed to enhanced fitness through mutagenesis, the authors have provided new data of whole genome sequencing of isolated colonies that arose from passage through macrophages or guinea pigs, in the former case in the presence or absence of antibiotics. I thank the authors for undertaking this extensive experimentation in response to the critique. In some cases, the data does seem to show a correlation between inactivation of mutY and evolution of antimicrobial resistance. This seems clearest for the gyrA mutation in the presence of cipro treatment (see Figure 6E) which is present in a high proportion of treated mutY null or R262Q strains, but not in wild type or complemented. This provides some support for the hypothesis that the MutY polymorphism can enhance the evolution of antimicrobial resistance, although one cannot derive true frequencies from this data.

The data for mutation accumulation in the absence of antibiotics is less clear and can't be as clearly linked to a phenotypic advantage in vivo. So, I think the genome sequencing adds to the story about evolution of drug resistance, but I don't think it supports the idea that the enhanced in vivo growth is due to adaptive evolution. I think this latter point should be softened as other explanations are possible, including a toxic effect of MutY DNA damage in the presence of oxo-G in vivo, as noted in the prior review.

Despite these caveats, I do think the paper discovers a MutY polymorphism and shows it is functionally important for mutagenesis, and in vivo survival, which is an important finding linking a clinical strain polymorphism to a mechanism of drug resistance and pathogenesis.

As suggested we have softened the explanation regarding the enhanced in vivo growth of different strains (line 309-312).

Reviewer #3 (Recommendations for the authors):

This reviewer thanks the authors for their revised submission. I have only one major suggestion related to "Essential Revision" point 3 from the first review of this manuscript.

"(3) Related to point (2) above: the clinical M. tuberculosis strains carrying the identified mutY and uvrB mutations should contain additional polymorphisms as a consequence of the loss/impairment of core DNA repair function(s), however no evidence is presented in support of (or refuting) this assumption."

The primary conclusion of this paper is that loss-of-function mutations in the DNA repair genes uvrB and mutY contribute to the evolution of drug resistance in Mtb by elevating the mutation rate (Figure 2 —figure supplement 5). In support of this conclusion, the authors present data in M. smegmatis (uvrB) or Mtb (mutY) that two identified mutants (uvrB-A524V and mutY-R262Q) have elevated mutation rates in laboratory strains.

Where the authors work would benefit greatly is convincingly demonstrating this association- mutations in uvrB and mutY and elevated mutation rates- using publicly available whole-genome sequencing data from clinical Mtb strains. If the authors hypothesis is true, then it is reasonable to expect that clinical Mtb strains harboring loss-of-function mutations in uvrB and mutY would have more polymorphisms relative to strains WT for uvrB and mutY. The authors present a preliminary analysis to this effect and do not see this association (response to reviewers document). Figure 2 —figure supplement 6I supposedly shows this association, but this reviewer could not find a detailed explanation how this analysis was done, nor is it clear that the authors statement "showed increased C->T or C->G mutations" is true, particularly for C->G. The authors mention that lack of mutY should lead to elevated C->G or C->A mutations, so all SNP types should be included in this analysis to see if any hypermutator phenotype is specific to the SNP effects expected for mutY.

If the authors can analyze publicly available Mtb genome sequencing data and convincingly demonstrate an association between uvrB and mutY mutations and elevated polymorphism burden- or provide a convincing explanation as to why this association is not expected or seen- this would strongly support the primary conclusion of this manuscript.

The analysis presented in response to the reviewers' document and Figure 2—figure supplement 6I was performed using available Mtb genome sequencing data of clinical strains. To analyze the mutation spectrum, we used the same clinical strains which are used in the present study. As suggested, we have included all the SNP types in the analysis. We have identified a higher trend towards CA, AG, and CT mutations in the strains harboring mutY polymorphism compared with closely related drug-susceptible strains. However, we could not perform the statistical analysis because the number of strains harboring mutY mutation was limited. We have added this analysis in Figure 2—figure supplement 6l and mentioned it in the figure legend and discussion. Analysis of other publicly available databases for uvrB or mutY mutations and possible implications on elevated polymorphism would be attempted subsequently.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Nandicoori VK. 2022. Whole genome sequencing of Mycobacterium tuberculosis strains under different conditions. NCBI BioProject. PRJNA885615
    2. Hicks ND, Yang J, Zhang X, Zhao B, Grad YH, Liu L. 2018. Clinically prevalent mutations in Mycobacterium tuberculosis alter propionate metabolism and mediate multidrug tolerance. NCBI BioProject. PRJNA268900 [DOI] [PMC free article] [PubMed]
    3. Zhang H, Li D, Zhao L, Fleming J, Lin N, Wang T. 2013. Genome sequencing of 161 Mycobacterium tuberculosis isolates from China identifies genes and intergenic regions associated with drug resistance. NCBI Sequence Read Archive. SRA065095 [DOI] [PubMed]
    4. Casali N, Nikolayevskyy V, Balabanova Y, Harris SR, Ignatyeva O, Kontsevaya I. 2014. Evolution and transmission of drug-resistant tuberculosis in a Russian population. ERP000192. PRJEB2138 [DOI] [PMC free article] [PubMed]
    5. Blouin Y, Hauck Y, Soler C, Fabre M, Vong R, Dehan C. 2012. Significance of the identification in the Horn of Africa of an exceptionally deep branching Mycobacterium tuberculosis clade. ERP001885. PRJEB3334 [DOI] [PMC free article] [PubMed]
    6. Shanmugam S, Kumar N, Nair D, Natrajan M, Tripathy SP, Peacock SJ. 2018. Genome Sequencing of Polydrug-, Multidrug-, and Extensively Drug-Resistant Mycobacterium tuberculosis Strains from South India. NCBI BioProject. 492975 [DOI] [PMC free article] [PubMed]
    7. Guerra-Assuncao JA, Houben RM, Crampin AC, Mzembe T, Mallard K, Coll F. 2015. Recurrence due to relapse or reinfection with Mycobacterium tuberculosis: a whole-genome sequencing approach in a large, population-based cohort with a high HIV infection prevalence and active follow-up. ERP001072. PRJEB2794 [DOI] [PMC free article] [PubMed]
    8. Clark TG, Mallard K, Coll F, Preston M, Assefa S, Harris D. 2013. Elucidating emergence and transmission of multidrug-resistant tuberculosis in treatment experienced patients by whole genome sequencing. EBI. PRJEB2424 [DOI] [PMC free article] [PubMed]
    9. Bryant JM, Harris SR, Parkhill J, Dawson R, Diacon AH, van Helden P. 2013. Whole-genome sequencing to establish relapse or re-infection with Mycobacterium tuberculosis: a retrospective observational study. DDBJ. ERA020628 [DOI] [PMC free article] [PubMed]
    10. Walker TM, Cl IP, Harrell RH, Evans JT, Kapatai G, Dedicoat MJ. 2013. Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. ERP000276. PRJEB2221 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Figure 2—source data 1. Mutations identified in genes that belong to different categories.
    Figure 3—source data 1. Mutation rate analysis in the presence of different drugs.
    Figure 3—figure supplement 1—source data 1. Confirmation of gene repalcement mutant and complementation strains.
    Figure 3—figure supplement 2—source data 1. Analysis of Mutation frequency.
    Figure 4—source data 1. Killing kinetics in the absence and presence of different antibiotics.
    Figure 5—source data 1. Survival of different strains in the absence and presence of antibiotics ex vivo.
    Figure 5—figure supplement 1—source data 1. Survival of strains before and after passage in the peritoneal macrophages.
    Figure 7—source data 1. Competition experiment in the presence and absence of different drugs after passage in the peritoneal macrophages.
    Figure 8—source data 1. Gross histopathology of the infected lungs and spleen isolated from guinea pig.
    Figure 8—source data 2. Haematoxylin and eosin staining.
    Figure 8—source data 3. Survival of different strains in vivo.
    Figure 8—figure supplement 1—source data 1. Gross histopathology of lungs and spleen isolated from guinea pigs after competition experiment.
    Supplementary file 1. Total number of clinical strains used in this study.

    The table contains the total number of clinical strains obtained from different studies.

    elife-75860-supp1.xlsx (83.5KB, xlsx)
    Supplementary file 2. Clinical strains used for the Genome-wide association analysis.

    The table contains the clinical strains which are used for performing genome-wide association study analysis.

    elife-75860-supp2.xlsx (68KB, xlsx)
    Supplementary file 3. Synonymous change identified in the association analysis.

    The table contains synonymous changes identified in the multidrug-resistant/extensively drug-resistant strains.

    elife-75860-supp3.xlsx (17.6KB, xlsx)
    Supplementary file 4. Non-synonymous change identified in the association analysis.

    The table contains non-synonymous changes identified in the multidrug-resistant/extensively drug-resistant strains.

    elife-75860-supp4.xlsx (21.4KB, xlsx)
    Supplementary file 5. Upstream gene variants identified in the association analysis.

    The table contains non-upstream gene variants identified in the multidrug-resistant/extensively drug-resistant strains.

    elife-75860-supp5.xlsx (12.2KB, xlsx)
    Supplementary file 6. Stop codon or frameshift mutations identified in the association analysis.

    The table contains Stop codon or frameshift mutations identified in the multidrug-resistant/extensively drug-resistant strains.

    elife-75860-supp6.xlsx (11.7KB, xlsx)
    Supplementary file 7. Codon usage of the multidrug-resistant/extensively drug-resistant (MDR/XDR) strains.

    The table contains codon usage in the MDR/XDR and Rv strain.

    elife-75860-supp7.xlsx (12.2KB, xlsx)
    Supplementary file 8. Mutations identified in Rv, RvΔmutY, RvΔmutY::mutY, and RvΔmutY::mutY-R262Q strains under different conditions.

    The table consists of percentage of mutation and blosum score of genes that harbor s under different conditions.

    elife-75860-supp8.xlsx (29.6KB, xlsx)
    Supplementary file 9. Oligonucleotide used in the study.

    The table consists of oligonucleotide used in the study.

    elife-75860-supp9.xlsx (10.2KB, xlsx)
    Transparent reporting form

    Data Availability Statement

    All data generated or analysed during this study are included in the manuscript and supporting file. Sequencing data have been deposited at NCBI under Bioproject PRJNA885615. Code availability at GitHub: https://github.com/kumar-paritosh/analysis_of_Mtb_genome; (copy archived at swh:1:rev:cf2547e50b00c57dee9b60bca899ed50e617106f).

    The following dataset was generated:

    Nandicoori VK. 2022. Whole genome sequencing of Mycobacterium tuberculosis strains under different conditions. NCBI BioProject. PRJNA885615

    The following previously published datasets were used:

    Hicks ND, Yang J, Zhang X, Zhao B, Grad YH, Liu L. 2018. Clinically prevalent mutations in Mycobacterium tuberculosis alter propionate metabolism and mediate multidrug tolerance. NCBI BioProject. PRJNA268900

    Zhang H, Li D, Zhao L, Fleming J, Lin N, Wang T. 2013. Genome sequencing of 161 Mycobacterium tuberculosis isolates from China identifies genes and intergenic regions associated with drug resistance. NCBI Sequence Read Archive. SRA065095

    Casali N, Nikolayevskyy V, Balabanova Y, Harris SR, Ignatyeva O, Kontsevaya I. 2014. Evolution and transmission of drug-resistant tuberculosis in a Russian population. ERP000192. PRJEB2138

    Blouin Y, Hauck Y, Soler C, Fabre M, Vong R, Dehan C. 2012. Significance of the identification in the Horn of Africa of an exceptionally deep branching Mycobacterium tuberculosis clade. ERP001885. PRJEB3334

    Shanmugam S, Kumar N, Nair D, Natrajan M, Tripathy SP, Peacock SJ. 2018. Genome Sequencing of Polydrug-, Multidrug-, and Extensively Drug-Resistant Mycobacterium tuberculosis Strains from South India. NCBI BioProject. 492975

    Guerra-Assuncao JA, Houben RM, Crampin AC, Mzembe T, Mallard K, Coll F. 2015. Recurrence due to relapse or reinfection with Mycobacterium tuberculosis: a whole-genome sequencing approach in a large, population-based cohort with a high HIV infection prevalence and active follow-up. ERP001072. PRJEB2794

    Clark TG, Mallard K, Coll F, Preston M, Assefa S, Harris D. 2013. Elucidating emergence and transmission of multidrug-resistant tuberculosis in treatment experienced patients by whole genome sequencing. EBI. PRJEB2424

    Bryant JM, Harris SR, Parkhill J, Dawson R, Diacon AH, van Helden P. 2013. Whole-genome sequencing to establish relapse or re-infection with Mycobacterium tuberculosis: a retrospective observational study. DDBJ. ERA020628

    Walker TM, Cl IP, Harrell RH, Evans JT, Kapatai G, Dedicoat MJ. 2013. Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. ERP000276. PRJEB2221


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES