Skip to main content
PLOS One logoLink to PLOS One
. 2021 Oct 7;16(10):e0258402. doi: 10.1371/journal.pone.0258402

Phylogenomic analysis and Mycobacterium tuberculosis antibiotic resistance prediction by whole-genome sequencing from clinical isolates of Caldas, Colombia

Lusayda Sánchez-Corrales 1, Olga Lucía Tovar-Aguirre 2, Narmer Fernando Galeano-Vanegas 3,4, Paula Alejandra Castaño Jiménez 2, Ruth Arali Martínez-Vega 5, Carlos Ernesto Maldonado-Londoño 6, Johan Sebastián Hernández-Botero 7,8, Fernando Siller-López 2,9,*
Editor: Seyed Ehtesham Hasnain10
PMCID: PMC8496870  PMID: 34618869

Abstract

Mycobacterium tuberculosis (M. tuberculosis) was the pathogen responsible for the highest number of deaths from infectious diseases in the world, before the arrival of the COVID-19 pandemic. Whole genome sequencing (WGS) has contributed to the understanding of genetic diversity, the mechanisms involved in drug resistance and the transmission dynamics of this pathogen. The object of this study is to use WGS for the epidemiological and molecular characterization of M. tuberculosis clinical strains from Chinchiná, Caldas, a small town in Colombia with a high incidence of TB. Sputum samples were obtained during the first semester of 2020 from six patients and cultured in solid Löwenstein-Jensen medium. DNA extraction was obtained from positive culture samples and WGS was performed with the Illumina HiSeq 2500 platform for subsequent bioinformatic analysis. M. tuberculosis isolates were typified as Euro-American lineage 4 with a predominance of the Harlem and LAM sublineages. All samples were proven sensitive to antituberculosis drugs by genomic analysis, although no phenotype antimicrobial tests were performed on the samples, unreported mutations were identified that could require further analysis. The present study provides preliminary data for the construction of a genomic database line and the follow-up of lineages in this region.

Introduction

M. tuberculosis (MTB) is considered one of the most lethal pathogens, with a toll of 1 billion people suffering tuberculosis (TB)-related deaths in the last two centuries. The World Health Organization (WHO) in their annual report of 2019 depicted an estimate of 10 million new cases and 1.7 million deaths per year [1], making MTB the deadliest single infectious agent before the arrival of SARS-CoV-2. As in many developing countries, Colombia has presented a steady increase in tuberculosis incidence in the last six years from 23 to 27.6 cases per 100,000 habitants in 2019. In Caldas, a department located in the center of the country, TB is considered endemic causing 34 new cases and 2.31 deaths per 100,000 habitants at its peak in 2017, making this malady a primary public health concern in the region due to the progressive aging of the population, emerging immunosuppressive treatments, sociodemographic and comorbidities conditions. Chinchiná (47,929 habitants in 2012), a small town in the center of Caldas, is key to the region’s coffee harvesting processes which lead to the arrival of hundreds of coffee pickers from all over the country at harvesting season. Endemic social issues like poverty and homelessness make a perfect combination for TB to thrive in Chinchiná resulting in one of the highest TB incidence rates in the country (71.5 cases per 100,000 habitants) [2]. WGS offers an opportunity for the molecular epidemiology surveillance for TB in the region. The development of local capacities bioinformatic analysis and the reduction on overall genome sequencing costs make whole genome sequencing (WGS) an increasingly accessible alternative for molecular epidemiologic studies [3]. With the genetic information obtained by WGS compared to other analysis, drug resistance profiles can be predicted for most of the current antibiotic treatment, enabling a wide monitoring of drug resistance.

The aim of this study is to explore the potential use of WGS in the context of routine surveillance, describing the epidemiological and molecular characteristics of circulating M. tuberculosis strains from Chinchiná, Caldas using minimal sequencing and computational resources.

Materials and methods

Population and sample

Patients suspected of pulmonary tuberculosis due to respiratory symptoms screened by routine surveillance in the first semester of 2020 were considered for the study. Demographic information of the patients was obtained from clinical records and epidemiological data from the National surveillance system (SIVIGILA) of Colombia. Seventeen cases of TB were notified from the health authorities of Chinchiná, Caldas to the SIVIGILA system. Six subjects were not considered to participate in the study since no sample was collected due to loss of follow up by the emerging coronavirus pandemic quarantine. Sputum samples were collected from 11 patients at the Hospital San Marcos in Chinchiná, Caldas, and cultured in Lowenstein-Jensen media for the microbiological and genomic analysis of M. tuberculosis. At the beginning of the COVID-19 pandemic, between March to June 2020, there were no growth-dependent or molecular susceptibility tests as part of the standard of care, therefore, no phenotypic or genotypic antimicrobial resistance data was available for the analysis. Positive cultures with colonies suspected of MTB were send to the Laboratory of Microbiology of the Universidad Católica de Manizales, Caldas for DNA extraction.

Seven samples passed the DNA quality control for the sequencing platform, whereas one sample was withdrawn because it was not taxonomically assigned to MTB genome after the bioinformatic analyzes; finally, six samples were considered for the study (Fig 1).

Fig 1. Flow diagram for patient samples.

Fig 1

DNA extraction

Genomic DNA was extracted from colonies using the Bacteria DNA preparation kit following the manufacturer’s protocol (Jena Bioscience, Germany). The quality and quantity of isolated DNA was measured using a Nanodrop 2000 spectrophotomer (ThermoScientific, Waltham, MA) and a Quantus fluorometer (Promega, Madison, WI). DNA integrity was assessed by visualization in 1% agarose gel electrophoresis. DNA samples that fulfill the quality standards in terms of integrity, purity and quantity were sent to whole genome sequencing at NovoGene (Beijing, China).

Whole genome sequencing

Sequencing was performed on PCR-free libraries on the Illumina Hiseq 2500 platform (Illumina Inc., San Diego, CA), producing 150 bp paired end reads. Quality control data of the DNA samples pre- and post-sequencing are presented on S1 and S2 Tables of the S1 File.

Bioinformatic pipeline

The Illumina reads were analyzed in a standard PC (Intel Core i7-7700, 16 Gb of RAM, 4 TB HDD) with Linux subsystem for Windows (Ubuntu 20.04.1 LTS). An in-house bioinformatics workflow was installed using Miniconda (Conda 4.10.3 Python 3.9.5) with the core programs of the unified analysis variant pipeline of the Related Sequencing TB data platform (ReSeqTB/UVP) to identify and assign lineage to the MTB isolates sequence data [4]. (S3 Table in S1 File). Briefly, reads were classified by Kraken version 2 to detect possible contamination or the presence of other mycobacteria; [5] additionally, reads were checked, validated, and trimmed with FastQC version 0.11.9 [6], FastQVAlidator (http://genome.sph.umich.edu/wiki/FastQValidator), and Trimmomatic [7], respectively. Processed reads were mapped using BWA-mem [8] using H37Rv (NCBI ID: NC_000962.3) as reference genome. Alignment quality control was assesed using QuallyMap [9] and Alfred was used to evaluate the depth of coverage in all loci of interest associated with resistance lineage assignment [10]. Genomes with > 75X of depth and > 95% reference mapped with all ReSeqTB/UVP loci of interest were considered for downstream analysis. Variant calling using GATK [11] with hard-filter settings was used to eliminate artefactual false-positive variants. [12] Finally, NGSEP version 4.0 [13] and SnpEff 5.0 [14] was used to perform functional annotation of single nucleotide variants (SNVs) and insertions or deletions (INDELs). By parsing the annotated VCF it was possible to identify variants with low, medium, and high confidence (LR+>10; p<0,05) of association with antituberculous resistance and, also, identify variants not yet described in the ReSeqTB database (V. 2019-I) [15]. The pipeline for WGS TB resistance analysis is available on demand on protocols.io (https://www.protocols.io/private/A0CEADBAFBBB11EB878B0A58A9FEAC02). Links to references of the software used for the bioinformatic pipeline are summarized on S4 Table of the S1 File.

De novo assembly and annotation

The final high-quality reads were assembled using Megahit version 1.2.9 [16]. (S5 Table in S1 File). The assembly was carried out with a kmer range from 29 to 141 at intervals of 20 and 800 bp as minimum contig size. Kaiju version 1.7.3 [17,18] was used to perform contig taxonomic assignment. Quast version 4.3 [19] was used to generate metrics and evaluate the quality of the assemblies using M. tuberculosis strain HR37v (NC_000962) as reference genome. The completeness of the assembly process was verified with Busco version 4.1.4 [20]. The structural and functional annotation was performed with Prokka version 1.12–14 [21].

Lineage assignment and phylogeny

M. tuberculosis complex (MTBC) lineages/sublineages and in silico spoligotyping were determined using TB-Profiler version 2.8.13 [22]. The variants on genes rpoB, inhA, embB, embC, gyrA, gyrB, katG, panD, pncA, recA, 16S, pssC, ahpC, cycA, clpC, aftA, atpE, mas, eccC5, kasA, gpsI, mas, and rpsA, were used as a second approach to classify the isolates on specific lineages and sub-lineages as was proposed by Coll et al. [23] Whole genome dinucleotide variants that passed the quality filters, including the phylogenetically informative SNPs for MTBC typing, and excluding regions of erroneous mapping [24,25], were used to perform clustering by phylogenetic analysis using the software Mega X 10.1. Initially, a maximum likelihood (ML) analysis was run to find the best nucleotide substitution model for the dataset, the phylogenetic analysis was done by the ML method and general time reversible model with a bootstrapping test of 1000 replications, a non-rooted phylogenetic tree to show sample’s clustering was obtained [26].

Analysis of variants associated with first- and second-line drug resistance

After variant calling and annotation, variants on each resistance gene (according to ReSeqTB/UVP) with a high confidence of association with antimicrobial resistance (LR+ > 10; p <0,05) were parsed from the variant call format (VCF) files (for a complete list of genes tested per drug, see Supplementary file, S6 Table in S1 File).

Genes related to resistance to first-line drug therapy [rifampicin (R), isoniazid (H), pyrazinamide (Z), and ethambutol (E)] and second-line drugs [fluoroquinolone (FQ), linezolid (Lnz), bedaquilina (BDQ), delamanid (DLM)] were considered for the analysis. Variants of uncertain effect in genes related to resistance to therapy, and not yet listed in the ReSeqTB database (V. 2019-I) were also evaluated. Finally, all variants (SNV, and INDELS) in drug targets were tested with PROVEAN algorithm to predict deleterious effects in proteins [27].

Ethics statement

The local San Marcos hospital sent all positive MTB cultures to DNA extraction after finished all diagnostic procedures during patient care. Clinical data of patients from surveillance records were collected and anonymized after patient care. All samples were deemed of retrospective nature by the clinical and academic ethics committee (San Marcos Hospital–2019–2907, and Universidad Católica de Manizales–DIP-1260-001-4184).

Results

Sociodemographic and clinical characteristics

All patients were of low socioeconomic status with a median age of 32 years. HIV comorbidity was present in two TB patients. Bacillary load was greater than two crosses in the first smear microscopy in four patients, however, no relation of this issue was observed with the clinical outcome (Table 1). A complete sociodemographic, epidemiological, and clinical data is listed on S7 Table in S1 File.

Table 1. Sociodemographic and clinical characteristics of patients with tuberculosis.

Patient ID TB02 TB03 TB04 TB05 TB06 TB09
Sociodemographic
Age 24 24 25 66 51 39
Sex M M M F M M
Occupation Informal jobs Agricultural Agricultural Informal jobs Unemployed Agricultural
High-risk population No Rural disperse Rural disperse No Homeless Rural disperse
Clinical
Comorbidities HIV Low BMI Low BMI HIV Low BMI COPD Low BMI None
Previous TB treatment No Yes No Yes Yes No
Hospitalization Yes No Yes Yes No Yes
Malnutrition Yes No Yes Yes No Yes
Weight loss Yes No Yes Yes No Yes
Hemoptisis Yes No Yes No Yes Yes
Fever No Yes No Yes Yes No
Bacillary load +++ +++ + ++ ++ +

M: Male, F: Female, HIV: Human immunodeficiency virus, COPD: Chronic obstructive pulmonary disease, BMI: Body Mass Index.

Lineage assignment and phylogeny

The analysis showed that all isolates belong to the Euro-American lineage (Lineage 4), divided in LAM and Harlem sublineages (Fig 2).

Fig 2. Clustering by phylogenetic analysis and lineage assignment of six M. tuberculosis isolates form Chinchiná (Caldas, Colombia).

Fig 2

In total, 1753 dinucleotide variants of high quality were found in the genome. After filtering regions of erroneous mapping, a final data set of 1386 SNVs was obtained. The un-rooted tree shows two main clades, with a branch support of 100% according to the bootstrap test.

The phylogenetic tree shows a reliable clustering. An appreciable genetic distance is not observed since there are few variations in the selected phylogeny genes.

Clade one contained samples TB02 (4.3.4.1), TB03 (4.3.4.2) and TB09 (4.3.3) and assigned to the spoligotype LAM and the closest relationship to the reference H37Rv genome. The second clade groups samples TB04, TB05, and TB06, assigned to the spoligotype Haarlem sublineage 4.1.2.1 (Fig 2). Of note, according to phylogeny analysis, the two closest isolates were TB05 and TB06 both from patients with previous antituberculous treatment.

Phylogeny inferred by maximum likelihood method using the general time reversible model, bootstrap test of 1000 replications. Values on nodes corresponds to the bootstrap value in percentage.

WGS prediction of drug resistance

According to the genomic analysis all isolates are sensitive to all the first-line drugs. Several canonical genes related to first-line treatment showed different substitutions previously reported in the ReSeqTB database with 57.8% synonymous mutations and 42.1% non-synonymous. All substitutions showed no association with clinically significant drug resistance (all positive likelihood ratio (LR)<10). A mutation in ponA1 (1893_1894insTCG) showed LR+ >10, nevertheless, was non-significant (p = 0,3818). Of 54 mutations across all samples reported in ReSeqTB for first-line drugs, only three mutations (5,5%) were classified as deleterious by the PROVEAN score (S8 Table in S1 File).

Regarding second-line treatments, several mutations previously described in ReSeqTB for FQN were found. Mutations in two genes, gyrA (with both Glu21Gln and Gly668Asp SNV in all but one isolate) and eccC5 (with one non-synonymous substitution in one sample). The rest of the mutational spectrum for quinolones are available in S3 Table in S1 File. Of note, one gyrA mutation (739G>A, Gly247Ser) with the prediction of deleterious effect (PROVEAN score of -5,942) was present in one sample. However, none of the variants related to FQN genes showed impact on drug resistance (LR+ <10, p value>0,05). Of those drugs recently added to MDR treatment, loci related to LNZ resistance (genes rrl, rplC) showed no SNV or INDEL. Another cornerstone of MDR-TB treatment, the repurposed drug BDQ, showed one synonymous substitution atpeE (24C>T) in one sample. Genes related to new drugs DMN and protamanid showed a single substitution in fbiC (Ala132Val) in one isolate, and substitution Lys270Met in fgd1 was found in four samples (S8 Table in S1 File). Finally, ten variants were predicted as deleterious in proteins associated with antimicrobial resistance genes. Of note, a Ala132Val mutation in fbiC gene related to DMN and a Thr202Ala mutation in thyA gene related to PAS deem deleterious PROVEAN scores with a predicted effect on the target protein which were no described in the ReSeqTB database (Table 2).

Table 2. Mutations no reported in ReseqTB in loci related to antituberculosis therapy detected by WGS analysis from isolates of Chinchiná.

Gene Genome position Mutation type Nucleotide change Aminoacid change PROVEAN score Isolates
PYRAZINAMIDE (P)
mas 3282700 missense variant 16A>G Thr6Ala neutral (-1,765) TB04
BEDAQUILINE (BDQ)
atpE 1461068 synonymous variant 24C>T Gly8Gly neutral (0) TB03
DELAMANID (DMN)
fgd1 491591 missense variant 809A>T Lys270Met neutral (-2,158) TB04 TB05 TB06
fbiC 1303325 missense variant 395C>T Ala132Val deleterious (-3,392) TB03
PAS
thyA 3073868 missense variant 604A>G Thr202Ala deleterious (-4,577) TB03 TB09

NOTE: PROVEAN prediction scores < -2,5 are considered deleterious.

In genomes from two HIV patient isolates (TB02, TB04), the number of missense variants (541 and 533) was within range (485–546) comparing to the rest of the dataset. Both samples had 15 SNVs in genes related to first-line treatment and 10 to 17 SNVs for the second-line treatment loci, respectively (S8 Table in S1 File).

Discussion

The present study is the first to provide information on the phylogenetic characteristics and the identification of the predominant strains in Chinchiná, a municipality with a high incidence of TB; this study alsodescribes for the first time the use of WGS in the surveillance of TB drug susceptibility in Colombia. By developing a strategy of outsourcing library creation and sequencing, plus using standard computer capabilities, our team showed that it is possible to add WGS results to the surveillance data without significant changes in the laboratory setup or considerable investments in computing power. Furthermore, to our knowledge, it provides the first high-quality WGS dataset of M. tuberculosis from Colombia in compliance with new standards in terms of depth, coverage, tools, and quality controls during the bioinformatic pipeline (Q scores > 30, depth > 125x, MAP-Q > 58, all UVP-Loci of Interest Covered) [4,25,28]. The dataset of our study will allow the inclusion of MTB WGS from Colombia for comparative analyses using current pipelines deployed for surveillance [4,20,25,28].

Although there was a limitation on the surveillance activities due to the COVID-19 pandemic and no availability of phenotypic or molecular tests to assess drug susceptibility, the incidence of TB cases in Chinchiná for the first semester of 2020 (N = 17) was similar to reports from previous years: 2019 (N = 20), 2018 (N = 22), and 2017 (N = 19). The sample was also representative on demographic and clinical characteristics of patients, predominantly male, uneducated, with low income, and no formal jobs following local and regional patterns of mobility and social risk factors [2,29]. Furthermore, all isolates were susceptible to antimicrobial drugs according to the WGS analysis, which agrees with local epidemiology (TB-DR prevalence of 1–2%) [2,29].

The Euro-American lineage found in this study presents a variable distribution in Colombia. Similar results have been found in other regions of the world. Reports from Argentina, Brazil, México and Peru have established that the predominant strains have evolved from this lineage [30]. This lineage has been described as more transmittable than others [31]. Half of the cases analyzed corresponded to the LAM sublineage, the most frequent and globally extended of the Euro-American lineages. The other half, to the Haarlem sublineage, probably consequence of human migration and their commercial activities [32]. The European origin and the influence of European settlers in America and Colombia are confirmed. Similar results have been described by other authors in regions of the country such as Cali [33] Bogotá [34] and Medellín [35]. Contrary to these reports, in Buenaventura, a region of Valle del Cauca, the Beijing lineage, clearly associated with resistant cases has been identified [33]. Most recent reports on TB-MDR Colombian isolates using spoligotyping showed unique TB-MDR strains with no support of active circulation of MDR clones and no relation to specific lineage (Beijing vs. LAM) [36]. Moreover, there is an extensive circulation (53%) of orphan lineages between the indigenous population [36]. Both studies emphasize the importance of using WGS as the primary method for phylogenetic analysis due to high resolution and discriminatory power and potentially characterize new lineages and outbreaks [37,38].

All mutations regarding first-line drugs showed no association with resistance when using a clinical relational database for interpreting the significance of variants (10.1038/s41598-018-33731-1), even though 5,5% are predicted to show deleterious effects in the proteins using the PROVEAN score [39].

The Asp103Asp mutation is a silent mutation in the rpoB gene reported by other authors in mono-resistant isolates [40]. The Gly594Glu mutation has been considered compensatory in isolates resistant to rifampicin, although it is not found in the resistance determining region (codon 432 to 438).

Mutations in mshA (Ans111Ser) were detected in three isolates, although it has been reported as a variant related to the Haarlem lineage and responsible for conferring resistance to Ethionamide [41,42] there is no significant association with drug resistance in the ReSeqTB database [4].

Regarding BDQ resistance by atpeE mutations, one isolate showed a non-repoted synonymous mutation (24C>T) outside codon positions 53 and 72, the key genomic region with most substitutions related to an increase in MIC for BDQ [43]. DMN, and more recently pretomanid for TB-XDR treatment, use similar pathways and genes for activation. fgd1 encodes a key enzyme for the recycle of the target F420 of DMN; in our study, we found three isolates with Lys270Met substitutions in patients not previously exposed to the drug, this substitution was phylogenetically restricted to the Harleem lineage which is regarded as neutral [43]. fbiC encodes a protein that participates in F420 biosynthesis and harbors key mutations found in resistant MTB isolates. The mutation Ala132Val we detected in one isolate was not described in the systematic review on DMN resistance by Kadura et al., [43] however, it showed a deleterious effect on the protein according to the PROVEAN score (-3,392); thus, the presence of this mutation in TB isolates not previously exposed to the drug requires future surveillance as not seems to be lineage-restricted [44].

No differences in SNV and other variants were observed when comparing patients living with HIV within our sample. Although with did not find statistical significance due to the small sample, our results are broadly consistent with the literature. Previous reports found that HIV co-infection did not affect the mutation rate and was not associated with faster de novo development of drug resistance [45,46].

As for the patients with previous treatment, our results showed full genetic susceptibility to current antimicrobial drugs and successful clinical outcomes after the new HRZE treatment. Although there was no previous microbial culture in our samples, the WGS results in the context of adequate therapeutic adherence and no cavitation with fibrosis in radiologic studies (associated with poor tissue penetration of drugs) [47] seem to indicate that a re-infection with a drug sensitive strain by re-exposure to a known focus would be the most plausible explanation for the re-treatment cases.

There were some limitations in our study, since due to the COVID pandemic, the researchers did not have access to phenotypic or molecular tests that would have made it possible to compare data with our genomic findings; and, due to international shipping for WGS and the corresponding response times, the delivery of the results added approximately four weeks to analyze them. Currently, we are improving those aspects for future WGS analyzes of MTB surveillance projects.

Conclusions

Recent studies in top medical journals have shown the enormous potential of WGS in the surveillance of tuberculosis but at the cost of high investments in laboratory infrastructure, sequence capacity, and processing power. However, our results indicate that it is possible to develop genomics surveillance applications at the regional level without considerable investments in sequencing platforms, laboratory upgrades, or computational infrastructure. A decentralized strategy of drug susceptibility analysis with WGS of first and second-line anti-TB drugs could accelerate the adoption of these methodologies in developing regions. The future of this approach depends on a collaboration of multiple institutions, both public and private, to overcome the limitations in turnaround time, number of samples, large phylogenetic analysis, computational requirements, and rigorous validation processes required to include WGS data in the surveillance arsenal against TB.

Supporting information

S1 File

(XLSX)

Acknowledgments

We thank the health professionals of the Hospital San Marcos of Chinchiná who provided insight and expertise that greatly assisted the research, and the epidemiological support obtained from the Dirección Territorial de Salud de Caldas.

Data Availability

The data sets analyzed in this study, with relevant epidemiological metadata, are available at the NCBI database as the Biosamples (SAMN20847272, SAMN20847273, SAMN20847274, SAMN20847275, SAMN20847276, SAMN20847277), covered on the Bioproject PRJNA755956.

Funding Statement

Funding of our study were obtained from the Universidad Católica de Manizales (Grant 033-2019), Universidad de Manizales (Grant RES-088-16-11-2018) and Universidad de Santander (Postgraduate research fellowship to Lusayda Sánchez-Corrales). Authors who received salary from the Universidad Católica de Manizales were Olga L Tovar-Aguirre, Narmer F Galeano-Vanegas and Fernando Siller-López. Ruth A Martínez-Vega received her salary from the Universidad de Santander, and Johan S Hernández-Botero received his salary from the Universidad de Manizales. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.World Health Organization. Global tuberculosis report. Geneva2019.
  • 2.Dirección Territorial de Salud de Caldas. Análisis de Situación de Salud con el Modelo de los Determinantes Sociales de Salud del departamento de Caldas, 2019. Grupo de Vigilancia en Salud Pública 2019. p. 1–98. [Google Scholar]
  • 3.Acquah R, Furin J. Universal regimens or universal access to drug susceptibility testing for tuberculosis? Lancet Infect Dis. 2019;19(3):224–5. Epub 2019/02/13. doi: 10.1016/S1473-3099(18)30742-4 . [DOI] [PubMed] [Google Scholar]
  • 4.Ezewudo M, Borens A, Chiner-Oms A, Miotto P, Chindelevitch L, Starks AM, et al. Integrating standardized whole genome sequence analysis with a global Mycobacterium tuberculosis antibiotic resistance knowledgebase. Sci Rep. 2018;8(1):15382. Epub 2018/10/20. doi: 10.1038/s41598-018-33731-1 ; PubMed Central PMCID: PMC6194142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. bioRxiv: Genome Biology; 2019. p. 1–13. doi: 10.1186/s13059-019-1891-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Andrews S. FastQC: A Quality Control Tool for High Throughput Sequence Data. 2010. [Google Scholar]
  • 7.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. Epub 2014/04/04. doi: 10.1093/bioinformatics/btu170 ; PubMed Central PMCID: PMC4103590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60. Epub 2009/05/20. doi: 10.1093/bioinformatics/btp324 ; PubMed Central PMCID: PMC2705234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Okonechnikov K, Conesa A, Garcia-Alcalde F. Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics. 2016;32(2):292–4. Epub 2015/10/03. doi: 10.1093/bioinformatics/btv566 ; PubMed Central PMCID: PMC4708105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Rausch T, Hsi-Yang Fritz M, Korbel JO, Benes V. Alfred: interactive multi-sample BAM alignment statistics, feature counting and feature annotation for long- and short-read sequencing. Bioinformatics. 2019;35(14):2489–91. Epub 2018/12/07. doi: 10.1093/bioinformatics/bty1007 ; PubMed Central PMCID: PMC6612896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303. Epub 2010/07/21. doi: 10.1101/gr.107524.110 ; PubMed Central PMCID: PMC2928508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chowell G, Hyman JM. Mathematical and statistical modeling for emerging and re-emerging infectious diseases. Mathematical and Statistical Modeling for Emerging and Re-emerging Infectious Diseases 2016. p. 1–356. [Google Scholar]
  • 13.Tello D, Gil J, Loaiza CD, Riascos JJ, Cardozo N, Duitama J. NGSEP3: accurate variant calling across species and sequencing protocols. Bioinformatics. 2019;35(22):4716–23. Epub 2019/05/18. doi: 10.1093/bioinformatics/btz275 ; PubMed Central PMCID: PMC6853766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;6(2):80–92. Epub 2012/06/26. doi: 10.4161/fly.19695 ; PubMed Central PMCID: PMC3679285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9. Epub 2014/03/20. doi: 10.1093/bioinformatics/btu153 . [DOI] [PubMed] [Google Scholar]
  • 16.Stucki D, Brites D, Jeljeli L, Coscolla M, Liu Q, Trauner A, et al. Mycobacterium tuberculosis lineage 4 comprises globally distributed and geographically restricted sublineages. Nat Genet. 2016;48(12):1535–43. Epub 2016/11/01. doi: 10.1038/ng.3704 ; PubMed Central PMCID: PMC5238942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Menzel P, Ng KL, Krogh A. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat Commun. 2016;7:11257. Epub 2016/04/14. doi: 10.1038/ncomms11257 ; PubMed Central PMCID: PMC4833860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, et al. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature. 1998;393(6685):537–44. Epub 1998/06/20. doi: 10.1038/31159 . [DOI] [PubMed] [Google Scholar]
  • 19.Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29(8):1072–5. Epub 2013/02/21. doi: 10.1093/bioinformatics/btt086 ; PubMed Central PMCID: PMC3624806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Iketleng T, Lessells R, Dlamini MT, Mogashoa T, Mupfumi L, Moyo S, et al. Mycobacterium tuberculosis Next-Generation Whole Genome Sequencing: Opportunities and Challenges. Tuberc Res Treat. 2018;2018:1298542. Epub 2019/01/12. doi: 10.1155/2018/1298542 ; PubMed Central PMCID: PMC6304523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Getahun H, Matteelli A, Chaisson RE, Raviglione M. Latent Mycobacterium tuberculosis infection. N Engl J Med. 2015;372(22):2127–35. Epub 2015/05/29. doi: 10.1056/NEJMra1405427 . [DOI] [PubMed] [Google Scholar]
  • 22.Phelan JE, O’Sullivan DM, Machado D, Ramos J, Oppong YEA, Campino S, et al. Integrating informatics tools and portable sequencing technology for rapid detection of resistance to anti-tuberculous drugs. Genome Med. 2019;11(1):41. Epub 2019/06/27. doi: 10.1186/s13073-019-0650-x ; PubMed Central PMCID: PMC6591855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Coll F, McNerney R, Guerra-Assuncao JA, Glynn JR, Perdigao J, Viveiros M, et al. A robust SNP barcode for typing Mycobacterium tuberculosis complex strains. Nat Commun. 2014;5:4812. Epub 2014/09/02. doi: 10.1038/ncomms5812 ; PubMed Central PMCID: PMC4166679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Roetzer A, Diel R, Kohl TA, Ruckert C, Nubel U, Blom J, et al. Whole genome sequencing versus traditional genotyping for investigation of a Mycobacterium tuberculosis outbreak: a longitudinal molecular epidemiological study. PLoS Med. 2013;10(2):e1001387. Epub 2013/02/21. doi: 10.1371/journal.pmed.1001387 ; PubMed Central PMCID: PMC3570532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Meehan CJ, Goig GA, Kohl TA, Verboven L, Dippenaar A, Ezewudo M, et al. Whole genome sequencing of Mycobacterium tuberculosis: current standards and open issues. Nat Rev Microbiol. 2019;17(9):533–45. Epub 2019/06/19. doi: 10.1038/s41579-019-0214-5 . [DOI] [PubMed] [Google Scholar]
  • 26.Stecher G, Tamura K, Kumar S. Molecular Evolutionary Genetics Analysis (MEGA) for macOS. Mol Biol Evol. 2020;37(4):1237–9. Epub 2020/01/07. doi: 10.1093/molbev/msz312 ; PubMed Central PMCID: PMC7086165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Choi Y, Chan AP. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31(16):2745–7. Epub 2015/04/09. doi: 10.1093/bioinformatics/btv195 ; PubMed Central PMCID: PMC4528627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Consortium CR, the GP, Allix-Beguec C, Arandjelovic I, Bi L, Beckert P, et al. Prediction of Susceptibility to First-Line Tuberculosis Drugs by DNA Sequencing. N Engl J Med. 2018;379(15):1403–15. Epub 2018/10/04. doi: 10.1056/NEJMoa1800474 ; PubMed Central PMCID: PMC6121966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Duenes M, Cardona D. [Factors related to treatment adherence in patients with tuberculosis in Pereira, Colombia, 2012–2013]. Biomedica. 2016;36(3):423–31. Epub 2016/11/22. doi: 10.7705/biomedica.v36i3.2904 . [DOI] [PubMed] [Google Scholar]
  • 30.Woodman M, Haeusler IL, Grandjean L. Tuberculosis Genetic Epidemiology: A Latin American Perspective. Genes (Basel). 2019;10(1). Epub 2019/01/19. doi: 10.3390/genes10010053 ; PubMed Central PMCID: PMC6356704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Coscolla M, Gagneux S. Consequences of genomic diversity in Mycobacterium tuberculosis. Semin Immunol. 2014;26(6):431–44. Epub 2014/12/03. doi: 10.1016/j.smim.2014.09.012 ; PubMed Central PMCID: PMC4314449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mokrousov I, Vyazovaya A, Iwamoto T, Skiba Y, Pole I, Zhdanova S, et al. Latin-American-Mediterranean lineage of Mycobacterium tuberculosis: Human traces across pathogen’s phylogeography. Mol Phylogenet Evol. 2016;99:133–43. Epub 2016/03/24. doi: 10.1016/j.ympev.2016.03.020 . [DOI] [PubMed] [Google Scholar]
  • 33.Garcia DF, Astudillo M. MIRU-VNTR genotyping of Mycobacterium tuberculosis in a population of patients in Cali, Colombia, 2013–2015. Biomedica. 2019;39(s1):71–85. Epub 2019/09/19. doi: 10.7705/biomedica.v39i2.3924 . [DOI] [PubMed] [Google Scholar]
  • 34.Cerezo-Cortes MI, Rodriguez-Castillo JG, Hernandez-Pando R, Murcia MI. Circulation of M. tuberculosis Beijing genotype in Latin America and the Caribbean. Pathog Glob Health. 2019;113(8):336–51. Epub 2020/01/07. doi: 10.1080/20477724.2019.1710066 ; PubMed Central PMCID: PMC7006823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Realpe T, Correa N, Rozo JC, Ferro BE, Gomez V, Zapata E, et al. Population structure among mycobacterium tuberculosis isolates from pulmonary tuberculosis patients in Colombia. PLoS One. 2014;9(4):e93848. Epub 2014/04/22. doi: 10.1371/journal.pone.0093848 ; PubMed Central PMCID: PMC3991582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rodriguez-Castillo JG, Llerena C, Argoty-Chamorro L, Guerra J, Couvin D, Rastogi N, et al. Population structure of multidrug-resistant Mycobacterium tuberculosis clinical isolates in Colombia. Tuberculosis (Edinb). 2020;125:102011. Epub 2020/11/03. doi: 10.1016/j.tube.2020.102011 . [DOI] [PubMed] [Google Scholar]
  • 37.Homolka S, Projahn M, Feuerriegel S, Ubben T, Diel R, Nubel U, et al. High resolution discrimination of clinical Mycobacterium tuberculosis complex strains based on single nucleotide polymorphisms. PLoS One. 2012;7(7):e39855. Epub 2012/07/07. doi: 10.1371/journal.pone.0039855 ; PubMed Central PMCID: PMC3388094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Walker TM, Ip CL, Harrell RH, Evans JT, Kapatai G, Dedicoat MJ, et al. Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. Lancet Infect Dis. 2013;13(2):137–46. Epub 2012/11/20. doi: 10.1016/S1473-3099(12)70277-3 ; PubMed Central PMCID: PMC3556524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS One. 2012;7(10):e46688. Epub 2012/10/12. doi: 10.1371/journal.pone.0046688 ; PubMed Central PMCID: PMC3466303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Jagielski T, Bakula Z, Brzostek A, Minias A, Stachowiak R, Kalita J, et al. Characterization of Mutations Conferring Resistance to Rifampin in Mycobacterium tuberculosis Clinical Strains. Antimicrob Agents Chemother. 2018;62(10). Epub 2018/08/01. doi: 10.1128/AAC.01093-18 ; PubMed Central PMCID: PMC6153850. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.de Souza VC, Antunes D, Santos LH, Goliatt PVZC, Caffarena ER, Guimarães ACR, et al. Insights into the Mechanism of Ethionamide Resistance in Mycobacterium tuberculosis through an in silico Structural Evaluation of EthA and Mutants Identified in Clinical Isolates. Catalysts. 2020;10(5):543. [Google Scholar]
  • 42.Rueda J, Realpe T, Mejia GI, Zapata E, Rozo JC, Ferro BE, et al. Genotypic Analysis of Genes Associated with Independent Resistance and Cross-Resistance to Isoniazid and Ethionamide in Mycobacterium tuberculosis Clinical Isolates. Antimicrob Agents Chemother. 2015;59(12):7805–10. Epub 2015/09/16. doi: 10.1128/AAC.01028-15 ; PubMed Central PMCID: PMC4649173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kadura S, King N, Nakhoul M, Zhu H, Theron G, Koser CU, et al. Systematic review of mutations associated with resistance to the new and repurposed Mycobacterium tuberculosis drugs bedaquiline, clofazimine, linezolid, delamanid and pretomanid. J Antimicrob Chemother. 2020;75(8):2031–43. Epub 2020/05/04. doi: 10.1093/jac/dkaa136 ; PubMed Central PMCID: PMC7825472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Rifat D, Li SY, Ioerger T, Shah K, Lanoix JP, Lee J, et al. Mutations in fbiD (Rv2983) as a Novel Determinant of Resistance to Pretomanid and Delamanid in Mycobacterium tuberculosis. Antimicrob Agents Chemother. 2020;65(1). Epub 2020/10/21. doi: 10.1128/AAC.01948-20 ; PubMed Central PMCID: PMC7927868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Eldholm V, Rieux A, Monteserin J, Lopez JM, Palmero D, Lopez B, et al. Impact of HIV co-infection on the evolution and transmission of multidrug-resistant tuberculosis. Elife. 2016;5. Epub 2016/08/10. doi: 10.7554/eLife.16644 ; PubMed Central PMCID: PMC4978521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Koch AS, Brites D, Stucki D, Evans JC, Seldon R, Heekes A, et al. The Influence of HIV on the Evolution of Mycobacterium tuberculosis. Mol Biol Evol. 2017;34(7):1654–68. Epub 2017/04/04. doi: 10.1093/molbev/msx107 ; PubMed Central PMCID: PMC5455964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Marrone MT, Venkataramanan V, Goodman M, Hill AC, Jereb JA, Mase SR. Surgical interventions for drug-resistant tuberculosis: a systematic review and meta-analysis. Int J Tuberc Lung Dis. 2013;17(1):6–16. Epub 2012/12/13. doi: 10.5588/ijtld.12.0198 . [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Seyed Ehtesham Hasnain

7 Jul 2021

PONE-D-21-17481

Phylogenomic analysis and Mycobacterium tuberculosis antibiotic resistance prediction by whole-genome sequencing from clinical isolates of Caldas, Colombia.

PLOS ONE

Dear Dr. Siller-Lopez, 

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Aug 21 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Seyed Ehtesham Hasnain

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (a) whether consent was informed and (b) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

If you are reporting a retrospective study of medical records or archived samples, please ensure that you have discussed whether all data were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data from their medical records used in research, please include this information.

Once you have amended this/these statement(s) in the Methods section of the manuscript, please add the same text to the “Ethics Statement” field of the submission form (via “Edit Submission”).

For additional information about PLOS ONE ethical requirements for human subjects research, please refer to " ext-link-type="uri" xlink:type="simple">http://journals.plos.org/plosone/s/submission-guidelines#loc-human-subjects-research."

3. Thank you for stating the following in the Acknowledgments Section of your manuscript:

"This research was supported by Universidad Católica de Manizales, Universidad de Manizales and Universidad de Santander. Epidemiological support was obtained from the Dirección Territorial de Salud de Caldas."

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

"The funders had no role in study design, data collection and analysis, decision to

publish, or preparation of the manuscript."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

4. We note that Figure 3 in your submission contain satellite images which may be copyrighted. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (a) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (b) remove the figures from your submission:

a) You may seek permission from the original copyright holder of Figure(s) [#] to publish the content specifically under the CC BY 4.0 license.

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

b) If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

The following resources for replacing copyrighted map figures may be helpful:

USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/

The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/

Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.html

NASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/

Landsat: http://landsat.visibleearth.nasa.gov/

USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/#

Natural Earth (public domain): http://www.naturalearthdata.com/

5. Your ethics statement should only appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please move it to the Methods section and delete it from any other section. Please ensure that your ethics statement is included in your manuscript, as the ethics statement entered into the online submission form will not be published alongside your manuscript.

Additional Editor Comments (if provided):

Major Revision

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have carried out work entitled “Phylogenomic analysis and Mycobacterium tuberculosis antibiotic resistance prediction by whole-genome sequencing from clinical isolates of Caldas, Colombia.” The study has explored the variants present in MTB clinical samples from

Chinchiná, Caldas, a small town in Colombia with a high incidence of TB.

However, there is some gap in the experimental work to reach some significant outcome which authors need to address.

Reviewer #2: Comments:

Present study has been conducted by Sánchez-Corrales et al. and trying to represent the molecular characterization as well as genomic surveillance of MTB strains from Chinchiná, Caldas. WGS would be the best way to represent genomic surveillance. However, if we talking about WGS in routine process, it would be difficult for middle and low income countries. Following are some suggestions which can make the manuscript stronger and beneficial for the readers:

Introduction:

• Write Mycobacterium tuberculosis (M. tuberculosis). (Line no. 45)

• Required reference (agent before the arrival of SARS-CoV-2….) (Line no. 49)

• Modify the data from Global TB report 2020.

Method:

• Ethical statement should write in Method section.

Results:

• It is advised to take permission from source (Google) for Geographical territorial picture for any international publication. This may be a copyright issue for future (Figure 3).

• While analyzing the sequence data, it is advised to include or discuss about the resistance patterns for newer drugs [beqaquiline and delamanid] as well as repurposed drugs [clofazimine].

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: PONE-D-21-17481_Comments.doc

PLoS One. 2021 Oct 7;16(10):e0258402. doi: 10.1371/journal.pone.0258402.r002

Author response to Decision Letter 0


10 Sep 2021

Journal Requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming.

Thank you, we have corrected the style of our manuscript according to the PLOS ONE requirements.

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (a) whether consent was informed and (b) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed).

… If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

We have now specified the ethics statement in the Methods and online submission information. According to the type of study (a retrospective study of medical records) we did not require an informed consent. Data was totally anonymized and discussed and approved by two ethical committees as it is now stated at the ethical statement section.

… If you are reporting a retrospective study of medical records or archived samples, please ensure that you have discussed whether all data were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data from their medical records used in research, please include this information.

N.A.

… Once you have amended this/these statement(s) in the Methods section of the manuscript, please add the same text to the “Ethics Statement” field of the submission form (via “Edit Submission”).

Done

3. … We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement.

We amendment the Acknowledgment section and remove the funding-related text from the manuscript. The funding statement continues to be as presented: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

4. We note that Figure 3 in your submission contain satellite images which may be copyrighted. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), … We require you to either (a) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (b) remove the figures from your submission:

Thank you for your indication. We deleted Figure 3 after reviewing that no major contribution to the manuscript was obtained from that figure.

5. Your ethics statement should only appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please move it to the Methods section and delete it from any other section. Please ensure that your ethics statement is included in your manuscript, as the ethics statement entered into the online submission form will not be published alongside your manuscript.

Thank you for your observation. We now presented the ethics statement only in the Methods section of our manuscript.

Detailed responses to reviewers:

Reviewer # 1

Major comments:

1. For the prediction of drug resistance causing variants against bedaquiline and delamanid drugs which are recently FDA approved drugs for the treatment of TB, the authors can consider recently published articles related to these drugs. They can report whether variants related to these drugs are present in the analyzed clinical samples.

Thank you for this direction. Text has been revised to incorporate the reviewer comment. We modified Table 2 and S8 Table to include information on the nucleotide sequence of the TB isolates of our study related to variants associated to bedaquiline and delamanid resistance. Additionally, it was included this sentence in results section; “Another cornerstone of MDR-TB treatment, the repurposed drug BDQ, showed one synonymous substitution atpeE (24CT) in one sample. Genes related to new drugs DMN and protamanid showed a single substitution in fbiC (Ala132Val) in one isolate, and substitution Lys270Met in fgd1 was found in four samples (S8 Table)”; and “Finally, ten variants were predicted as deleterious in proteins associated with antimicrobial resistance genes. Of note, a Ala132Val mutation in fbiC gene related to DMN and a Thr202Ala mutation in thyA gene related to PAS deem deleterious PROVEAN scores with a predicted effect on the target protein which were no described in the ReSeqTB database”. Later we discussed in the next section about findings from line 293 to 305.

2. The authors have reported that their samples of HIV+TB (Co-morbidity) and TB patients. However, they have analyzed the samples in one group only. It will add significanse value to the study if they compare the variants of TB patients with patients having comorbdity condition.

It was added to the text, to include some about it in lines 231 to 234, discussed at lines 308 to 312

3. Authors are giving synonymous variants. They should explain why and how does it affect the drug resistance development on MTB?

Reviewer is right In silico we have not evidence about it. However, we consider it is important that data remains in order to see that there is no evidence trying to correlate the trait with the variant. We know that it would sound obvious but we tried to avoid commitments with any results in this case because changes due to this kind of mutations are more related to transcription regulations, accordingly to Pevsner in an eukaryotic model. For that reason, we include the sentence “the researchers did not have access to phenotypic or molecular tests that would have made it possible to compare data with our genomic findings” 320-321.

4. Authors have mentioned, “Likewise, for further analysis it would be considered to carry out protein modeling to determine if changes in the mutated amino acids could generate resistance.” However, the effect of variants on protein function can be analysed by SIFT (whether an amino acid substitution affects protein function), PROVEAN (an amino acid substitution or indel has an impact on the biological function of a protein) or PolyPhen. Authors can add this step to their bioinformatic analysis.

Thank you so much for your recommendation. We run Provean and give us some interesting results discussed in the document’s body.

Minor comments:

1. Abstract

5. “All samples were proven sensitive to antituberculosis drugs by phenotype and genomic analysis, although unreported mutations were identified.” Authors have mentioned phenotype, however they have not carried out Drug susceptibility testing, therefore this sentence should be modified, Line nos. 36-37; Page No. 2.

Thank you for the observation. Although we did not perform the drug susceptibility tests by ourselves, we did receive the drug susceptibility results from the Colombian National Health Institute where a mandatory replicate sample is sent to report the clinical case. We have now included that statement on the corresponding Materials and methods section.

2. Introduction

6. Should be read as, “1.7 million” for “1,7 million”, Line no. 47; Page No. 3.

7. Should be read as, “100,000 habitants at” for “100.000 habitants at” Line no. 52; Page No. 3.

8. “Also, outbreaks can be identified with higher resolution, allowing optimized control measures not only by revealing the genetic makeup and evolution of strains, but by showing epidemiological evidence of contagion from patients unwilling or unable to give information for the study of the case, as its routinely the case for high-risk social groups like homeless, immigrants, and itinerant agricultural workers.” Sentence is too long. Rewrite it. Line no. 66; Page No. 3: Line nos. 67-70; Page No. 4.

9. Should be read as, “visualization in” for “visualization by”, Line no. 96; Page No. 5.

10. 1. The authors have repeated this section from “The readings were reviewed [4], validated (FastQVAlidator v1.0.5 2017) and trimmed (Trimmomatic v0.36) and the processed reads were mapped using BWA v0.7.17 [19] with the M. tuberculosis reference genome H37Rv (NC_000962.3). Quality control of the alignment was performed using QuallyMap v2.2.1 [6] for coverage statistics and Phred score to assess depth and coverage at all loci of interest associated with resistance. Only genomes 75X deep and 95% reference mapped to all UVP v2.7.0 loci of covered interest were considered for further analysis. Variant calling was accomplished with the use of GATK v4.2.0.0 [20] with filter adjustments to eliminate artificial false positive variants. Finally, NGSEP v4.0 was used to score both SNV and INDEL. [8] When analyzing the annotated VCF file, it was possible to identify variants with low, medium, and high confidence (LR +10; p 0.05)” from De novo assembly and annotation section. The authors can write, “The variants obtained after variant calling were considered for the prediction of drug reistance” Line nos. 144-154; Page No. 7.

11. Asn927Asn is not non-synonymous mutation. Please check it, Line nos. 223; Page No. 11.

12. In Table 5. Variants related to drug resistance are listed. Out of 19, 42% variants are synonymous variants which should be excluded from the study as they will have no effect on the functional aspect of the gene.

13. Should be read as, “these samples were” for “this sample was”, Line no. 257; Page No. 15.

14. “Other authors evaluated the variation that the protein present in these mutations could have and found that the effect of the mutation on it was neutral.” Rewrite this sentence. Line nos. 296-298; Page No. 16.

15. “The Asp103Asp mutation is a silent mutation in the rpoB gene reported by other authors in mono-resistant isolates”, Do this mutation affect the physiology of MTB? Line nos. 299-300; Page No. 17.

16. Should be read as, “in such a way this activity” for “in such a way that this activity” Line no. 323; Page No. 18.

17. Should be read as, “that allow to identify local” for “that allow identifying local” Line no. 333; Page No. 18.

18. 1. Should be read as, “use of WGS for the characterization” for “use of the WGS the characterization” Line no. 337; Page No. 18.

19. Figure. 1: In the flow chart, in last box please check spelling of “analyses”.

20. “Table 2. Clinical characteristics of pulmonary tuberculosis patients from Chinchiná, Colombia.” is not present in the manuscript, Line no. 191; Page No. 9.

We thank the reviewer for highlighting all these points. We have amended the manuscript considering the observations. Since the restructuring of the manuscript was substantial, we have incorporated the reviewer suggestions as part of the major changes of the manuscript.

Attachment

Submitted filename: Response to reviewers PONE-D-21-17481 v.f.pdf

Decision Letter 1

Seyed Ehtesham Hasnain

27 Sep 2021

Phylogenomic analysis and Mycobacterium tuberculosis antibiotic resistance prediction by whole-genome sequencing from clinical isolates of Caldas, Colombia.

PONE-D-21-17481R1

Dear Dr. Siller-Lopez,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Seyed Ehtesham Hasnain

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

I have gone through the revised manuscript and also the Author's response to the comments of the reviewers. The important issue of synonyms variants and drug resistance has been addressed. They have also added results of PROVEAN analysis. In my view, Authors have comprehensively revised the manuscript addressing all the comments of the reviewers. All typo and grammatical errors have been taken care off. All other explanations provided by the Authors to the queries of the reviewers are quite satisfactory.

I recommend this manuscript for publication.

Reviewers' comments:

Acceptance letter

Seyed Ehtesham Hasnain

29 Sep 2021

PONE-D-21-17481R1

Phylogenomic analysis and Mycobacterium tuberculosis antibiotic resistance prediction by whole-genome sequencing from clinical isolates of Caldas, Colombia

Dear Dr. Siller-López:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Prof. Seyed Ehtesham Hasnain

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File

    (XLSX)

    Attachment

    Submitted filename: PONE-D-21-17481_Comments.doc

    Attachment

    Submitted filename: Response to reviewers PONE-D-21-17481 v.f.pdf

    Data Availability Statement

    The data sets analyzed in this study, with relevant epidemiological metadata, are available at the NCBI database as the Biosamples (SAMN20847272, SAMN20847273, SAMN20847274, SAMN20847275, SAMN20847276, SAMN20847277), covered on the Bioproject PRJNA755956.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES