Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2018 Apr;20(4):228–235. doi: 10.1016/j.micinf.2017.12.011

Discovery of a unique Mycobacterium tuberculosis protein through proteomic analysis of urine from patients with active tuberculosis

Nira Pollock a,4, Rakesh Dhiman b,4,1, Nada Daifalla b,2, Maha Farhat c, Antonio Campos-Neto b,∗,3
PMCID: PMC5946903  PMID: 29306028

Abstract

Identification of pathogen-specific biomarkers present in patients' serum or urine samples can be a useful diagnostic approach. In efforts to discover Mycobacterium tuberculosis (Mtb) biomarkers we identified by mass spectroscopy a unique 21-mer Mtb peptide sequence (VVLGLTVPGGVELLPGVALPR) present in the urines of TB patients from Zimbabwe. This peptide has 100% sequence homology with the protein TBCG_03312 from the C strain of Mtb (a clinical isolate identified in New York, NY, USA) and 95% sequence homology with Mtb oxidoreductase (MRGA423_21210) from the clinical isolate MTB423 (identified in Kerala, India). Alignment of the genes coding for these proteins show an insertion point mutation relative to Rv3368c of the reference H37Rv strain, which generated a unique C-terminus with no sequence homology with any other described protein. Phylogenetic analysis utilizing public sequence data shows that the insertion mutation is apparently a rare event. However, sera from TB patients from distinct geographical areas of the world (Peru, Vietnam, and South Africa) contain antibodies that recognize a purified recombinant C-terminus of the protein, thus suggesting a wider distribution of isolates that produce this protein.

Keywords: Mycobacterium tuberculosis, C strain, Antigenuria

1. Introduction

Urinary excretion of antigens from pathogens that cause systemic diseases, also known as “antigenuria,” has been known for decades to occur [1], [2], [3], [4], [5]. Both polysaccharides and proteins from pathogens have been detected in the urine of diseased patients [6], [7], [8], [9], [10], and in most cases, the antigenuria is not reported to be associated with renal dysfunction. Though the pathophysiology of pathogen-specific antigenuria is not completely understood, antigenuria has been used as an important premise for the development of antigen detection assays for the diagnosis of active infectious diseases. Examples of such tests are those used for the diagnosis of systemic infectious diseases caused by Legionella pneumophilla [11], [12], [13], Streptococcus pneumoniae [14], [15], [16], M. tuberculosis [17], [18], [19], [20], Leishmania donovani complex [21], [22], [23], [24], [25], [26], Histoplasma capsulatum [27], [28], [29], and several others.

Previously, we have discovered and validated protein biomarkers of M. tuberculosis (Mtb) and L. donovani in urine of patients with active tuberculosis (TB) and visceral leishmaniasis (VL), respectively [10], [25], [26], [30], [31], [20]. In these patients, the presence of these proteins in urine correlated well with clinical manifestations of active disease.

Here, we describe the discovery and characterization of a particularly unique Mtb protein we have found in urine of patients with active TB. To date, the nucleotide sequence for this protein has been reported in only two clinical Mtb isolates. The first isolate was a widely-disseminated drug-susceptible Mtb strain that caused disease in New York City between 1991 and 1994 [32], [33]. This Mtb isolate was named the “C strain” (accession CH482375.1). More recently, the gene was also found in a clinical isolate recovered in culture from TB patients from Kerala, India [34]. This Mtb isolate was named “MTB423” (accession CP003234). The relationships between these two isolates, as well as their worldwide distribution, have not previously been investigated.

Here we report the characterization of this novel protein and explore immunoreactivity to the molecule in a panel of sera from patients from multiple countries who do and do not have active TB. The C-terminal half of the amino acid sequence of this protein has no known sequence match with any other microorganism. We propose that this portion of the molecule may be a useful tool for assessment of the worldwide clinical prevalence of these strains of Mtb—and, should prevalence warrant, for the development of novel TB diagnostic tests.

2. Materials and methods

2.1. Human samples

Two urine samples and a total of 60 serum samples from patients with pulmonary TB were evaluated in this study [all kindly provided by Foundation for Innovative New Diagnostics (FIND, Geneva, Switzerland)]. These samples were collected from patients diagnosed with TB based on a clinical course consistent with the disease and confirmatory laboratory findings (growth of Mtb from sputum culture). The patients providing urine were from Zimbabwe, and patients providing serum were from Peru (n = 20), Vietnam (n = 20), and South Africa (n = 20); in each of the three sets of sera, 10 of the patients had AFB smear-negative sputum, and 10 had smear-positive sputum. In addition, normal human serum (NHS) samples were obtained either from two commercial sources (ThermoFisher Scientific, Grand Island, NY and Sigma–Aldrich, St. Louis, MO) or under verbal informed consent via a sample collection protocol approved by the Forsyth Institute (n = 3). Three commercial NHS were pooled samples derived from whole blood obtained from more than 100 healthy donors per pool (ages 18–65) in the United States and processed within 24 h.

2.2. Mass spectroscopy analysis

Individual urine samples (15 ml) from patients with TB were concentrated using Centricon P3 (3 kDa cutoff filters) to ∼200–300 μl. Urine samples were then submitted to SDS-PAGE followed by Coomassie staining. Bands were excised from the gel and submitted for mass spectroscopy analysis at the Taplin Mass Spectrometry Facility, Harvard Medical School, Boston, MA. Excised gel bands were cut into approximately 1–2 mm wide pieces. Gel pieces were then subjected to a modified in-gel trypsin digestion procedure [35]. Gel pieces were washed and dehydrated with acetonitrile for 10 min followed by removal of acetonitrile. Pieces were then completely dried in a speed-vac. Rehydration of the gel pieces was with 50 mM ammonium bicarbonate solution containing 12.5 ng/μl modified sequencing-grade trypsin (Promega, Madison, WI) at 4 °C. After 45 min, the excess trypsin solution was removed and replaced with 50 mM ammonium bicarbonate solution to just cover the gel pieces. Samples were then placed in a 37 °C incubator overnight. Peptides were later extracted by removing the ammonium bicarbonate solution, followed by addition of a solution containing 50% acetonitrile and 1% formic acid. The extracts were then dried in a speed-vac (∼1 h). The samples were then stored at 4 °C until analysis. Samples were reconstituted in 5–10 μl of HPLC solvent A (2.5% acetonitrile, 0.1% formic acid). A nano-scale reverse-phase HPLC capillary column was created by packing 5 μm C18 spherical silica beads into a fused silica capillary (125 μm inner diameter x ∼20 cm length) with a flame-drawn tip [36]. After equilibrating the column each sample was loaded via a Famos auto sampler (LC Packings, San Francisco CA) onto the column. A gradient was formed and peptides were eluted with increasing concentrations of solvent B (97.5% acetonitrile, 0.1% formic acid). Eluted peptides were subjected to electrospray ionization and then entered into an LTQ Velos ion-trap mass spectrometer (ThermoFisher, San Jose, CA). Peptides were then fragmented to produce a tandem mass spectrum of specific fragment ions for each peptide. Peptide sequences (and hence protein identity) were determined by matching protein databases with the acquired fragmentation pattern by the software program, Sequest (ThermoFisher, San Jose, CA) [37].

2.3. Cloning of TBCG_03312 C-terminus gene, protein expression and purification of recombinant protein

The DNA sequence coding for the C-terminal half (aa 98–206) of the discovered Mtb protein (TBCG_03312 from the Mtb C strain), was optimized for expression in Escherichia coli. The gene was synthesized by Blue Heron (Bothell, WA). To allow sub-cloning, restriction enzyme sequences Nde I and Bam HI were included at 5′ and 3′ endings, respectively, of the optimized DNA fragment. The synthetic gene was digested with the restriction enzymes and sub-cloned into a pET-14b expression vector, which was similarly digested for directional cloning. Protein in the pET-14b expression vector generates a six-residue histidine tag at the N-terminus of the molecule, which facilitates purification by affinity on QIAexpress® Ni-NTA agarose matrix (Qiagen, Valencia, CA) as described [38].

2.4. Identifying the prevalence of the TBCG_03312 protein in public Mtb sequence data

We downloaded shotgun sequence files from 5310 Mtb isolates from the NCBI sequence read archive described by Manson et al. [39]. Each pair of fastq sequence files was processed in the following bioinformatics pipeline to generate a list of variant calls: (a) the fastq format was confirmed using fastQValidator v 0.1.1 (https://genome.sph.umich.edu/wiki/FastQValidator); (b) Prinseq v 0.20.4 was used to trim reads at a quality threshold less than 20 [40]; (c) kraken v 0.10.5 was used to confirm that >90% of reads match Mtb complex taxonomic classification [41]; (d) trimmed reads were aligned to the H37Rv reference genome with bwa mem v 0.7.11 [42]; (e) samtools v 1.5 was used to calculate coverage and sequences with <95% H37Rv coverage at 10x or more were discarded [43]; (f) duplicate reads were removed using Picard v 2.0.1 (https://github.com/broadinstitute/picard); and (g) variant calling was performed after duplicate removal using Pilon [44]. We then identified all variants in this 5310 isolate set that occurred between the H37Rv coordinates 3,780,335 and 3,780,979 corresponding to the possible oxidoreductase gene Rv3368c. In the seven genomes in which we identified an insertion, we downloaded the assembly draft protein.gff file from NCBI and preformed protein-blast using the TBCG_03312 peptide. To query public finished genomes for the TBCG_03312 protein and its fragments, we used the NCBI protein and nucleotide BLAST functions.

2.5. Phylogenetic classification of public Mtb genomes

We used MUMmer [45] v 3.0 to compare the H37Rv, C strain and MTB423 finished genomes that were downloaded in fasta format from the NCBI genome database. We identified any variants overlapping with a reference set of variants from 78 phylogenetically typed genomes described by Sekizuka et al. [46] and used the concatenated set of variants as a multiple sequence alignment to build a neighbor joining phylogeny. The query strain lineage was identified as the same as the lineage of the closest reference genome on the phylogeny. For the 5310 shotgun sequences, we applied the Coll et al. [47] 67 SNP barcode to classify lineage using variants with a Pilon filter designation of PASS.

2.6. ELISA

ELISA was performed using standard protocols. Briefly, maxisorp surface ELISA plates (Nalge Nunc International) were coated with 50 μl of 200-ng antigen in carbonate-bicarbonate buffer/well and incubated at 4 °C overnight. Wells were aspirated and then blocked with PBS–1% bovine serum albumin at 250 μl/well at room temperature for 2 h. The blocking reagent was aspirated, and the plates were washed 5 times with PBS 0.1% Tween 20 plus 0.01% benzalkonium chloride. One hundred μl of human serum diluted in PBS at 1/100 was added per well. Samples were incubated at room temperature for 60 min. The plates were washed and 50 μl of protein A-horseradish peroxidase conjugate at a 1:20,000 dilution in PBS was added per well and incubated for 30 min at room temperature. The conjugate was aspirated, and the plates were washed. One hundred microliters of tetramethyl benzidine substrate (Kirkegaard & Perry Laboratories)/well was added and incubated for 15 min at room temperature, and the reaction was stopped with 100 μl of 1 N H2SO4/well. The plates were then read at 450 nm using an ELISA reader (ELX 808; Bio-TEK Instruments, Inc.). The cutoff for the assays was the mean of the 6 normal human serum samples plus three standard deviations of the mean.

3. Results

3.1. Identification of a unique Mtb protein in the urine of patients with active pulmonary TB

Urine was collected from two patients from Zimbabwe with active, culture-confirmed pulmonary TB. Neither patient had any clinical signs, symptoms, or laboratory findings compatible with renal or urinary tract abnormalities. These criteria were important to rule out renal TB in these patients and therefore to support the proposed lung (but not kidney) origin of the Mtb antigens present in the patients' urine. Neither patient was on anti-tuberculosis therapy at the time of urine collection. Individual urine samples were analyzed by mass spectrometry (Methods). This analysis generated more than 500 peptide sequences. As expected, most sequences had identical sequence homologies with human proteins. However, a protein band (MW 26–37 kDa) eluted from each of the two SDS-PAGE gels (one gel for each patient's urine sample) contained one non-human 21-mer peptide sequence (VVLGLTVPGGVELLPGVALPR) with XCorr value >4.0 (Table 1). This peptide had 100% sequence homology with the deduced sequence of the protein TBCG_03312 from the C strain of Mtb and 95% sequence homology with Mtb oxireductase from the clinical isolate MTB423. Fig. 1A shows the full-length amino acid sequence of these two proteins and highlights that the discovered peptide lies in the C-terminal half of the molecules. As specificity controls, we confirmed that MS spectra for this peptide did not match any predicted E. coli tryptic peptides (E. coli being a common urinary commensal), and analysis of MS data from similarly processed urine specimens from patients with VL (and without TB) did not yield this peptide (not shown).

Table 1.

Summary of the mass spectroscopy data for an Mtb peptide derived from a parent protein (TBCG_03312) found in urine of two TB patients from Zimbabwe.

Patient numerical identification # of peptides from parent protein found in urine Peptide sequence found in urine Peptide Rank Charge Ions XCorr ΔCn Peptide position in parent protein
1 1 VVLGLTVPGGVELLPGVALPR 3 29/80 4.023 0.478 132–152
2 1 VVLGLTVPGGVELLPGVALPR 3 30/80 4.338 0.430 132–152

Fig. 1.

Fig. 1

Amino acid sequence of TBCG_03312 and Mtb oxireductase proteins and position of the peptide VVLGLTVPGGVELLPGVALPR (red and underlined) discovered in urine of two Zimbabwean patients with TB (A). In (B) are the top 10 NCBI BLAST hits obtained for the peptide. Note that only the two first hits have significant E values.

3.2. The TBCG_03312 C-terminus peptide is unique

The detailed BLAST analysis of the peptide revealed that its sequence matches only those of the TBCG_03312 from the Mtb C strain and oxidoreductase from the clinical isolate MTB423 with E-values that were extremely high and significant, i.e., 3e-11 and 7e-10 respectively (Fig. 1B). In contrast, the next possible match of the peptide sequence was with a hypothetical protein AQI70_27480 (Streptomyces curacoi); the E-value for this match was 0.06, and thus of no or very low significance. Other less significant ranking matches are also depicted in Fig. 1B.

The BLAST analysis also revealed that the amino acid sequence of the N-terminal half (aa 1–97) of the donor proteins TBCG_03312 and oxidoreductase (from MTB423) is ubiquitously distributed among the genus Mycobacterium (Fig. 2A) and has 100% match with Mtb nitroreductase (Rv3368c protein). In contrast the C-terminus (aa 98–206) of the TBCG_03312 molecule, which contains the peptide VVLGLTVPGGVELLPGVALPR, is unique to the donor proteins TBCG_03312 and to oxidoreductase (from MTB423). Fig. 2B illustrates that the amino acid sequence of TBCG_03312 protein is very closely related to that of Mtb oxidoreductase, with an extremely high E value (3e-45) [the E value for TBCG_03312 itself is higher (2e-57) as expected because this sequence was used for the BLAST analysis]. Even with the BLAST analysis set for Max Target Sequences of 50, only one additional sequence was detected (isoleucine--tRNA ligase of Brevibacterium sp); however, the E value (5.3) for this alignment is not significant.

Fig. 2.

Fig. 2

NCBI BLAST analysis of full length and C-terminus of TBCG_03312 protein an Mtb oxireductase sequences. In (A) red lines are TBCG_03312 (from Mtb C strain) and Mtb oxidoreductase (from MTB423 strain). Purple lines represent Mtb nitroreductase from 48 different strains of Mtb as well as other Mycobacterium species. In (B) BLAST analysis was set for Max Target Sequences of 50 and confirm that only one significant matching hit was found (oxireductase from RGTB423). Note that the E value for isoleucine-tRNA ligase from Brevibacterium sp is not significant. (C) represents a schematic illustration of the genomic distance separating the strains H37Rv, C, and MTB423. Distance in number of single nucleotide substitutions as obtained from a MUMmer comparison of the genomes (see Methods).

Phylogenetic analysis (Fig. 2C) confirmed that the Mtb C strain belongs to lineage 4, specifically to sublineage 4.1.1, and is separated by 2820 single nucleotide substitutions from the Lineage 4 H37Rv reference genome. Although the C strain and MTB 423 proteome are both predicted to contain the VVLGLTVPGGVELLPGVALPR peptide they were distantly related on a genomic scale, with MTB423 belonging to lineage 1.2.2, and separated from the C-strain by 4897 single nucleotide substitutions. Finally, the gene alignment for TBCG_03312 (C strain) and MRGA423_21210 (MTB423) with Rv3368c (H37Rv) genes shows an insertion point mutation in both genes relative to Rv3368c. Thymine at position 289 in TBCG_03312, and adenine at position 371 in MRGA423_21210 resulted in a frameshift encoding the unique TBCG_03312 C-terminus sequence (Fig. 3).

Fig. 3.

Fig. 3

Genomic alignment between TBCG_03312 and Rv3386c loci. The site of the insertion point mutation in TBCG_03312 that resulted in a frame shift is highlighted in red.

In examining 5310 public Mtb shotgun sequences we found an overall low level of variation in the Rv3368c H37Rv region. We found 13 different single nucleotide substitutions in 142 isolates out of the 5310 examined. The most common was a cytosine to thymine substitution at H37Rv coordinate 3780542; it occurred in 97 isolates, all of which belonged to lineage 4.4.1. There was no convergent evolution noted of these substitutions, with each restricted to a specific Mtb sublineage (Table 2). We also found 7 insertions putative large sequence polymorphisms all occurring at or before Rv3368c coordinate 289 (genomic coordinate≥3780623). Three of these events were an IS6110 insertion at coordinate 3780917 and occurred in 3 isolates from lineage 4.4.1.1 that were isolated from South African patients. Examination of the NCBI assembly of these genomes including the draft annotation showed no high confidence match for the TBCG_03312 C-terminus.

Table 2.

Genetic variation in the Rv3368c gene found in 5310 public shot gun sequence [39].

H37Rv Coordinate Change Lineage Number of isolates from lineage with variant Total number of isolates from lineage studied
3780738 G > A 4.8 1 100
3780715 T > C 5 1 2
3780885 A > G 7 1 1
3780652 G > A 1.1.1 1 1
3780734 G > A 1.1.3 6 29
3780782 G > A 1.2.2 23 62
3780506 C > A 2.2.1 4 907
3780962 G > A 2.2.2 2 104
3780836 G > C 4.1 1 51
3780776 G > A 4.1.2 1 41
3780727 G > A 4.2.1 3 143
3780594 C > T 4.3.3 1 326
3780542 C > T 4.4.1 97 105
3780917 IS6110 insertion 4.4.1.1 3 100
3780647 Putative duplication 2.2.1 1 907
3780739 Putative duplication 2.2.1 1 907
3780966 Putative duplication 2.2.1 1 907
3780971 Putative duplication 2.2.1 1 907

Of the 5310 examined isolates, only 83 were classified as belonging to lineage 4.1.1, and 62 to lineage 1.2.2, i.e. predicted to be phylogenetically close to C-strain and MTB423 respectively. There were no nucleotide variants found in the 83 lineage 4.1.1 genomes, and only a single nucleotide substitution 3780782 G>A was found in 23/62 lineage 1.2.2 isolates.

3.3. Gene cloning and protein expression/purification of TBCG_03312 C-terminus

A codon-optimized synthetic gene coding for the C-terminal sequence of TBCG_03312 (aa 98–206) was obtained and sub-cloned into pET-14b expression vector. The gene was induced in E. coli host cells and recombinant protein was purified using a Ni-NTA agarose resin. Purity was assessed by SDS-PAGE with Coomassie blue staining. As illustrated in a single band of the expected MW (15 kDa) was obtained, indicating a high degree of purity of the recombinant TBCG_03312 C-terminus (Fig. 4).

Fig. 4.

Fig. 4

Recognition of recombinant TBCG_03312 C-terminus by sera from TB patients from Peru, Vietnam, and South Africa. Sera were tested by conventional ELISA using the purified C-terminus of TBCG_03312 [inset; Lane 1, IPTG-induced E. coli culture; Lane 2, flow through of IPTG-induced E. coli culture; Lane 3, wash; Lane 4, purified TBCG_03312 C-terminus protein (arrow)]. Dotted red line in the graph represents the mean of the OD obtained for six normal human serum (NHS) samples (Methods) plus 3 SD of the mean. S-pos, smear positive; C-pos, culture positive.

3.4. Recognition of TBCG_03312 C-terminus by sera of TB patients and controls

To evaluate the possible recognition of TBCG_03312 C-terminus by antibodies from patients with TB, sera were obtained from patients with active TB living in three different geographical areas where the disease is endemic. Control sera were either commercial healthy human serum pools each derived from whole blood from more than 100 US healthy donors (n = 3) or sera obtained from individual healthy donors from Brazil (n = 3). TB patients were from Peru (n = 20); Vietnam (n = 20); and South Africa (n = 20). Recognition of TBCG_03312 C-terminus was evaluated by conventional ELISA. The results (Fig. 4) show that none of the healthy control sera reacted with TBCG_03312 C-terminus at the tested serum dilution (1/100). In contrast, the TBCG_03312 C-terminus was recognized by approximately 35%, 60%, and 75% of the sera from patients with TB from Peru, Vietnam, and South Africa, respectively (no obvious difference in serum reactivity was observed between the smear-positive and smear-negative TB patients). These results suggest both that the TBCG_03312 C-terminus is immunogenic during infection and that Mtb strains carrying this unique peptide (like strain C and/or MTB423) might have a more worldwide distribution than previously thought.

4. Discussion

In search of specific molecular markers for TB diagnostics development we serendipitously found an Mtb polypeptide sequence that seems to be uniquely associated with defined clinical isolate of this pathogen. The peptide sequence was found in the urines of two TB patients from Zimbabwe. The BLAST analysis of the peptide sequence matched the protein sequence (TBCG_03312) of a distinct clinical isolate of Mtb that, based on limited data in the literature, would seem to be rare. This strain, a drug susceptible Mtb called “C strain,” was described as the etiological agent of a large proportion of new TB cases occurring in New York City between 1991 and 1994 [32], [33]. Our BLAST analysis also identified a closely-related peptide (95% homology) within a more recently-reported clinical isolate of Mtb (MTB423) from Kerala, South India [34]. The discovered peptide initially seemed to be unique to these two strains, showing no sequence homology with any other Mtb strain or with any other member of the Mycobacterium genus by BLAST analysis. However, our observations from serologic testing suggest that Mtb strains harboring the gene coding for TBCG_03312 may be widely distributed; in addition to discovery of the peptide in the urine of TB patients from Zimbabwe, sera from TB patients from countries representing 3 different regions of the world (Peru, Vietnam, and South Africa) contain antibodies that recognize the C-terminus of this unique molecule. We recognize that one limitation of our study is that we were unable to test sera from currently healthy patients (with and without latent TB) from Zimbabwe/Peru/Vietnam/South Africa; it would be helpful to know whether individuals from these areas with a history of either active or latent TB would also have reactive sera, indicating that they too had been exposed to isolates producing this peptide. While we do not know the country of origin, TB history, or BCG history of the donors contributing to the pooled commercial sera we used as negative controls, we did observe that sera from 3 healthy individuals from Brazil (all BCG-vaccinated) did not react with the TBCG_03312 molecule. Given the apparently wide geographic distribution and high frequency of TB patients with serologic reactivity to the C-terminus of TBCG_03312, suggesting that Mtb isolates producing this unique molecule are actually widely distributed, this molecule might also be an attractive candidate for development of a urine antigen detection assay [20].

At this point two epidemiologically and phylogenetically distinct clinical isolates of Mtb appear to have the genetic code for the unique TBCG_03312 C-terminus (Mtb C strain and MTB423); our serological findings cannot pinpoint the actual Mtb strain that caused TB in the patients evaluated in our study. However, given that the unique sequence of the TBCG_03312 C-terminus was generated by an insertion mutation in the Rv3386c gene, and the rarity of these insertion mutations in other examined public sequence data, it is possible that the strains that infected the patients who provided the sera we evaluated are related to either C strain or MTB423.

Although our search through public sequence data revealed a low level of variation in the Rv3368c gene and notably a lack of single base insertions in lineages phylogenetically close to the C-strain and to MTB423, the results are limited by the lack of finished or complete sequence data available for Mtb lineages 1 and 4.1.1 in the public domain. It is worth noting that whole genome sequencing efforts in TB have to-date not been systematic or designed to be accurately representative of the burden of different TB lineages, and biased towards cases from developed countries and with drug resistance [48].

Despite using a bioinformatics pipeline that incorporates read assembly and increases the sensitivity for predicting insertions, deletions and sequence polymorphisms, it is possible that our pipeline was conservative and that we missed relevant variation that could explain the ELISA results. Further, in our analysis of the few genomes we found with insertions we relied on a draft automated protein annotation that may not be reliable. Therefore, further studies of finished genomes from lineage 4.1.1 and lineage 1 are still needed to definitively interpret these information in conjunction with serological data.

Moreover, proteome analysis of Mtb C and/or MTB423 strains, will be required to assess and confirm the potential utility of diagnostics development based on the TBCG_03312 C-terminus peptide. These evaluations will include: first, the molecular detection (e.g., by mass spectroscopy or RT-PRC) of the unique C-terminus polypeptide in cultures of Mtb C strain, MTB423 and/or other isolates; second, production of specific antibodies to the polypeptide and assemble of a sensitive capture ELISA followed by a clinical investigation to determine if such an antigen detection assay could help the diagnosis of TB.

Conflict of interest

None of the authors has any financial conflict of interest.

Acknowledgments

The authors wish to thank Ross Tomaino, Harvard Medical School, Taplin Mass Spectrometry Facility for his superb help with the performance of Mass spectroscopy analysis and results interpretation. We would also like to thank the Foundation for Innovative Diagnostics (FIND) for kindly providing the sera from tuberculosis patients and Dr. Tobias Broger (FIND) for constructive comments on the analysis of the serological results. Financial support: This work was supported in part by the National Institutes of Health (R01 AI076425 grant) and the Gates Foundation (Grand Challenges in Global Health Initiative grant) to AC-N.

References

  • 1.Freilij H.L., Corral R.S., Katzin A.M., Grinstein S. Antigenuria in infants with acute and congenital Chagas' disease. J Clin Microbiol. 1987;25:133–137. doi: 10.1128/jcm.25.1.133-137.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Spinola S.M., Sheaffer C.I., Gilligan P.H. Antigenuria after Haemophilus influenzae type b polysaccharide vaccination. J Pediatr. 1986;108:247–249. doi: 10.1016/s0022-3476(86)80992-1. [DOI] [PubMed] [Google Scholar]
  • 3.Tang P.W., de S.D., Toma S. Detection of legionella antigenuria by reverse passive agglutination. J Clin Microbiol. 1982;15:998–1000. doi: 10.1128/jcm.15.6.998-1000.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Luby J.P., Murphy F.K., Gilliam J.N., Kang C.Y., Frank R. Antigenuria in St. Louis encephalitis. Am J Trop Med Hyg. 1980;29:265–268. doi: 10.4269/ajtmh.1980.29.265. [DOI] [PubMed] [Google Scholar]
  • 5.Kaldor J., Asznowicz R., Dwyer B. Haemophilus influenzae type b antigenuria in children. J Clin Pathol. 1979;32:538–541. doi: 10.1136/jcp.32.6.538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Manabe Y.C., Nonyane B.A., Nakiyingi L., Mbabazi O., Lubega G., Shah M. Point-of-care lateral flow assays for tuberculosis and cryptococcal antigenuria predict death in HIV infected adults in Uganda. PLoS One. 2014;9 doi: 10.1371/journal.pone.0101459. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Dufresne S.F., Datta K., Li X., Dadachova E., Staab J.F., Patterson T.F. Detection of urinary excreted fungal galactomannan-like antigens for diagnosis of invasive aspergillosis. PLoS One. 2012;7 doi: 10.1371/journal.pone.0042736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sarkari B., Chance M., Hommel M. Antigenuria in visceral leishmaniasis: detection and partial characterisation of a carbohydrate antigen. Acta Trop. 2002;82:339–348. doi: 10.1016/s0001-706x(02)00043-8. [DOI] [PubMed] [Google Scholar]
  • 9.Militao D.N., Camargo L.M., Katzin A.M. Detection of antigens in the urine of patients with acute Plasmodium vivax malaria. Exp Parasitol. 1993;76:115–120. doi: 10.1006/expr.1993.1013. [DOI] [PubMed] [Google Scholar]
  • 10.Abeijon C., Kashino S.S., Silva F.O., Costa D.L., Fujiwara R.T., Costa C.H. Identification and diagnostic utility of Leishmania infantum proteins found in urine samples from patients with visceral leishmaniasis. Clin Vaccine Immunol. 2012;19:935–943. doi: 10.1128/CVI.00125-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Couturier M.R., Graf E.H., Griffin A.T. Urine antigen tests for the diagnosis of respiratory infections: legionellosis, histoplasmosis, pneumococcal pneumonia. Clin Lab Med. 2014;34:219–236. doi: 10.1016/j.cll.2014.02.002. [DOI] [PubMed] [Google Scholar]
  • 12.van D.D. Diagnostic challenges and opportunities in older adults with infectious diseases. Clin Infect Dis. 2012;54:973–978. doi: 10.1093/cid/cir927. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Den Boer J.W., Yzerman E.P. Diagnosis of Legionella infection in Legionnaires' disease. Eur J Clin Microbiol Infect Dis. 2004;23:871–878. doi: 10.1007/s10096-004-1248-8. [DOI] [PubMed] [Google Scholar]
  • 14.Sinclair A., Xie X., Teltscher M., Dendukuri N. Systematic review and meta-analysis of a urine-based pneumococcal antigen test for diagnosis of community-acquired pneumonia caused by Streptococcus pneumoniae. J Clin Microbiol. 2013;51:2303–2310. doi: 10.1128/JCM.00137-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Anjay M.A., Anoop P. Diagnostic utility of rapid immunochromatographic urine antigen testing in suspected pneumococcal infections. Arch Dis Child. 2008;93:628–631. doi: 10.1136/adc.2008.138958. [DOI] [PubMed] [Google Scholar]
  • 16.Klugman K.P., Madhi S.A., Albrich W.C. Novel approaches to the identification of Streptococcus pneumoniae as the cause of community-acquired pneumonia. Clin Infect Dis. 2008;47(Suppl. 3):S202–S206. doi: 10.1086/591405. [DOI] [PubMed] [Google Scholar]
  • 17.Gupta-Wright A., Peters J.A., Flach C., Lawn S.D. Detection of lipoarabinomannan (LAM) in urine is an independent predictor of mortality risk in patients receiving treatment for HIV-associated tuberculosis in sub-Saharan Africa: a systematic review and meta-analysis. BMC Med. 2016;14:53. doi: 10.1186/s12916-016-0603-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lawn S.D., Gupta-Wright A. Detection of lipoarabinomannan (LAM) in urine is indicative of disseminated TB with renal involvement in patients living with HIV and advanced immunodeficiency: evidence and implications. Trans R Soc Trop Med Hyg. 2016;110:180–185. doi: 10.1093/trstmh/trw008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lawn S.D. Point-of-care detection of lipoarabinomannan (LAM) in urine for diagnosis of HIV-associated tuberculosis: a state of the art review. BMC Infect Dis. 2012;12:103. doi: 10.1186/1471-2334-12-103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pollock N.R., Macovei L., Kanunfre K., Dhiman R., Restrepo B.I., Zarate I. Validation of Mycobacterium tuberculosis Rv1681 protein as a diagnostic marker of active pulmonary tuberculosis. J Clin Microbiol. 2013;51:1367–1373. doi: 10.1128/JCM.03192-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ben-Abid M., Galai Y., Habboul Z., Ben-Abdelaziz R., Ben-Sghaier I., Aoun K. Diagnosis of Mediterranean visceral leishmaniasis by detection of leishmania-related antigen in urine and oral fluid samples. Acta Trop. 2017;167:71–72. doi: 10.1016/j.actatropica.2016.12.026. [DOI] [PubMed] [Google Scholar]
  • 22.Ejazi S.A., Bhattacharya P., Bakhteyar M.A., Mumtaz A.A., Pandey K., Das V.N. Noninvasive diagnosis of visceral leishmaniasis: development and evaluation of two urine-based immunoassays for detection of Leishmania donovani Infection in India. PLoS Negl Trop Dis. 2016;10 doi: 10.1371/journal.pntd.0005035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Vallur A.C., Tutterrow Y.L., Mohamath R., Pattabhi S., Hailu A., Abdoun A.O. Development and comparative evaluation of two antigen detection tests for visceral leishmaniasis. BMC Infect Dis. 2015;15:384. doi: 10.1186/s12879-015-1125-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hatam G.R., Ghatee M.A., Hossini S.M., Sarkari B. Improvement of the newly developed latex agglutination test (Katex) for diagnosis of visceral lieshmaniasis. J Clin Lab Anal. 2009;23:202–205. doi: 10.1002/jcla.20312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Abeijon C., Campos-Neto A. Potential non-invasive urine-based antigen (protein) detection assay to diagnose active visceral leishmaniasis. PLoS Negl Trop Dis. 2013;7 doi: 10.1371/journal.pntd.0002161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Abeijon C., Singh O.P., Chakravarty J., Sundar S., Campos-Neto A. Novel antigen detection assay to monitor therapeutic efficacy of visceral leishmaniasis. Am J Trop Med Hyg. 2016;184:3170–3175. doi: 10.4269/ajtmh.16-0291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Fandino-Devia E., Rodriguez-Echeverri C., Cardona-Arias J., Gonzalez A. Antigen detection in the diagnosis of histoplasmosis: a meta-analysis of diagnostic performance. Mycopathologia. 2016;181:197–205. doi: 10.1007/s11046-015-9965-3. [DOI] [PubMed] [Google Scholar]
  • 28.Theel E.S., Harring J.A., Dababneh A.S., Rollins L.O., Bestrom J.E., Jespersen D.J. Reevaluation of commercial reagents for detection of Histoplasma capsulatum antigen in urine. J Clin Microbiol. 2015;53:1198–1203. doi: 10.1128/JCM.03175-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Caceres D.H., Scheel C.M., Tobon A.M., Ahlquist C.A., Restrepo A., Brandt M.E. Validation of an enzyme-linked immunosorbent assay that detects Histoplasma capsulatum antigenuria in Colombian patients with AIDS for diagnosis and follow-up during therapy. Clin Vaccine Immunol. 2014;21:1364–1368. doi: 10.1128/CVI.00101-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kashino S.S., Pollock N., Napolitano D.R., Rodrigues V., Jr., Campos-Neto A. Identification and characterization of Mycobacterium tuberculosis antigens in urine of patients with active pulmonary tuberculosis: an innovative and alternative approach of antigen discovery of useful microbial molecules. Clin Exp Immunol. 2008;153:56–62. doi: 10.1111/j.1365-2249.2008.03672.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Napolitano D.R., Pollock N., Kashino S.S., Rodrigues V., Jr., Campos-Neto A. Identification of Mycobacterium tuberculosis ornithine carboamyltransferase in urine as a possible molecular marker of active pulmonary tuberculosis. Clin Vaccine Immunol. 2008;15:638–643. doi: 10.1128/CVI.00010-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Friedman C.R., Stoeckle M.Y., Kreiswirth B.N., Johnson W.D., Jr., Manoach S.M., Berger J. Transmission of multidrug-resistant tuberculosis in a large urban setting. Am J Respir Crit Care Med. 1995;152:355–359. doi: 10.1164/ajrccm.152.1.7599845. [DOI] [PubMed] [Google Scholar]
  • 33.Friedman C.R., Quinn G.C., Kreiswirth B.N., Perlman D.C., Salomon N., Schluger N. Widespread dissemination of a drug-susceptible strain of Mycobacterium tuberculosis. J Infect Dis. 1997;176:478–484. doi: 10.1086/514067. [DOI] [PubMed] [Google Scholar]
  • 34.Madhavilatha G.K., Joseph B.V., Paul L.K., Kumar R.A., Hariharan R., Mundayoor S. Whole-genome sequences of two clinical isolates of Mycobacterium tuberculosis from Kerala, South India. J Bacteriol. 2012;194:4430. doi: 10.1128/JB.00453-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Shevchenko A., Wilm M., Vorm O., Mann M. Mass spectrometric sequencing of proteins silver-stained polyacrylamide gels. Anal Chem. 1996;68:850–858. doi: 10.1021/ac950914h. [DOI] [PubMed] [Google Scholar]
  • 36.Peng J., Gygi S.P. Proteomics: the move to mixtures. J Mass Spectrom. 2001;36:1083–1091. doi: 10.1002/jms.229. [DOI] [PubMed] [Google Scholar]
  • 37.Eng J.K., McCormack A.L., Yates J.R. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J Am Soc Mass Spectrom. 1994;5:976–989. doi: 10.1016/1044-0305(94)80016-2. [DOI] [PubMed] [Google Scholar]
  • 38.Mukherjee S., Kashino S.S., Zhang Y., Daifalla N., Rodrigues-Junior V., Reed S.G. Cloning of the gene encoding a protective Mycobacterium tuberculosis secreted protein detected in vivo during the initial phases of the infectious process. J Immunol. 2005;175:5298–5305. doi: 10.4049/jimmunol.175.8.5298. [DOI] [PubMed] [Google Scholar]
  • 39.Manson A.L., Cohen K.A., Abeel T., Desjardins C.A., Armstrong D.T., Barry C.E., III Genomic analysis of globally diverse Mycobacterium tuberculosis strains provides insights into the emergence and spread of multidrug resistance. Nat Genet. 2017;49:395–402. doi: 10.1038/ng.3767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Schmieder R., Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27:863–864. doi: 10.1093/bioinformatics/btr026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Wood D.E., Salzberg S.L. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014;15 doi: 10.1186/gb-2014-15-3-r46. R46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Li H., Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Li H. Improving SNP discovery by base alignment quality. Bioinformatics. 2011;27:1157–1158. doi: 10.1093/bioinformatics/btr076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Walker B.J., Abeel T., Shea T., Priest M., Abouelliel A., Sakthikumar S. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9 doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kurtz S., Phillippy A., Delcher A.L., Smoot M., Shumway M., Antonescu C. Versatile and open software for comparing large genomes. Genome Biol. 2004;5 doi: 10.1186/gb-2004-5-2-r12. R12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Sekizuka T., Yamashita A., Murase Y., Iwamoto T., Mitarai S., Kato S. TGS-TB: total genotyping solution for Mycobacterium tuberculosis using short-read whole-genome sequencing. PLoS One. 2015;10 doi: 10.1371/journal.pone.0142951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Coll F., McNerney R., Guerra-Assuncao J.A., Glynn J.R., Perdigao J., Viveiros M. A robust SNP barcode for typing Mycobacterium tuberculosis complex strains. Nat Commun. 2014;5:4812. doi: 10.1038/ncomms5812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Farhat M.R., Sultana R., Iartchouk O., Bozeman S., Galagan J., Sisk P. Genetic determinants of drug resistance in Mycobacterium tuberculosis and their diagnostic value. Am J Respir Crit Care Med. 2016;194:621–630. doi: 10.1164/rccm.201510-2091OC. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES