Abstract
Bacterial culture of M. tuberculosis (MTB), the causative agent of tuberculosis (TB), from clinical specimens is the gold standard for laboratory diagnosis of TB, but is slow and culture-negative TB cases are common. Alternative immune-based and molecular approaches have been developed, but cannot discriminate between active TB (ATB) and latent TB (LTBI). Here, to identify biomarkers that can discriminate between ATB and LTBI/healthy individuals (HC), we profiled 116 serum samples (HC, LTBI and ATB) using a protein microarray containing 257 MTB secreted proteins, identifying 23 antibodies against MTB antigens that were present at significantly higher levels in patients with ATB than in those with LTBI and HC (Fold change > 1.2; p < 0.05). A 4-protein biomarker panel (Rv0934, Rv3881c, Rv1860 and Rv1827), optimized using SAM and ROC analysis, had a sensitivity of 67.3% and specificity of 91.2% for distinguishing ATB from LTBI, and 71.2% sensitivity and 96.3% specificity for distinguishing ATB from HC. Validation of the four candidate biomarkers in ELISA assays using 440 serum samples gave consistent results. The promising sensitivity and specificity of this biomarker panel suggest it merits further investigation for its potential as a diagnostic for discriminating between latent and active TB.
Subject terms: Biomarkers, Diseases
Introduction
Tuberculosis (TB), caused by Mycobacterium tuberculosis (MTB), is the leading cause of death from infectious diseases1. An estimated one-quarter of the world’s population is latently infected with MTB (latent tuberculosis infection (LTBI)), and 1.41 million people died from TB in 20192. Approximately 5–10% of those infected with MTB will develop active tuberculosis (ATB) during their lifetime. Pulmonary infection accounts for 75% of active TB disease3 and is also the main factor underlying high tuberculosis mortality, especially in populations, such as individuals living with HIV-AIDS, with lowered immunity4. The identification of MTB-infected individuals and the appropriate treatment of those who develop ATB are undoubtedly crucial for effective TB control5.
Current clinical diagnosis of TB still relies on a traditional approach that includes acid-fast bacillus (AFB) smears, nucleic acid amplification (NAA) (e.g. Xpert MTB/RIF or Xpert MTB/RIF Ultra), and culture of M. tuberculosis from sputum and other respiratory specimens, in addition to evaluation of clinical symptoms6. However, sputum smears have very low sensitivity (10–20%), and MTB culture lacks sensitivity and requires 2–8 weeks to obtain results7. Although the speed and sensitivity of Xpert MTB/RIF is high, this technique cannot replace AFB sputum smears and culture methods due to its low sensitivity in detecting AFB-negative individuals, its high operation costs, and inconsistency in results between laboratories8,9. Prolonging diagnosis delays the commencement of appropriate treatment for TB; in the case of sputum smear-negative tuberculosis, delayed diagnosis is known to lead to irreversible lung injury10, one of the causes of TB’s high mortality. In vivo tuberculin skin tests (TST) and in vitro interferon gamma release assays (IGRAs) are considered auxiliary methods for diagnosing TB11; however, the TST cannot distinguish between healthy individuals vaccinated with the BCG and those with active TB disease. IGRA testing must be conducted under standard laboratory conditions by trained personnel12 and its reproducibility is disputed13–15. In addition, neither test can distinguish between ATB and LTBI16,17. There is therefore an urgent need for a rapid, simple and more accurate method for diagnosing active TB disease.
Serological screening of disease-related serum biomarkers is a convenient, quick and non-invasive method for disease diagnosis. The sensitivity and specificity of biomarkers for the disease in question is an important indicator of whether they can be used in clinical practice, determining the reliability of test results18. Serum biomarkers for TB, such as the 38kD, 16kD, ESAT-6, MPT63, 19kD, MPT64, MPT32, Rv1009, MTB48, Mtb81, MTC28, Ag85B, and KatG proteins have been identified19, but while most show high sensitivity in ATB patients20, their sensitivity and specificity is insufficient for discriminating between active TB and LTBI21. The screening of new TB serum biomarkers that can discriminate between active TB and LTBI is thus of great importance for improving diagnostic accuracy.
Here, we set out to identify serum biomarkers that can be used to diagnose active pulmonary TB. Reasoning that secreted proteins are likely to be a useful source of biomarkers, we used an MTB protein microarray containing 257 M. tuberculosis recombinant secreted proteins. In the biomarker discovery phase of the study, we screened 116 samples (52 ATB, 37 LTBI, and 27 HC) with the microarray, identifying 23 antibodies that showed a differential pattern between the ATB and LTBI and HC groups. We then identified a panel of 4 proteins with high sensitivity and specificity that could distinguish ATB from LTBI, and verified the performance of these four proteins using ELISA assays and 440 serum samples. This panel merits further investigation for its potential in diagnosing active pulmonary tuberculosis.
Materials and methods
Study cohort
Serum samples from individuals with active tuberculosis (ATB, 205) or latent tuberculosis (LTBI, 123) and healthy donors (HC, 112) included in this study were collected at Hebei Chest Hospital between May 2018 and Jan 2019. Diagnosis of active tuberculosis was based on standard criteria, including clinical symptoms, chest radiograph abnormalities, AFB sputum smears, bacterial culture and IGRA (X.DOT-TB, TB Healthcare, Foshan, China) results. The active TB group included individuals who displayed TB-specific clinical symptoms, had abnormal chest radiography consistent with active TB, and were sputum AFB positive and/or bacterial culture positive. Samples from patients in the ATB group were collected before treatment. The LTBI group included individuals who did not display clinical symptoms, but were IGRA positive and showed no signs of active TB in chest X-rays. The healthy control (HC) group included individuals who did not display clinical symptoms, and were IGRA negative and showed no signs of TB in chest X-rays. Individuals who tested positive for human immunodeficiency virus (HIV), or were taking immunosuppressive or immunopotentiator agents were excluded from the study. This study was approved by the Ethics committee of Hebei Chest Hospital (Hebei Province, China), in accordance with the Declaration of Helsinki (No. 2020076). Informed consent was obtained from all subjects or their legal guardians if the subjects were under 18.
Serum samples
Peripheral blood samples (5 mL) were collected in vacutainer tubes. Sera were obtained by centrifugation at 1509×g for 10 min, and were then aliquoted into sterile polypropylene microtubes and stored at − 80 °C until required.
Profiling on MTB secreted protein microarrays
The MTB proteome microarrays used in this study were constructed by BC-BIO (Guangzhou, China). Microarrays comprised 257 recombinant M. tuberculosis H37Rv secreted proteins. All proteins had GST tags and were expressed and purified using a S. cerevisiae expression system. Microarrays were incubated with GST antibodies to assess their quality, human IgG and IgM being used as positive controls, and bovine serum protein (BSA) as a negative control.
Microarrays were blocked in blocking buffer (1× PBS, 3% BSA, 0.1% Tween 20 [pH 7.4]) for 3 h at room temperature with shaking. After blocking, 200 μL serum samples (1:50 dilution in PBST-B (1× PBS, 1% BSA, 0.1% Tween 20 [pH 7.4])) were overlaid onto protein microarrays and incubated at 4 °C overnight. After washing three times in PBS containing 0.1% Tween 20 detergent (PBST), arrays were probed with goat anti-human IgG conjugated with Cy3, and goat anti-human IgM conjugated with Cy5 (Jackson Laboratory, PA, USA) at a 1:1000 dilution and incubated in a dark room for 1 h at room temperature. After washing three times in PBST and twice in ddH2O, arrays were dried in a SlideWash (Capital Bio) and then scanned in a GenePix 4200A slider scanner (Molecular Devices).
Protein array data preprocessing
The median values of the foreground (Fij) and background (Bij) intensities of a given protein spot (i,j) on the protein array were extracted from quantified sample array image data exported from GenePix Pro 6.0. The ratio of the mean signal (protein spot signal intensity (Rij): Fij/Bij) over the mean background signal was determined for each protein spot. This preprocessing method thus normalized all features in the array to their background signals. As each protein was printed on the array in duplicate, Rij was averaged for each protein as Rp.
Data analysis
SAM (Significance analysis of microarrays, R software (v3.6.1))22 was used to determine proteins to which ATB, LTBI and HC samples showed a statistically significant immunogenic response. Stringent criteria were set; significantly upregulated proteins were only called if the ratio of signal intensity between the two groups being compared was > 1.2 at p ≤ 0.05. Significantly downregulated proteins were called if the ratio of the signal intensity values was < 0.83. The heatmap for the differential protein set was drawn using the ‘pheatmap’ package (v1.0.12)23 in the R statistical language (v3.6.1)22. Biomarker candidates were the differential proteins with the highest discriminant ability (Discriminant ability = Sensitivity + Specificity − 100%)24 and were selected for each comparison (ATB versus LTBI and ATB versus HC). Candidate biomarkers were differential proteins (fold change > 2 and p-value < 0.05) that satisfied the following criteria: (1) they showed at least 90% specificity and (2) had the highest discriminant ability (> 10%).
Functional analysis
GO (Gene Ontology) classification of differential proteins according to molecular function and biological process was based on their functional annotations using the R package clusterProfiler (v3.4.4)25. GO ‘Biological Process’ (BP)26 and ‘molecular function’ (MF) enrichment analyses were also performed for the two groups of differential proteins using clusterProfiler. The p-value threshold for significance was selected as < 0.05. Protein–protein interactions (PPI) between differential proteins were analysed using the STRING database (v10.0)27, and networks were visualized with Cytoscape (v3.4.0)28.
ELISA assays
MTB antigens (BCBIO, Guangzhou, China) were diluted to 1 μg/mL with a coating solution (sodium carbonate 1.59 g/L, sodium bicarbonate 2.93 g/L), added to 96-well plates, and allowed to stand overnight at 4 °C for coating. Plates were washed three times with PBST, then blocked with blocking buffer (1× PBS, 3% BSA, 0.1% Tween 20 [pH 7.4]) at room temperature for 3 h. 100 μL serum samples (diluted 1:100 in PBST-B) were added to antigen-coated wells and incubated at room temperature for 30 min. After washing the plates five times with PBST, anti-human IgG antibody (CWBiotech, Beijing, China), diluted 1:10,000 in PBST-B, was added, and incubated at room temperature for 30 min. After washing five more times with PBST, plates were developed using TMB substrate (BD, NJ, USA) in a dark room for 10 min at 37 °C; reactions being stopped using 2 mol/L sulfuric acid. The optical density of the wells was determined at 450 nm/620 nm using an automated microplate reader (Perlong, Beijing, China).
Statements on study approvals
We confirm that all methods used in this study were carried out in accordance with relevant guidelines and regulations.
Results
Overall study design
We employed a two-phase strategy29,30 to identify novel biomarkers for ATB (Fig. 1). In the discovery phase, we profiled serum samples from 52 ATB patients, 37 individuals with LTBI and 27 healthy volunteers on a protein microarray containing 257 M. tuberculosis secreted proteins. After identifying differential antigens, i.e. proteins present at different levels in samples from ATB patients and those from individuals with LTBI or healthy individuals, promising candidate biomarkers were selected based on their sensitivity and specificity in the differential diagnosis of ATB. A panel of four biomarkers that gave optimal sensitivity and specificity was then selected using SAM (significance analysis of microarrays). In the subsequent validation phase of the study, an additional 324 serum samples were evaluated in a separate ELISA experiment along with the 116 samples used in the discovery phase (205 ATB, 123 LTBI, 112 HC; Table 1; Fig. 1).
Table 1.
Active tuberculosis | Latent tuberculosis | Healthy donors | |
---|---|---|---|
Discovery phase | |||
Number | 52 | 37 | 27 |
Average age (range) | 40.3 (15–76) | 36.1 (17–84) | 39 (18–91) |
Gender (M/F) | 34/19 | 21/16 | 11/16 |
Smear-positive (%) | 28 (53.8%) | – | – |
Culture-positive (%) | 37 (71.1%) | – | – |
IGRA positive | – | 37 | 0 |
Validation phase | |||
Number | 205 | 123 | 112 |
Average age (range) | 41.9 (15–90) | 39.5 (17–84) | 33.5 (18–91) |
Gender (M/F) | 111/94 | 70/53 | 42/70 |
Smear-positive (%) | 102 (49.7%) | – | – |
Culture-positive (%) | 150 (73.2%) | – | – |
IGRA positive | – | 123 | 0 |
MTB secreted protein microarrays
In our search for serum biomarkers associated with active TB, we focused on secreted proteins, reasoning that they mediate important functions through their interactions with host cells such as macrophages and are thus likely important in virulence and pathogenesis31,32. We thus used a protein microarray containing 257 secreted proteins from the M. tuberculosis reference strain H37Rv32,33 to profile serum samples from people with differing TB status. We first tested the quality and stability of the microarrays by profiling one sample from the ATB, LTBI, and HC groups (selected randomly) on secreted protein microarrays. Microarray quality was validated by comparing the signal intensity of the two replicate points for each protein on each array (Supplementary Fig. S1). The average Pearson correlation coefficient (R2) between replicate proteins was 0.99 (Supplementary Fig. S1, top right), indicating that the secreted protein microarrays were homogeneous and reproducible.
Differential antibodies identified using the secreted protein microarray
In the discovery phase, 52 ATB, 37 LTBI, and 27 HC serum samples were profiled on MTB secreted protein microarrays, antibodies against MTB secreted proteins on the microarray being identified by developing the arrays with Cy3-labeled anti-human IgG antibodies. Signal intensities were normalized against the median value of the microarray, and comparisons were made between the groups to identify differential antibodies, thresholds being set at p ≤ 0.05 and fold change > 1.2. Using these criteria, 37, 53, and 1 differential proteins were identified in the ATB-vs-HC, ATB-vs-LTBI, and LTBI-vs-HC comparisons respectively (Fig. 2a–c).
Cluster analysis of the differential proteins showed that the ATB group was well separated from the HC and LTBI groups (Supplementary Fig. S2). The lists of differential proteins identified in the ATB-vs-HC and ATB-vs-LTBI comparisons had 23 proteins in common. These 23 proteins are likely a source of biomarkers that can discriminate the ATB group from the HC and LTBI groups (Fig. 2d,e).
Gene Ontology and protein–protein interaction network analysis
Data enrichment analysis was performed using clusterProfiler (R package) and GO annotation, and differential antigens were considered to be significantly enriched in a particular GO term when p < 0.05. Differential proteins from the ATB-vs-HC comparison were enriched in the ‘molecular function’, ‘ferroxidase activity’, ‘ferric iron binding’, and ‘oxidoreductase activity’ terms, while those from the ATB-vs-LTBI comparison were enriched in the ‘protein binding’ term (Fig. 3a green bar chart).
Biological process enrichment analysis of differential proteins identified in the ATB-vs-LTBI comparison indicated that processes related to interspecific interactions showed significant enrichment (Fig. 3a). In the ATB-vs-HC comparison, differential proteins were significantly enriched in biological processes related to the chelation and transport of metal ions (p < 0.05), the three most enriched biological processes being pathogenesis, iron ion transport, and iron assimilation (Fig. 3a).
We used STRING analysis of protein–protein interaction networks to determine which differential proteins from the ATB-vs-LTBI and ATB-vs-HC comparisons were important in the functional network. Key proteins included Rv1827, Rv1860, Rv3413, Rv0934, Rv3803c, and Rv3881c (ATB-vs-LTBI comparison; Fig. 3b left panel), and Rv3881c, Rv0064, Rv1827, Rv1860, and Rv0934 (ATB-vs-HC comparison; Fig. 3b right panel). Although the biological process and molecular function enrichment results for the ATB-vs-HC and ATB-vs-LTBI comparisons were different, there was some overlap in key proteins, Rv3881c, Rv1860, Rv0934, and Rv1827 being common to both lists.
ROC analysis of biomarkers and derivation of a combined panel for ATB diagnosis
To identify potential serum biomarkers for active pulmonary TB diagnosis, we plotted receiver operating characteristic (ROC) curves for differential proteins and calculated the areas under the curve (AUCs). Sensitivity and specificity values were calculated for each protein. Selecting the following criteria (a) specificity > 90%; (b) discriminant ability (sensitivity plus specificity − 100) > 10%; (b) fold change > 2; and (c) statistical significance p < 0.05), resulted in identification of 8 candidate proteins from the anti-IgG profiles in the ATB-vs-LTBI comparison and 4 candidate proteins from the anti-IgG profiles in the ATB-vs-HC comparison, respectively (Fig. 4a,b, Supplementary Table S1, S2). The signal intensities of the four proteins common to both the ATB-vs-LTBI and ATB-vs-HC comparisons, Rv0934, Rv3881c, Rv1860 and Rv1827, were all significantly different (Fig. 4c). The diagnostic performance of each candidate biomarker was assessed using areas under the receiver operating characteristic (ROC) curves (AUCs). The AUC values of the four proteins ranged from 0.56 to 0.788 (Table 2). Next the maximum discriminant ability values for each protein were calculated with a requirement of a minimum specificity of 90% (see “Materials and methods” section). The optimal cutoff values of the signal intensity for each protein were then determined with the corresponding sensitivity and specificity values (Table 2).
Table 2.
Proteina | ATB-vs-LTBI | ATB-vs-HC | ||||
---|---|---|---|---|---|---|
AUC | Specificity (%) | Sensitivity (%) | AUC | Specificity (%) | Sensitivity (%) | |
Rv0934* | 0.735 | 97.297 | 46.154 | 0.788 | 100.000 | 51.923 |
Rv1827* | 0.560 | 97.297 | 17.308 | 0.563 | 100.000 | 13.462 |
Rv1860* | 0.655 | 100.000 | 25.000 | 0.637 | 100.000 | 26.923 |
Rv3881c* | 0.634 | 97.297 | 23.077 | 0.611 | 96.296 | 30.769 |
Rv2236c | 0.708 | 97.297 | 28.163 | 0.710 | 77.778 | 63.462 |
Rv3248c | 0.680 | 94.595 | 25.000 | 0.491 | 59.259 | 53.846 |
Rv3413c | 0.629 | 97.297 | 26.923 | 0.570 | 88.889 | 32.692 |
Rv3803c | 0.674 | 94.595 | 38.462 | 0.670 | 48.148 | 82.692 |
aProteins marked with an asterisk (*) were selected as candidate biomarkers in both comparisons.
Of note, proteins Rv0934, Rv3881c, Rv1860 and Rv1827 had sensitivities > 46.2% and specificities > 97.3%, making them very good candidates for diagnostic biomarkers.
We noted that the sensitivity values of each candidate protein ranged from 13.46 to 46.15%, so attempted to identify a combined biomarker panel that would offer better performance. The performance of all possible combinations (14 combinations) of between two and four proteins was evaluated using a previously reported approach34. Firstly a binary scoring system was used to convert the actual signal intensity of each protein to either 1 or 0, with 1 represented a signal intensity greater than the optimal cutoff value and 0 represented a signal intensity lower than the optimal cutoff value. Secondly the performance of every possible combination in the discovery cohort was evaluated. The total binary score of a given combination of n proteins was assigned to each serum sample as a summary score. If the summary score of a sample was greater than k (1 ≤ k ≤ n), the sample was determined as positive. For each protein combination, the sensitivity and specificity at the best discriminant ability were recorded. Finally, the optimal protein combination and its k value with the best discriminant ability at a minimum specificity of 90% were identified. The best panel combination, comprised of Rv0934, Rv3881c, Rv1860 and Rv1827, achieved 67.3% sensitivity at 91.2% specificity with a k value of 1 for discriminating ATB from LTBI, and 71.2% sensitivity at 96.3% specificity for discriminating ATB from HC (Fig. 4d). In other words, a serum sample would be scored positive when at least one (i.e. k = 1) of the four proteins showed a signal intensity greater than the corresponding optimal cutoff value. These promising high sensitivity and specificity values suggest that this panel of four proteins is worth investigating further and applying in the development of a diagnostic panel for ATB.
Validation of the ATB biomarker panel using ELISA assays
To develop a clinically useful application of the biomarkers identified here, we verified the biomarkers using an enzyme-linked immunosorbent assay (ELISA) platform. Rv0934, Rv3881c, Rv1860, and Rv1827 were purified from yeast and coated onto the individual wells of ELISA plates. Results obtained were consistent with those obtained in the discovery phase using microarrays; all four proteins showed significantly higher signals in the ATB group than in the HC or LTBI groups (Fig. 5a, Supplementary Table S3). When specificity was set at > 90%, the four proteins had sensitivities ranging from 13.17 to 59.02%, and ROC analysis of ELISA data gave AUC values for the four proteins of between 0.76 and 0.86.
We next evaluated the performance of the biomarker panel against the ELISA data. The ELISA data were converted to a binary scoring system in a manner similar to that in the discovery phase, and the same criteria (specificity > 90%) were used. 71.22% and 66.83% of ATB samples in the ATB groups were correctly scored as positive in the ATB-vs-LTBI and ATB-vs-HC comparisons, respectively (Fig. 5b), while only 8.04% of samples in the LTBI group and 8.12% of samples in the HC group were scored as false positives. This biomarker panel thus showed a sensitivity of 71.22% at a specificity of 91.87% for ATB diagnosis based on ELISA tests.
Discussion
The lack of a rapid diagnostic test for ATB hinders the timely diagnosis and treatment of this devastating infectious disease, impacting patient management and ultimately resulting in further disease transmission. Here, by performing a systematic search for secreted MTB antigens that can distinguish ATB from LTBI using an MTB secreted protein microarray and serum samples, we identified a four antigen panel (Rv0934, Rv1827, Rv1860 and Rv3881c) that could distinguish ATB from LTBI/healthy individuals with a sensitivity of 67% and specificity of 92%. Validation of this protein panel by ELISA gave similar results. This ATB diagnostic panel deserves further evaluation with larger cohorts of ATB and LTBI samples from different locations to determine its broad applicability.
The considerable efforts invested in developing tests that can distinguish latent and active TB have yet to yield a test with sufficient discriminatory power for clinical application. The approach taken in most studies has been to evaluate the discriminatory ability of known MTB antigens or combinations of antigens, rather than to take a systematic approach to antigen discovery. MTB antigens identified so far that can discriminate individuals infected with MTB from those without infection (but not from those with active TB) include ESAT-6 (Rv3875), CFP-10 (Rv3874), Ag85B (Rv1886c), LAM, Mtb81, MTC28, and KatG35,36. Antigens evaluated for their ability to differentiate latent from active TB have been reviewed in detail elsewhere37, and include latency-associated antigens from the Dormancy of survival Regulon (DosR, including Rv0081, Rv1733, Rv1737, Rv2029c, and Rv2031), and the Region of Difference (Rv2659c, Rv2660c), Resuscitation promoting factors (Rpfs, Rv0867c, Rv2389c), reactivation associated antigens (Rv1131 and Rv3862c) and the Ag85 complex (Ag85A, Ag85B and Ag85C). While many studies have been performed, to date, a panel of biomarkers with adequate sensitivity and specificity for discriminating LTBI and active TB has been elusive.
Here, we systematically searched for secreted proteins with high discriminatory ability, reasoning that these key components of the MTB weaponry that play important roles in pathogenesis, secreted at different stages of infection, likely lead to variation in the host immune response31,32 and are thus a reservoir of potential biomarkers for discriminating different stages of disease. MTB can remain dormant within its host for a long time, interactions between secreted bacterial proteins and host cells mediating its ability to evade the host immune system and survive within macrophages.
Development and application of high-throughput technologies for screening serum biomarkers of various diseases has shown great promise for facilitating disease diagnosis38. Protein microarray technology can be used to detect changes in large numbers of proteins within a sample simultaneously, and can detect changes in the modification status of proteins, making it especially suitable for screening disease diagnostic markers and drug targets. Here, we used a commercial MTB secreted protein microarray containing 257 secreted proteins from the H37Rv reference strain purified using a yeast expression system32,33. Proteins expressed using a yeast system have posttranslational modifications39 and are suitable for screening functional antigens40. Our use here of a microarray using yeast-expressed proteins rather than proteins purified using the E. coli system may have facilitated our discovery of previously unreported antigens associated with active TB disease.
Our study design has several strengths. First, we employed a commercial TB secreted protein microarray with a high coverage of TB secreted proteins to improve the likelihood of finding potential biomarkers. Second, our study included samples from 328 patients with ATB or LTBI and 112 samples from healthy volunteers. Third, we used samples from both LTBI and healthy individuals as negative controls to improve discrimination of ATB. Finally, we used ELISA as an independent platform to validate the performance of the newly discovered biomarker panel.
Individual antigens rarely perform well enough to be useful as single biomarkers for a disease, but a panel of several markers can significantly improve performance. Here, individual antigens ranged in sensitivity from 17 to 46%, but when four promising antigens were combined into a panel, the sensitivity of the panel was 62% at a specificity of 92%. Consistent levels of antigen-specific IgG antibodies were detected in ELISA experiments to validate these four antigens, all showing significant differences between healthy individuals, those who had LTBI, and those who had active TB, suggesting that these antigens may elicit strong humoral immune responses.
To the best of our knowledge this is the first time that these biomarker candidates have been reported in a marker panel for discriminating LTBI from active TB. Interestingly, previous reports on each of these biomarker candidates link them with different aspects of virulence or pathogenesis. Rv0934 (Periplasmic phosphate-binding lipoprotein, PstS1) is a 38-kDa antigen commonly used in tests for diagnosing TB infection41. On its own, however, Rv0934 does not have sufficient sensitivity to discriminate active from latent TB. It belongs to the family of phosphate receptors for bacterial ABC-type lipoprotein transporters and is involved in active binding-protein-mediated import of inorganic phosphate across the membrane42. Rv1827 (GarA) is a conserved essential protein containing an FHA domain in its C-terminus. It is phosphorylated by Serine/threonine protein kinase G (PknG), an important virulence factor, and participates in the regulation of the tricarboxylic acid cycle43. Rv1860 (Alanine and proline rich secreted protein Apa) is a 45-kDa secreted glycoprotein predicted to be involved in cell wall and cell processes. It is a major immunodominant antigen that is considered to have potential as a vaccine against tuberculosis, and can elicit proliferation of both CD4+ and CD8+ T cells and IFN-γ secretion in healthy people with LTBI44. Changes in the mannosylation pattern of this protein affect its ability to stimulate T-lymphocyte responses. Virulence factor Rv3881c (Secreted ESX-1 substrate protein B, EspB) is a conserved alanine and glycine rich protein that is a member of the PE-PPE family45. It is reported to inhibit autophagosome formation in murine macrophages via downregulating the expression of IFN-γ receptor 146. Further mechanistic research on the involvement of these candidate biomarkers in the pathogenesis of active TB may provide interesting insights into the pathogenesis of active TB.
Our study also has some limitations. By focusing on secreted proteins, we may have missed other cell wall, or indeed intracellular proteins that provoke an important immune response during active disease. The number of samples tested in our study (440), particularly in the discovery phase (116) was limited. While the panel identified here showed high discriminatory power in this study cohort, one limitation is that all participants recruited were Chinese, raising the small possibility that there could be an ethnic bias. A larger cohort of samples, preferably drawn from more than one geographical region will be necessary to further evaluate the potential of this panel as a diagnostic test. We also did not discriminate between AFB positive and AFB negative active TB cases, nor did we consider extrapulmonary TB cases. All patients included were adults, so we are unable to predict if this panel would be efficacious for the diagnosis of childhood TB. In addition, pulmonary TB patients included in our study were sputum AFB positive and/or bacterial culture positive cases. The specificity of our marker panel for distinguishing ATB from HC in such cases was high (96.3%), however, its sensitivity (71.2%) falls short of that specified in the WHO’s optimal Target Product Profile (TPP) for non-sputum-based TB diagnosis tests (98%)47. Nonetheless, like other biomarker studies published that similarly did not reach these standards9,48,49, our study has useful reference value for those engaged in biomarker discovery. Further work on optimizing this panel will be necessary.
Of note, our four-antigen panel was unable to discriminate between individuals with latent TB and uninfected individuals (control group). Given the availability of many commercial platforms for IGRA-based detection of MTB infection based on ESAT-6/CFP-10-induced stimulation of IFN-γ release, the most likely application of the biomarker panel developed here would be as an ELISA test that is combined with one of the commercial ESAT-6/CFP-10 IGRA assays. Further research is also needed to identify biomarkers associated with other stages of MTB infection (e.g. recovery, re-activation), and prognostic markers that can predict treatment outcome.
The availability of rapid tests that can accurately detect active TB using the biomarker panel developed here should not only facilitate timely treatment of patients with TB, but should also greatly facilitate the implementation of proactive TB case-finding programs and ultimately reduce the spread of TB within the community, making a significant contribution to achieving the WHO’s End TB goals for eliminating TB by 2035.
Supplementary Information
Acknowledgements
We would like to gratefully acknowledge BC-BIO Biotechnology Co., Ltd. for technical help in performing microarray experiments and establishing the ELISA platform.
Author contributions
Conceived and designed the study: S.W. Performed the experiments: Z.L. Analyzed the data: Z.L. and J.H. Contributed reagents/material/analytical tools: P.L. Wrote the paper: Z.L. and D.C. Revised the paper: S.W. and H.D. Managed the project: S.W. Obtained funding: S.W. Read and approved the final version of the manuscript: all authors.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-021-93893-3.
References
- 1.Pai M, et al. Tuberculosis. Nat. Rev. Dis. Primers. 2016;2:16076. doi: 10.1038/nrdp.2016.76. [DOI] [PubMed] [Google Scholar]
- 2.WHO . WHO Global Tuberculosis Report 2019. World Health Organisation; 2020. [Google Scholar]
- 3.Denkinger CM, et al. Xpert MTB/RIF assay for the diagnosis of extrapulmonary tuberculosis: A systematic review and meta-analysis. Eur. Respir. J. 2014;44:435–446. doi: 10.1183/09031936.00007814. [DOI] [PubMed] [Google Scholar]
- 4.Jenkins HE, et al. Mortality in children diagnosed with tuberculosis: A systematic review and meta-analysis. Lancet Infect. Dis. 2017;17:285–295. doi: 10.1016/S1473-3099(16)30474-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lewinsohn DM, et al. Official American Thoracic Society/Infectious Diseases Society of America/Centers for Disease Control and Prevention Clinical Practice Guidelines: Diagnosis of tuberculosis in adults and children. Clin. Infect. Dis. 2017;64:111–115. doi: 10.1093/cid/ciw778. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Qiu X, et al. Accumulate evidence for IP-10 in diagnosing pulmonary tuberculosis. BMC Infect. Dis. 2019;19:924. doi: 10.1186/s12879-019-4466-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kim JH, Kim MJ, Ham SY. Clinical characteristics and chest computed tomography findings of smear-positive and smear-negative pulmonary tuberculosis in hospitalized adult patients. Int. J. Tuberc. Lung. Dis. 2019;98:e16921. doi: 10.1097/MD.0000000000016921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.WHO Global tuberculosis control: Key findings from the December 2009 WHO report. Wkly. Epidemiol. Rec. 2010;85:69–80. [PubMed] [Google Scholar]
- 9.Shi SD, Hsueh PR, Yang PC, Chou CC. Use of DosR dormancy antigens from Mycobacterium tuberculosis for serodiagnosis of active and latent tuberculosis. ACS Infect. Dis. 2019;6:272–280. doi: 10.1021/acsinfecdis.9b00329. [DOI] [PubMed] [Google Scholar]
- 10.Richard L, Armand VD, Ivan B, Mark FG. Laboratory Diganosis of Tuberculosis by Sputum Microscopy: The Handbook, Global Edition. SA Pathology; 2013. [Google Scholar]
- 11.Zhao YF, et al. IP-10 and RANTES as biomarkers for pulmonary tuberculosis diagnosis and monitoring. Tuberculosis (Edinb.) 2018;111:45–53. doi: 10.1016/j.tube.2018.05.004. [DOI] [PubMed] [Google Scholar]
- 12.Mahomed H, et al. The tuberculin skin test versus QuantiFERON TB Gold in predicting tuberculosis disease in an adolescent cohort study in South Africa. PLoS ONE. 2011;6:e17984. doi: 10.1371/journal.pone.0017984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Bellete B, et al. Evaluation of a whole-blood interferon-gamma release assay for the detection of Mycobacterium tuberculosis infection in 2 study populations. Clin. Infect. Dis. 2002;34:1449–1456. doi: 10.1086/340397. [DOI] [PubMed] [Google Scholar]
- 14.Shen F, Han C, Wang MS, Yu H. Poor agreement between repeated T-SPOT.TB in a short time period in a high TB burden country. Infect. Dis. (Lond.) 2018;50:771–774. doi: 10.1080/23744235.2018.1455007. [DOI] [PubMed] [Google Scholar]
- 15.Oh CE, Ortiz-Brizuela E, Bastos ML, Menzies D. Comparing the diagnostic performance of QFT-Plus to other tests of latent tuberculosis infection: A systematic review and meta-analysis. Clin. Infect. Dis. 2020 doi: 10.1093/cid/ciaa1822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Suzukawa M, et al. Combined analysis of IFN-γ, IL-2, IL-5, IL-10, IL-1RA and MCP-1 in QFT supernatant is useful for distinguishing active tuberculosis from latent infection. PLoS ONE. 2016;11:e0152483. doi: 10.1371/journal.pone.0152483. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Andersen P, Munk ME, Pollock JM, Doherty TM. Specific immune-based diagnosis of tuberculosis. Lancet. 2000;356:1099–1104. doi: 10.1016/S0140-6736(00)02742-2. [DOI] [PubMed] [Google Scholar]
- 18.Burbelo PD, et al. Serological diagnosis of pulmonary Mycobacterium tuberculosis infection by LIPS using a multiple antigen mixture. BMC Microbiol. 2015;15:205. doi: 10.1186/s12866-015-0545-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Abebe F, Holm-Hansen C, Wiker HG, Bjune G. Progress in serodiagnosis of Mycobacterium tuberculosis infection. Scand. J. Immunol. 2007;66:176–191. doi: 10.1111/j.1365-3083.2007.01978.x. [DOI] [PubMed] [Google Scholar]
- 20.Cao SH, et al. Screening of serum biomarkers for distinguishing between latent and active tuberculosis using proteome microarray. Biomed. Environ. Sci. 2018;31:515–526. doi: 10.3967/bes2018.069. [DOI] [PubMed] [Google Scholar]
- 21.Ren N, et al. Identification of new diagnostic biomarkers for Mycobacterium tuberculosis and the potential application in the serodiagnosis of human tuberculosis. Microb. Biotechnol. 2018;11:893–904. doi: 10.1111/1751-7915.13291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proc. Natl. Acad. Sci. U.S.A. 2001;98:5116–5121. doi: 10.1073/pnas.091062498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kolde R. Pheatmap: Pretty Heatmaps, R Package Version 1.0.12 (2019). https://CRAN.R-project.org/package=pheatmap. Accessed 20 March 2020.
- 24.Riegelman RK. Studying a Study & Testing a Test: Reading Evidence-Based Health Research. Lippincott Williams & Wilkins; 2013. [Google Scholar]
- 25.Yu GC, Wang LG, Han YY, He QY. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.The Gene Ontology Consortium Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2017;45:331–338. doi: 10.1093/nar/gkw1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Szklarczyk D, et al. STRING v10: Protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43:447–452. doi: 10.1093/nar/gku1003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Shannon P, et al. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Hu CJ, et al. Identification of novel biomarkers for behcet disease diagnosis using human proteome microarray approach. Nucleic Acids Res. 2017;43:147–156. doi: 10.1074/mcp.M116.061002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Yang L, et al. Identification of serum biomarkers for gastric cancer diagnosis using a human proteome microarray. Mol. Cell Proteomics. 2016;15:614–623. doi: 10.1074/mcp.M115.051250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Majlessi L, Prados-Rosales R, Casadevall A, Brosch R. Release of mycobacterial antigens. Immunol. Rev. 2015;264:25–45. doi: 10.1111/imr.12251. [DOI] [PubMed] [Google Scholar]
- 32.Målen H, Berven FS, Fladmark KE, Wiker HG. Comprehensive analysis of exported proteins from Mycobacterium tuberculosis H37Rv. Proteomics. 2007;7:1702–1718. doi: 10.1002/pmic.200600853. [DOI] [PubMed] [Google Scholar]
- 33.Deng JY, et al. Mycobacterium tuberculosis proteome microarray for global studies of protein function and immunogenicity. Cell Rep. 2014;9:2317–2329. doi: 10.1016/j.celrep.2014.11.023. [DOI] [PubMed] [Google Scholar]
- 34.Pan JB, et al. Identification of serological biomarkers for early diagnosis of lung cancer using a protein array-based approach. Mol. Cell. Proteomics. 2017;16:2069–2078. doi: 10.1074/mcp.RA117.000212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Zhang CQ, et al. Mycobacterium tuberculosis secreted proteins as potential biomarkers for the diagnosis of active tuberculosis and latent tuberculosis infection. J. Clin. Lab. Anal. 2015;29:375–382. doi: 10.1002/jcla.21782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bai XJ, Yan L, Yang YR, Feng JD, Wu XQ. Potential novel markers to discriminate between active and latent tuberculosis infection in Chinese individuals. Comp. Immunol. Microbiol. Infect. Dis. 2015;44:8–13. doi: 10.1016/j.cimid.2015.11.002. [DOI] [PubMed] [Google Scholar]
- 37.Meier NR, Jacobsen M, Ottenhoff THM, Ritz N. A systematic review on novel Mycobacterium tuberculosis antigens and their discriminatory potential for the diagnosis of latent and active tuberculosis. Front. Immunol. 2018;9:2476. doi: 10.3389/fimmu.2018.02476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Janvilisri T, Suzuki H, Scaria J, Chen JW, Charoensawan V. High-throughput screening for biomarker discovery. Dis. Mark. 2015;2015:108064. doi: 10.1155/2015/108064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Budzik JM, et al. Dynamic post-translational modification profiling of Mycobacterium tuberculosis-infected primary macrophages. Elife. 2020;9:e51461. doi: 10.7554/eLife.51461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Mehaffy C, Belisle JT, Dobos KM. Mycobacteria and their sweet proteins: An overview of protein glycosylation and lipoglycosylation in M. tuberculosis. Tuberculosis. 2019;115:1–13. doi: 10.1016/j.tube.2019.01.001. [DOI] [PubMed] [Google Scholar]
- 41.Sartain MJ, Slayden RA, Singh KK, Laal S, Belisle JT. Disease state differentiation and identification of tuberculosis biomarkers via native antigen array profiling. Mol. Cell Proteomics. 2006;5:2102–2113. doi: 10.1074/mcp.M600089-MCP200. [DOI] [PubMed] [Google Scholar]
- 42.Cole ST. Learning from the genome sequence of Mycobacterium tuberculosis H37Rv. FEBS Lett. 1999;452:7–10. doi: 10.1016/S0014-5793(99)00536-0. [DOI] [PubMed] [Google Scholar]
- 43.Rieck B, et al. PknG senses amino acid availability to control metabolism and virulence of Mycobacterium tuberculosis. PLoS Pathog. 2017;13:e1006399. doi: 10.1371/journal.ppat.1006399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Satchidanandam V, et al. The secreted protein Rv1860 of Mycobacterium tuberculosis stimulates human polyfunctional CD8+ T cells. Clin. Vaccine Immunol. 2016;23:282–293. doi: 10.1128/CVI.00554-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Korotkova N, et al. Structure of EspB, a secreted substrate of the ESX-1 secretion system of Mycobacterium tuberculosis. J. Struct. Biol. 2015;191:236–244. doi: 10.1016/j.jsb.2015.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Simar J, Belkhir L, Tombal B, André E. Mycobacterium tuberculosis EspB protein suppresses interferon-γ-induced autophagy in murine macrophages. BMC Res. Notes. 2017;10:64. doi: 10.1186/s13104-017-2382-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.WHO . High-Priority Target Product Profiles for New Tuberculosis Diagnostics: Report of a Consensus Meeting. World Health Organisation; 2014. [Google Scholar]
- 48.Perumal P, et al. Validation of differentially expressed immune biomarkers in latent and active tuberculosis by real-time PCR. Front. Immunol. 2020;11:612564. doi: 10.3389/fimmu.2020.612564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Shao M, et al. Screening of potential biomarkers for distinguishing between latent and active tuberculosis in children using bioinformatics analysis. Medicine (Baltimore) 2021;100:e23207. doi: 10.1097/MD.0000000000023207. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.