ABSTRACT
Diagnosis of latent tuberculosis infection (LTBI) using biomarkers in order to identify the risk of progressing to active TB and therefore predicting a preventive therapy has been the main bottleneck in eradication of tuberculosis. We compared two assays for the diagnosis of LTBI: transcript signatures and interferon gamma release assay (IGRA), among household contacts (HHCs) in a high tuberculosis-burden population. HHCs of active TB cases were recruited for our study; these were confirmed to be clinically negative for active TB disease. Eighty HHCs were screened by IGRA using QuantiFERON-TB Gold Plus (QFT-Plus) to identify LTBI and uninfected cohorts; further, quantitative levels of transcript for selected six genes (TNFRSF10C, ASUN, NEMF, FCGR1B, GBP1, and GBP5) were determined. Machine learning (ML) was used to construct models of different gene combinations, with a view to identify hidden but significant underlying patterns of their transcript levels. Forty-three HHCs were found to be IGRA positive (LTBI) and thirty-seven were IGRA negative (uninfected). FCGR1B, GBP1, and GBP5 transcripts differentiated LTBI from uninfected among HHCs using Livak method. ML and ROC (Receiver Operator Characteristic) analysis validated this transcript signature to have a specificity of 72.7%. In this study, we compared a quantitative transcript signature with IGRA to assess the diagnostic ability of the two, for detection of LTBI cases among HHCs of a high-TB burden population; we concluded that a three gene (FCGR1B, GBP1, and GBP5) transcript signature can be used as a biomarker for rapid screening.
IMPORTANCE The study compares potential of transcript signature and IGRA to diagnose LTBI. It is first of its kind study to screen household contacts (HHCs) in high TB burden area of India. A transcript signature (FCGR1B, GBP1, & GBP5) is identified as potential biomarker for LTBI. These results can lead to development of point-of-care (POC) like device for LTBI screening in a high TB burdened area.
KEYWORDS: latent tuberculosis infection, transcript signature, biomarker, interferon gamma release assay, household contacts, machine learning
INTRODUCTION
Latent tuberculosis infection (LTBI) is a state of persistent immune response to stimulation by Mycobacterium tuberculosis antigens without any clinical manifestation of active tuberculosis (TB) (1). The global prevalence of LTBI is estimated to be at ∼33% (2); though it is a non-communicable asymptomatic condition, but LTBI individuals stand at a significant risk of progression to active TB due to various factors such as low immunity, co-morbidities etc.
Identification of novel biomarkers for rapid diagnosis of LTBI is a prerequisite to eradicate TB since the current diagnostic methods for detection of LTBI are either subjective or expensive. Several studies have shown variations in transcription levels between healthy controls versus active TB and LTBI versus active TB populations (3–5). However, not many studies have identified transcript signatures that can distinguish LTBI individuals from uninfected, within the household contacts (HHCs) of active TB population. Such signatures can be useful for rapid screening of the already at risk-HHCs and thus could be an important tool in achieving the goal of End-TB.
Machine learning techniques are being used as a tool to identify hidden but significant underlying patterns among complex data sets. One such machine learning unsupervised approach is cluster analysis that is a collection of methods for defining subgroups of individuals with high heterogeneity (6–8). It is a widely used exploratory approach in biological studies discovering substructures inherent in a given data set. It is a hypothesis generating approach that assigns patients to clusters based on certain characteristics, so that homogeneity is high within the cluster and at the same time low between-group (9).
In the present study, for segregating cohorts of LTBI and uninfected individuals, an interferon gamma release assay (IGRA) based screening of HHCs - parent, sibling, spouse, child and others, of index TB cases was carried out. The transcript levels of six genes, that previously showed an association with TB-infection, namely, TNFRSF10C (Tumor Necrosis Factor Receptor Super Family Member 10c) (10), ASUN (Asunder spermatogenesis regulator), NEMF (Nuclear Export Mediator Factor) (11), FCGR1B (Fc gamma receptor 1B), GBP1 (Guanylate binding protein 1) and GBP5 (Guanylate binding protein 5) (12), were quantified by qRT-PCR.
For comprehensive analysis, two approaches of machine learning (ML) were applied. The outcomes of the ML approaches were further validated using various statistical analyses. This study is first of its kind to assess a transcript signature of three genes and compares it to IGRA for efficacy to diagnose LTBI among HHCs of high TB burdened Indian population, which paves a way for a broader trial and validation of this marker. A three-gene (FCGR1B, GBP1, and GBP5) transcript signature is identified which can be used as a biomarker for rapid screening of HHCs to diagnose LTBI.
RESULTS
Selection of LTBI and uninfected cohorts within household contacts of index TB populations.
Eighty HHCs of confirmed pulmonary TB Indian patients were screened by IGRA to segregate LTBI and uninfected individuals. IGRA assay was carried out twice using the same sample (supernatant from 4 tubes- Nil, TB1, TB2 and Mitogen) for each of the participant. Only those samples which showed repeated results in this assay, were considered in the study (Fig. 1). In case of any discrepancies between the technical repeats, the participant was eliminated from the study. Among 80 HHCs, 54% (n = 43) tested IGRA positive, and 46% (n = 37) tested IGRA negative; these cohorts were identified as LTBI and uninfected respectively (Table 1).
TABLE 1.
Household Contacts |
|||
---|---|---|---|
Characteristic | IGRA +ve | IGRA -ve | Total |
N (%) | 43 | 37 | 80 |
Age median (IQR) | 30 (14–58) | 27 (15–63) | 29 (14–63) |
Gender | |||
Male N (%) | 22 (53) | 20 (47) | 42 |
Female N (%) | 21 (55) | 17 (45) | 38 |
Index Cases | |||
Cat I contacts (%) | 36 (56) | 28 (44) | 64 |
Cat II contacts (%) | 2 (22) | 7 (78) | 9 |
Cat IV contacts (%) | 5 (71) | 2 (29) | 7 |
Diabetes | |||
Yes | 3 | 1 | |
No | 40 | 36 | |
Smoking | |||
Yes | 2 | 3 | |
No | 41 | 34 | |
Alcoholic | |||
Yes | 3 | 4 | |
No | 40 | 33 | |
Diet | |||
Veg | 14 | 17 | |
Non-veg | 25 | 15 | |
Ventilation | |||
Good | 4 | 4 | |
Avg | 19 | 17 | |
Poor | 20 | 16 |
IGRA: interferon gamma release assay; IQR: interquartile range. Veg.: vegetarian. Cat I: Drug Susceptible TB; Cat II: Drug Susceptible-Relapse; Cat IV: Drug Resistant TB. Ventilation: No. of rooms/No. of members (<0.5: Poor; 0.5–0.75: Average; >0.75 Good).
No statistical significance could be seen for the various demographic characteristics between the two cohorts. We further analyzed the relationship status of the HHCs with respect to their IGRA results and gender-wise distribution to assess any correlation; percentage wise distribution among different relationship categories for males: females in the LTBI and uninfected cohort (Fig. 2a). No statistical significance was found for age and gender between latent and uninfected cohorts (Table S2 [File 01] in the supplemental material).
To evaluate the level of IFN-γ production in IGRAs, levels of antigen-specific CD4+ and CD8+ T cells in LTBI cohort and uninfected cohort (Fig. 2b) were compared. To assess the risk of M. tuberculosis infection, difference of TB1 from TB2 was analyzed. A higher TB2 antigen response (TB2-TB1 > 0.6 IU/mL) was observed in six (∼14%) of LTBI subjects. Samples with values for TB1 or TB2 > 10.0 IU/mL were excluded since they were outside the linear range of the assay (Fig. 2c).
Variation in transcription levels of selected genes between LTBI and uninfected cohorts.
Next, the relative transcription levels of six prioritized genes, namely, TNFRSF10C, ASUN, NEMF, FCGR1B, GBP1, and GBP5, that previously showed an association with M. tuberculosis infection in different geographical locations, was assessed (10–12). qRT-PCR based analysis was carried out to estimate transcript levels of these genes among 80 HHCs, with a view to evaluate the potential of these genes to differentiate LTBI and uninfected cohorts among HHCs. FCGR1B gene presented the most significant differential expression, as the relative fold expression in the LTBI cohort was significantly higher compared to uninfected cohort (p0.018) followed by GBP1 gene (p0.039) and GBP5 gene (p0.592) (Fig. 3). Relatively low level of differential expression could be seen for ASUN and NEMF genes, which was not significant. However, no significant difference was observed for TNFRSF10C gene.
Unsupervised cluster analysis.
Unsupervised clustering technique was implemented to find any hidden pattern across the recruited HHCs of index TB cases, in order to group the participants in separate clusters based on their intra-cluster’s similarities and inter-cluster’s differences. A best cluster model was identified that had four groups that were found to be different from each other based on IGRA status (positive/negative) and expression levels of the three genes namely, FCGR1B, GBP1, and GBP5. The cluster model has an acceptable ratio of 1.87 and the silhouette measure of cohesion and separation for our cluster analysis was found to be more than 0.5 indicating it to be an acceptable robust cluster (Fig. 4) and suggesting that the within-cluster distance and the between cluster distance was significant.
Classification model for IGRA class-prediction.
The distribution of other available parameters, which were not used in the formation of the final cluster model, were assessed (Table 2).
TABLE 2.
Features in clusters | Cluster-1 | Cluster-2 | Cluster-3 | Cluster-4 |
---|---|---|---|---|
Sample no., n (%) | 18 (22) | 19 (23) | 15 (18) | 28 (35) |
IGRA | Negative | Negative | Positive | Positive |
Expression level of FCGR1B (avg ΔCt) | 5.95 (Lowest exp) | 4.12 | 2.90 (Highest exp) | 4.93 |
Expression level of GBP1 (avg ΔCt) | 8.71 (Lowest exp) | 3.39 | 1.66 (Highest exp) | 5.77 |
Expression level of GBP5 (avg ΔCt) | 10.49 (Lowest exp) | 3.95 | 2.29 (Highest exp) | 8.23 |
Female:Male | 7:11 (0.63) | 10:9 (1.11) | 10:5 (2.0) | 11:17 (0.64) |
TB2-TB1 | –0.17 to 0.1 | –0.17 to 0.51 | –7.25 to 7.44 | –4.65 to 2.48 |
Age (range) | 15-63 | 15-50 | 16-53 | 14-58 |
Age (mode) | 15 | 25 | 35 | 40 |
Age (avg) | 33.44 | 32.6 | 31.4 | 32.14 |
Age (median) | 32.5 | 27 | 33 | 29.5 |
The classification models were developed using expression data of the six prioritized genes, age and gender as input parameters. Out of the several generated models, based on all the algorithms, present in the WEKA package, the best three models were selected. Based on the feature selection technique, we concluded that the expression values of the three genes: FCGR1B, GBP1, and GBP5 could differentiate and predict the outcome of IGRA analysis of the subject with an accuracy of 67.9% (Table 3). The association of the three genes was further validated through statistical analysis.
TABLE 3.
Inputs | Algorithm | Sensitivity | Specificity | Accuracy |
---|---|---|---|---|
FCGR1B, GBP1, GBP5 | Rules.OneR | 74.4 | 60.55 | 67.9 |
GBP1, GBP5 | Rules.OneR | 74.4 | 60.5 | 67.9 |
NEMF, GBP1, GBP5, FCGR1B | Rules.OneR | 74.4 | 60 | 67.9 |
ROC analysis.
ROC analysis was performed on Ct difference to GAPDH for all the genes (Fig. 5a). Further, to validate the association of the three genes as a combination among these six genes, binomial logistic regression and ROC was performed to ascertain the gene combinations to correctly classify the study subjects. The discrimination of LTBI and uninfected cohorts using the combination of FCGR1B, GBP1, and GBP5 genes achieved the highest area under the curve (AUC) of 0.68 (95% CI = 0.56-0.80). The threshold for this 3-gene combination that discriminated the IGRA positive groups from IGRA negative groups with the greatest accuracy was identified using the Youden index. This threshold achieved a sensitivity of 0.64 (95% CI 0.48–0.78) and a specificity of 0.72 (95% CI 0.55–0.87), giving a positive likelihood ratio of 2.36 (Fig. 5b). ROC graphs for other gene combinations have been shown in Fig. S1 in the supplemental material.
DISCUSSION
Transcript signatures can play an important role in differentiating the healthy population from latent individuals as well as in predicting the progression from LTBI to active TB disease. These signatures have emerged ahead of proteomics and metabolomics biomarkers for diagnosis of TB (13–16); this strategy takes advantage of well-established sample-processing pathways as well as rapid multiplex PCR platforms (17, 18). Though, IGRA can be used for the screening of latent infected individuals, however, technical difficulties including collection of larger volume of blood, limited reproducibility of the assay, as well as the tedious sample processing limits its usage in large numbers. Therefore, in the present study, our goal was to assess the levels of transcript signatures to distinguish LTBI and uninfected cohorts among the HHCs of index TB cases within an Indian geographical and ethnic population.
A number of transcript signature markers have been reported to differentiate active TB cases across different geographical areas around the world. Maertzdorf et al. earlier reported that transcript levels for FCGR1B, GBP1, and GBP5 could discriminate between active TB versus LTBI groups and active TB versus healthy control groups using South African population as study group (12). Similarly, Lee et al. reported ASUN and NEMF transcript levels could differentiate LTBI from healthy individuals in a cohort from Taiwan (11). Lastly, Wang et al. showed in a study group from Fudan that TNFRSF10C could distinguish between individuals in the active TB, LTBI and healthy control groups using transcriptional profiling study (10). Roe et al. identified a transcript signature of three genes with a high predictive value to identify progression of TB disease in HIV patients in 3 months which shows the importance of a transcript signature-based biomarker (19).
In the present study, we evaluated the potential of these six transcript signatures for defining LTBI cases in a high TB burden Indian population by qRT-PCR since the identification and treatment of LTBI cases form an essential component of the TB eradication programs. The expression of FCGR1B and GBP1 was found to be significantly higher in the LTBI cohort compared to the uninfected cohort, followed by GBP5, however GBP5 differential expression was not statistically significant. The present results were consistent with the previous study by Maertzdorf et al. on the usefulness of expression profile of FCGR1B, GBP1, and GBP5. No significant difference in the transcript levels for the genes TNFRSF10C, ASUN, and NEMF was seen in the present study.
The genes FCGR1B, GBP1, and GBP5 play a role in the host immune response during mycobacterial infection. Satproedprai et al. reported that upon bacterial infection, FCGR1B upregulation induced humoral immune response and played a role in lung inflammation (20). It was also reported as one of the most differentially expressed genes in individuals with TB and LTBI than uninfected individuals (12). Likewise, GBP1and GBP5 belong to a family of IFN-γ-inducible Guanylate binding proteins (GBPs) and play a key role in host response to intracellular infection (21–23).
ML based cluster analysis was utilized with a view to decipher any correlation among all possible combinations of parameters in LTBI and uninfected cohorts of HHCs. Based upon this analysis, four significant distinguishable clusters were identified; the LTBI cases were distributed into two distinct clusters (cluster 3 and 4) which showed variation for the parameter: IFN-γ. In the LTBI cohort, a higher TB2 antigen response (TB2-TB1 > 0.6 IU/mL) was observed in six subjects (∼14%), suggesting that these subjects are at a higher risk of progression to active TB. The IFN-γ response from CD8+ T cells elicited by MTB complex-specific antigens ESAT-6 and CFP-10 are more frequently detected in subjects with active TB than LTBI (24). Therefore, a higher TB2 antigen response elicited by CD8+ T cells is said to be associated with severe MTB infection and, consequently, with increased risk of progression to active TB (25). In an earlier study by Petruccioli et al. and Barcellini et al., a higher TB2 antigen response was reported in a subgroup of LTBI contacts with a higher TB burden (26). The cluster 4 showed higher IFN-γ values in TB1 and TB2 compared to cluster 3, which can therefore be said to have a higher TB burden. Therefore, such clustering could help in targeting individuals with LTBI who are at a higher risk of progression to active TB and could also help in solving the dilemma in clinical practice of targeting preventive treatments to groups which are at a higher risk of progression to active TB. The WHO recommends 6–9 months of daily isoniazid preventive therapy (IPT) or a 3-month of weekly rifapentine plus isoniazid (3HP) or a 3-month regimen of daily isoniazid plus rifampicin for the preventive treatment of LTBI (27). The clinical implications of identifying a transcript signature could be towards mass screening of contacts in endemic populations to identify LTBI for targeted preventive therapy. CORTIS trial by Scriba et al. showed the utility of RISK11 signature to predict progression of incident tuberculosis in exposed populations; however, the 3HP based therapy in RISK11 positive group was not successful in preventing the disease based on population (28). In contrast to RISK11, the gene signature screened in this study are for identifying LTBI, and thus the implication could be preventive therapy to contacts having LTBI rather than specifically targeting high risk individuals.
Model classification was used to assess the diagnostic performance of the exact combination of genes and indicated the combination of FCGR1B, GBP1, and GBP5 genes to have maximum predictive power in differentiating the LTBI cohort from the uninfected cohort. Further, ROC analysis of expression levels of these combinations confirmed the potential of the 3 gene expression signature. Few limitations of the study are low sample size, which may explain the low accuracy of the transcript signature and absence of an independent, external validation data set. This was not a longitudinal study and could not develop any correlate with transcriptomic biomarker and disease progression. Regardless, we showed the discriminative power of the transcript signature in our population, which was specific for Indian population and hence a different geographical and ethnic cohort when compared with other biomarker studies performed in the past. Therefore, further studies utilizing a larger sample size are required to identify differentially expressing genes to discriminate LTBI from uninfected individuals. Additional studies are needed to be carried out to examine the exposed but uninfected group of HHCs that are immunologically and genetically unique.
In conclusion, this study is not an unbiased analysis of gene expression but rather the validation of previously published genes associated with TB infection in other populations. This was performed using the published signatures to screen a highly exposed Indian population with a view to assess the potential of the targeted transcriptomic markers and their correlation with the IGRA results. Since these gene signatures may not necessarily be generalizable to geographically, epidemiologically and ethnically diverse populations, therefore identifying a distinct transcript signature to diagnose both LTBI and predict disease progression, which can be used globally, remains challenging.
MATERIALS AND METHODS
Ethics statement.
The study protocols were approved by institutional ethics committee and informed consents were obtained from all the study participants.
Study participants and sample collection.
A total of 80 HHCs of bacteriologically confirmed active pulmonary TB patients (diagnosed either as drug susceptible TB [Cat I], drug susceptible-relapse [Cat II] or Drug resistant TB [Cat IV] cases as per National Guidelines at Lok Nayak Chest Clinic, National TB Elimination Program-New Delhi, Government of India), who were in close contact of more than or equal to 8 h/day for at least 3 months, with the respective index TB patient after onset of the infection, were recruited for the study (Fig. 1). Sample size was estimated based on statistical power of 0.8, medium effect size of 0.6 and significance criterion α = 0.05 using the R package pwr (29, 30). The medium effect size of 0.6 was considered in the case of HHCs as they were at a higher probability of having LTBI than the general population (31). Further, all the HHCs were screened for TB-disease using chest radiography (CXR) and GeneXpert MTB Rif testing (Cepheid, USA), and only those found to be negative for TB-disease, and having no previous TB-history, were recruited for the study. The exclusion criteria for the recruited HHCs was that they should not have any history of diabetes, smoking and alcohol use. 4 mL of venous blood was collected from the subjects in Lithium Heparin tubes (367886, BD, UK) for IGRA analysis and 500 μL was collected in K2E tubes for RNA extraction (367836, BD, UK).
Interferon gamma release assay (IGRA) (QuantiFERON-TB Gold Plus [QFT-Plus]).
IGRA was performed using QFT-Plus kit (Qiagen, Germany) for all the HHCs as per manufacturer’s protocol (622180, Qiagen, Germany) and the test results were interpreted using QuantiFERON-TB Gold Plus Analysis Software.
RNA extraction and cDNA synthesis.
Total RNA was extracted from 200 μL whole blood using a RNeasy mini spin column (74104, Qiagen, Germany) following manufacturer’s instructions. An amount of 100 ng of RNA was used for each cDNA synthesis reaction using reverse High-Capacity cDNA RT-kit (4368814, ThermoFisher SCIENTIFIC, USA) following manufacturer’s instructions.
Quantitative Real time-PCR (qRT-PCR).
Relative transcript levels of 6-genes (TNFRSF10C, ASUN, NEMF, FCGR1B, GBP1, and GBP5) were measured by qRT-PCR, which was performed in triplicates. Primer sequences for each gene are listed in Table S1 (File 01) in the supplemental material. The relative amount of expressed RNA was calculated by comparing it with the expression of the housekeeping gene GAPDH (Glyceraldehyde-3-Phosphate Dehydrogenase) using the 2−ΔΔCT (Livak) method.
Machine Learning based approaches.
(i) Cluster analysis. Classifying the study cohort into distinct clusters was done based on 10 parameters including expression data of the six selected genes, immunological parameters from IGRA assays (TB1-TB2 values, IGRA positive/negative) and epidemiological data like age and gender. Two-step cluster analysis (SPSS 26.0, IBM Corporation, USA) was performed for clustering where both categorical and continuous variables were used as input parameters. The analysis involved two stages which resulted in a large number of clusters; filtered down to the best numbers using Schwarz's Bayesian Information Criterion. Briefly, in the first stage a cluster feature tree was constructed and the cases were grouped into these preclusters followed by implementation of the hierarchical clustering algorithm on the preclusters.
In a preliminary evaluation using cluster analysis, three main approaches were used for defining the study cohort. The first approach involved grouping the participants based on demographic details and sought to describe age and gender differences among the groups. In the second approach, expression data of the six genes along with immunological data, were used to cluster the participants in different groups followed by the third approach where all the data (demographic, expression and immunological) were used. Of the total 73 clusters, only those clusters in which the input parameters indicated significant differences, were included.
(ii) Classification model for IGRA class-prediction. WEKA (32) was used to perform a class prediction study to predict whether the participant was IGRA positive/IGRA negative based on the expression data of the six prioritized genes, age and gender. The training set was used to evaluate the performance of 120 algorithms from eight main classifiers (bayes, functions, lazy, meta, mi, misc, rules, trees) available in WEKA (v3.8.4). Various combinations of different input parameters were evaluated and the algorithm with the best performance in the leave-one-out cross-validation (LOOCV) was selected and a classification model was built to differentiate between the IGRA positive (43) and IGRA negative (37) individuals. Due to limited number of participants, it was not possible to prepare separate training and validation data sets. Therefore, the LOOCV technique was used to utilize the available information optimally. In the LOOCV technique, the models are trained and validated so that each record is used for training and internal testing. The LOOCV technique has widely been used to solve several classification problems where data is scarce (33, 34). Further, all the feature selection techniques (WEKA) were used to identify the most significant and discriminatory input parameters leading to the best classification model.
Statistical validation.
Differences between the IGRA positive and IGRA negative cohorts were evaluated by Mann-Whitney U-test in SPSS 26.0 (IBM Corporation, USA). The diagnostic ability of the selected genes to discriminate between the study groups was evaluated by performing receiver operating characteristic (ROC) analysis for the genes individually and the overall accuracy was assessed by area under curve (AUC) values in easyROC web-tool (35). Binary logistic regression analysis was then used to examine combinations of genes followed by ROC analysis to validate the model.
ACKNOWLEDGMENTS
S.K., V.N., S.B.: methodology, investigation, data curation, writing- original draft of manuscript, visualization; S.D., S.R., S.A., S.M., K.R., P.M., D.G.: formal analysis of data, writing- review and editing of final manuscript; V.K., S.L., A.K.: clinical investigation, sample collection and resources acquisition; A.M.: conceptualization, methodology, data curation or formal analysis, resources, manuscript writing, supervision, project administration and funding acquisition. All authors reviewed and approved final manuscript.
The research work is supported by project grant GCE-India/RI/2017/001 under Grand Challenges Explorations from B&M Gates foundation and BIRAC (Biotechnology Industry Research Assistance Council, Department of Biotechnology, Govt. of India) and the Flagship Project grant (BT/IC-06/003/91) from Department of Biotechnology, Govt. of India.
Footnotes
Supplemental material is available online only.
Contributor Information
Asif Mohmmed, Email: amohd@icgeb.res.in.
Gyanu Lamichhane, Johns Hopkins University School of Medicine.
REFERENCES
- 1.Getahun H, Matteelli A, Abubakar I, Aziz MA, Baddeley A, Barreira D, Den Boon S, Borroto Gutierrez SM, Bruchfeld J, Burhan E, Cavalcante S, Cedillos R, Chaisson R, Chee CB-E, Chesire L, Corbett E, Dara M, Denholm J, de Vries G, Falzon D, Ford N, Gale-Rowe M, Gilpin C, Girardi E, Go U-Y, Govindasamy D, D Grant A, Grzemska M, Harris R, Horsburgh jr CR, Ismayilov A, Jaramillo E, Kik S, Kranzer K, Lienhardt C, LoBue P, Lönnroth K, Marks G, Menzies D, Migliori GB, Mosca D, Mukadi YD, Mwinga A, Nelson L, Nishikiori N, Oordt-Speets A, Rangaka MX, Reis A, Rotz L, Sandgren A, et al. 2015. Management of latent Mycobacterium tuberculosis infection: WHO guidelines for low tuberculosis burden countries. Eur Respir J 46:1563–1576. doi: 10.1183/13993003.01245-2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Cohen A, Mathiasen VD, Schön T, Wejse C. 2019. The global prevalence of latent tuberculosis: a systematic review and meta-analysis. Eur Respir J 54:1900655. doi: 10.1183/13993003.00655-2019. [DOI] [PubMed] [Google Scholar]
- 3.Jacobsen M, Repsilber D, Gutschmidt A, Neher A, Feldmann K, Mollenkopf HJ, Ziegler A, Kaufmann SHE. 2007. Candidate biomarkers for discrimination between infection and disease caused by Mycobacterium tuberculosis. J Mol Med (Berl) 85:613–621. doi: 10.1007/s00109-007-0157-6. [DOI] [PubMed] [Google Scholar]
- 4.Berry MPR, Graham CM, McNab FW, Xu Z, Bloch SAA, Oni T, Wilkinson KA, Banchereau R, Skinner J, Wilkinson RJ, Quinn C, Blankenship D, Dhawan R, Cush JJ, Mejias A, Ramilo O, Kon OM, Pascual V, Banchereau J, Chaussabel D, O'Garra A. 2010. An interferon-inducible neutrophil-driven blood transcriptional signature in human tuberculosis. Nature 466:973–977. doi: 10.1038/nature09247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Constantoulakis P, Filiou E, Rovina N, Chras G, Hamhougia A, Karabela S, Sotiriou A, Roussos C, Poulakis N. 2010. In vivo expression of innate immunity markers in patients with Mycobacterium tuberculosis infection. BMC Infect Dis 10:243. doi: 10.1186/1471-2334-10-243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Aldenderfer MS, Blashfield RK. 1984. Cluster analysis. SAGE Publications, Inc., Thousand Oaks, CA. doi: 10.4135/9781412983648. [DOI] [Google Scholar]
- 7.Eisen MB, Spellman PT, Brown PO, Botstein D. 1998. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA 95:14863–14868. doi: 10.1073/pnas.95.25.14863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Everitt BS. 2005. An R and S-PLUS companion to multivariate analysis. Springer, NY. [Google Scholar]
- 9.Clatworthy J, Buick D, Hankins M, Weinman J, Horne R. 2005. The use and reporting of cluster analysis in health psychology: a review. Br J Health Psychol 10:329–358. doi: 10.1348/135910705X25697. [DOI] [PubMed] [Google Scholar]
- 10.Wang S, He L, Wu J, Zhou Z, Gao Y, Chen J, Shao L, Zhang Y, Zhang W. 2019. Transcriptional profiling of human peripheral blood mononuclear cells identifies diagnostic biomarkers that distinguish active and latent tuberculosis. Front Immunol 10:2948. doi: 10.3389/fimmu.2019.02948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lee SW, Wu LS, Huang GM, Huang KY, Lee TY, Weng JT. 2016. Gene expression profiling identifies candidate biomarkers for active and latent tuberculosis. BMC Bioinformatics 17(Suppl 1):3. doi: 10.1186/s12859-015-0848-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Maertzdorf J, Repsilber D, Parida SK, Stanley K, Roberts T, Black G, Walzl G, Kaufmann SHE. 2011. Human gene expression profiles of susceptibility and resistance in tuberculosis. Genes Immun 12:15–22. doi: 10.1038/gene.2010.51. [DOI] [PubMed] [Google Scholar]
- 13.Zak DE, Penn-Nicholson A, Scriba TJ, Thompson E, Suliman S, Amon LM, Mahomed H, Erasmus M, Whatney W, Hussey GD, Abrahams D, Kafaar F, Hawkridge T, Verver S, Hughes EJ, Ota M, Sutherland J, Howe R, Dockrell HM, Boom WH, Thiel B, Ottenhoff THM, Mayanja-Kizza H, Crampin AC, Downing K, Hatherill M, Valvo J, Shankar S, Parida SK, Kaufmann SHE, Walzl G, Aderem A, Hanekom WA. 2016. A blood RNA signature for tuberculosis disease risk: a prospective cohort study. Lancet 387:2312–2322. doi: 10.1016/S0140-6736(15)01316-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Suliman S, Thompson EG, Sutherland J, Weiner J, Ota MOC, Shankar S, Penn-Nicholson A, Thiel B, Erasmus M, Maertzdorf J, Duffy FJ, Hill PC, Hughes EJ, Stanley K, Downing K, Fisher ML, Valvo J, Parida SK, van der Spuy G, Tromp G, Adetifa IMO, Donkor S, Howe R, Mayanja-Kizza H, Boom WH, Dockrell HM, Ottenhoff THM, Hatherill M, Aderem A, Hanekom WA, Scriba TJ, Kaufmann SHE, Zak DE, Walzl G, GC6-74 Cohort Study Team, The ACS cohort study team. 2018. Four-gene pan-African blood signature predicts progression to tuberculosis. Am J Respir Crit Care Med 197:1198–1208. doi: 10.1164/rccm.201711-2340OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Drain PK, Bajema KL, Dowdy D, Dheda K, Naidoo K, Schumacher SG, Ma S, Meermeier E, Lewinsohn DM, Sherman DR. 2018. Incipient and subclinical tuberculosis: A clinical review of early stages and progression of infection. Clin Microbiol Rev 31. doi: 10.1128/CMR.00021-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Penn-Nicholson A, Mbandi SK, Thompson E, Mendelsohn SC, Suliman S, Chegou NN, Malherbe ST, Darboe F, Erasmus M, Hanekom WA, Bilek N, Fisher M, Kaufmann SHE, Winter J, Murphy M, Wood R, Morrow C, Van Rhijn I, Moody B, Murray M, Andrade BB, Sterling TR, Sutherland J, Naidoo K, Padayatchi N, Walzl G, Hatherill M, Zak D, Scriba TJ, CAPRISA IMPRESS team. 2020. RISK6, a 6-gene transcriptomic signature of TB disease risk, diagnosis and treatment response. Sci Rep 10:8629. doi: 10.1038/s41598-020-65043-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Laux da Costa L, Delcroix M, Dalla Costa ER, Prestes IV, Milano M, Francis SS, Unis G, Silva DR, Riley LW, Rossetti MLR. 2015. A real-time PCR signature to discriminate between tuberculosis and other pulmonary diseases. Tuberculosis (Edinb) 95:421–425. doi: 10.1016/j.tube.2015.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Gliddon HD, Kaforou M, Alikian M, Habgood-Coote D, Zhou C, Oni T, et al. 2019. Identification of reduced host transcriptomic signatures for tuberculosis and digital PCR-based validation and quantification. bioRxiv 583674. doi: 10.1101/583674. [DOI] [PMC free article] [PubMed]
- 19.Roe J, Venturini C, Gupta RK, Gurry C, Chain BM, Sun Y, Southern J, Jackson C, Lipman MC, Miller RF, Martineau AR, Abubakar I, Noursadeghi M. 2020. Blood transcriptomic stratification of short-term risk in contacts of tuberculosis. Clin Infect Dis 70:731–737. doi: 10.1093/cid/ciz252. [DOI] [PubMed] [Google Scholar]
- 20.Satproedprai N, Wichukchinda N, Suphankong S, Inunchot W, Kuntima T, Kumpeerasart S, Wattanapokayakit S, Nedsuwan S, Yanai H, Higuchi K, Harada N, Mahasirimongkol S. 2015. Diagnostic value of blood gene expression signatures in active tuberculosis in Thais: a pilot study. Genes Immun 16:253–260. doi: 10.1038/gene.2015.4. [DOI] [PubMed] [Google Scholar]
- 21.Kim BH, Shenoy AR, Kumar P, Bradfield CJ, MacMicking JD. 2012. IFN-inducible GTPases in host cell defense. Cell Host Microbe 12:432–444. doi: 10.1016/j.chom.2012.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Huber A, Killy B, Grummel N, Bodendorfer B, Paul S, Wiesmann V, Naschberger E, Zimmer J, Wirtz S, Schleicher U, Vera J, Ekici AB, Dalpke A, Lang R. 2020. Mycobacterial cord factor reprograms the macrophage response to IFN-γ towards enhanced inflammation yet impaired antigen presentation and expression of GBP1. J Immunol 205:1580–1592. doi: 10.4049/jimmunol.2000337. [DOI] [PubMed] [Google Scholar]
- 23.Olive AJ, Smith CM, Baer CE, Coers J, Sassetti CM. 2020. Mycobacterium tuberculosis evasion of Guanylate Binding Protein-mediated host defense in mice requires the ESX1 secretion system. bioRxiv. doi: 10.1101/2020.07.27.223362. [DOI] [PMC free article] [PubMed]
- 24.Moon H-W, Gaur RL, Tien SS-H, Spangler M, Pai M, Banaei N. 2017. Evaluation of QuantiFERON-TB gold-plus in health care workers in a low-incidence setting. J Clin Microbiol 55:1650–1657. doi: 10.1128/JCM.02498-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Petruccioli E, Chiacchio T, Vanini V, Cuzzi G, Codecasa LR, Ferrarese M, Schininà V, Palmieri F, Ippolito G, Goletti D. 2018. Effect of therapy on Quantiferon-plus response in patients with active and latent tuberculosis infection. Sci Rep 8:15626. doi: 10.1038/s41598-018-33825-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Barcellini L, Borroni E, Brown J, Brunetti E, Campisi D, Castellotti PF, Codecasa LR, Cugnata F, Di Serio C, Ferrarese M, Goletti D, Lipman M, Rancoita PMV, Russo G, Tadolini M, Vanino E, Cirillo DM. 2016. First evaluation of QuantiFERON-TB Gold Plus performance in contact screening. Eur Respir J 48:1411–1419. doi: 10.1183/13993003.00510-2016. [DOI] [PubMed] [Google Scholar]
- 27.WHO Guidelines Approved by the Guidelines Review Committee. 2020. WHO consolidated guidelines on tuberculosis: tuberculosis preventive treatment: Module 1: prevention. World Health Organization, Geneva, Switzerland. [PubMed] [Google Scholar]
- 28.Scriba TJ, Fiore-Gartland A, Penn-Nicholson A, Mulenga H, Kimbung Mbandi S, Borate B, Mendelsohn SC, Hadley K, Hikuam C, Kaskar M, Musvosvi M, Bilek N, Self S, Sumner T, White RG, Erasmus M, Jaxa L, Raphela R, Innes C, Brumskine W, Hiemstra A, Malherbe ST, Hassan-Moosa R, Tameris M, Walzl G, Naidoo K, Churchyard G, Hatherill M, Baepanye K, Baepanye T, Clarke K, Collignon M, Dlamini A, Eyre C, Feni T, Fikizolo M, Galane P, Goliath T, Gangat A, Malefo-Grootboom S, Janse van Rensburg E, Janse van Rensburg B, Kekana S, Zietsman M, Kock A, Kunene I, Lakhi A, Langa N, Ledwaba H, Luphoko M, et al. 2021. Biomarker-guided tuberculosis preventive therapy (CORTIS): A randomised controlled trial. Lancet Infectious Diseases 21:354–365. doi: 10.1016/S1473-3099(20)30914-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Cohen J. 1988. Statistical Power Analysis for the Behavioral Sciences. 2nd ed Routledge, NY. [Google Scholar]
- 30.Champely S, Ekstrom C, Dalgaard P, Gill J, Weibelzahl S, Anandkumar A, Ford C, Volcic R, De Rosario H. 2017. pwr: basic functions for power analysis. https://cran.r-project.org/web/packages/pwr/index.html.
- 31.Saha S, Kumar A, Saurabh K, Shankar SH, Kashyap A, Nischal N, Biswas A, Wig N. 2020. Current status of treatment of latent tuberculosis infection in India. Int J Mol Sci 71:54–59. doi: 10.25259/IJMS_18_2019. [DOI] [Google Scholar]
- 32.Frank E, Hall MA, Witten IH. 2016. The WEKA workbench. Morgan Kaufmann, Burlington, MA, USA. [Google Scholar]
- 33.Mete M, Sakoglu U, Spence JS, Devous MD, Sr, Harris TS, Adinoff B. 2016. Successful classification of cocaine dependence using brain imaging: a generalizable machine learning approach. BMC Bioinformatics 17:357. doi: 10.1186/s12859-016-1218-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Jiang M, Mieronkoski R, Syrjälä E, Anzanpour A, Terävä V, Rahmani AM, Salanterä S, Aantaa R, Hagelberg N, Liljeberg P. 2019. Acute pain intensity monitoring with the classification of multiple physiological parameters. J Clin Monit Comput 33:493–507. doi: 10.1007/s10877-018-0174-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Goksuluk D, Korkmaz S, Zararsiz G, Karaagaoglu AE. 2016. easyROC: An interactive web-tool for ROC curve analysis using R language environment. R J 8:213. doi: 10.32614/RJ-2016-042. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.