Abstract
Background
Approximately 5-10% of persons infected with M. tuberculosis develop tuberculosis, but the factors associated with disease progression are incompletely understood. Both linkage and association studies have identified human genetic variants associated with susceptibility to pulmonary tuberculosis, but few genetic studies have evaluated extrapulmonary disease. Because extrapulmonary and pulmonary tuberculosis likely have different underlying pathophysiology, identification of genetic mutations associated with extrapulmonary disease is important.
Findings
We performed a pilot genome-wide association study among 24 persons with previous extrapulmonary tuberculosis and well-characterized immune defects; 24 pulmonary tuberculosis patients and 57 patients with M. tuberculosis infection served as controls. The Affymetrix GeneChip Human Mapping Xba Array was used for genotyping; after careful quality control, genotypes at 44,175 single nucleotide polymorphisms (SNPs) were available for analysis. Eigenstrat quantified population stratification within our sample; logistic regression, using results of the Eigenstrat analysis as a covariate, identified significant associations between groups. Permutation testing controlled the family-wise error rate for each comparison between groups. Four SNPs were significantly associated with extrapulmonary tuberculosis compared to controls with M. tuberculosis infection; one (rs4893980) in the gene PDE11A, one (rs10488286) in KCND2, and one (rs2026414) in PCDH15; one was in chromosome 7 but not associated with a known gene. Two additional variants were significantly associated with extrapulmonary tuberculosis compared with pulmonary tuberculosis; one (rs340708) in the gene FAM135B and one in chromosome 13 but not associated with a known gene. The function of all four genes affects cell signaling and activity, including in the brain.
Conclusions
In this pilot study, we identified 6 novel variants not previously known to be associated with extrapulmonary tuberculosis, including two SNPs more common in persons with extrapulmonary than pulmonary tuberculosis. This provides some support for the hypothesis that the pathogenesis and genetic predisposition to extrapulmonary tuberculosis differs from pulmonary tuberculosis. Further study of these novel SNPs, and more well-powered genome-wide studies of extrapulmonary tuberculosis, is warranted.
Introduction
Although approximately one-third of the world's population is infected with M. tuberculosis, [1] only 5-10% of infected individuals develop tuberculosis [2]. The primary site of disease is usually the lungs, but extrapulmonary sites may also be affected. Population-based surveys have found that 20% of tuberculosis cases are extrapulmonary [3]. With a global tuberculosis prevalence of 11.1 million cases,[4] this suggests a global prevalence of extrapulmonary tuberculosis of 2.2 million cases. Several factors are known to be associated with disease progression, including HIV infection, diabetes mellitus and malnutrition [5]. There is also evidence that some persons may have a genetic predisposition to tuberculosis [6-12]. Both linkage and association studies have identified candidate genetic variants and regions of the genome associated with tuberculosis risk [13-16], but they have focused primarily on pulmonary disease. Because the pathophysiology of pulmonary and extrapulmonary disease appears to differ [17,18], and because extrapulmonary disease in particular may be associated with an underlying immune defect [19-21], it is important to assess for genetic variants specifically associated with extrapulmonary tuberculosis.
Mutations in P2X7, [22] vitamin D receptor, [23] interleukin (IL) 1 - β, [24] LTA,[25] IL-10 [25], and IL-10 together with IFN-γ,[26-29] have been associated with extrapulmonary disease in previous candidate gene studies, and support the hypothesis of a genetic predisposition to extrapulmonary tuberculosis. We have previously confirmed the association between polymorphisms in the genes for vitamin D receptor and IL 1 - β and extrapulmonary disease, and reported a novel association between a polymorphism in the Toll-like receptor 2 gene with extrapulmonary tuberculosis, among black patients from the United States [30]. In this previous study, a candidate gene approach was utilized in which single nucleotide polymorphisms (SNPs) previously associated with tuberculosis as well as SNPs in candidate genes involved in tuberculosis pathogenesis were assessed. As a consequence, that study had limited potential to uncover novel genomic regions that play a role in the etiology of extrapulmonary tuberculosis. Rapid and affordable SNP genotyping across the entire genome is now possible, allowing for genome mapping to uncover novel associations not possible with a candidate gene approach. We are unaware of previous genome-wide association studies of extrapulmonary tuberculosis. We have performed a pilot genome-wide association study among HIV-seronegative persons in which the immunologic defects associated with extrapulmonary tuberculosis have been well-characterized, including low unstimulated cytokine production and low CD4+ lymphocyte levels [20,21,30].
Methods
Study participants
The study population was a pooled sample from two previous immunologic studies and the inclusion criteria for both studies have been described in detail previously [20,21,30]. Patients were selected through the Nashville Metropolitan Health Department Tuberculosis Clinic and the Baltimore City Health Department Eastern Chest Clinic, and all participants provided written informed consent. Extrapulmonary disease was defined as disease at any site outside of the pulmonary parenchyma. Eligibility criteria for extrapulmonary tuberculosis cases included: a history of completely treated culture-confirmed extrapulmonary tuberculosis, age ≥ 18 years, and HIV-seronegative status. Exclusion criteria for extrapulmonary tuberculosis cases included serum creatinine > 2 mg/dL, use of corticosteroids or other immunosuppressive agents at the time of diagnosis or time of study entry, malignancy, or diabetes mellitus. The inclusion criteria for pulmonary tuberculosis controls included a history of completely treated culture-confirmed pulmonary tuberculosis, age ≥ 18 years, no evidence of extrapulmonary tuberculosis, and HIV-seronegative status. Controls with latent M. tuberculosis infection were at least 18 years old, HIV-seronegative, and had a positive tuberculin skin test (purified protein derivative positive (PPD +), defined as ≥ 10 mm induration after intradermal placement of 5 tuberculin units of PPD) and without evidence of active tuberculosis. Participants in this control group were U.S.-born (and therefore not vaccinated with BCG), and were mostly close contacts of tuberculosis cases. The controls were sampled from the same two clinic populations as the cases, and were not related to the cases. Exclusion criteria for both control groups were the same as for cases.
Laboratory Techniques
DNA was extracted from blood samples using the Puregene DNA Isolation kit (Gentra Systems, Minneapolis, MN) as per the manufacturer's protocol. Genomic DNA was stored at -70°C until genotyping. Laboratory personnel were blinded to the case-control status of the specimens.
Genome-Wide DNA genotyping
All DNA samples underwent SNP genotyping using the GeneChip® Human Mapping 50 K Xba Array (Affymetrix, Inc., Santa Clara, CA). The manufacturer's protocol was followed for all procedures. Briefly, a complexity reduction process was performed where genomic DNA (250 ng) was digested with XbaI, ligated to XbaI adaptor (Affymetrix), and amplified by polymerase chain reaction (PCR) using primers specific to the ligated adaptor. Cycling conditions were an initial denaturation of 94°C for 3 minutes followed by 30 cycles of 94°C for 30 seconds, 60°C for 45 seconds, and 68°C for 60 seconds. A final extension of 68°C for 7 minutes concluded the reactions. PCR products were assayed by gel electrophoresis, purified, fragmented to < 250 bp using dilute DNaseI (Affymetrix), biotin end-labeled with terminal deoxynucleotidyl transferase, and hybridized to the 50 K Xba Array at 48°C for 16 hours at 60 rpm. The hybridized arrays were washed and stained on Fluidics Station 450 and scanned with the GeneChip Scanner according to the manufacturer's settings (Affymetrix). The arrays were analyzed with software GDAS version 3.0.2 (Affymetrix), which provides rank scores for the probability of particular genotypes at SNP loci. The scores were AA or BB for homozygous alleles and AB for heterozygous alleles, and confidence scores showed the accuracy of the genotype call. Standard procedures and default analysis parameters for individual DNA samples were employed. An internal control run in parallel did not detect any DNA contamination. All procedures were performed using the same lots of reagents. Laboratory personnel were blinded to the case-control status of the specimen.
Quality control
To prevent inclusion of genotyping errors in our association analysis, a 3-step filtering process was followed, which included: 1) removal of SNPs with < 90% genotyping efficiency (such that less than 10% of data were missing across all individuals in the dataset); 2) removal of individuals with genotyping efficiency < 90% (such that less than 10% of data were missing across all SNPs for each individual; 3) removal of SNPs out of Hardy-Weinberg equilibrium in PPD + controls (according to the results of a Fisher's Exact test) with p < 0.05 after Bonferroni correction for the total number of SNPs evaluated. PLINK version 1.04 [31] was used for quality control analysis and data processing. Of the 58,494 SNPs genotyped on the chip, 13,946 SNPs were removed for low genotyping rate (less than 90% of the participants could be genotyped for those SNPs). Each SNP was tested for deviation from Hardy-Weinberg Equilibrium using Fisher's exact tests in PPD + controls only. SNPs with p < 0.05 (after a Bonferroni correction for the number of tests performed) were removed (a total of 373), leaving 44,175 SNPs for statistical analysis. One PPD + control had < 90% genotyping efficiency and was removed from the analysis, leaving 104 individuals from the original 105.
Statistical Analysis
The three patient groups (extrapulmonary tuberculosis, pulmonary tuberculosis, PPD +) underwent three comparisons for analysis: 1) extrapulmonary tuberculosis vs. PPD + controls to identify SNPs that are predictors of extrapulmonary disease, 2) extrapulmonary vs. pulmonary tuberculosis to identify differences between these two forms of disease, and 3) both extrapulmonary + pulmonary tuberculosis vs. PPD + controls to identify SNPs associated with active tuberculosis in general.
Clinical and demographic characteristics were compared among the three patient groups using the Kruskal-Wallis test for continuous variables and the chi-squared or Fisher exact tests for categorical variables, as appropriate based on distributional assumptions.
Population stratification, which refers to allele frequency differences between cases and controls due to ancestry, is an important concern in genetic association tests, as it can cause spurious associations [32-34]. To address this problem in our diverse patient population the EIGENSTRAT method, which uses principal component analysis (PCA), was used to quantify and correct for population sub-structure and adjust for population stratification in association analyses [35,36]. The EIGENSTRAT utility of the EIGENSOFT package version 2.0 was used for our analysis [36]. The method infers axes of variation to reduce the dimensionality of the data while describing as much variability as possible. We used EIGENSTRAT to evaluate the top ten Eigen vectors of the covariance matrix between the samples [36]. Figure 1 shows the first two axes of variation from the PCA analysis. The results revealed two major clusters within our population and a third small cluster defined by a third axis of variation. These three groups (in order) were consistent with the self-reported racial groups of our sample (black, white, and Asian). These three clusters were used as categorical covariates in the genetic association analysis.
PLINK version 1.04 [31] was used for association analyses. Logistic regression using the cluster definitions from the EIGENSTRAT analysis as a covariate in each regression model, was used to evaluate potential associations for each SNP, for each of the three comparisons described above (using a genotypic encoding for the SNP variables). Dummy encoding was used for each SNP variable, such that no genetic model assumptions were made in the analysis. WGA Viewer [37] was used for the graphical representation of our data as well as for the selection of the SNPs with the strongest evidence for association. Annotation of SNPs was then done using the Ensemble database release 59 [38].
A major concern in genetic association studies in general, and genome-wide association studies in particular, are false positive findings due to multiple comparisons [39,40]. When evaluating thousands of potential associations, at a nominal alpha rate of 0.05, 5% of the associations may be statistically significant by chance alone. To decrease the false positive rate of the current study, permutation testing of all 44,175 SNPs as a set [41] was used to derive empirical cut-offs that corresponded to a family-wise (experiment wise) error rate of 0.05 for each comparison. This less stringent empirical p-value was used instead of the more widely used genome-wide significance threshold of 5 × 10-8 due to the small sample size of our study. Permutation testing was implemented in PLINK, and SNPs in each comparison with raw p-values below the empirical p-value cut-off derived from permutation testing corresponding to a family-wise error rate of 0.05 for each of the three comparisons were considered statistically significant. To check whether the association signals that pass this significance threshold are due to linkage disequilibrium (LD), r2 scores for SNPs on the same chromosome and/or gene were also calculated. The overall level of heterozygosity (Fst) in our sample was also calculated using GENEPOP version 1.2 [42].
Results
There were 44,175 SNPs and 104 individuals for analysis after applying our quality control filter: 24 extrapulmonary tuberculosis cases, 24 pulmonary tuberculosis controls, and 56 PPD + controls. The sites of extrapulmonary disease have been reported previously [30]; only 1 had central nervous system (miliary/meningeal) disease. The clinical and demographic characteristics of the 104 patients in the study population are in Table 1. There were no statistically significant differences by age, sex, or race between cases and controls.
Table 1.
Characteristic | Extrapulmonary TB (n = 24) | Pulmonary TB (n = 24) | PPD + (n = 56) | P-valuea |
---|---|---|---|---|
Median age at study entry in years (IQR) |
48.4 (43.6-79.4) |
43.6 (40.3-53.9) |
46 (39.3-53.5) |
0.19 |
# Male Sex (%) | 16 (67) | 14 (58) | 35 (63) | 0.87 |
# Black Race (%) | 18 (75) | 19 (79) | 48 (86) | 0.16 |
# White Race (%) | 4 (17) | 4 (17) | 8 (14) | 0.27 |
# Asian Race (%) | 2 (9) | 1 (4) | 0 (0) | 0.22 |
IQR: inter-quartile range
Figures 2, 3 and 4 show the distribution of SNPs and the -log(p) values for each SNP from the logistic regression analysis for the three comparisons (extrapulmonary tuberculosis vs PPD +, extrapulmonary tuberculosis vs pulmonary tuberculosis and any tuberculosis vs PPD +, respectively), as well as the coverage achieved using the Affymetrix 50 k array. The empirical P-value cutoff from permutation testing was 0.0009. Figures 5, 6 and 7 show Q-Q plots demonstrating the expected versus observed -2log(p-values) for each comparison: extrapulmonary tuberculosis vs PPD +, extrapulmonary tuberculosis vs pulmonary tuberculosis and any tuberculosis vs PPD +, respectively. These Q-Q plots show the observed versus expected p-values under the null hypothesis. A large number of p-values above the diagonal line would indicate strong signal within the data (especially at the top right portion of the plot), whereas a large number of p-values below the diagonal line indicate low statistical power. Deviation of the observed p-values from the diagonal across the entirety of the plot (not just at the top right) indicate possible technical concerns with the study. Our results demonstrate that there are not major technical concerns, but statistical power is low. Figure 6 demonstrates slight deviation from the diagonal, but generally all comparisons show overall good quality. These findings should be kept in mind when interpreting the results of this pilot study. The level of genetic variation between the two types of active disease (pulmonary and extrapulmonary tuberculosis) and PPD + controls was calculated using EIGENSTRAT [35] and was found to be 0.001. This demonstrates a low level of variability between those with prior tuberculosis disease and the PPD + group, indicating little differentiation among the groups, after accounting for racial differences.
Extrapulmonary tuberculosis vs. PPD + controls
There were four statistically significant SNPs between these two patient groups, three of which are in known genes: PDE11A, KCND2 and PCDH15 (Table 2). Two of these SNPs (rs10487416 and rs10488286) were on the same chromosome (chromosome 7) and the LD scores for these two SNPs on chromosome 7 are in Table 3.
Table 2.
SNP | Rank | P-value | MAF in ETB | MAF in PPD + | Odds Ratio | Chromosome | Gene | Closest Gene | Distance to Gene |
---|---|---|---|---|---|---|---|---|---|
rs4893980 | 1 | 0.0007 | 0.1087 | 0.4902 | 0.1268 | 2 | PDE11A | --- | --- |
rs10487416 | 2 | 0.0007 | 0.25 | 0.0566 | 5.556 | 7 | unknown | unknown | unknown |
rs10488286 | 3 | 0.0009 | 0.2381 | 0.02727 | 11.15 | 7 | KCND2 | --- | --- |
rs2026414 | 4 | 0.0009 | 0.5208 | 0.2589 | 3.111 | 10 | PCDH15 | --- | --- |
Empirical P - value cutoff from permutation testing = 0.0009
The p - value reported is the raw value.
MAF = Minor Allele Frequency
ETB = Extrapulmonary tuberculosis
PPD + = Purified protein derivative positive
Table 3.
chromosome | Comparison | SNP1 | SNP2 | D' | LOD | r2 |
---|---|---|---|---|---|---|
7 | ETB vs PPD + | rs10487416 | rs10488286 | 0.406 | 3.08 | 0.148 |
4 | Any TB vs PPD + | rs2107005 | rs509423 | 0.345 | 0.95 | 0.043 |
PPD + = Purified protein derivative positive
ETB = Extrapulmonary tuberculosis
Any TB = Pulmonary or extrapulmonary tuberculosis
Extrapulmonary tuberculosis vs. pulmonary tuberculosis
There were two SNPs that were significantly different between these two groups (Table 4), and the most significant one (lowest p-value) was in the gene FAM135B. The two SNPs were on different chromosomes so no LD calculations were made for this subgroup.
Table 4.
SNP | Rank | P-value | MAF in ETB | MAF in PTB | Odds Ratio | Chromosome | Gene |
---|---|---|---|---|---|---|---|
rs340708 | 1 | 0.0008 | 0.4375 | 0.125 | 5.444 | 8 | FAM135B |
rs1886870 | 2 | 0.0009 | 0.6667 | 0.25 | 6 | 13 | RP11-141M1 |
Empirical P-value cutoff from permutation testing = 0.0009
The p-value reported is the raw value.
MAF = Minor Allele Frequency
ETB = Extrapulmonary tuberculosis
PTB = Pulmonary tuberculosis
Any tuberculosis vs. PPD + controls
Six SNPs were significantly different in this comparison (Table 5), of which five (rs1989565, rs10490266, rs2107005, rs509423, rs968079) were in known genes: MARK3, AC007317.1, RP11-556L14.2, RP11-440N4 respectively, and SNP rs1518350 was not in any know gene but was closest to the gene SLC35F1. Two SNPs (rs2107005 and rs509423) were on the same chromosome (chromosome 4), and their LD scores are in Table 3.
Table 5.
SNP | Rank | P-value | MAF in any TB | MAF in PPD + | Odds Ratio | Chromosome | Gene | Closest Gene | Distance to Gene |
---|---|---|---|---|---|---|---|---|---|
rs1989565 | 1 | 0.0004 | 0.1702 | 0.4074 | 0.2984 | 14 | MARK3 | --- | --- |
rs10490266 | 2 | 0.0005 | 0.125 | 0.3661 | 0.2474 | 2 | AC007317.1 | ||
rs1518350 | 3 | 0.0006 | 0.4688 | 0.2143 | 3.235 | 6 | --- | SLC35F1 | 29,253bp |
rs2107005 | 4 | 0.0008 | 0.2917 | 0.5804 | 0.2977 | 4 | RP11-556L14.2 | --- | --- |
rs509423 | 5 | 0.0008 | 0.1064 | 0.3039 | 0.2727 | 4 | RP11-440N4 | --- | --- |
rs968079 | 6 | 0.0009 | 0.587 | 0.3163 | 3.071 | 9 | PTPRD | --- | --- |
Empirical P - value cutoff from permutation testing = 0.0009
The p - value reported is the raw value
MAF = Minor Allele Frequency
TB = Pulmonary or extrapulmonary tuberculosis
PPD + = Purified protein derivative positive
Discussion
The results of this pilot study reveal 6 SNPs that were significantly associated with extrapulmonary tuberculosis: four SNPs distinguished between extrapulmonary tuberculosis and M. tuberculosis infection and two distinguished between extrapulmonary and pulmonary tuberculosis. There were an additional 6 SNPs that distinguished between active tuberculosis of any site (pulmonary or extrapulmonary) and M. tuberculosis infection. None of the SNPs identified have previously been reported to be associated with tuberculosis. This is to be expected since almost all of the SNPs included in the Affymetrix 50K Xba chip have not previously been reported to be associated with tuberculosis risk, nor in genes known to be important in tuberculosis pathogenesis. Most of the SNPs were found in regions with newly discovered genes of which very little is known, as such, this study should be viewed as exploratory and a way to uncover potentially novel biology. The SNPs identified should be evaluated in subsequent studies of tuberculosis genetics, and their immunologic correlate should be assessed.
The analysis of persons with extrapulmonary tuberculosis vs. PPD + controls (Table 2) identified 4 SNPs that were significantly associated with extrapulmonary disease. Three of the SNPs were in known genes (PDE11A, KCND2, PCDH15). While much is known about the function of these genes themselves, how they may potentially play a role in tuberculosis pathogenesis is unclear. None of these genes have previously been reported to be associated with tuberculosis. PDE11A encodes a member of the PDE superfamily, which plays a role in cAMP and cGMP function as second messengers in a wide variety of signal transduction pathways. Mutations in this gene are a cause of Cushing disease and adrenocortical hyperplasia [43]. The gene also appears to play a significant role in regulating brain function. KCND2 encodes a member of the family of voltage-gated potassium channels (shal-related sub-family) which is prominent in the repolarization phase of the action potential [44]. The diverse functions of these channels include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. PCDH15 is a member of the cadherin superfamily. Family members encode integral membrane proteins that mediate calcium-dependent cell-cell adhesion. It plays an essential role in maintenance of normal retinal and cochlear function. Mutations in this gene result in hearing loss and Usher Syndrome Type IF [45].
It has been previously suggested that the pathogenesis of pulmonary tuberculosis may differ from that of extrapulmonary tuberculosis. While this study was under-powered to fully evaluate differences in genetic predisposition, the results from the extrapulmonary tuberculosis vs. pulmonary tuberculosis (Table 4) analysis provides some evidence in support of this hypothesis.
Two SNPs were significantly associated with extrapulmonary vs. pulmonary tuberculosis, one of which was in a known gene FAM135B (family with sequence similarity 135, member B). The gene FAM135B located on chromosome 8 and is a protein coding gene that is expressed in several tissues/organs such as the brain and heart, but has not previously been linked to either form of tuberculosis. The other significant polymorphism (rs1886870) for this comparison is in a newly annotated gene RP11-141M1 of which not much is currently known.
The level of linkage disequilibrium between significant SNPs found on the same gene and those on the same chromosome was calculated: the low correlation coefficient scores (r2) suggest that the SNPs are independent of each other (Table 3).
The biological relevance of the SNPs identified is difficult to determine because there is very little known about these SNPs. Most were in unknown regions of the genome and those that were found within gene regions have not been associated with either extrapulmonary or pulmonary tuberculosis. Additional study is warranted to assess the possible biological relevance.
Our findings from the third comparison (both forms of tuberculosis vs PPD + controls) identified 6 SNPs associated with tuberculosis risk (Table 5). One of these SNPs (rs1989565) was in the MARK3 (microtubule affinity-regulating kinase 3) gene, which is a protein coding gene that might play a role in the regulation of cell polarity [46]. This gene has not previously been linked with tuberculosis risk. Another SNP (rs968079) was found to be in the gene PTPRD (protein-tyrosine phosphatase receptor-type delta) which has been associated with asthma[47], Restless leg syndrome[48] and some cancers as a tumor suppressor[49]. Three SNPs (rs2107005, rs509423, rs10490266) were found in three newly annotated genes and not much is known about them. The final significant SNP in this comparison was not found in any gene but was closest to the gene SLC35F1(Solute Carrier Family 35, Member F1), which is mainly expressed in the brain[50], but not much else is known about its function.
Though our sample size was small we controlled for population stratification to reduce the occurrence of spurious results; such an approach has also been shown to increase power [51]. Additionally, we used permutation testing to control for false positive findings. While the current findings would not hold up to a stringent Bonferroni correction for genome-wide significance, the permutation testing does control the family-wise error rate for these tests.
It is clear that the major limitation of this study was the sample size, which leaves us under-powered to detect small effects as shown by our Q-Q plots (Figures 5, 6 and 7), and therefore only very large effect sizes could be detected. Although we did not have the power to detect potentially important genetic predictors of tuberculosis risk, the SNPs identified were by necessity, strong predictors. They should also be interpreted within the context of the immunologic deficits noted in these patients with previous extrapulmonary tuberculosis [20,21]. Our results should be considered hypothesis generating, as opposed to hypothesis testing, and should be evaluated in subsequent studies of extrapulmonary tuberculosis with substantially increased sample sizes.
A second limitation was the limited SNP coverage of the Affymetrix 50 k array. In particular, large regions in chromosomes 1, 9 and 16 were not covered. Newer arrays, with a greater number of SNPs and more extensive coverage, would potentially identify even more novel SNP associations. However, the above issues of sample size, statistical power, and multiple comparisons would take on even greater importance.
Conclusions
Novel genetic factors were associated with susceptibility to extrapulmonary tuberculosis, and these factors may act through previously unknown pathophysiologic mechanisms. The genes identified play a role in cellular signaling, activity and polarity, with some appearing to play an important role in the brain. Only one of the patients with extrapulmonary disease had central nervous system involvement. Future large-scale studies are warranted in which persons with extrapulmonary tuberculosis and appropriate controls have both genetic and immunologic features characterized. Additional studies of the biologic function of the SNPs identified here would also provide additional insight into the pathophyisology of extrapulmonary tuberculosis.
Abbreviations
M. tuberculosis: Mycobacterium tuberculosis; TB: tuberculosis; ETB: extrapulmonary tuberculosis; PTB: pulmonary tuberculosis; PPD: purified protein derivative; SNP: single nucleotide polymorphism; MAF: minor allele frequency;
Competing interests
None of the authors have a commercial or other association that might pose a conflict of interest related to this work.
Authors' contributions
NOO led the analysis and writing of the manuscript. AAMR contributed to the analysis and writing of the manuscript, PRZA contributed to the design of the study, preparation of the samples to be tested for polymorphisms and also to the methods section of the manuscript. SL oversaw the performance of the genotyping. SMH contributed to the design of the study. TRS conceived and designed the study and made significant contributions to the writing of the manuscript. The final manuscript was read and approved by all authors.
Contributor Information
Noffisat O Oki, Email: onoki@ncsu.edu.
Alison A Motsinger-Reif, Email: alison.motsinger@gmail.com.
Paulo RZ Antas, Email: pzquim@ioc.fiocruz.br.
Shawn Levy, Email: shawn.levy@vanderbilt.edu.
Steven M Holland, Email: smh@nih.gov.
Timothy R Sterling, Email: timothy.sterling@vanderbilt.edu.
Acknowledgements
The authors thank the Vanderbilt Center for Human Genetics Research for advice regarding study design (Marylyn D. Ritchie, Ph.D.). All array-based genotyping was performed in the Vanderbilt Microarray Shared Resource (John Mote). The Vanderbilt Microarray Shared Resource was supported by the Vanderbilt Ingram Cancer Center (P30 CA68485), the Vanderbilt Digestive Disease Center (P30 DK58404), and the Vanderbilt Vision Center (P30 EY08126).
This work was supported by the National Institutes of Allergy and Infectious Diseases (K23 AI01654, K24 AI065298, and the intramural program), the CAPES Foundation (Brazil), the Johns Hopkins Hospital General Clinical Research Center (M01-RR00052 from the National Center for Research Resources, National Institutes of Health), and the Vanderbilt University Department of Medicine.
References
- Raviglione MC, Snider DE, Kochi A. Global Epidemiology of Tuberculosis Morbidity and Mortality of a worldwide epidemic. JAMA. 1995;273:220–226. doi: 10.1001/jama.273.3.220. [DOI] [PubMed] [Google Scholar]
- Horsburgh CR. Priorities for the treatment of latent tuberculosis in the United States. NEJM. 2004;350(3):2060–2067. doi: 10.1056/NEJMsa031667. [DOI] [PubMed] [Google Scholar]
- Centers for Disease Control and Prevention. Decrease in Reported Tuberculosis Cases - United States, 2009. MMWR. 2010;59:289–94. [PubMed] [Google Scholar]
- World Health Organisation. Global tuberculosis control: a short update to the 2009 report. WHO/HTM/TB/2009.426.
- American Thoracic Society. Targeted tuberculin testing and treatment of latent tuberculosis infection. Am J Respir Crit Care Med. 2000;161(4 Suppl):221–247. doi: 10.1164/ajrccm.161.supplement_3.ats600. [DOI] [PubMed] [Google Scholar]
- Stein CM, Guwatudde D, Nakakeeto M, Peters P, Elston RC, Tiwari HK, Mugerwa R, Whalen CC. Heritability analysis of cytokines as intermediate phenotypes of tuberculosis. J Infect Dis. 2003;187(11):1679–85. doi: 10.1086/375249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stein CM, Nshuti L, Chiunda AB, Boom WH, Elston RC, Mugerwa RD, Iyengar SK, Whalen CC. Evidence for a major gene influence on tumor necrosis factor-alpha expression in tuberculosis: path and segregation analysis. Hum Hered. 2005;60(2):109–18. doi: 10.1159/000088913. [DOI] [PubMed] [Google Scholar]
- Berrington WR, Hawn TR. Mycobacterium tuberculosis, macrophages, and the innate immune response: does common variation matter? Immunol Rev. 2007;219:1671–186. doi: 10.1111/j.1600-065X.2007.00545.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bellamy R, Beyers N, McAdam KP, Ruwende C, Gie R, Samaai P, Bester D, Meyer M, Corrah T, Collin M, Camidge DR, Wilkinson D, Helden EH, Whittle HC, Amos W, Helden P, Hill AVS. Genetic susceptibility to tuberculosis in Africans: a genome-wide scan. Proc Natl Acad Sci USA. 2000;97:8005–8009. doi: 10.1073/pnas.140201897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fitness J, Floyd S, Warndorff DK, Sichali L, Malema S, Crampin AC, Fine PE, Hill AV. Large-scale candidate gene study of tuberculosis susceptibility in the Karonga district of northern Malawi. Am J Trop Med Hyg. 2004;71(3):330–340. [PubMed] [Google Scholar]
- Miller EN, Jamieson SE, Joberty C, Fakiola M, Hudson D, Peacock C S, Cordell H J, Shaw M-A, Lins-Lainson Z, Shaw J J, Ramos F, Silveira F, Blackwell J M. Genome-wide scans for leprosy and tuberculosis susceptibility genes in Brazilians. Genes and Immunity. 2004;5:63–67. doi: 10.1038/sj.gene.6364031. [DOI] [PubMed] [Google Scholar]
- Comstock GW. Tuberculosis in twins: a re-analysis of the Prophit survey. Am Rev Resp Dis. 1978;117:621–624. doi: 10.1164/arrd.1978.117.4.621. [DOI] [PubMed] [Google Scholar]
- Bellamy R, Ruwende C, Corrah T, McAdam KP, Whittle HC, Hill AV. Assessment of the interleukin 1 gene cluster and other candidate gene polymorphisms in host susceptibility to tuberculosis. Tuber Lung Dis. 1998;79:83–89. doi: 10.1054/tuld.1998.0009. [DOI] [PubMed] [Google Scholar]
- Bellamy R, Ruwende C, Corrah T, McAdam KPWJ, Whittle HC, Hill AVS. Variations in the NRAMP1 gene and susceptibility to tuberculosis in West Africans. N Engl J Med. 1998;338:640–644. doi: 10.1056/NEJM199803053381002. [DOI] [PubMed] [Google Scholar]
- Awomoyi AA, Marchant A, Howson JM, McAdam KP, Blackwell JM, Newport MJ. Interleukin-10, polymorphism in SLC11A1 (formerly NRAMP1), and susceptibility to tuberculosis. J Infect Dis. 2002;186:1808–1814. doi: 10.1086/345920. [DOI] [PubMed] [Google Scholar]
- Bellamy R, Ruwende C, Corrah T, McAdam KP, Thursz M, Whittle HC, Hill AV. Tuberculosis and chronic hepatitis B virus infection in Africans and variation in the vitamin D receptor gene. J Infect Dis. 1999;179:721–724. doi: 10.1086/314614. [DOI] [PubMed] [Google Scholar]
- Hasan Z, Zaidi I, Jamil B, Khan MA, Kanji A, Hussain R. Elevated ex viv monocyte chemotactic protein-1 (CCL2) in pulmonary as compared to extra-pulmonary tuberculosis. BMC Immunol. 2005;6:14. doi: 10.1186/1471-2172-6-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Z, Kong Y, Wilson F, Foxman B, Fowler AH, Marrs CF, Cave MD, Bates JH. Identification of risk factors for extrapulmonary tuberculosis. Clin Infect Dis. 2004;38(2):199–205. doi: 10.1086/380644. [DOI] [PubMed] [Google Scholar]
- Gonzalez OY, Adams G, Teeter LD, Bui TT, Musser JM, Graviss EA. Extra-pulmonary manifestations in a large metropolitan area with a low incidence of tuberculosis. Int J Tuberc Lung Dis. 2003;7:1178–1185. [PubMed] [Google Scholar]
- Antas PRZ, Ding L, Hackman J, Reeves-Hammock L, Shintani AK, Schiffer J, Holland SM, Sterling TR. Decreased CD4+ lymphocytes and innate immune responses in adults with previous extrapulmonary tuberculosis. J Allerg Clin Immunol. 2006;117:916–923. doi: 10.1016/j.jaci.2006.01.042. [DOI] [PubMed] [Google Scholar]
- Sterling TR, Dorman SE, Chaisson RE, Ding L, Hackman J, Moore K, Holland SM. HIV- seronegative adults with extrapulmonary tuberculosis have abnormal innate immune responses. Clin Infect Dis. 2001;33:976–982. doi: 10.1086/322670. [DOI] [PubMed] [Google Scholar]
- Fernando SL, Saunders BM, Sluyter R, Skarratt Kk, Goldberg H, Marks GB, Wiley JS, Britton WJ. A polymorphism in the P2X7 Gene Increases Susceptibility to extrapulmonary tuberculosis. Am J Respir Crit Care Med. 2006;175:360–366. doi: 10.1164/rccm.200607-970OC. [DOI] [PubMed] [Google Scholar]
- Wilkinson RJ, Llewelyn M, Toossi Z, Patel P, Pasvol G, Lalvani A, Wright D, Latif M, Davidson RN. Influence of vitamin D deficiency and vitamin D receptor polymorphisms on tuberculosis among Gujarati asians in west London: a case-control study. Lancet. 2000;355(9204):618–621. doi: 10.1016/S0140-6736(99)02301-6. [DOI] [PubMed] [Google Scholar]
- Wilkinson RJ, Patel P, Llewelyn M, Hirsch CS, Pasvol G, Snounou G, Davidson RN, Toossi Z. Influence of polymorphisim in the genes for the interleukin (IL)-1 Receptor antagonist an IL-1β on tuberculosis. J Exp Med. 1999;189(12):1863–1874. doi: 10.1084/jem.189.12.1863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taype CA, Shamsuzzaman S, Accinelli RA, Espinoza JR, Shaw M. Genetic susceptibility to different clinical forms of tuberculosis in the peruvian population. Infection Genetics and Evolution. 2010;5(10(4)):495–504. doi: 10.1016/j.meegid.2010.02.011. [DOI] [PubMed] [Google Scholar]
- Tso HW, Ip WK, Chong WP, Tam CM, Chiang AK, Lau YL. Association of interferon gamma and interleukin 10 genes with tuberculosis in Hong Kong Chinese. Genes Immun. 2005;6(4):358–63. doi: 10.1038/sj.gene.6364189. [DOI] [PubMed] [Google Scholar]
- Mosaad OE, Soliman ZE, Tawhid D, Sherif DM. Interferon-gamma +874 T/A and Interleukin-10 -1082 A/G Single nucleotide Polymorphism in Egyptian Children with Tuberculosis. Scandinavian J of Immun. 2010;72(4):358–364. doi: 10.1111/j.1365-3083.2010.02426.x. [DOI] [PubMed] [Google Scholar]
- Ansari A, Talat N, Jamil B, Hasan Z, Razzaki T, Dawood G, Hussain R. Cytokine gene polymorphisms across tuberculosis clinical spectrum in Pakistani patients. PLoS One. 2009;4(3):e4778. doi: 10.1371/journal.pone.0004778. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henao MI, Montes C, París SC, García LF. Cytokine gene polymorphisms in Colombian patients with different clinical presentations of tuberculosis. Tuberculosis (Edinb) 2006;86(1):11–9. doi: 10.1016/j.tube.2005.03.001. [DOI] [PubMed] [Google Scholar]
- Motsinger-Reif AA, Antas PRZ, Oki NO, Levy S, Holland SM, Sterling TR. Polymorphisms in IL-1, Vitamin D Receptor Fok1, and Toll-like Receptor 2 are Associated with Extrapulmonary Tuberculosis. BMC Med Genet. 2010;2:11–37. doi: 10.1186/1471-2350-11-37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Purcell s, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, DeBakker PI, Daly MJ, Sham PC. PLINK, a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marchini J, Cardon LR, Phillips MS, Donnelly P. The effects of human population structure on large genetic association studies. Nat Genet. 2004;36(5):512–7. doi: 10.1038/ng1337. [DOI] [PubMed] [Google Scholar]
- Freedman M, Reich D, Penney K, J McDonald GJ, Mignault AA, Patterson N, Gabriel SB, Topol EJ, Smoller JW, Pato CN, Pato MT, Petryshen TL, Kolonel LN, Lander ES, Sklar P, Henderson B, Hirschhorn JN, Altshuler D. Assessing the impact of population stratification on genetic association studies. Nat Genet. 2004;36:388–393. doi: 10.1038/ng1333. [DOI] [PubMed] [Google Scholar]
- Helgason A, Yngvadóttir B, Hrafnkelsson B, Gulcher J, Stefánsson K. An Icelandic example of the impact of population structure on association studies. Nat Genet. 2005;37:90–95. doi: 10.1038/ng1492. [DOI] [PubMed] [Google Scholar]
- Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38:904–909. doi: 10.1038/ng1847. [DOI] [PubMed] [Google Scholar]
- Patterson N, Price AL, Reich D. Population Structure and Eigenanalysis. PLoS Genet. 2006;2(12):e190. doi: 10.1371/journal.pgen.0020190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ge D, Zhang K, Need AC, Martin O, Fellay J, Urban TJ, Telenti A, Golsteing DB. WGAViewer: software for genomic annotation of whole genome asociation studies. Genome Res. 2008;18:640–643. doi: 10.1101/gr.071571.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hubbard TJP, Aken BL, Ayling S, Ballester B, Beal K, Bragin E, Brent S, Chen Y, Clapham P, Clarke L, Coates G, Fairley S, Fitzgerald S, Fernandez-Banet J, Gordon L, Gräf S, Haider S, Hammond M, Holland R, Howe K, Jenkinson A, Johnson N, Kähäri A, Keefe D, Keenan S, Kinsella R, Kokocinski F, Kulesha E, Lawson D, Longden I, Megy K, Meidl P, Overduin B, Parker A, Pritchard B, Rios D, Schuster M, Slater G, Smedley D, Spooner W, Spudich G, Trevanion S, Vilella A, Vogel J, White S, Wilder S, Zadissa A, Birney E, Cunningham F, Curwen V, Durbin R, Fernandez-Suarez XM, Herrero J, Kasprzyk A, Proctor G, Smith J, Searle S, Flicek P. Ensembl 2009. Nucleic Acids Research. 2009. pp. D690–D697. [DOI] [PMC free article] [PubMed]
- Ioannidis JP, Ntzani EE, Trikalinos TA, Contopoulos-Ioannidis DG. Replication validity of genetic association studies. Nat Genet. 2002;29(3):306–309. doi: 10.1038/ng749. [DOI] [PubMed] [Google Scholar]
- Wang TH, Wang HS. A genome-wide association study primer for clinicians. Taiwan J Obstet Gynecol. 2009;48(2):89–95. doi: 10.1016/S1028-4559(09)60265-5. [DOI] [PubMed] [Google Scholar]
- Good P. Extensions of the concept of exchangeability and their applications. J Modern Appl Statist Methods. 2002;1:4. [Google Scholar]
- Raymond M, Rousset F. Genepop (version1.2): population genetics software for exact tests and ecumenicism. J Hered. 1995;86(3):248. [Google Scholar]
- Kelly MO, Logue SF, Brennan J, Day JP, Lakkaraju S, Jiang L, Zhong X, Tam M, Sukoff Rizzo SJ, Platt BJ, Dwyer JM, Neal S, Pulito VL, Agostino MJ, Grauer SM, Navarra RL, Kelley C, Comery TA, Murrills RJ, Houslay MD, Brandon NJ. Phosphodiesterase 11A in brain is enriched in ventral hippocampus and deletion causes psychiatric disease-related phenotypes. Proc Natl Acad Sci. 2010;107(18):8457–62. doi: 10.1073/pnas.1000730107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- National Center for Biotechnology Information (NCBI) Entrez Gene. http://www.ncbi.nlm.nih.gov/sites/entrez?db=gene
- Alagramam KN, Yuan H, Kuehn MH, Murcia CL, Wayne S, Srisailpathy CRS, Lowry RB, Knaus R, Van LL, Bernier FP, Schwartz S, Lee C, Morton CC, Mullins RF, Ramesh A, Van Camp G, Hagemen GS, Woychik RP, Smith RJH. Mutations in the novel protocadherin PCDH15 cause usher syndrome type 1F. Human Molecular Genetics. 2001 August 01;10(16):1709–18. doi: 10.1093/hmg/10.16.1709. [DOI] [PubMed] [Google Scholar]
- Tochio N, Koshiba S, Kobayashi N, Inoue M, Takashi MA, Seki E, Matsuda T, Tomo Y, Motoda Y, Kobayashi A, Tanaka A, Hayashizaki Y, Terada T, Shirouzu M, Kigawa T, Yokoyama S. Solution structure of the kinase-associated domain 1 of mouse microtubule- associated protein/microtubule affinity-regulating kinase 3. Protein Sci. 2006 November;15(11):2534–2543. doi: 10.1110/ps.062391106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shyur S, Wang J, Lin CG, Hsiao Y, Liou Y, Wu Y, Wu LS. The polymorphisms of protein-tyrosine phosphatase receptor-type delta gene and its association with pediatric asthma in the taiwanese population. Eur J Hum Genet. 2008;16(10):1283–8. doi: 10.1038/ejhg.2008.79. [DOI] [PubMed] [Google Scholar]
- Schormair B, Kemlink D, Roeske D, Eckstein G, Xiong L, Lichtner P, Ripke S, Trenkwalder C, Zimprich A, Stiasny-Kolster K, Oertel W, Bachmann CG, Paulus W, Hogl B, Frauscher B, Gschliesser V, Poewe W, Peglau I, Vodicka P, Vavrova J, Sonka K, Nevsimalova S, Montplaisir J, Turecki G, Rouleau G, Gieger C, Illig T, Wichmann H, Holsboer F, Muller-Myhsok B, Meitinger T, Winkelmann J. PTPRD (protein tyrosine phosphatase receptor type delta) is associated with restless legs syndrome. Nat Genet. 2008;40(8):946–8. doi: 10.1038/ng.190. [DOI] [PubMed] [Google Scholar]
- Veeriah S, Brennan C, Meng S, Singh B, Fagin JA, Solit DB, Paty PB, Rohle D, Vivanco I, Chmielecki J, Pao W, Ladanyi M, Gerald WL, Liau L, Cloughesy TC, Mischel PS, Sander C, Taylor B, Schultz N, Major J, Heguy A, Fang F, Mellinghoff IK, Chan TA. The tyrosine phosphatase PTPRD is a tumor suppressor that is frequently inactivated and mutated in glioblastoma and other human cancers. Proceedings of the National Academy of Sciences. 2009 June 09;106(23):9435–40. doi: 10.1073/pnas.0900571106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nishimura M, Suzuki S, Satoh T, Naito S. Tissue-Specific mRNA Expression Profiles of Human Solute Carrier 35 Transporters. Drug Metabolism and Pharmacokinetics. 2009;24(1):91–9. doi: 10.2133/dmpk.24.91. [DOI] [PubMed] [Google Scholar]
- Shi YY, Zhao XZ, Yu L, Tao R, Tang J, La Y, Duan Y, Gao B, Gu N, Xu Y, Feng G, Zhu S, Liu H, Salter H, He L. Genetic Structure Adds Power to Detect Schizophrenia Susceptibility at SLIT3 in the Chinese Han Population. Genome Res. 2004;14(7):1345–1349. doi: 10.1101/gr.1758204. [DOI] [PMC free article] [PubMed] [Google Scholar]