Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2010 Apr 13;5(4):e10142. doi: 10.1371/journal.pone.0010142

The Effect of Single Nucleotide Polymorphisms from Genome Wide Association Studies in Multiple Sclerosis on Gene Expression

Adam E Handel 1,2, Lahiru Handunnetthi 1,2, Antonio J Berlanga 1,2, Corey T Watson 3, Julia M Morahan 1,2, Sreeram V Ramagopalan 1,2,4,*
Editor: Syed A Aziz5
PMCID: PMC2854120  PMID: 20405052

Abstract

Background

Multiple sclerosis (MS) is a complex neurological disorder. Its aetiology involves both environmental and genetic factors. Recent genome-wide association studies have identified a number of single nucleotide polymorphisms (SNPs) associated with susceptibility to (MS). We investigated whether these genetic variations were associated with alteration in gene expression.

Methods/Principal Findings

We used a database of mRNA expression and genetic variation derived from immortalised peripheral lymphocytes to investigate polymorphisms associated with MS for correlation with gene expression. Several SNPs were found to be associated with changes in expression: in particular two with HLA-DQA1, HLA-DQA2, HLA-DQB1, HLA-DRB1, HLA-DRB4 and HLA-DRB5, one with ZFP57, one with CD58, two with IL7 and FAM164A, and one with FAM119B, TSFM and KUB3. We found minimal cross-over with a recent whole genome expression study in MS patients.

Discussion

We have shown that many susceptibility loci in MS are associated with changes in gene expression using an unbiased expression database. Several of these findings suggest novel gene candidates underlying the effects of MS-associated genetic variation.

Introduction

Multiple sclerosis (MS) is an inflammatory disease of the central nervous system characterised by demyelination and axonal loss.[1] Studies conducted in mono- and dizygotic twin pairs and siblings have shown that genetics plays a role in MS susceptibility.[2] Linkage was effective in identifying the locus exerting the single strongest genetic effect in MS, namely, the human leukocyte antigen (HLA) class II region.[3] The risk associated with this region has since been shown to be determined by epistatic interactions between different HLA alleles,[4] and is thought to be responsible for approximately 50% of the genetic risk of MS.[5] Beyond this powerful determinant of MS genetic susceptibility, research has taken considerably longer to bear useful fruit. Finally, after the genotyping of hundreds of thousands of single nucleotide polymorphisms (SNPs) in many thousands of MS patients and controls, we are beginning to establish a network of loci outside of the HLA region involved in determining MS susceptibility.[6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17] It is worth considering that even the most strongly associated of these with MS is still a significantly weaker determinant of MS susceptibility than HLA alleles. For some of these loci, functional studies have been undertaken.[11], [18], [19] However, such studies are rarely carried out in an unbiased manner since these generally correlate genetic variations with the expression of a candidate gene. A recent study of mRNA levels in MS patients and healthy controls showed a great multitude of differentially expressed genes however it is uncertain to what extent this reflects the aetiology of disease as opposed to the disease process or adaptive biological pathways.[20]

A recent investigation has performed whole genome expression analysis in lymphoblastoid cell lines (LCLs) from healthy volunteers who were also genotyped for a large number of SNPs.[21] We used the data from this study to examine the effects of current susceptibility loci in MS on gene expression.

Methods

Gene expression analysis

This was carried out as described in Dixon et al. [21] Briefly, peripheral lymphocytes were transformed using Epstein-Barr virus before being cultured, pelleted and frozen for storage. cDNA templates were created using the One-Cycle cDNA Synthesis Kit (Affymetrix). In vitro transcription of cDNA was performed using the IVT Labeling Kit (Affymetrix) and, after hybridisation on U133 Plus 2.0 GeneChips (Affymetrix), this was scanned using a high-resolution scanner (Affymetrix). Whole-genome genotyping was carried out according to manufacturers' instructions using the Sentrix Human-1 Genotyping BeadChip and the HumanHap300 Genotyping BeadChip. The analysis of expression was carried out on the publically available database of mRNA by SNP Browser 1.0 as described.[21]

mRNA by SNP analysis

We investigated the mRNAs significantly altered in expression by the SNPs reported in the literature to be at or close to genome-wide significance.[6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17] If the susceptibility SNP was not available on the database, we used the SNP with the strongest linkage disequilibrium (LD) with the susceptibility SNP as provided by SNP Browser 1.0 based on r2. For SNPs where no proxy was provided, we investigated all genotyped SNPs within 500 kb for LD with r2≥0.4 for a suitable proxy SNP. We also assessed the degree of LD with potentially interesting SNPs within 500 kb of the original susceptibility SNP. Finally we assessed the SNPs associated with expression of putative candidate genes to ensure that we did not miss any important associations with expression.

Results

SNP selection

We chose to look at a set of 38 SNPs which were the top loci to reach genome-wide significance selected from currently reported genome wide association studies (GWAS) of which 14 had been independently replicated in 2 studies. 17 of these were not present in the genome-wide association mRNA expression library and so when possible proxy SNPs in strong-to-moderate LD were used instead. The SNPs and proxy SNPs used are detailed in Table 1 .

Table 1. SNPs and proxy SNPs analysed.

SNP putative gene association proxy SNP r2
rs1054283 IL7
rs10876994 METTL1, CYP27B1, CDK4 rs10083154 0.70067
rs1132200 TMEM39A
rs11554159 IFI30 rs874628 0.949
rs11808092 EVI5-RPL5
rs11865121 CLEC16A rs2041670 1
rs12122721 KIF21B
rs12368653 METTL1, CYP27B1, CDK4
rs12708716 CLEC16A rs725613 1
rs12722489 IL2RA rs12722561 0.9349
rs1335532 CD58 rs6677309 1
rs1569723 CD40
rs17445836 IRF8
rs17824933 CD6 rs2237997 0.43718
rs1800693 TNFRSF1A
rs2051322 CD226
rs2104286 IL2RA rs12722561 0.548
rs2300747 CD58 rs6677309 1
rs2523393 HLA class I rs2394160 0.96552
rs2587156 IL7
rs3129860 HLA class II rs9271366 0.95557
rs3129934 C6orf10, BTNL2, NOTCH4 rs9267992 0.95604
rs3135388 HLA class II rs9271366 0.95699
rs34536443 TYK2 No proxy available
rs4149584 TNFRSF1A No proxy available
rs441349 SOCS1 rs1646042 1
rs6074022 CD40
rs6131010 CD40 rs6074022 0.70337
rs6604026 EVI5-RPL5
rs6860438 C7
rs6897932 IL7R
rs703842 METTL1, CYP27B1, CDK4
rs7404554 CLEC16A
rs744166 STAT3
rs763361 CD226
rs8118449 TYK2 No proxy available
rs9271366 HLA class II
rs9523762 GPC5

mRNA expression

13 of the MS susceptibility SNPs or proxy SNPs were associated with changes in mRNA expression (Table S1). Two SNPs in strong LD with multiple MS-associated SNPs in the HLA region were related to expression of various HLA alleles, including HLA-DQA1, HLA-DQA2, HLA-DQB1, HLA-DRB1, HLA-DRB4 and HLA-DRB5. One SNP in the HLA class I region was associated with altered expression of ZFP57. Both SNPs in CD58 were associated with expression of CD58. A SNP in the IL7 region was associated with expression of mRNA encoding IL7 and FAM164A. Three SNPs in the region of METTL1-CYP27B1-CDK4 altered the expression of several genes: FAM119B, TSFM and KUB3. The common gene of altered expression for all three SNPs was TSFM.

Overlap with previous mRNA expression studies

We used the supplemental data supplied by Gandhi and colleagues to examine cross-over between the results obtained in that study and the genes we identified as being altered in expression by susceptibility SNPs.[20] Only three genes were in common between the two sets: HLA-DQB1, HLA-DRB1 and STAT3. HLA-DRB1 was upregulated in MS, relapsing-remitting MS (RRMS) and secondary progressive MS compared with healthy controls. HLA-DQB1 expression was reduced in MS and RRMS compared with healthy controls. STAT3 was reduced in primary progressive MS compared with healthy controls.

Discussion

Our findings show that some, but by no means all, susceptibility SNPs in MS are associated with changes in gene expression. Some of these (CD58, had already been noted by previous investigators.[11] We were unable to find supporting evidence in this dataset for the previously reported allelic effect of the susceptibility SNP in IL7R on expression of the gene.[18] Similarly, SNPs in the IL2RA gene did not correlate with expression of IL2RA mRNA, despite previously finding altered levels of this in MS patients relative to controls.[19]

We also found several novel effects of susceptibility SNPs. Two SNPs in tight LD with susceptibility SNPs in the HLA region correlated with expression of several HLA class II mRNAs. However, measuring gene expression in the HLA is a complex task. There is haplotype specificity for some genes (HLA-DRB4 and HLA-DRB5) and thus we are not sure whether differential expression of HLA genes measured by microarray reflects different probe affinity for different HLA class II alleles and thus further work is needed to fully understand this association. Our identification that a SNP in the HLA class I region was associated with altered expression of ZFP57 is an interesting observation as this gene has been linked with DNA methylation changes across the genome resulting in transient neonatal diabetes.[22] There is some epidemiological evidence that MS may be partly determined by epigenetic alterations and this would be an ideal candidate functionally linking MS to the epigenome.[23] A SNP in IL7 recently confirmed as associated with MS was shown to correlate with the expression of several genes: IL7 and FAM164A. Naturally the most compelling candidate of these is IL7 due to its probable role in autoimmunity. However, the advantage of an unbiased screen is that it raises the possibility of candidate genes that would otherwise not be considered. This is especially so since the SNP is far more strongly associated with FAM164A expression than with IL7. FAM164A is a hypothetical protein encoded in the reverse direction to IL7 and its functional importance is largely unknown.[24] The susceptibility region on chromosome 12 was previously linked with the expression of FAM119B.[20] We feel that the relationship of all three major susceptibility SNPs with the expression of TSFM suggests this as a strong candidate. This is a plausible candidate in terms of function too as it is involved in the translation of mitochondrial proteins, providing a potential link with other susceptibility genes linked to mitochondrial function, such as KIF21B.[8], [25] Further functional work will be needed to better assess these candidates.

The limited cross-over between known and suspected susceptibility genes in the whole genome expression analysis of Gandhi and colleagues is likely due to a number of differences including the use of whole blood mRNA and individuals with established disease in the Gandhi study.[20] It is possible that future whole genome analyses of expression conducted using RNA-seq in cell-sorted samples of patients with very early disease may reveal alterations in the level of susceptibility gene mRNA.

The advantage of an unbiased approach to linking the expression of genes with genetic variation associated with disease susceptibility is that there is no a priori hypothesis to blind investigators to the presence of other genes. There are several limitations to the approach we used. The mRNA screen was conducted in transformed LCLs and so it would not be informative about tissue-specific gene expression.[21] Also, SNP coverage across the genome was not complete and so the functional effects of some SNPs for which no proxy was available will be concealed. Furthermore, despite using expression data from 400 LCLs, we may have been underpowered to detect relevant effects. However, our finding of several novel associations between MS SNPs and gene expression is worthy of further investigation and also raises the hypothesis that some disease associated SNPs may not exert their effects on MS susceptibility through simple effects on gene expression.

Supporting Information

Table S1

Changes in mRNA expression associated with susceptibility SNPs.

(0.15 MB DOC)

Footnotes

Competing Interests: The authors have declared that no competing interests exist.

Funding: This work was funded by the Wellcome Trust (Grant Number 075491/Z/04). SVR is a Goodger Scholar at the University of Oxford. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Noseworthy JH, Lucchinetti C, Rodriguez M, Weinshenker BG. Multiple sclerosis. N Engl J Med. 2000;343:938–952. doi: 10.1056/NEJM200009283431307. [DOI] [PubMed] [Google Scholar]
  • 2.Willer CJ, Dyment DA, Risch NJ, Sadovnick AD, Ebers GC. Twin concordance and sibling recurrence rates in multiple sclerosis. Proceedings of the National Academy of Sciences of the United States of America. 2003;100:12877–12882. doi: 10.1073/pnas.1932604100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ebers GC, Kukay K, Bulman DE, Sadovnick AD, Rice G, et al. A full genome search in multiple sclerosis. Nature Genetics. 1996;13:472–476. doi: 10.1038/ng0896-472. [DOI] [PubMed] [Google Scholar]
  • 4.Lincoln MR, Ramagopalan SV, Chao MJ, Herrera BM, Deluca GC, et al. Epistasis among HLA-DRB1, HLA-DQA1, and HLA-DQB1 loci determines multiple sclerosis susceptibility. Proc Natl Acad Sci U S A. 2009 doi: 10.1073/pnas.0812664106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Peltonen L. Old suspects found guilty–the first genome profile of multiple sclerosis. The New England Journal of Medicine. 2007;357:927–929. doi: 10.1056/NEJMe078147. [DOI] [PubMed] [Google Scholar]
  • 6.Burton PR, Clayton DG, Cardon LR, Craddock N, Deloukas P, et al. Association scan of 14,500 nonsynonymous SNPs in four diseases identifies autoimmunity variants. Nature Genetics. 2007;39:1329–1337. doi: 10.1038/ng.2007.17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hafler DA, Compston A, Sawcer S, Lander ES, Daly MJ, et al. Risk alleles for multiple sclerosis identified by a genomewide study. The New England Journal of Medicine. 2007;357:851–862. doi: 10.1056/NEJMoa073493. [DOI] [PubMed] [Google Scholar]
  • 8.IMSGC. Comprehensive follow-up of the first genome-wide association study of multiple sclerosis identifies KIF21B and TMEM39A as susceptibility loci. Hum Mol Genet. 2010;19:953–962. doi: 10.1093/hmg/ddp542. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ban M, Goris A, Lorentzen AR, Baker A, Mihalova T, et al. Replication analysis identifies TYK2 as a multiple sclerosis susceptibility factor. Eur J Hum Genet. 2009;17:1309–1313. doi: 10.1038/ejhg.2009.41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.De Jager PL, Jia X, Wang J, de Bakker PI, Ottoboni L, et al. Meta-analysis of genome scans and replication identify CD6, IRF8 and TNFRSF1A as new multiple sclerosis susceptibility loci. Nat Genet. 2009;41:776–782. doi: 10.1038/ng.401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.De Jager PL, Baecher-Allan C, Maier LM, Arthur AT, Ottoboni L, et al. The role of the CD58 locus in multiple sclerosis. Proceedings of the National Academy of Sciences of the United States of America. 2009 doi: 10.1073/pnas.0813310106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hafler JP, Maier LM, Cooper JD, Plagnol V, Hinks A, et al. CD226 Gly307Ser association with multiple autoimmune diseases. Genes Immun. 2009;10:5–10. doi: 10.1038/gene.2008.82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Baranzini SE, Wang J, Gibson RA, Galwey N, Naegelin Y, et al. Genome-wide association analysis of susceptibility and clinical phenotype in multiple sclerosis. Hum Mol Genet. 2009;18:767–778. doi: 10.1093/hmg/ddn388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rubio JP, Stankovich J, Field J, Tubridy N, Marriott M, et al. Replication of KIAA0350, IL2RA, RPL5 and CD58 as multiple sclerosis susceptibility genes in Australians. Genes Immun. 2008;9:624–630. doi: 10.1038/gene.2008.59. [DOI] [PubMed] [Google Scholar]
  • 15.(ANZgene) AaNZMSGC. Genome-wide association study identifies new multiple sclerosis susceptibility loci on chromosomes 12 and 20. Nat Genet. 2009;41:824–828. doi: 10.1038/ng.396. [DOI] [PubMed] [Google Scholar]
  • 16.Kallio SP, Jakkula E, Purcell S, Suvela M, Koivisto K, et al. Use of a genetic isolate to identify rare disease variants: C7 on 5p associated with MS. Hum Mol Genet. 2009;18:1670–1683. doi: 10.1093/hmg/ddp073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Jakkula E, Leppa V, Sulonen AM, Varilo T, Kallio S, et al. Genome-wide association study in a high-risk isolate for multiple sclerosis reveals associated variants in STAT3 gene. Am J Hum Genet. 2010;86:285–291. doi: 10.1016/j.ajhg.2010.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gregory SG, Schmidt S, Seth P, Oksenberg JR, Hart J, et al. Interleukin 7 receptor alpha chain (IL7R) shows allelic and functional association with multiple sclerosis. Nature Genetics. 2007;39:1083–1091. doi: 10.1038/ng2103. [DOI] [PubMed] [Google Scholar]
  • 19.Maier LM, Lowe CE, Cooper J, Downes K, Anderson DE, et al. IL2RA genetic heterogeneity in multiple sclerosis and type 1 diabetes susceptibility and soluble interleukin-2 receptor production. PLoS Genet. 2009;5:e1000322. doi: 10.1371/journal.pgen.1000322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Gandhi KS, McKay FC, Cox M, Riveros C, Armstrong N, et al. The multiple sclerosis whole blood mRNA transcriptome and genetic associations indicate dysregulation of specific T cell pathways in pathogenesis. Hum Mol Genet In press. 2010 doi: 10.1093/hmg/ddq090. [DOI] [PubMed] [Google Scholar]
  • 21.Dixon AL, Liang L, Moffatt MF, Chen W, Heath S, et al. A genome-wide association study of global gene expression. Nat Genet. 2007;39:1202–1207. doi: 10.1038/ng2109. [DOI] [PubMed] [Google Scholar]
  • 22.Mackay DJ, Callaway JL, Marks SM, White HE, Acerini CL, et al. Hypomethylation of multiple imprinted loci in individuals with transient neonatal diabetes is associated with mutations in ZFP57. Nat Genet. 2008;40:949–951. doi: 10.1038/ng.187. [DOI] [PubMed] [Google Scholar]
  • 23.Handel AE, Ebers GC, Ramagopalan SV. Epigenetics: molecular mechanisms and implications for disease. Trends Mol Med. 2009 doi: 10.1016/j.molmed.2009.11.003. [DOI] [PubMed] [Google Scholar]
  • 24.Kono T, Bird S, Sonoda K, Savan R, Secombes CJ, et al. Characterization and expression analysis of an interleukin-7 homologue in the Japanese pufferfish, Takifugu rubripes. FEBS J. 2008;275:1213–1226. doi: 10.1111/j.1742-4658.2008.06281.x. [DOI] [PubMed] [Google Scholar]
  • 25.Smeitink JA, Elpeleg O, Antonicka H, Diepstra H, Saada A, et al. Distinct clinical phenotypes associated with a mutation in the mitochondrial translation elongation factor EFTs. Am J Hum Genet. 2006;79:869–877. doi: 10.1086/508434. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S1

Changes in mRNA expression associated with susceptibility SNPs.

(0.15 MB DOC)


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES