Skip to main content
Data in Brief logoLink to Data in Brief
. 2017 Feb 21;11:364–370. doi: 10.1016/j.dib.2017.02.040

Data characterizing the ZMIZ1 molecular phenotype of multiple sclerosis

N Fewings a,1, PN Gatt a,1, FC McKay a,1, GP Parnell a,g,1, SD Schibeci a, J Edwards a, MA Basuki a, A Goldinger b,h, MJ Fabis-Pedrini c,d, AG Kermode d, CP Manrique e, JL McCauley e, D Nickles f, SE Baranzini f, T Burke g, S Vucic a,g, GJ Stewart a,g, DR Booth a,g,
PMCID: PMC5329066  PMID: 28275670

Abstract

The data presented in this article are related to the research article entitled “The autoimmune risk gene ZMIZ1 is a vitamin D responsived marker of a molecular phenotype of multiple sclerosis” Fewings et al. (2017) [1]. Here we identify the set of genes correlated with ZMIZ1 in multiple cohorts, provide phenotypic details on those cohorts, and identify the genes negatively correlated with ZMIZ1 and the cells predominantly expressing those genes. We identify the metabolic pathways in which the molecular phenotype genes are over-represented. Finally, we present the flow cytometry gating strategy we have used to identify the immune cells from blood which are producing ZMIZ1 and RPS6.

Keywords: ZMIZ1, Gene expression, Multiple sclerosis, Molecular phenotype


Specifications Table

Subject area Biology
More specific subject area Autoimmune Genetics
Type of data Tables, Figures
How data was acquired Molecular phenotypes are determined from transcriptomic data (RNAseq, Microarray) derived from previously published sources[2], [3], [4]
Molecular pathways were analysed using GeneGo, Metacore.
Flow Cytometry data was acquired using Canto II (BD Biosciences) and Fortessa (BD Biosciences) instruments.
Data format Analyzed
Experimental factors Samples are from whole blood collected in to PAXgene tubes for RNA analysis. The source of samples were healthy controls and Multiple Sclerosis patients not on therapy from Australia and the United States.
Experimental features Gene expression in whole blood was interrogated by transcriptomic and RTPCR procedures to identify genes dysregulated in MS and correlated gene sets (mRNA level). Flow cytometry was used to identify immune cell subsets in which ZMIZ1 and RPS6 were most highly expressed (protein level).
Data source location Cohorts of MS patients and controls are from Sydney and Perth in Australia; and Miami and San Francisco in United States.
Data accessibility UCSF CIS Cohort Transcriptome Source Data is in RepositoryGSE41846(url:https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE41846)
ANZgene Transcriptome Source Data is in RepositoryGSE17048(url:https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE17048)

Value of the data

  • 1.

    The detailed list of genes defining the ZMIZ1 molecular phenotype in multiple cohorts is described (Supplementary Table 1).

  • 2.

    The cohorts used to identify the genes in the ZMIZ1 molecular phenotype (Table 1).

  • 3.

    The genes negatively correlated with ZMIZ1 expression in whole blood and the immune cells in which they are expressed are identified (Fig. 1).

  • 4.

    The gene pathways indicated as overrepresented with the genes of the ZMIZ1 molecular phenotype are identified (Fig. 2).

  • 5.

    The flow cytometry gating strategy used to identify the immune cells most highly expressing ZMIZ1 and RPS6 are diagrammed (Fig. 3).

  • 6.

    A list of Multiple Sclerosis (MS) risk SNPs tested for association with gene expression in whole blood in the study published in [1] (Table 2).

Table 1.

Usage and source of various cohorts in this work.

Cohort Name Used in this study in: Numbers of individuals Original Reference
Sydney PCR cohort Figures 2, 3, 4, 5, 6, 7, 9 n=39 untreated MS
n=40 healthy controls
[1]
Miami cohort Figures 3, 6, 7 n=73 untreated MS
n=32 healthy controls
[1]
Treated MS cohort Figure 10 Total, n=78:
n=10, glatiramer acetate
n=18, fingolimod
n=20, interferon beta
n=23, natalizumab
n=7, dimethyl fumarate
[8]
ANZgene (microarray) cohort Figures 1B, 6, 7 n=99 untreated MS;
n=45 healthy controls
[2]
Sydney RNASeq cohort Figures 1, 3, 6, 7, Supplementary Figure. 1 and 2 n=32 untreated MS;
n=40 healthy controls
[4]
CIS cohort Figure 1B n=42 CIS [3]

Fig. 1.

Fig. 1.

Relative expression in immune cell subsets of ZMIZ1 and the 50 genes whose expression is most highly negatively correlated with ZMIZ1 expression in PAXgene whole blood in multiple sclerosis and healthy controls. (These were determined in the RNASeq cohort: n=32 MS, n=40 healthy controls, [4]. These genes are mostly expressed in lymphocytes. Expression was by RNASeq and colour on heatmap indicates relative expression level: orange is high, blue is low. Cell subsets were ex vivo or in-vitro generated as previously described [7]. Pearson׳s correlation (R) of expression with ZFP36L2 and RPS6 is also shown for each module gene, red is positive correlation and green is negative correlation. ZFP36L2 correlations all less than r=-0.27 (p=0.02), RPS6 correlations all greater than r=0.54 (p=8.9E-07). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Fig. 2.

Fig. 2.

Gene pathways, diseases, process networks, processes, and molecular functions enriched for the ZMIZ1 Molecular Module genes (the 200 genes with expression most highly positively correlated with ZMIZ1 gene expression in whole blood). The 20 most over-represented items in the module are shown for A. Pathway maps; B. Pathway map folders (diseases); C. Process networks; D. Processes; E. Molecular functions. Enrichment analysis using Gene Ontology (GO) software (http://geneontology.org/); with items listed in order of significance; p value shows the probability that the module is not over-represented; FDR: false discovery rate; ratio is the number of genes from the ZMIZ1 module in the pathway, compared to the total number of genes in the pathway.

Fig. 3.

Fig. 3.

Flow cytometric gating of PBMC subsets for determination of ZMIZ1 and RPS6 protein expression. (A) ZMIZ1 panel: progressive gating of subsets of interest, and ZMIZ1 median fluorescence intensity of DC and monocyte subsets relative to isotype control; (B) RPS6 panel: gating of live PBMCs and their median fluorescence intensity of RPS6 relative to isotype control.

Table 2.

List of MS risk SNPs tested for association with gene expression in whole blood in this study.

Gene SNP Reference
RPS6KB1 rs180515 [9]
ZMIZ1 rs1782645 [10]
ZFP36L2 rs2163226 [9]
HLADRB1 rs2516049 [10]
CYP27B1 rs10877012 [11]
CYP24A1 rs2248359 [10]

1. Data

We have recently described a gene, ZMIZ1, whose expression is dysregulated in the blood of people with MS [1]. From transcriptomic data, the expression of this gene is tightly correlated with that of many others. The set of genes whose expression is most correlated with ZMIZ1 across cohorts from Australia and the United States is defined in Supplementary Table 1. The cohort details are described in Table 1. The genes whose expression is most positively correlated with that of ZMIZ1 is previously described [1]. The genes most negatively correlated with ZMIZ1, and the immune cell subsets in which they are predominantly produced, are shown in Fig. 1. Gene pathway analysis was used to identify the types of molecular pathways most overrepresented in the gene lists of the ZMIZ1 molecular phenotype (Fig. 2). Specifically, GeneGo Maps, GeneGo Map folders, GeneGo Networks, Gene Ontology processes, and Gene Ontology molecular functions over-represented are shown in Figs. 2A–E respectively. The flow cytometry gating strategy to identify the immune cell subsets expressing ZMIZ1 and RPS6 is described in Figs. 3A and B respectively. A list of MS risk single nucleotide polymorphisms (SNPs) tested for association with gene expression in whole blood in this study is presented in Table 2.

2. Experimental design, materials and methods

2.1. Cohorts

Untreated MS patients who had not been on immunomodulatory therapies for at least three months and age-matched healthy controls were recruited for the following cohorts. The molecular phenotypes were determined from previously described transcriptomic cohorts (all PAXgene whole blood samples): ANZgene (2010) [2]; clinically isolated syndrome (CIS; or first clinical diagnosis of central nervous system demyelination) [3] and RNAseq [4], [5]. All MS patients were diagnosed using the revised MacDonald criteria [6]. All blood was collected with informed consent after the nature and possible consequences of the studies were explained from people with MS and healthy controls with approval from Human Research Ethics Committees.

2.2. Molecular phenotypes

The genes most correlated with ZMIZ1, ZFP36L2 and RPS6 expression in PAXgene whole blood was determined by Pearson׳s correlation using an RNAseq dataset of MS patients and healthy controls [4]. To visualise relative expression levels across different immune cell populations for the most correlated genes, a heatmap was generated using an RNAseq dataset of ex-vivo and in-vitro differentiated immune cell subsets, as previously described [7]. These genes were assessed for immune cell transcription factor roles and involvement in molecular pathways using GeneGo Metacore. Genes associated with MS were also noted. We then tested correlations in previously described cohorts used for transcriptomic studies: the ANZgene microarray cohort [2], and the CIS cohort [3].

2.3. Flow cytometry

Venous blood was collected in EDTA and peripheral blood mononuclear cells (PBMCs) isolated on Ficoll-Paque Plus (VWR International), washed in phosphate-buffered saline (PBS) and cryopreserved in RPMI 1640 Medium (Life Technologies) containing 2 mM glutamine, 10% heat-inactivated fetal bovine serum (FBS, Fisher Biotec), 10% DMSO and 50 units/ml penicillin and 50 µg/ml streptomycin. PBMCs were thawed, washed in RPMI with 2% FBS, and incubated for 30 min in RPMI with 2% FBS, 10 mM HEPES, 1 mM magnesium chloride and 100 units/ml DNase I (Roche). Cells were stained with Live/Dead Aqua viability stain (Molecular Probes) in PBS on ice for 30 min, washed in PBS, and blocked with 33 µg/ml mouse IgG (Life technologies). Antibodies were as described in [1].

Acknowledgements

We would like to thank the people with MS and controls for donating blood to support this research. This work was funded by grants from the Australian National Health and Medical Research Council (NHMRC) (GNTs 1006759, 1049936, and 1050074), MS Research Australia and the Trish MS Research Foundation, the National Multiple Sclerosis Society (NMSS) research grant (RG4680A1/1) and the National Institutes of Health (NIH)/National Institute of Neurological Disorders and Stroke (NINDS) (R01NS088155). Flow cytometry was performed in the Flow Cytometry Core Facility that is supported by Westmead Research Hub, Cancer Institute New South Wales and the NHMRC. The BSGS cohort was built with support from NHMRC project grants (GNT1046880, GNT1083405). Funding bodies had no role in study design, in collection, analysis and interpretation of data, in the writing of the report, or in the decision to submit the article for publication.

Footnotes

Transparency document

Transparency data associated with this article can be found in the online version at 10.1016/j.dib.2017.02.040.

Appendix A

Supplementary data associated with this article can be found in the online version at 10.1016/j.dib.2017.02.040.

Transparency document. Supplementary material

Supplementary material

mmc1.docx (116.3KB, docx)

Appendix A. Supplementary material

Supplementary Table 1 legend. Expanded gene list of the ZMIZ1 module and comparison across multiple cohorts. A. Module of 1656 genes with expression correlated with ZMIZ1 (R=0.63 to 0.91, p<5×10–11 to 4×10–29, (5×10−5 to 8×10–25 corrected)) and 621 genes correlated with ZFP36L2 ((R=0.63 to 0.84, p<2×10−9 to 2×10–20, (5×10−5 to 5×10–16 corrected)) in the RNASeq cohort. Lists of 200 genes whose expression is most highly positively B. or negatively C. correlated with ZMIZ1 gene expression in whole blood in 3 separate cohorts. Pearson׳s correlation coefficient (R) for each gene is shown for each cohort, and overlap between the cohorts is shown (Venn diagram).

mmc2.zip (211.9KB, zip)

References

  • 1.N.L. Fewings, P.N. Gatt, F.C. McKay, G.P. Parnell, S.D. Schibeci, J. Edwards, et al., The autoimmune risk gene ZMIZ1 is a vitamin D responsive marker of a molecular phenotype of multiple sclerosis, J. Autoimmun. (2017), http://dx.doi.org/10.1016/j.jaut.2016.12.006. [DOI] [PubMed]
  • 2.Gandhi K.S., McKay F.C., Cox M., Riveros C., Armstrong N., Heard R.N. The multiple sclerosis whole blood mRNA transcriptome and genetic associations indicate dysregulation of specific T cell pathways in pathogenesis. Hum. Mol. Genet. 2010;19:2134–2143. doi: 10.1093/hmg/ddq090. [DOI] [PubMed] [Google Scholar]
  • 3.Nickles D., Chen H.P., Li M.M., Khankhanian P., Madireddy L., Caillier S.J. Blood RNA profiling in a large cohort of multiple sclerosis patients and healthy controls. Hum. Mol. Genet. 2013;22:4194–4205. doi: 10.1093/hmg/ddt267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Parnell G.P., Gatt P.N., McKay F.C., Schibeci S., Krupa M., Powell J.E. Ribosomal protein S6 mRNA is a biomarker upregulated in multiple sclerosis, downregulated by interferon treatment, and affected by season. Mult. Scler. 2014;20:675–685. doi: 10.1177/1352458513507819. [DOI] [PubMed] [Google Scholar]
  • 5.Parnell G.P., Gatt P.N., Krupa M., Nickles D., McKay F.C., Schibeci S.D. The autoimmune disease-associated transcription factors EOMES and TBX21 are dysregulated in multiple sclerosis and define a molecular subtype of disease. Clin. Immunol. 2014;151:16–24. doi: 10.1016/j.clim.2014.01.003. [DOI] [PubMed] [Google Scholar]
  • 6.Polman C.H., Reingold S.C., Banwell B., Clanet M., Cohen J.A., Filippi M. Diagnostic criteria for multiple sclerosis: 2010 revisions to the McDonald criteria. Ann. Neurol. 2011;69:292–302. doi: 10.1002/ana.22366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Shahijanian F., Parnell G.P., McKay F.C., Gatt P.N., Shojoei M., O׳Connor K.S. The CYP27B1 variant associated with an increased risk of autoimmune disease is underexpressed in tolerizing dendritic cells. Hum. Mol. Genet. 2014;23:1425–1434. doi: 10.1093/hmg/ddt529. [DOI] [PubMed] [Google Scholar]
  • 8.McKay F.C., Gatt P.N., Fewings N., Parnell G.P., Schibeci S.D., Basuki M.A. The low EOMES/TBX21 molecular phenotype in multiple sclerosis reflects CD56+ cell dysregulation and is affected by immunomodulatory therapies. Clin. Immunol. 2016;163:96–107. doi: 10.1016/j.clim.2015.12.015. [DOI] [PubMed] [Google Scholar]
  • 9.IMSGC, WTCCC2 Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature. 2011;476:214–219. doi: 10.1038/nature10251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Beecham A.H., IMSGC Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. Nat. Genet. 2013;45:1353–1360. doi: 10.1038/ng.2770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Ramagopalan S.V., Dyment D.A., Cader M.Z., Morrison K.M., Disanto G., Morahan J.M. Rare variants in the CYP27B1 gene are associated with multiple sclerosis. Ann. Neurol. 2011;70:881–886. doi: 10.1002/ana.22678. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.docx (116.3KB, docx)

Supplementary Table 1 legend. Expanded gene list of the ZMIZ1 module and comparison across multiple cohorts. A. Module of 1656 genes with expression correlated with ZMIZ1 (R=0.63 to 0.91, p<5×10–11 to 4×10–29, (5×10−5 to 8×10–25 corrected)) and 621 genes correlated with ZFP36L2 ((R=0.63 to 0.84, p<2×10−9 to 2×10–20, (5×10−5 to 5×10–16 corrected)) in the RNASeq cohort. Lists of 200 genes whose expression is most highly positively B. or negatively C. correlated with ZMIZ1 gene expression in whole blood in 3 separate cohorts. Pearson׳s correlation coefficient (R) for each gene is shown for each cohort, and overlap between the cohorts is shown (Venn diagram).

mmc2.zip (211.9KB, zip)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES