Abstract
The proto-oncogene SKI is highly expressed in human myeloid leukemia and also in murine hematopoietic stem cells. However, its operative relevance in these cells remains elusive. We have over-expressed SKI to define its intrinsic role in hematopoiesis and myeloid neoplasms, which resulted in a robust competitive advantage upon transplantation, a complete dominance of the stem and progenitor compartments, and a marked enhancement of myeloid differentiation at the expense of other lineages. Accordingly, enforced expression of SKI induced gene signatures associated with hematopoietic stem cells and myeloid differentiation. Here we provide detailed experimental methods and analysis for the gene expression profiling described in our recently published study of Singbrant et al. (2014) in Haematologica. Our data sets (available at http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE39457) provide a resource for exploring the underlying molecular mechanisms of the involvement of the proto-oncogene SKI in hematopoietic stem cell function and development of myeloid neoplasms.
Keywords: Gene expression profiling, cDNA microarray, SKI, Myeloproliferative disease, Hematopoietic stem cell activity
| Specifications | |
|---|---|
| Organism | Mus musculus |
| Array type | Illumina MouseWG-6 v2.0 expression bead chip qRT-PCR |
| Data format | Raw data: TAR. Normalized data: SOFT, MINIML, TXT, XLS, ODF, RDA |
| Experimental factors | Murine stem- and progenitor cells overexpressing SKI vs. MIG empty control vector. |
| Experimental features | Human SKI was overexpressed using a retroviral MIG-vector (MCV-based vector containing IRES and the marker gene GFP) in sorted murine stem- and progenitor cells (lin−cKit+Sca-1+) from C57Bl6 animals. Cells were infected and cultured for two days prior to isolation of GFP+ cells, followed by RNA isolation and microarray. Cells transduced with empty control vector (MIG) or overexpressing SKI were compared. Each sample represents an independent infection. |
| Consent | n/a |
Direct link to deposited data
Raw and processed microarray data is available in GEO under accession GSE39457 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE39457).
Materials and methods
Isolation and transduction of primary murine stem- and progenitor cells
The SKI construct [2] was cloned into the Murine Stem Cell Vector (MSCV)-based retroviral MIG-vector containing IRES and the marker gene GFP, and virus was produced as previously described [1].
Hematopoietic stem- and progenitor cells (negative for the mature lineage markers Mac1, Gr1, CD3, CD4, CD8, B220, Ter119, and positive for the primitive surface markers c-Kit and Sca1; LKS +) were FACS-sorted from the bone marrow of C57Bl6 wild-type mice and pre-stimulated with 50 ng/ml murine Stem Cell Factor (mSCF; Amgen, Thousand Oaks, CA), 10 ng/ml murine Interleukin 3 (mIL-3; PeproTech), and 50 ng/ml human IL-6 (PeproTech) at 37 °C, 5% CO2 for 48 h to induce cycling. Pre-stimulated cells were subsequently cultured in the above described cytokines with the addition of 8 µg/ml Polybrene (Sigma) on retronectin-coated plates (Takara Bioscience, Clontech) pre-loaded with viral supernatant (1–2 hits), and GFP+ cells were isolated after an additional 48 h (n = 6 independent transductions).
Mice were obtained from Animal Resources Centre, Perth, Australia and kept at BioResources Centre, St Vincent's Hospital, Australia. All experiments were performed under ethical approval of St Vincent's Health Melbourne Animal Ethics Committee.
cDNA microarray
Data normalization and analysis
Total RNA was extracted using Trizol followed by RNeasy Micro kit (Qiagen). RNA was hybridized to the Illumina Mouse WG-6 v2.0 Expression BeadChip at the Ramaciotti Centre for Gene Function Analysis at the University of New South Wales, Australia. Illumina Data was processed in GenePattern (GP) [3] using IlluminaExpressionFileCreator (GP, v2.1) with background correction using manifest MouseWG-6_V2_0_R3_11278593_A.txt (GPL6887). Detection p-values were calculated as the proportion of negative control probes with expression greater than the regular probe in question using the calculateDetected function in the bioconductor bead array package (v2.8.1) [4]. The data was log transformed in GenePattern (logTransform, GP, v3.1) and quantile normalized using NormalizeColulms (GP, v4.2.1).
Filtering
Filtering of the data to eliminate “bad probes” as defined by Barbosa-Morais NL et al. [5] resulted in the removal of 6324 probes, leaving 38,958 probes. Furthermore, probes that were not detected in at least 5 out of 6 SKI or MIG samples were discarded (75%), leaving 9780 probes (7514 unique gene symbols). The filtered data is available as supplementary file “SKI_normalized_filtered.gct”.
Quality control
In order to assess microarray quality we ran ArrayQualityMetrics [6]. All microarrays passed quality control and the output is available as supplementary files “ArrayQualityMetrics”. There was a clear separation between the SKI and MIG samples, as depicted by an MDS plot in Fig. 1A. The dissimilarity matrix of the normalized arrays (Fig. 1B, blue: high similarity, yellow: low similarity) reflects the distance between each pair of arrays — the mean absolute difference between the array data.
Fig. 1.
Microarray data overview. A) MDS plot of samples demonstrating a clear separation between the SKI and MIG samples. B) A dissimilarity matrix of the normalized arrays (blue: high similarity, yellow: low similarity) reflecting the distance between each pair of arrays. C) Volcano plot showing the distribution of the gene expression changes. Genes with absolute logFC > 1 and Q value < 0.05 are indicated in red. D) Heat map of the top 50 genes deregulated by SKI overexpression.
Differential expression
Differentially expressed genes were determined using the LimmaGP module [7] with the most informative probe for each gene used (supplementary files SKI_limmaGP_SKIvsMIG.odf, SKI_limmaGP_SKIvsMIG.rnk, SKI_limmaGP_SKIvsMIG.xls, SKI_limmaGP_list.Rda). The distribution of the fold change and q-value is shown in Fig. 1C. Using a cutoff of absolute logFC > 1 and Q.value < 0.05, 113 genes were considered differentially expressed, 51 genes up-regulated and 62 genes down-regulated in response to SKI overexpression. A heat map of the top 50 genes is shown in Fig. 1D, generated using GENE-E [8].
Pathway analysis using GSEA
We performed Gene Set Enrichment Analysis (GSEA) [9] using GSEAPreRanked against the gene set database C2 (c2.all.v3.0.symbols.gmt, curated gene sets). We used the ranked gene list from LimmaGP as input on all “good” probes with the number of permutations = 1000. The Gene Pattern modules LimmaGP and GSEAPreRanked were developed by Mark Cowley (m.cowley@garvan.org.au) from Peter Wills Bioinformatics Centre, Garvan Institute of Medical Research, Sydney Australia.
Validation of gene expression with quantitative real time PCR
The expression of a cohort of genes that were up- or down-regulated by SKI in the microarray was validated using real time PCR on transduced FACS sorted LKS cells from 3 transductions separate from those used for the array.
Conclusion
Collectively our results demonstrate that enforced expression of SKI results in activation of hematopoietic stem cell transcriptional programs and a hematopoietic stem cell autonomous repopulating advantage that favors myeloid development from an early progenitor stage, ultimately resulting in a myeloproliferative disorder.
Disclosures
None of the authors declare any competing financial interests.
Acknowledgments
This research was supported by the Swedish Research Council (fellowship to SS) (524-2009-515), the Swedish Tegger Foundation (fellowship to SS), the Swedish Cancer Society (fellowship to SS): 2011/1055, the Leukemia Foundation Philip Desbrow Senior Research Fellowship (CRW) and National Health and Medical Research Council (1031481 and 559016) (NHMRC) Australia (CRW).
Footnotes
Supplementary data to this article can be found online at http://dx.doi.org/10.1016/j.gdata.2014.06.022.
Appendix A. Supplementary data
Supplementary material 1.
References
- 1.Singbrant S., Wall M., Moody J., Karlsson G., Chalk A.M., Liddicoat B. The SKI proto-oncogene enhances the in vivo repopulation of hematopoietic stem cells and causes myeloproliferative disease. Haematologica. 2014;99(4):647–655. doi: 10.3324/haematol.2013.093971. PMID: 24415629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Akiyoshi S., Inoue H., Hanai J., Kusanagi K., Nemoto N., Miyazono K. c-Ski acts as a transcriptional co-repressor in transforming growth factor-beta signaling through interaction with smads. J. Biol. Chem. 1999;274(49):35269–35277. doi: 10.1074/jbc.274.49.35269. [DOI] [PubMed] [Google Scholar]
- 3.Reich M., Liefeld T., Gould J., Lerner J., Tamayo P., Mesirov J.P. GenePattern 2.0. Nat. Genet. 2006;38(5):500–501. doi: 10.1038/ng0506-500. [DOI] [PubMed] [Google Scholar]
- 4.Dunning M.J., Smith M.L., Ritchie M.E., Tavare S. BeadArray: R classes and methods for Illumina bead-based data. Bioinformatics. 2007;23(16):2183–2184. doi: 10.1093/bioinformatics/btm311. [DOI] [PubMed] [Google Scholar]
- 5.Barbosa-Morais N.L., Dunning M.J., Samarajiwa S.A., Darot J.F., Ritchie M.E., Lynch A.G. A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data. Nucleic Acids Res. 2010;38(3):e17. doi: 10.1093/nar/gkp942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kauffmann A., Gentleman R., Huber W. ArrayQualityMetrics—a bioconductor package for quality assessment of microarray data. Bioinformatics. 2009;25(3):415–416. doi: 10.1093/bioinformatics/btn647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wettenhall J.M., Smyth G.K. LimmaGUI: a graphical user interface for linear modeling of microarray data. Bioinformatics. 2004;20(18):3705–3706. doi: 10.1093/bioinformatics/bth449. [DOI] [PubMed] [Google Scholar]
- 8.GENE-E http://www.broadinstitute.org/cancer/software/GENE-E/index.html [Internet]. [cited 16th of June 2014]. Available from:
- 9.Subramanian A., Tamayo P., Mootha V.K., Mukherjee S., Ebert B.L., Gillette M.A. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U. S. A. 2005;102(43):15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary material 1.

