Skip to main content
Stem Cell Reports logoLink to Stem Cell Reports
. 2014 Sep 25;3(5):858–875. doi: 10.1016/j.stemcr.2014.08.012

Transcriptome-wide Profiling and Posttranscriptional Analysis of Hematopoietic Stem/Progenitor Cell Differentiation toward Myeloid Commitment

Daniel Klimmeck 1,2,3,4,5, Nina Cabezas-Wallscheid 1,2,4,5, Alejandro Reyes 3,5, Lisa von Paleske 1, Simon Renders 1,2, Jenny Hansson 3, Jeroen Krijgsveld 3, Wolfgang Huber 3,6,, Andreas Trumpp 1,2,4,6,∗∗
PMCID: PMC4235149  PMID: 25418729

Summary

Hematopoietic stem cells possess lifelong self-renewal activity and generate multipotent progenitors that differentiate into lineage-committed and subsequently mature cells. We present a comparative transcriptome analysis of ex vivo isolated mouse multipotent hematopoietic stem/progenitor cells (LinnegSCA-1+c-KIT+) and myeloid committed precursors (LinnegSCA-1negc-KIT+). Our data display dynamic transcriptional networks and identify a stem/progenitor gene expression pattern that is characterized by cell adhesion and immune response components including kallikrein-related proteases. We identify 498 expressed lncRNAs, which are potential regulators of multipotency or lineage commitment. By integrating these transcriptome with our recently reported proteome data, we found evidence for posttranscriptional regulation of processes including metabolism and response to oxidative stress. Finally, our study identifies a high number of genes with transcript isoform regulation upon lineage commitment. This in-depth molecular analysis outlines the enormous complexity of expressed coding and noncoding RNAs and posttranscriptional regulation during the early differentiation steps of hematopoietic stem cells toward the myeloid lineage.

Highlights

  • Transcriptional control and immune response pathways indicative of stem/progenitors

  • Identification of lncRNA species expressed during early adult hematopoiesis

  • Posttranscriptional regulation of oxidative stress response and metabolism

  • High degree of differential exon usage upon lineage commitment


In this article, Trumpp, Huber, and colleagues provide a comprehensive transcriptome and posttranscriptional analysis of hematopoietic stem/progenitor cell differentiation toward myeloid commitment. By integrating the proteome of these cells, they found evidence for posttranscriptional regulation in genes involved in metabolic stress response. They uncover lncRNA expression and transcript isoform changes upon commitment. All the data are available in an interactive website.

Introduction

In the adult hematopoietic system, short-lived mature cells are constantly lost and need to be replaced in order to maintain blood homeostasis (Murphy et al., 2005; Weissman and Shizuru, 2008). This essential task is fulfilled by hematopoietic stem cells (HSCs), which reside in the trabecular areas of the bone marrow (Purton and Scadden, 2007; Till and McCulloch, 1961; Wilson et al., 2009). HSCs possess the highest self-renewal capacity and produce multipotent progenitors (MPPs) with steadily decreasing self-renewal activity (Trumpp et al., 2010; Weissman and Shizuru, 2008). HSCs and MPPs (HSPC) are contained within a compartment immunophenotypically defined as negative for mature blood cell markers (Lin) and positive for stem cell markers SCA-1 and c-KIT (LS+K; Weissman and Shizuru, 2008). HSPCs eventually commit to more mature lymphoid or myeloid progenitors with increasingly restricted self-renewal and differentiation potential (Graf and Enver, 2009). The myeloid committed progenitor subset (Lin, SCA-1 and c-KIT+; [LSK]) comprises common myeloid progenitors (CMPs) as well as more specialized granulocyte-macrophage progenitors (GMPs) and megakaryocyte-erythroid progenitors (MEPs) (Akashi et al., 2000; Pronk et al., 2007), which differentiate toward mature effector cells.

Two crucial aspects of early hematopoiesis are multipotency and lineage commitment (Graf and Enver, 2009; Trumpp et al., 2010). Expression profiling of HSPCs by cDNA microarrays has elucidated important aspects of hematopoietic stem cell biology, including the relevance of the KIT and Wnt signaling pathways (Gazit et al., 2013; Kent et al., 2008; Luis et al., 2012; Seita and Weissman, 2010) for multipotency. Transcriptional control networks active in early hematopoiesis have been studied using single-gene expression analysis (Moignard et al., 2013), but their impact on protein levels and posttranscriptional gene expression regulation in HSPCs has not been described.

Recently, transcriptome profiling by next-generation sequencing (NGS; e.g., RNA sequencing [RNA-seq]) has significantly extended the possibilities to study gene expression (Ozsolak and Milos, 2011), which was also used to investigate young versus aged HSCs (Sun et al., 2014). It permits not only the analysis of differential mRNA expression of low abundant regulatory factors, but also the detection of alternative splicing events that can generate different protein isoforms and the identification of noncoding RNAs. Long noncoding RNAs (lncRNAs) (Mercer et al., 2009) are involved in the regulation of gene expression at various levels (Pauli et al., 2011; Yoon et al., 2013) and can function as oncogenes or tumor-suppressor genes (Gutschner and Diederichs, 2012). Although efforts have been made to identify and elucidate the roles of lncRNAs in stem cells (Qureshi and Mehler, 2012; Uchida et al., 2012), little is known about the expression of lncRNAs or their functions in hematopoietic stem/progenitors (Paralkar and Weiss, 2013). Further, the advent of improved proteome techniques has enabled in-depth comparative analysis of RNA and protein signatures in diverse systems (Cox and Mann, 2007; Schwanhäusser et al., 2011; Vogel and Marcotte, 2012). Although attempts were made to correlate transcriptome and proteome signatures of hematopoietic immature cells (Spooncer et al., 2008), a comprehensive comparison is still lacking.

We performed a genome-wide RNA-seq analysis of primary multipotent and self-renewing hematopoietic stem/progenitors and myeloid committed precursors. We report robust and reproducible transcriptome data with more than 19,000 quantified genes including more than 1,300 noncoding RNA species. To address how gene expression is regulated in multipotency and commitment, we integrated our RNA-seq data with the recently reported proteome data set of the identical cell populations (Klimmeck et al., 2012). These data sets outline the dynamic expression changes that occur during the transition of stem/progenitors toward myeloid commitment.

Results

Quantitative Transcriptomic Analysis of Hematopoietic Stem and Progenitor Cells

Whole-transcriptome analysis was performed to investigate differences in the gene expression profiles between multipotent hematopoietic stem progenitor cells (HSPCs; LS+K) and myeloid committed cells (LSK) of the mouse bone marrow (Figure 1A). We fluorescence-activated cell sorting (FACS)-sorted 50,000 primary cells of each population in three independent biological experiments (Figure 1B and Figure S1 available online) and enriched for polyadenylated RNA. We generated paired-end libraries and sequenced more than 2 × 108 reads per sample (Figures S1 and S2). Quality-control metrics indicated that the data were reproducible and of high quality (Figures 1C, S1, and S2). We identified the expression of 19,824 genes (Table S1). We classified the quantified genes by RNA categories. As expected after poly(A)-RNA enrichment, the majority of transcripts were categorized as protein-coding genes (78%; 15,474; Figure 1D). In addition, we classified hits to 23 other noncoding RNA categories including pseudogenes (1,783) and lncRNAs (498).

Figure 1.

Figure 1

Quantitative Transcriptomic Analysis of Hematopoietic Stem and Progenitor Cells

(A) Early hematopoiesis. Multipotent hematopoietic stem cells (HSCs) give rise to multipotent progenitors (MPPs), which commit either to myeloid specified progenitors (CMPs, common myeloid progenitors; GMP, granulocyte-macrophage progenitor; MEP, megakaryocyte/erythrocyte progenitor) or to lymphoid specified progenitors.

(B) Experimental design. Cell fractions were purified with fluorescence-activated cell sorting (FACS) using specific surface markers for Lineage (Lin), SCA-1, and c-KIT, which distinguish multipotent stem progenitor cells (LS+K) from myeloid committed (LSK) cells. For analysis of posttranscriptional regulation, the transcriptome was integrated with our previously published proteome (Klimmeck et al., 2012) (asterisk).

(C) Clustering of biological replicates. Heatmap represents similarity of samples from black (highest) to light gray (smallest).

(D) Classification of the quantified genes by RNA categories. The pie chart legend indicates percentage and absolute number of genes for each type of RNA. LncRNA, long noncoding RNA; snoRNA, small nucleolar RNA; miRNA, microRNA; IG_V gene, immunoglobin V gene; snRNA, small nuclear RNA; miscRNA, miscellaneousRNA; rRNA, ribosomal RNA; IG_J gene, immunoglobulin J gene; IG_C gene, immunoglobulin C gene; Mt_tRNA, mitochondrial transfer RNA; TR_V gene, V gene. Genes were called as quantified genes if they had at least 20 read counts per cell type.

(E) Differential gene expression between multipotent and myeloid committed progenitors. The plot shows for each gene (indicated by dots) a measure of its average expression (x axis) versus the logarithm of the ratio between expression levels in LS+K and LSK. Red and blue coloring of dots represents differentially expressed genes (FDR = 0.05). Red dots represent upregulated genes in LS+K (1,970 genes). Blue dots depict upregulated genes in LSK (1,266). The five most abundant genes in each fraction are shown in italics.

(F) Biological processes enriched in the annotation of differentially expressed genes. Within each gene ontology (GO) term, the number of genes with higher expression in LS+K (red) and LSK (blue) are indicated. Processes related to immune response; ∗∗cell adhesion; ∗∗∗transcriptional control, ∗∗∗∗metabolism, or ∗∗∗∗∗cell cycle are labeled. The odd ratios are calculated with respect to background. For complete list of differential GO terms, see Table S2.

Differential Gene Expression Analysis Reveals Significant Transcriptional Divergence between HSPCs and Myeloid Committed Precursors

We found 3,236 genes to be differentially expressed (false discovery rate [FDR] = 0.05) between multipotent and myeloid committed cells, which indicated a high divergence in transcript levels (Figure 1E; Table S1). Of these, 1,970 genes were highly expressed in HSPCs and 1,266 were higher in LSK cells. As expected, transcripts encoding SCA-1, which was used to sort this population, as well as FLT3 and THY1 surface markers (Adolfsson et al., 2005; Weissman and Shizuru, 2008) were preferentially expressed in LS+K cells (Table S1). In contrast, neutrophil serine proteases Ctsg and Elane were highly expressed in LSK cells (Korkmaz et al., 2008). We validated 24 differentially expressed genes by quantitative real-time PCR, which confirmed the robustness of the data for a wide range of gene expression transcript abundances (from 2 to 2,314,409 of the mean of sequenced fragments; Figure S2D).

To investigate the biological roles of our set of differentially expressed genes, we applied a gene ontology (GO) enrichment analysis, using a stratified approach that controls for expression strength biases (Figure 1F; Table S2). The GO terms overrepresented in LS+K cells fell into three main categories: immune response ( in Figure 1F; e.g., antigen processing and presentation, inflammatory response), cell adhesion (∗∗; chemotaxis), and transcriptional control (∗∗∗; negative regulation of transcription). In contrast, metabolism (∗∗∗∗; cell redox homeostasis, glycolysis) was significantly overrepresented in myeloid committed cells. Cell cycle (∗∗∗∗∗) showed a dual pattern with e.g., negative regulation of cell cycle enriched in LS+K and spindle assembly involved in mitosis enriched in myeloid cells, reflecting the higher proliferative state in the committed progenitors. We determined genes that were uniquely detected in HSPCs but not in LSK and vice versa (Figure S3). Although four genes were exclusively detected in LSK myeloid progenitors (Ivl, Gm14705, Gp6, Zfp819), 69 genes of diverse RNA categories were only detected in LS+K cells and categorized primarily to cell adhesion and immune system process (e.g., Kallikrein-related peptidases Klk1, Klk9, and Klk10; Figure S3), suggesting particular importance of these processes for the HSPC state. In conclusion, the differential expression pattern for key cellular processes demonstrates a distinct transcriptomic composition of multipotent and myeloid committed cells.

Cell-Cycle Activity Is Tightly Regulated in HSPCs and upon Myeloid Commitment

Next, we analyzed the protein-protein interaction network of the differentially expressed genes related to cell cycle (Figure 2A; ∗∗∗∗∗ in Figure 1F). We found negative regulators of mitosis to be highly expressed in LS+K cells (Cdkn1b/p27), in line with previous work demonstrating regulation of cell-cycle activity in HSPCs (Zou et al., 2011; Tesio and Trumpp, 2011). In contrast, a highly interconnected group of proliferation-related genes were enriched in LSK (Plk1, Rps6ka2). We validated these findings by real-time PCR and extended this expression analysis to more refined myeloid committed progenitors (CMP, MEP, GMP; Figures 2B, S4A, and S4B). Interestingly, MEPs and GMPs showed higher expression of cell-cycle activators but lower expression of inhibitors compared to HSPCs and CMPs. To test whether these gene expression patterns were reflected functionally, we assessed cell-cycle stages by flow cytometry (Figures 2C and 2D). In line, HSPCs showed significantly more cells in G0 (36%) compared to overall myeloid committed precursors (17%). Within the LSK compartment, CMPs were cycling less (23%; G0) compared to the more committed MEP (13%) and GMP (11%) populations. Together, these results suggest that the low cell-cycle activity in HSPCs becomes stepwise activated upon myeloid commitment.

Figure 2.

Figure 2

Cell-Cycle Stages of HSPCs and Myeloid Committed Precursors

(A) Protein-protein interaction analysis built based on GO terms related to cell cycle. Genes highly expressed in LS+K cells are shown in red, and genes higher in LSK cells are shown in blue. Log2 fold change is represented.

(B) Heatmap of cell-cycle regulators. mRNA expression levels in LS+K, LSK, CMPs, GMPs, and MEPs based on real-time PCR. Z scores were calculated per row to highlight expression differences per gene between populations. See Figures S4A and S4B.

(C and D) FACS analysis of cell-cycle stages of HSPCs and myeloid progenitors. (C) Shows representative gating scheme. (D) Quantification of three biological replicates. Error bars indicate ±SD of three independent biological replicates.

Global Analysis of Genes Involved in Transcriptional Control of Multipotency and Commitment

Next, we investigated the differentially expressed genes related to transcriptional control (∗∗∗ in Figure 1F). Notably, the majority of these were enriched in HSPCs (Figure 3; 146 out of 177; 82%) and only few in myeloid progenitors (31 out of 177). Classification of these 177 genes according to their molecular function (Figure 3A; Table S3) revealed various functional categories including ligand-receptors (31) and transcription factors (TFs; 62) (Figure 3A). We then assigned these differential regulators to signaling pathways (Table S3). We found the expression of the Bmp-Smad-TGF-beta (Bmp4, Smad1; Blank and Karlsson, 2011) and WntNotch (Wnt10b, Jag2; Bigas and Espinosa, 2012; Luis et al., 2012) signaling pathways to be highly enriched in HSPCs. Notably, six members of the Forkhead box protein family (e.g., Foxa3, Foxr1) were enriched in HSPCs (2- to 32-fold), whereas Foxh1 was lowly expressed (−8-fold). Next, we screened the expression of TF targets of Fox family members (Figure S4C). This analysis revealed genes possibly activated downstream of Foxo1 (Ccng2, Trib3) in LS+K and Foxh1 (Aldh1a1, Mixl1) in LSK cells; it also highlights factors potentially repressed by Foxh1 upon myeloid commitment (Aldh1a2, Pitx2).

Figure 3.

Figure 3

Differential Expression of Genes Involved in Transcriptional Control and Landscape of lncRNA in Multipotent and Myeloid Committed Progenitors

(A) Protein-protein interaction analysis built based on GO-terms related to transcriptional control. Each gene was functionally classified according to GeneGo MetaCore annotation and manual curation. Genes highly expressed in LS+K cells are shown in red, and genes higher expressed in LSK cells are shown in blue. The color bar indicates the color code for Log2 fold change. For the complete list of differentially expressed genes involved in transcriptional control, see Table S3.

(B–E) Differential landscape of lncRNAs in multipotent and myeloid committed progenitors.

(B) Differentially abundant lncRNAs. Of the 67 lncRNAs with significant change in abundance, the table shows ten examples with changes in each direction. For the complete list, see Table S1. Inf, infinite number, meaning read count of one population is zero.

(C) Top abundant lncRNAs. Ten examples for highly abundant lncRNAs sorted by mean counts.

(D) Confirmation of differentially expressed lncRNAs by real-time PCR. mRNA expression levels of lncRNAs Meg3 and Malat1 in LS+K (red) and LSK (blue) samples were quantified by real-time PCR. Numbers represent Log2 fold changes from real-time PCR (left, bold) and RNA-seq (right) data for LS+K compared to LSK. For all real-time PCR data, mean values means ± SD of three independent biological replicates with three technical replicates each are shown.

(E) Expression analysis of lncRNA Malat1 in HSCs and MPPs. mRNA expression levels of Malat1 in HSC, MPP1, MPP2, MPP3, MPP4 (Wilson et al., 2008), and LSK cells were analyzed by real-time PCR.

(F) Expression analysis of lncRNA 1700006J14Rik in myeloid committed precursors. RNA expression levels LS+K, LSK, CMPs, GMPs, and MEPs based on real-time PCR. p < 0.05; ∗∗p < 0.01; ∗∗∗p < 0.001; ∗∗∗∗p < 0.001 (two-sided Student’s t test).

Next, we investigated the expression of long noncoding RNAs (lncRNAs) in LS+K and LSK cells (Figures 3B–3F). From the 489 ENSEMBL-annotated lncRNAs for which we identified expression in our data, 67 were differentially expressed between HSPCs and LSK (FDR = 0.05; 55 up/ 12 down; Table S1). Strikingly, only two out of the 67 differentially expressed lncRNA set were functionally annotated (H19, Meg3; (Venkatraman et al., 2013; Yoshimizu et al., 2008; Zhou et al., 2012)). In contrast, other lncRNAs were also exclusive or strongly differentially expressed in HSPCs (A930001C03Rik, Gm12066) or LSKs (Gm12708), but they lacked characterization. One of the most abundant lncRNA was Malat1 (>800,000 counts in LSK cells; Figure 3C; Table S1), which showed more than 2-fold higher expression in HSPCs. Increased expression of Meg3 and Malat1 in HSPCs was confirmed by independent real-time PCR (Figure 3D). In order to examine the expression of Malat1 within the LS+K compartment, we performed real-time PCR in refined HSC and MPP populations (Wilson et al., 2008). We found Malat1 to be highly expressed in the most immature HSCs and MPP1 cells compared to later MPPs and LSKs (Figure 3E). On the contrary, the high expression of 1700006J14Rik in LSK was mostly due to its increased expression in MEPs, as shown by real-time PCR (Figure 3F). In addition to ENSEMBL-annotated lncRNAs, we predicted de novo transcripts using Cufflinks (Trapnell et al., 2010). In order to increase the confidence of the assemblies, we considered only multiexonic assemblies and we filtered these assemblies based on (1) their length (>200 bp), (2) proximity to annotated genes (>10 Mb), (3) coverage (>20 mean fragment counts across all the samples), and (4) lack of coding sequences (<100 consecutive codons). After these filters, we predicted 713 potential de novo lncRNAs, of which 149 were differentially expressed (FDR = 0.1; Table S4). This set represents a predictive starting point for further validation. Taken together, we provide a comprehensive expression landscape of transcriptional control factors and lncRNAs, which serves as a resource to investigate regulatory circuits controlling early adult hematopoiesis.

Regulation of Gene Expression in HSPCs and Myeloid Precursors

Next, we tested for differential usage of exons between LS+K and LSK cells using DEXseq (Figures 4A and 4B; Anders et al., 2012). In total, 4,096 genes showed evidence of differential exon usage (http://www-huber.embl.de/DEULSK/testForDEU.html), out of which 755 were differentially expressed genes. Because we identified a transcriptional control cluster (Table S3) to be enriched in HSPCs, we further investigated exon usage differences within this regulatory network. We found 48 out of 177 genes showing differential exon usage (Figure 4A). Of these, we exemplify the MyoD family inhibitor, Mdfi, a regulator of Wnt signaling (Kusano and Raab-Traub, 2002). Its fifth exon was almost exclusively expressed in HSPCs, suggesting specific roles for different Mdfi isoforms in stem/progenitors (Figure 4B). Complementarily, we used MISO (Katz et al., 2010) to quantify annotated alternative splicing events and used DEXSeq to test for differences in these annotated splicing events between the LS+K and LSK populations. Using this approach, we found 214 differentially spliced events (FDR = 0.1; Table S5). These findings suggest transcript isoform regulation as an unexpectedly abundant regulatory mechanism at the transition from HSPCs to LSK.

Figure 4.

Figure 4

Gene Expression Regulation in Multipotency and Commitment

(A and B) Analysis of transcript isoform regulation.

(A) Differential exon usage in genes involved in transcriptional control. Forty-eight out of 177 genes exhibited exon usage changes between LS+K and LSK cells in one to 12 exons (top 30 genes in alphabetical order are shown; for full annotation, see Table S3).

(B) Example: TF (Mdfi) shows differential exon usage. The data are consistent with higher levels of isoform #6 of Mdfi in LS+K (red) compared to LSK (blue).

(C–F) Analysis of posttranscriptional regulation. (C) Overview of integration of transcriptome and proteome data. Transcriptome RNA-seq data (19,824 genes quantified), and proteome data (5,027 proteins quantified) were mapped to each other based on Uniprot IDs using BioMart. Ninety-eight percent of quantified proteins could be mapped (4,919 out of 5,027 total). (D) Overall correlation. Correlation of RNA to protein expression ratios (Log2 fold change) for all 4,919 genes. Four hundred nineteen hits were significant on both RNA and protein level (FDR = 0.1), shown in red. R, Pearson correlation coefficient. (E) Correlation of 419 hits significant on RNA and protein level (FDR = 0.1). (F) GO enrichment analysis of 419 hits significant on RNA and protein level (FDR = 0.1). GO enrichment analysis was carried out on 342 hits, which were consistently up- or downregulated (yellow), and 77 hits, which were anticorrelated with increased protein but decreased RNA ratios (pink).

In order to explore potential posttranscriptional regulation, we integrated these RNA-seq data with our previously reported proteome data set generated from the identical cell populations (Klimmeck et al., 2012) (Figures 4C–4F; Table S6). Ninety-eight percent of all quantified proteins were assigned to the respective gene identifier (4,919 out of 5,027; Figure 4C). Of these, 419 were found to be differentially expressed both at the RNA and protein level (FDR = 0.1). Overall, the correlation coefficient of the RNA and protein fold changes was 0.39 (Figure 4D). However, when we restricted the genes to those detected as differentially expressed on RNA and protein levels, the correlation coefficient increased substantially (R = 0.81; Figure 4D). We found 82% of the differentially expressed genes with consistent sign of fold changes in mRNA and protein levels (Figure 4E, 342 out of 419). In contrast, a group of 77 hits showed anticorrelation in their fold changes, with increased protein but decreased transcript levels in HSPCs compared to LSKs (Figure 4E; Table 1). These anticorrelated genes were enriched for metabolic process and response to oxidative stress suggesting the involvement of posttranscriptional mechanisms in the regulation of these processes (Figure 4F). No significant anticorrelated hits with increased transcript but decreased protein level in HSPCs were found, suggesting less pronounced effects of paused translation or protein degradation during hematopoietic commitment.

Table 1.

List of 77 Anticorrelating Hits between RNA and Protein Level

Uniprot ID Ensembl Transcript ID Gene Name Protein Name RNA-seq Log2 Fold Change LS+K/LSK RNA-seq adj. p Value Proteome Log2 Fold Change LS+K/LSK Proteome adj. p Value Differential Exon Usage
Q8BHA3 ENSMUSG00000020956 6530401N04Rik D-tyrosyl-tRNA(Tyr) deacylase 2 −0.70 0.0692 0.65 0.0901 no
Q07417 ENSMUSG00000029545 Acads acyl-coenzyme A dehydrogenase, short chain, isoform CRA_a −0.79 0.0350 0.53 0.0035 no
P28271 ENSMUSG00000028405 Aco1 aconitase 1 −0.98 0.0041 0.39 0.0599 no
Q99KI0 ENSMUSG00000022477 Aco2 aconitate hydratase, mitochondrial −0.93 0.0058 0.26 0.0362 yes
Q9Z2N8 ENSMUSG00000027671 Actl6a actin-like protein 6A;23 kDa protein −0.75 0.0364 0.28 0.0888 no
P54822 ENSMUSG00000022407 Adsl adenylosuccinate lyase −0.68 0.0630 0.25 0.0350 no
Q9ESW4 ENSMUSG00000029916 Agk acylglycerol kinase, mitochondrial −0.71 0.0597 0.45 0.0270 yes
P50247 ENSMUSG00000048087 Ahcy adenosylhomocysteinase −1.20 0.0004 0.43 0.0050 no
Q9JII6 ENSMUSG00000028692 Akr1a1 alcohol dehydrogenase [NADP+] −0.88 0.0101 0.39 0.0423 yes
Q9JLJ2 ENSMUSG00000026687 Aldh9a1 aldehyde dehydrogenase 9A1 −1.05 0.0015 0.20 0.0548 no
P28352 ENSMUSG00000035960 Apex1 DNA-(apurinic or apyrimidinic site) lyase −0.64 0.0997 0.67 0.0290 no
Q564P4 ENSMUSG00000006589 Aprt adenine phosphoribosyltransferase −0.85 0.0137 0.55 0.0871 no
P84078 ENSMUSG00000048076 Arf1 ADP-ribosylation factor 1 −1.02 0.0020 0.28 0.0798 yes
Q99PT1 ENSMUSG00000025132 Arhgdia Rho GDP-dissociation inhibitor 1 −0.73 0.0391 0.18 0.0737 yes
P47754 ENSMUSG00000015733 Capza2 F-actin-capping protein subunit alpha-2 −0.69 0.0915 0.30 0.0708 yes
Q32P00 ENSMUSG00000057886 Cbx3 chromobox homolog 3 −0.81 0.0654 0.35 0.0082 no
Q91WS0 ENSMUSG00000037710 Cisd1 CDGSH iron sulfur domain-containing protein 1 −0.63 0.0942 0.43 0.0235 no
P61202 ENSMUSG00000027206 Cops2 COP9 signalosome complex subunit 2 −0.77 0.0316 0.28 0.0484 no
Q8C243 ENSMUSG00000007891 Ctsd cathepsin D −0.63 0.0903 0.27 0.0281 yes
Q91VR5 ENSMUSG00000037149 Ddx1 ATP-dependent RNA helicase DDX1 −0.87 0.0115 0.22 0.0858 yes
Q9ESX5 ENSMUSG00000031403 Dkc1 dyskeratosis congenita 1 −0.71 0.0492 0.27 0.0288 yes
P17182 ENSMUSG00000059040 Eno1 alpha-enolase −1.04 0.0106 0.18 0.0798 no
Q921G7 ENSMUSG00000027809 Etfdh electron transfer flavoprotein-ubiquinone oxidoreductase, mitochondrial −0.77 0.0325 0.39 0.0206 no
Q78JE5 ENSMUSG00000032309 Fbxo22 F-box only protein 22 −0.73 0.0494 0.46 0.0539 no
P23780 ENSMUSG00000045594 Glb1 β-galactosidase −0.69 0.0633 0.51 0.0447 no
Q9CPU0 ENSMUSG00000024026 Glo1 lactoylglutathione lyase −0.74 0.0424 0.80 0.0101 no
Q3UJH8 ENSMUSG00000025190 Got1 aspartate aminotransferase 1 −0.73 0.0524 0.52 0.0167 no
P11352 ENSMUSG00000063856 Gpx1 glutathione peroxidase 1 −1.09 0.0216 0.37 0.0167 no
P51859 ENSMUSG00000004897 Hdgf hepatoma-derived growth factor −0.67 0.0684 0.27 0.0334 yes
Q9ERZ0 ENSMUSG00000028332 Hemgn hemogen −1.06 0.0580 1.01 0.0582 no
Q3UE51 ENSMUSG00000037012 Hk1 hexokinase 1 −0.69 0.0520 1.03 0.0004 yes
P17095 ENSMUSG00000046711 Hmga1 high mobility group protein A1 −1.36 0.0000 1.12 0.0562 yes
O54879 ENSMUSG00000015217 Hmgb3 high mobility group protein B3 −0.83 0.0171 0.42 0.0202 no
B2M1R6 ENSMUSG00000021546 Hnrnpk heterogeneous nuclear ribonucleoprotein K −0.81 0.0185 0.23 0.0494 yes
Q9D0E1 ENSMUSG00000059208 Hnrnpm heterogeneous nuclear ribonucleoprotein M −0.79 0.0235 0.19 0.0809 yes
Q3TME6 ENSMUSG00000027531 Impa1 inositol (myo)-1(or 4)-monophosphatase 1 −0.86 0.0124 0.65 0.0059 no
P62482 ENSMUSG00000028931 Kcnab2 voltage-gated potassium channel subunit beta-2 −0.68 0.0602 0.36 0.0782 yes
Q9CPY7 ENSMUSG00000039682 Lap3 cytosol aminopeptidase −0.80 0.0225 0.24 0.0495 yes
P48678 ENSMUSG00000028063 Lmna ILamin-A/C −1.52 0.0051 1.19 0.0106 no
P97823 ENSMUSG00000025903 Lypla1 acyl-protein thioesterase 1 −0.84 0.0152 0.48 0.0220 yes
Q9CRB2 ENSMUSG00000001056 Nhp2 H/ACA ribonucleoprotein complex subunit 2 −0.79 0.0615 0.29 0.0862 no
Q9D0T1 ENSMUSG00000063543 Nhp2l1 NHP2-like protein 1 −1.01 0.0181 0.31 0.0283 no
Q64669 ENSMUSG00000003849 Nqo1 NAD(P)H dehydrogenase [quinone] 1 −1.05 0.0302 1.02 0.0430 no
Q63850 ENSMUSG00000043858 Nup62 nuclear pore glycoprotein p62 −0.88 0.0100 0.36 0.0560 no
Q8K183 ENSMUSG00000032788 Pdxk pyridoxal kinase −0.73 0.0432 0.50 0.0141 yes
P70296 ENSMUSG00000032959 Pebp1 phosphatidylethanolamine-binding protein 1 −0.76 0.0321 0.57 0.0027 no
Q5SUR0 ENSMUSG00000020899 Pfas phosphoribosylformylglycinamidine synthase −0.79 0.0240 0.19 0.0780 yes
Q9D0F9 ENSMUSG00000029171 Pgm1 phosphoglucomutase-1 −1.11 0.0007 0.89 0.0005 yes
Q7TSV4 ENSMUSG00000029171 Pgm2 phosphoglucomutase-2 −1.11 0.0007 0.64 0.0095 yes
P52480 ENSMUSG00000032294 Pkm isoform M2 of Pyruvate kinase isozymes M1/M2 −0.76 0.0340 0.30 0.0317 yes
P52480 ENSMUSG00000032294 Pkm isoform M1 of Pyruvate kinase isozymes M1/M2 −0.76 0.0340 1.27 0.0447 yes
P62137 ENSMUSG00000040385 Ppp1ca serine/threonine-protein phosphatase PP1-alpha −0.78 0.0263 0.18 0.0886 no
P58389 ENSMUSG00000039515 Ppp2r4 serine/threonine-protein phosphatase 2A regulatory subunit B −0.71 0.0482 0.36 0.0218 yes
Q6A0D0 ENSMUSG00000026701 Prdx6 peroxiredoxin-6 −1.05 0.0012 0.55 0.0050 yes
Q3U5I2 ENSMUSG00000006498 Ptbp1 polypyrimidine tract binding protein 1 −1.02 0.0019 0.26 0.0338 yes
Q8BVI4 ENSMUSG00000015806 Qdpr dihydropteridine reductase −0.63 0.0973 0.34 0.0087 no
Q8C2Q3 ENSMUSG00000006456 Rbm14 RNA-binding protein 14 −0.82 0.0264 0.64 0.0081 yes
Q9R1T2 ENSMUSG00000052833 Sae1 SUMO-activating enzyme subunit 1 −0.74 0.0381 0.18 0.0875 yes
Q9D154 ENSMUSG00000044734 Serpinb1a leukocyte elastase inhibitor A −0.69 0.0601 1.56 0.0029 no
Q9JJU8 ENSMUSG00000031246 Sh3bgrl SH3 domain-binding glutamic acid-rich-like protein −0.71 0.0494 0.27 0.0616 no
O55242 ENSMUSG00000036078 Sigmar1 sigma 1-type opioid receptor −0.68 0.0663 0.34 0.0421 no
Q8BH59 ENSMUSG00000027010 Slc25a12 calcium-binding mitochondrial carrier protein Aralar1 −1.01 0.0027 0.72 0.0082 yes
Q9QXX4 ENSMUSG00000015112 Slc25a13 calcium-binding mitochondrial carrier protein Aralar2 −0.79 0.0334 0.42 0.0360 no
P09671 ENSMUSG00000006818 Sod2 superoxide dismutase [Mn], mitochondrial −0.81 0.0203 0.60 0.0068 no
Q64442 ENSMUSG00000027227 Sord sorbitol dehydrogenase −0.68 0.0629 0.63 0.0108 yes
Q64105 ENSMUSG00000033735 Spr sepiapterin reductase −0.67 0.0762 0.87 0.0068 yes
Q9WUM5 ENSMUSG00000052738 Suclg1 succinyl-CoA ligase [GDP-forming] subunit alpha, mitochondrial −0.64 0.0855 0.33 0.0109 no
Q9Z2I8 ENSMUSG00000061838 Suclg2 succinyl-CoA ligase [GDP-forming] subunit beta, mitochondrial −0.69 0.0617 0.57 0.0140 no
Q93092 ENSMUSG00000025503 Taldo1 transaldolase −0.74 0.0386 0.20 0.0583 no
P10711 ENSMUSG00000033813 Tcea1 transcription elongation factor A protein 1 −0.74 0.0423 0.26 0.0562 yes
P17751 ENSMUSG00000023456 Tpi1 triosephosphate isomerase 1 −0.90 0.0078 0.86 0.0008 no
Q9QZE7 ENSMUSG00000056820 Tsnax translin-associated protein X −0.74 0.0408 0.76 0.0539 no
Q64727 ENSMUSG00000021823 Vcl vinculin −0.90 0.0178 0.28 0.0548 no
Q60932 ENSMUSG00000020402 Vdac1 voltage-dependent anion-selective channel protein 1 −1.00 0.0023 0.23 0.0698 no
P32921 ENSMUSG00000021266 Wars tryptophanyl-tRNA synthetase, cytoplasmic −0.66 0.0708 0.31 0.0430 yes
Q923D5 ENSMUSG00000030216 Wbp11 WW domain-binding protein 11 −0.81 0.0203 0.21 0.0895 no
P62259 ENSMUSG00000020849 Ywhae 14-3-3 protein epsilon −0.79 0.0238 0.34 0.0086 no

FDR = 0.1 for RNA and protein.

Extended Self-Protective Signature Highlights Posttranscriptional Regulation

To extend our findings to differentially expressed genes related to immune stress response, a protein-protein interaction analysis was carried out (Figures 5 and S5; Table S7), integrating the GO biological process annotation of the 3,236 differentially expressed genes ( in Figure 1F) and our previously described proteome immune signature (Klimmeck et al., 2012). The resulting network covered a broad range of cellular compartments and gene classes including e.g., secreted inflammatory cytokines (Cxcl2, Pf4), plasma membrane receptors (Cxcr1, Flt3), and transcription regulators (Mecom, Gata3). Enrichment of self-protective processes in HSPCs highlighted by our proteome analysis (e.g., viral dsRNA sensors [Ddx58, Oas3]) was confirmed by the combined analysis on RNA and protein level (Figure S5), indicating transcriptional shutdown of these programs upon commitment. Conversely, all detected genes involved in stress response to unfolded protein at the endoplasmic reticulum (Calr, Hsp90b) showed consistent suppression at RNA and protein level in HSPCs (Figures 5 and S5). In contrast, many proteins involved in mitochondrial response to oxidative stress (Nqo1, Sod2) showed strong anticorrelation with decreased RNA but increased protein levels, suggesting a coordinated regulation of translation in HSPCs and whereas Tfrc, a modulator of antibacterial cellular iron homeostasis was increased on both RNA and protein level in myeloid LSK cells, its two ligands Trf and Ltf as well as iron transporter Fth1 showed an anticorrelation. Finally, several factors involved in DNA repair (Xrcc6, Xpc) showed decreased RNA but increased protein expression in HSPCs, pointing to specific mechanisms leading to increased translation or protein stability (Figure 5). Taken together, by integrating transcriptome and proteome data, we achieved a global perspective on self-protective mechanisms in early hematopoiesis, which refines gene expression regulation to specific defense processes.

Figure 5.

Figure 5

Gene Expression Regulation of Self-Protective Immune Response in Multipotency and Commitment

Protein-protein interaction visualization of significantly overrepresented GO processes related to immune response (see in Figure 1F). For full list of involved differentially expressed genes, see Table S7. For high-resolution image of the interaction network, see Figure S5. Protein expression ratios were integrated for 138 out of 480 nodes total. Expression levels are displayed in color code (red, enriched in multipotent LS+K; blue, enriched in LSK).

Energy Metabolism in HSPCs

Next, we characterized the regulation of gene expression in metabolic genes (Figure 6; Table S8). Strikingly, more than half (seven out of 13) of the compared glycolytic enzymes showed anticorrelation between protein and RNA differences (Figures 6A and 6B). Whereas at the first step of glycolysis the mRNAs of all three hexokinases Hk1-3 were lowly expressed in HSPCs, only Hk1 showed higher expression at the protein level. At the final step of glycolysis, the pyruvate kinases Pkm1 and Pkm2 were highly expressed on protein levels but lowly expressed on RNA levels (Figures 6A and 6B). Furthermore, several enzymes involved in intracellular glycogen breakdown (Pgm1 and Pgm2) showed an anticorrelation between RNA and protein, which points to posttranscriptional regulation of this alternative entry into glycolysis. Although we did not detect significant changes for most enzymes involved in the TCA cycle, these transcripts showed higher mRNA levels in LSK. In addition, our analysis revealed anticorrelation in a cluster of 46 enzymes that are likely to contribute to the energy-creating infrastructure of HSPCs (e.g., galactose [Glb1/Bgal], superoxide/NADPH [Prdx6], and acetyl CoA fatty acid [Acads] metabolism). In summary, whereas parts of energy metabolism are consistently regulated both at mRNA and protein level, many enzymes show anticorrelation pointing to posttranscriptional regulation in HSPCs and LSK.

Figure 6.

Figure 6

Gene Expression Regulation of Energy Metabolism

Protein-protein interaction analysis of significantly overrepresented GO processes related to metabolism (∗∗∗∗ in Figure 1F). This network was imposed on a pathway map based on GeneGo pathway maps of glycolysis and gluconeogenesis. Expression levels are displayed in color code (red, enriched in multipotent LS+K; blue, enriched in LSK). For full list of involved differential hits, see Table S8.

(A) Protein expression levels.

(B) Gene expression levels. Genes with p adj <0.1 for both RNA and protein are shown with yellow (consistent changes) or pink boxes (anticorrelating changes). Gray squares indicate quantification in less than three replicates.

Discussion

In this study, we integrated transcriptome and proteome data and characterized the inventory of HSPCs and their immediate myeloid committed progeny (LSK) (Figure S6). With almost 20,000 identified expressed genes including more than 3,000 significantly regulated ones, these data represent a deep analysis comparing these two cell populations that can be accessed as an interactive online resource (http://vega.embl.de/LSK).

The specific cell adhesion repertoire exclusively detected in HSPCs strongly supports the concept of a distinct autonomous microenvironment of HSPCs in the bone marrow (Hanoun and Frenette, 2013; Wilson and Trumpp, 2006), which probably serves important protective functions, among others. In line with this notion, secreted kallikrein-related proteases exert pleiotropic roles in a broad range of physiological processes including inflammatory response (Sotiropoulou et al., 2009) and thus might contribute to HSPC specific defense signaling via proteolytic events e.g., via TGF-beta- or Protease-activated receptor (PAR)-GPCR signaling. The immune response network presented in this study significantly extends our knowledge of distinct self-protective mechanisms elevated in HSPCs already at steady state and is in agreement with the recently observed reversible activation of HSCs under conditions of stress (Essers et al., 2009; Trumpp et al., 2010).

Maintenance of HSC quiescence under homeostatic conditions is tightly linked to mitochondrial oxidative response (Takubo et al., 2013; Yu et al., 2013). In particular, response to oxidative stress is known to be regulated by a variety of mechanisms including targeting of RNA-binding proteins, RNA half-life, and translation efficiency (Vogel et al., 2011). The comparison of the transcriptome and proteome data sets demonstrates an anticorrelation for central superoxide enzymes like Gpx1 and Sod2 and provides evidence for posttranscriptional regulation of ROS response and other defense mechanisms like iron metabolism in early hematopoiesis. Notably, expression of iron-response proteins (IRPs) like FTH1 or ACO2 is tightly regulated via iron-responsive elements (IREs) (Hentze et al., 2010). In addition, the expression of regulatory hormone Hepcidin, which is crucial for cellular iron homeostasis, has been shown to be strongly dependent on Bmp—(Hemojuvelin [HJV])-Smad signaling (Andriopoulos et al., 2009). Thus, the BMP-Smad signaling axis, of which we found its components to be highly expressed in HSPCs, might as well be involved in the regulation of iron homeostasis in multipotent cells.

Although most differentially expressed genes showed consistent expression on RNA and protein level and therefore may not require posttranscriptional regulation, our correlation analysis uncovered isozyme specific regulation for essential glycolytic enzymes (hexokinases, pyruvate kinases) on either RNA or protein level, which argues for blocked translation or increased protein turnover. The higher transcript levels of TCA cycle enzymes potentially boost oxidative phosphorylation after commitment. Notably, our findings highlight additional regulation of carboxylic acid metabolism, which allows novel perspectives on the coordination of metabolism along early hematopoiesis and suggests that energy metabolism as well as the response to immune mediated stress are modulated by different gene-regulatory mechanisms affecting differential and process-specific transcription, translation, mRNA, or protein turnover. The immune response signature described here at steady state should be complemented by a global analysis of induced stress response of HSPCs.

Our study identifies TFs and lncRNAs as candidates controlling multipotency and/or commitment. Indeed, loss of TFs including Foxo3a, one of the differentially abundant genes in our study, has been demonstrated to severely affect HSPC integrity in functional KO mouse studies (Rossi et al., 2012). Because some lncRNAs are known to facilitate expression of transcriptional regulators during development (Pauli et al., 2011), the differentially expressed lncRNAs identified might exert similar roles in HSPCs, leading to stabilization of multipotency. Malat1, which is upregulated in several human tumors and plays an important role in extravasation in lung cancer metastasis (Gutschner et al., 2013), might be involved in promoting stem cell motility and HSC-niche interactions in the bone marrow. In addition, because the vast majority of differentially expressed lncRNAs lack molecular and biological characterization, they are now attractive candidates for further exploration in vivo. In summary, in this study we provide the global landscape gene expression of protein-coding and noncoding transcripts during early hematopoiesis. It represents a comprehensive resource for the stem cell field and will serve as a valuable resource for functional exploration of self-renewal, multipotency, and lineage determination.

Experimental Procedures

Animals

Eight- to 12-week-old female C57BL/6 mice purchased from Harlan Laboratories were used throughout the study. All mice were maintained in the animal facility at DKFZ, under specific pathogen-free (SPF) conditions and kept in individually ventilated cages (IVCs). Animal procedures were performed according to protocols approved by the German authorities, Regierungspräsidium Karlsruhe (Nr. Z110/02, DKFZ #261).

FACS Staining, Sorting, and Cell-Cycle Analysis

Bone marrow was isolated from hind legs (femur, tibia), hips (ilium), and backbone (vertebra). Muscle, connective tissue, and spinal cord were removed; bones were crushed in RPMI/2% FBS (GIBCO) using mortar and pestle. Single-cell suspensions were made by flushing through a 40 μm filter mesh. Cell numbers and viability were determined using a ViCell Counter (Beckman Coulter). To deplete lineage-positive cells, total bone marrow was stained 30 min with a combination of monoclonal rat antibodies directed against mature cell specific lineage markers (for detailed specifications, see Supplemental Experimental Procedures). Labeled cells were incubated for 20 min with polyclonal sheep anti-rat immunoglobulin-G-coated magnetic Dynabeads (Invitrogen) at a ratio 2:1 beads to cell and depleted using a magnet, enriching for the lineage-negative (Linneg) cell fraction. Beads were washed twice with RPMI/2% FBS to harvest residual cell fractions. Centrifugation steps were carried out at 1,500 rpm and 4°C for 5 min (5810r, Eppendorf). To specify multipotent and myeloid progenitor fractions, as well as refined HSC and MPP populations, the Linneg fraction was stained 30 min using rat monoclonal fluorochrome-coupled antibodies (see Supplemental Experimental Procedures). All antibodies were titrated prior to use. Cell sorting was performed on a FACS Aria I or II (Becton Dickinson) at the DKFZ Flow Cytometry Service Unit, using the following sort parameters: 70 μm nozzle; 15,000 evt/s; 70 psi. LS+K cells and LSK cells were obtained by sorting LinnegSCA-1+c-KIT+ and LinnegSCA-1+c-KIT, respectively, in biological triplicate (Figures S1A and S1B). Sorted cells were collected into ice-cold RNA lysis buffer (ARCTURUS PicoPure RNA Isolation Kit [Life Technologies, Invitrogen]) and stored at −80°C until further usage. Of note, to determine sample’s purity we resorted a fraction of each population (Figure S1B; purity greater than 95%). Cell-cycle analysis together with six-color surface staining to define myeloid subsets was performed as previously described (Wilson et al., 2004) using Ki67-FITC (BD Biosciences) and Hoechst 33342 (Molecular Probes).

Total RNA Isolation and RNA-Seq

Total RNA isolation was performed from the indicated populations using ARCTURUS PicoPure RNA Isolation Kit (Life Technologies, Invitrogen) according to the manufacturer’s instructions. DNase treatment was performed using RNase-free DNase Set (QIAGEN). Total RNA was used for quality controls and for normalization of starting material (Figure S2). cDNA-libraries were generated with 10 ng of total RNA using the SMARTer Ultra Low RNA Kit for Illumina Sequencing (Clontech) according to the manufacturer’s indications. Of note, 12 cycles were used for the amplification of cDNA, respectively. Paired-end adaptors were applied to each population. Sequencing was performed with the HiSeq2000 device (Illumina) and one sample per lane. Quality controls before and after sequencing and schematic overview of sampling workflow are shown in Figures S1 and S2.

Processing of RNA-Seq Data

RNA-seq reads were aligned to the reference genome of Mus musculus GRcm38 from ENSEMBL release 69 (Flicek et al., 2013) using GSNAP version 2012-07-20. The alignment quality statistics were computed using scripts based on the HTSeq Python library (Anders et al., 2014) and R/Bioconductor (Gentleman et al., 2004). The results from these statistics are available in Figure S2. Only uniquely aligned reads unambiguously assigned to annotated exons were considered for differential expression analysis. The differential expression analysis was done using DESeq (Anders and Huber, 2010), and the differential exon usage analysis (http://www-huber.embl.de/DEULSK/testForDEU.html) was done using DEXSeq (Anders et al., 2012). In RNA-seq count data, the power to detect differentially expressed genes varies widely through the dynamic range. Therefore, to avoid associated biases in gene set enrichment analysis, we generated background set of genes whose expression distribution approximated the expression distribution of our set of differentially expressed genes (Ho et al., 2007). Genes were called as quantified genes if they had at least 20 read counts per cell type. We tested differentially expressed genes from the RNA-seq data for overrepresented GO categories using Fisher’s exact test. The R/Bioconductor scripts used for this study are available as an online resource (http://www-huber.embl.de/DEULSK/supplementary_file.pdf).

Gene Expression Analysis by Real-Time PCR

For real-time PCR, total RNA was isolated as described above and reverse-transcribed using Superscript III reverse transcriptase (Life Technologies, Invitrogen). The PCR was performed using the Fast SYBR Green Master Mix (Applied Biosystems) on a ViiA 7 Real-Time PCR System (Applied Biosystems) according to the manufacturer’s instructions. Primers were designed using the Universal Probe Library Assay Design Center (Roche). For primers used, see Supplemental Experimental Procedures. The analysis of amplification curves was carried out using the ViAA 7 Software v1.1 (Applied Biosystems).

Bioinformatic Analysis

For Integration of transcriptomics and proteomics, the ENSEMBL identifiers were associated to their respective UniProt identifiers using BioMart (Durinck et al., 2005). For the purpose of this analysis, RNAs and proteins were considered differentially expressed if the adjusted p value <0.1. We tested for GO enriched categories using Fishers’ exact test, the functional annotation tool of MetaCore (GeneGo; Nikolsky et al., 2005), protein classification by Panther (Mi et al., 2007), and the BiNGO plugin (Maere et al., 2005) for Cytoscape (Shannon et al., 2003). To investigate interactions between differentially expressed proteins, networks were constructed using STRING (Szklarczyk et al., 2011). Interaction networks were visualized using Cytoscape; nodes were arranged after manual curation according to Gene Ontology (UniProt; (Jain et al., 2009), extensive literature search, and STRING interaction scores. To identify possible transcription factor targets, the Transcription Factor Encyclopedia (Yusuf et al., 2012) was used as a resource.

Author Contributions

D.K., N.C.-W., and A.T. designed and coordinated the study. A.T. and W.H. (bioinformatics) designed and supervised the experiments and interpreted the data. N.C.-W., A.R., and D.K performed the RNA-seq experiments, bioinformatic analysis, and data interpretation. A.R. built the interactive online data resource. D.K. and N.C.-W. coordinated the animal experiments and performed FACS. L.v.P., S.R., and D.K. performed real-time PCR. D.K., N.C.-W., A.R., and J.H. together with J.K., W.H., and A.T. wrote the manuscript.

Acknowledgments

We thank S. Wolf, S. Schmidt, U. Ernst, A. Hotz-Wagenblatt, and C. Previti from the Genomics Proteomics Core Facility and A. Atzberger and S. Schmitt from the Flow Cytometry Core Facility at DKFZ for their expert assistance. This work was supported by the BioRN Spitzencluster “Molecular and Cell Based Medicine” supported by the German Bundesministerium für Bildung und Forschung, the Sonderforschungsbereich SFB873 funded by the Deutsche Forschungsgemeinschaft, and the Dietmar Hopp Foundation (A.T.). W.H and A.R. acknowledge funding from the European Community’s Seventh Framework Programme (FP7/2007-2013), project RADIANT (grant agreement no. 305626).

Footnotes

This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/).

Contributor Information

Wolfgang Huber, Email: wolfgang.huber@embl.de.

Andreas Trumpp, Email: a.trumpp@dkfz.de.

Accession Numbers

The RNA-seq data sets from LS+K and LSK cells can be accessed through ArrayExpress (https://www.ebi.ac.uk/arrayexpress; accession number E-MTAB-1963). An interactive online resource that also integrates the previous proteome data (Klimmeck et al., 2012) can be found under the following link: http://vega.embl.de/LSK.

Supplemental Information

Supplemental Information includes Supplemental Experimental Procedures, six figures, and eight tables and can be found with this article online at http://dx.doi.org/10.1016/j.stemcr.2014.08.012.

Supplemental Information

Document S1. Supplemental Experimental Procedures and Figures S1–S6
mmc1.pdf (13.3MB, pdf)
Table S1. RNA-Seq Data of LS+K and LSK Cells

RNA-seq data for all 37,315 genes. These include 3,236 genes with differential expression between LS+K and LSK cells (FDR = 0.05).

mmc2.xls (11.3MB, xls)
Table S2. GO Term Enrichment in Differentially Expressed Genes

All GO terms with p adj. value <0.05 and GO term listed in Figure 1F are separated in different sheets.

mmc3.xls (2.6MB, xls)
Table S3. Differentially Expressed Genes Involved in Transcriptional Control

One hundred seventy-seven differentially expressed genes between LS+K and LSK involved in transcriptional control (FDR=0.05). Molecular functions according to GeneGo and Panther are annotated. See also Figure 3A.

mmc4.xls (81KB, xls)
Table S4. Annotation of De Novo/Putative Long Noncoding RNAs

Annotation file containing the filtered assemblies from Cufflinks in General Transfer Format (GTF) format. The initial assemblies from Cufflinks were filtered based on (1) their length (>200 bp), (2) proximity to annotated genes (>10 Mb), (3) coverage (>20 mean fragment counts across all the samples), and (4) lack of coding sequences (<100 consecutive codons). Additionally, we only considered multiexonic assemblies. A total of 713 potential de novo predicted long noncoding RNAs and 149 differentially expressed de novo predicted long noncoding RNAs (FDR = 0.05).

mmc5.zip (52.4KB, zip)
Table S5. Differential Splicing Analysis

Two hundred fourteen differentially spliced events (FDR = 0.1) found using MISO are listed.

mmc6.xls (66KB, xls)
Table S6. Correlation of RNA and Protein Expression Changes

RNA-seq and proteomics (Klimmeck et al., 2012) data for the 4,919 mapped quantified proteins. For this analysis, differentially expressed hits with FDR = 0.1 were considered.

mmc7.xls (2.3MB, xls)
Table S7. Differentially Expressed Genes Involved in Immune Response

Six hundred forty differentially expressed hits (FDR = 0.05) related to immune response. See also Figures 5 and S5.

mmc8.xls (130.5KB, xls)
Table S8. Differentially Expressed Genes Involved in Energy Metabolism

One hundred thirty-one differentially expressed genes and proteins (FDR = 0.05). See also Figure 6.

mmc9.xls (115.5KB, xls)
Document S2. Article plus Supplemental Information
mmc10.pdf (17.1MB, pdf)

References

  1. Adolfsson J., Månsson R., Buza-Vidas N., Hultquist A., Liuba K., Jensen C.T., Bryder D., Yang L., Borge O.J., Thoren L.A. Identification of Flt3+ lympho-myeloid stem cells lacking erythro-megakaryocytic potential a revised road map for adult blood lineage commitment. Cell. 2005;121:295–306. doi: 10.1016/j.cell.2005.02.013. [DOI] [PubMed] [Google Scholar]
  2. Akashi K., Traver D., Miyamoto T., Weissman I.L. A clonogenic common myeloid progenitor that gives rise to all myeloid lineages. Nature. 2000;404:193–197. doi: 10.1038/35004599. [DOI] [PubMed] [Google Scholar]
  3. Anders S., Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11:R106. doi: 10.1186/gb-2010-11-10-r106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Anders S., Reyes A., Huber W. Detecting differential usage of exons from RNA-seq data. Genome Res. 2012;22:2008–2017. doi: 10.1101/gr.133744.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Anders S., Pyl P.T., Huber W. HTSeq–a Python framework to work with high-throughput sequencing data. bioRxiv. 2014 doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Andriopoulos B., Jr., Corradini E., Xia Y., Faasse S.A., Chen S., Grgurevic L., Knutson M.D., Pietrangelo A., Vukicevic S., Lin H.Y., Babitt J.L. BMP6 is a key endogenous regulator of hepcidin expression and iron metabolism. Nat. Genet. 2009;41:482–487. doi: 10.1038/ng.335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bigas A., Espinosa L. Hematopoietic stem cells: to be or Notch to be. Blood. 2012;119:3226–3235. doi: 10.1182/blood-2011-10-355826. [DOI] [PubMed] [Google Scholar]
  8. Blank U., Karlsson S. The role of Smad signaling in hematopoiesis and translational hematology. Leukemia. 2011;25:1379–1388. doi: 10.1038/leu.2011.95. [DOI] [PubMed] [Google Scholar]
  9. Cox J., Mann M. Is proteomics the new genomics? Cell. 2007;130:395–398. doi: 10.1016/j.cell.2007.07.032. [DOI] [PubMed] [Google Scholar]
  10. Durinck S., Moreau Y., Kasprzyk A., Davis S., De Moor B., Brazma A., Huber W. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics. 2005;21:3439–3440. doi: 10.1093/bioinformatics/bti525. [DOI] [PubMed] [Google Scholar]
  11. Essers M.A., Offner S., Blanco-Bose W.E., Waibler Z., Kalinke U., Duchosal M.A., Trumpp A. IFNalpha activates dormant haematopoietic stem cells in vivo. Nature. 2009;458:904–908. doi: 10.1038/nature07815. [DOI] [PubMed] [Google Scholar]
  12. Flicek P., Ahmed I., Amode M.R., Barrell D., Beal K., Brent S., Carvalho-Silva D., Clapham P., Coates G., Fairley S. Ensembl 2013. Nucleic Acids Res. 2013;41:D48–D55. doi: 10.1093/nar/gks1236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gazit R., Garrison B.S., Rao T.N., Shay T., Costello J., Ericson J., Kim F., Collins J.J., Regev A., Wagers A.J., Rossi D.J., Immunological Genome Project Consortium Transcriptome analysis identifies regulators of hematopoietic stem and progenitor cells. Stem Cell Rev. 2013;1:266–280. doi: 10.1016/j.stemcr.2013.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gentleman R.C., Carey V.J., Bates D.M., Bolstad B., Dettling M., Dudoit S., Ellis B., Gautier L., Ge Y., Gentry J. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004;5:R80. doi: 10.1186/gb-2004-5-10-r80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Graf T., Enver T. Forcing cells to change lineages. Nature. 2009;462:587–594. doi: 10.1038/nature08533. [DOI] [PubMed] [Google Scholar]
  16. Gutschner T., Diederichs S. The hallmarks of cancer: a long non-coding RNA point of view. RNA Biol. 2012;9:703–719. doi: 10.4161/rna.20481. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gutschner T., Hämmerle M., Eissmann M., Hsu J., Kim Y., Hung G., Revenko A., Arun G., Stentrup M., Gross M. The noncoding RNA MALAT1 is a critical regulator of the metastasis phenotype of lung cancer cells. Cancer Res. 2013;73:1180–1189. doi: 10.1158/0008-5472.CAN-12-2850. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hanoun M., Frenette P.S. This niche is a maze; an amazing niche. Cell Stem Cell. 2013;12:391–392. doi: 10.1016/j.stem.2013.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Hentze M.W., Muckenthaler M.U., Galy B., Camaschella C. Two to tango: regulation of Mammalian iron metabolism. Cell. 2010;142:24–38. doi: 10.1016/j.cell.2010.06.028. [DOI] [PubMed] [Google Scholar]
  20. Ho D.E., Imai K., King G., Stuart E.A. Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Polit. Anal. 2007;15:199. [Google Scholar]
  21. Jain E., Bairoch A., Duvaud S., Phan I., Redaschi N., Suzek B.E., Martin M.J., McGarvey P., Gasteiger E. Infrastructure for the life sciences: design and implementation of the UniProt website. BMC Bioinformatics. 2009;10:136. doi: 10.1186/1471-2105-10-136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Katz Y., Wang E.T., Airoldi E.M., Burge C.B. Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nat. Methods. 2010;7:1009–1015. doi: 10.1038/nmeth.1528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kent D., Copley M., Benz C., Dykstra B., Bowie M., Eaves C. Regulation of hematopoietic stem cells by the steel factor/KIT signaling pathway. Clin. Cancer Res. 2008;14:1926–1930. doi: 10.1158/1078-0432.CCR-07-5134. [DOI] [PubMed] [Google Scholar]
  24. Klimmeck D., Hansson J., Raffel S., Vakhrushev S.Y., Trumpp A., Krijgsveld J. Proteomic cornerstones of hematopoietic stem cell differentiation: distinct signatures of multipotent progenitors and myeloid committed cells. Mol. Cell. Proteomics. 2012;11:286–302. doi: 10.1074/mcp.M111.016790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Korkmaz B., Moreau T., Gauthier F. Neutrophil elastase, proteinase 3 and cathepsin G: physicochemical properties, activity and physiopathological functions. Biochimie. 2008;90:227–242. doi: 10.1016/j.biochi.2007.10.009. [DOI] [PubMed] [Google Scholar]
  26. Kusano S., Raab-Traub N. I-mfa domain proteins interact with Axin and affect its regulation of the Wnt and c-Jun N-terminal kinase signaling pathways. Mol. Cell. Biol. 2002;22:6393–6405. doi: 10.1128/MCB.22.18.6393-6405.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Luis T.C., Ichii M., Brugman M.H., Kincade P., Staal F.J. Wnt signaling strength regulates normal hematopoiesis and its deregulation is involved in leukemia development. Leukemia. 2012;26:414–421. doi: 10.1038/leu.2011.387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Maere S., Heymans K., Kuiper M. BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005;21:3448–3449. doi: 10.1093/bioinformatics/bti551. [DOI] [PubMed] [Google Scholar]
  29. Mercer T.R., Dinger M.E., Mattick J.S. Long non-coding RNAs: insights into functions. Nat. Rev. Genet. 2009;10:155–159. doi: 10.1038/nrg2521. [DOI] [PubMed] [Google Scholar]
  30. Mi H., Guo N., Kejariwal A., Thomas P.D. PANTHER version 6: protein sequence and function evolution data with expanded representation of biological pathways. Nucleic Acids Res. 2007;35:D247–D252. doi: 10.1093/nar/gkl869. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Moignard V., Macaulay I.C., Swiers G., Buettner F., Schütte J., Calero-Nieto F.J., Kinston S., Joshi A., Hannah R., Theis F.J. Characterization of transcriptional networks in blood stem and progenitor cells using high-throughput single-cell gene expression analysis. Nat. Cell Biol. 2013;15:363–372. doi: 10.1038/ncb2709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Murphy M.J., Wilson A., Trumpp A. More than just proliferation: Myc function in stem cells. Trends Cell Biol. 2005;15:128–137. doi: 10.1016/j.tcb.2005.01.008. [DOI] [PubMed] [Google Scholar]
  33. Nikolsky Y., Ekins S., Nikolskaya T., Bugrim A. A novel method for generation of signature networks as biomarkers from complex high throughput data. Toxicol. Lett. 2005;158:20–29. doi: 10.1016/j.toxlet.2005.02.004. [DOI] [PubMed] [Google Scholar]
  34. Ozsolak F., Milos P.M. RNA sequencing: advances, challenges and opportunities. Nat. Rev. Genet. 2011;12:87–98. doi: 10.1038/nrg2934. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Paralkar V.R., Weiss M.J. Long noncoding RNAs in biology and hematopoiesis. Blood. 2013;121:4842–4846. doi: 10.1182/blood-2013-03-456111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Pauli A., Rinn J.L., Schier A.F. Non-coding RNAs as regulators of embryogenesis. Nat. Rev. Genet. 2011;12:136–149. doi: 10.1038/nrg2904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Pronk C.J., Rossi D.J., Månsson R., Attema J.L., Norddahl G.L., Chan C.K., Sigvardsson M., Weissman I.L., Bryder D. Elucidation of the phenotypic, functional, and molecular topography of a myeloerythroid progenitor cell hierarchy. Cell Stem Cell. 2007;1:428–442. doi: 10.1016/j.stem.2007.07.005. [DOI] [PubMed] [Google Scholar]
  38. Purton L.E., Scadden D.T. Limiting factors in murine hematopoietic stem cell assays. Cell Stem Cell. 2007;1:263–270. doi: 10.1016/j.stem.2007.08.016. [DOI] [PubMed] [Google Scholar]
  39. Qureshi I.A., Mehler M.F. Emerging roles of non-coding RNAs in brain evolution, development, plasticity and disease. Nat. Rev. Neurosci. 2012;13:528–541. doi: 10.1038/nrn3234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Rossi L., Lin K.K., Boles N.C., Yang L., King K.Y., Jeong M., Mayle A., Goodell M.A. Less is more: unveiling the functional core of hematopoietic stem cells through knockout mice. Cell Stem Cell. 2012;11:302–317. doi: 10.1016/j.stem.2012.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Schwanhäusser B., Busse D., Li N., Dittmar G., Schuchhardt J., Wolf J., Chen W., Selbach M. Global quantification of mammalian gene expression control. Nature. 2011;473:337–342. doi: 10.1038/nature10098. [DOI] [PubMed] [Google Scholar]
  42. Seita J., Weissman I.L. Hematopoietic stem cell: self-renewal versus differentiation. Wiley Interdiscip. Rev. Syst. Biol. Med. 2010;2:640–653. doi: 10.1002/wsbm.86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Shannon P., Markiel A., Ozier O., Baliga N.S., Wang J.T., Ramage D., Amin N., Schwikowski B., Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Sotiropoulou G., Pampalakis G., Diamandis E.P. Functional roles of human kallikrein-related peptidases. J. Biol. Chem. 2009;284:32989–32994. doi: 10.1074/jbc.R109.027946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Spooncer E., Brouard N., Nilsson S.K., Williams B., Liu M.C., Unwin R.D., Blinco D., Jaworska E., Simmons P.J., Whetton A.D. Developmental fate determination and marker discovery in hematopoietic stem cell biology using proteomic fingerprinting. Mol. Cell. Proteomics. 2008;7:573–581. doi: 10.1074/mcp.M700292-MCP200. [DOI] [PubMed] [Google Scholar]
  46. Sun D., Luo M., Jeong M., Rodriguez B., Xia Z., Hannah R., Wang H., Le T., Faull K.F., Chen R. Epigenomic profiling of young and aged HSCs reveals concerted changes during aging that reinforce self-renewal. Cell Stem Cell. 2014;14:673–688. doi: 10.1016/j.stem.2014.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Szklarczyk D., Franceschini A., Kuhn M., Simonovic M., Roth A., Minguez P., Doerks T., Stark M., Muller J., Bork P. The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res. 2011;39:D561–D568. doi: 10.1093/nar/gkq973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Takubo K., Nagamatsu G., Kobayashi C.I., Nakamura-Ishizu A., Kobayashi H., Ikeda E., Goda N., Rahimi Y., Johnson R.S., Soga T. Regulation of glycolysis by Pdk functions as a metabolic checkpoint for cell cycle quiescence in hematopoietic stem cells. Cell Stem Cell. 2013;12:49–61. doi: 10.1016/j.stem.2012.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Tesio M., Trumpp A. Breaking the cell cycle of HSCs by p57 and friends. Cell Stem Cell. 2011;9:187–192. doi: 10.1016/j.stem.2011.08.005. [DOI] [PubMed] [Google Scholar]
  50. Till J.E., McCulloch E.A. A direct measurement of the radiation sensitivity of normal mouse bone marrow cells. Radiat. Res. 1961;14:213–222. [PubMed] [Google Scholar]
  51. Trapnell C., Williams B.A., Pertea G., Mortazavi A., Kwan G., van Baren M.J., Salzberg S.L., Wold B.J., Pachter L. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 2010;28:511–515. doi: 10.1038/nbt.1621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Trumpp A., Essers M., Wilson A. Awakening dormant haematopoietic stem cells. Nat. Rev. Immunol. 2010;10:201–209. doi: 10.1038/nri2726. [DOI] [PubMed] [Google Scholar]
  53. Uchida S., Gellert P., Braun T. Deeply dissecting stemness: making sense to non-coding RNAs in stem cells. Stem Cell Rev. 2012;8:78–86. doi: 10.1007/s12015-011-9294-y. [DOI] [PubMed] [Google Scholar]
  54. Venkatraman A., He X.C., Thorvaldsen J.L., Sugimura R., Perry J.M., Tao F., Zhao M., Christenson M.K., Sanchez R., Yu J.Y. Maternal imprinting at the H19-Igf2 locus maintains adult haematopoietic stem cell quiescence. Nature. 2013;500:345–349. doi: 10.1038/nature12303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Vogel C., Marcotte E.M. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat. Rev. Genet. 2012;13:227–232. doi: 10.1038/nrg3185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Vogel C., Silva G.M., Marcotte E.M. Protein expression regulation under oxidative stress. Mol. Cell. Proteomics. 2011;10:009217. doi: 10.1074/mcp.M111.009217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Weissman I.L., Shizuru J.A. The origins of the identification and isolation of hematopoietic stem cells, and their capability to induce donor-specific transplantation tolerance and treat autoimmune diseases. Blood. 2008;112:3543–3553. doi: 10.1182/blood-2008-08-078220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Wilson A., Trumpp A. Bone-marrow haematopoietic-stem-cell niches. Nat. Rev. Immunol. 2006;6:93–106. doi: 10.1038/nri1779. [DOI] [PubMed] [Google Scholar]
  59. Wilson A., Murphy M.J., Oskarsson T., Kaloulis K., Bettess M.D., Oser G.M., Pasche A.C., Knabenhans C., Macdonald H.R., Trumpp A. c-Myc controls the balance between hematopoietic stem cell self-renewal and differentiation. Genes Dev. 2004;18:2747–2763. doi: 10.1101/gad.313104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Wilson A., Laurenti E., Oser G., van der Wath R.C., Blanco-Bose W., Jaworski M., Offner S., Dunant C.F., Eshkind L., Bockamp E. Hematopoietic stem cells reversibly switch from dormancy to self-renewal during homeostasis and repair. Cell. 2008;135:1118–1129. doi: 10.1016/j.cell.2008.10.048. [DOI] [PubMed] [Google Scholar]
  61. Wilson A., Laurenti E., Trumpp A. Balancing dormant and self-renewing hematopoietic stem cells. Curr. Opin. Genet. Dev. 2009;19:461–468. doi: 10.1016/j.gde.2009.08.005. [DOI] [PubMed] [Google Scholar]
  62. Yoon J.H., Abdelmohsen K., Gorospe M. Posttranscriptional gene regulation by long noncoding RNA. J. Mol. Biol. 2013;425:3723–3730. doi: 10.1016/j.jmb.2012.11.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Yoshimizu T., Miroglio A., Ripoche M.A., Gabory A., Vernucci M., Riccio A., Colnot S., Godard C., Terris B., Jammes H., Dandolo L. The H19 locus acts in vivo as a tumor suppressor. Proc. Natl. Acad. Sci. USA. 2008;105:12417–12422. doi: 10.1073/pnas.0801540105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Yu W.M., Liu X., Shen J., Jovanovic O., Pohl E.E., Gerson S.L., Finkel T., Broxmeyer H.E., Qu C.K. Metabolic regulation by the mitochondrial phosphatase PTPMT1 is required for hematopoietic stem cell differentiation. Cell Stem Cell. 2013;12:62–74. doi: 10.1016/j.stem.2012.11.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Yusuf D., Butland S.L., Swanson M.I., Bolotin E., Ticoll A., Cheung W.A., Zhang X.Y., Dickman C.T., Fulton D.L., Lim J.S. The transcription factor encyclopedia. Genome Biol. 2012;13:R24. doi: 10.1186/gb-2012-13-3-r24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Zhou Y., Zhang X., Klibanski A. MEG3 noncoding RNA: a tumor suppressor. J. Mol. Endocrinol. 2012;48:R45–R53. doi: 10.1530/JME-12-0008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Zou P., Yoshihara H., Hosokawa K., Tai I., Shinmyozu K., Tsukahara F., Maru Y., Nakayama K., Nakayama K.I., Suda T. p57(Kip2) and p27(Kip1) cooperate to maintain hematopoietic stem cell quiescence through interactions with Hsc70. Cell Stem Cell. 2011;9:247–261. doi: 10.1016/j.stem.2011.07.003. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Supplemental Experimental Procedures and Figures S1–S6
mmc1.pdf (13.3MB, pdf)
Table S1. RNA-Seq Data of LS+K and LSK Cells

RNA-seq data for all 37,315 genes. These include 3,236 genes with differential expression between LS+K and LSK cells (FDR = 0.05).

mmc2.xls (11.3MB, xls)
Table S2. GO Term Enrichment in Differentially Expressed Genes

All GO terms with p adj. value <0.05 and GO term listed in Figure 1F are separated in different sheets.

mmc3.xls (2.6MB, xls)
Table S3. Differentially Expressed Genes Involved in Transcriptional Control

One hundred seventy-seven differentially expressed genes between LS+K and LSK involved in transcriptional control (FDR=0.05). Molecular functions according to GeneGo and Panther are annotated. See also Figure 3A.

mmc4.xls (81KB, xls)
Table S4. Annotation of De Novo/Putative Long Noncoding RNAs

Annotation file containing the filtered assemblies from Cufflinks in General Transfer Format (GTF) format. The initial assemblies from Cufflinks were filtered based on (1) their length (>200 bp), (2) proximity to annotated genes (>10 Mb), (3) coverage (>20 mean fragment counts across all the samples), and (4) lack of coding sequences (<100 consecutive codons). Additionally, we only considered multiexonic assemblies. A total of 713 potential de novo predicted long noncoding RNAs and 149 differentially expressed de novo predicted long noncoding RNAs (FDR = 0.05).

mmc5.zip (52.4KB, zip)
Table S5. Differential Splicing Analysis

Two hundred fourteen differentially spliced events (FDR = 0.1) found using MISO are listed.

mmc6.xls (66KB, xls)
Table S6. Correlation of RNA and Protein Expression Changes

RNA-seq and proteomics (Klimmeck et al., 2012) data for the 4,919 mapped quantified proteins. For this analysis, differentially expressed hits with FDR = 0.1 were considered.

mmc7.xls (2.3MB, xls)
Table S7. Differentially Expressed Genes Involved in Immune Response

Six hundred forty differentially expressed hits (FDR = 0.05) related to immune response. See also Figures 5 and S5.

mmc8.xls (130.5KB, xls)
Table S8. Differentially Expressed Genes Involved in Energy Metabolism

One hundred thirty-one differentially expressed genes and proteins (FDR = 0.05). See also Figure 6.

mmc9.xls (115.5KB, xls)
Document S2. Article plus Supplemental Information
mmc10.pdf (17.1MB, pdf)

Articles from Stem Cell Reports are provided here courtesy of Elsevier

RESOURCES