Skip to main content
National Science Review logoLink to National Science Review
. 2020 Aug 24;8(3):nwaa180. doi: 10.1093/nsr/nwaa180

Single-cell transcriptomic landscape of human blood cells

Xiaowei Xie 1,2,d, Mengyao Liu 3,4,d, Yawen Zhang 5,6,d, Bingrui Wang 7,8, Caiying Zhu 9,10, Chenchen Wang 11,12, Qing Li 13,14, Yingying Huo 15,16, Jiaojiao Guo 17, Changlu Xu 18,19, Linping Hu 20,21, Aiming Pang 22,23, Shihui Ma 24,25, Lina Wang 26,27, Wenbin Cao 28,29, Shulian Chen 30,31, Qiuling Li 32,33, Sudong Zhang 34,35, Xueying Zhao 36,37, Wen Zhou 38, Hongbo Luo 39,40, Guoguang Zheng 41,42, Erlie Jiang 43,44, Sizhou Feng 45,46, Lixiang Chen 47, Lihong Shi 48,49, Hui Cheng 50,51, Sha Hao 52,53,, Ping Zhu 54,55,, Tao Cheng 56,57,
PMCID: PMC8288407  PMID: 34691592

Abstract

High throughput single-cell RNA-seq has been successfully implemented to dissect the cellular and molecular features underlying hematopoiesis. However, an elaborate and comprehensive transcriptome reference of the whole blood system is lacking. Here, we profiled the transcriptomes of 7551 human blood cells representing 32 immunophenotypic cell types, including hematopoietic stem cells, progenitors and mature blood cells derived from 21 healthy donors. With high sequencing depth and coverage, we constructed a single-cell transcriptional atlas of blood cells (ABC) on the basis of both protein-coding genes and long noncoding RNAs (lncRNAs), and showed a high consistence between them. Notably, putative lncRNAs and transcription factors regulating hematopoietic cell differentiation were identified. While common transcription factor regulatory networks were activated in neutrophils and monocytes, lymphoid cells dramatically changed their regulatory networks during differentiation. Furthermore, we showed a subset of nucleated erythrocytes actively expressing immune signals, suggesting the existence of erythroid precursors with immune functions. Finally, a web portal offering transcriptome browsing and blood cell type prediction has been established. Thus, our work provides a transcriptional map of human blood cells at single-cell resolution, thereby offering a comprehensive reference for the exploration of physiological and pathological hematopoiesis.

Keywords: human, hematopoietic cells, lncRNAs, transcription factor, single-cell RNA-seq

INTRODUCTION

The hematopoietic system, mainly derived from a pool of hematopoietic stem/progenitor cells (HSPCs), continuously generates erythrocytes/megakaryocytes, myeloid cells and lymphocytes and plays vital roles throughout the whole human lifespan. Dysregulation of hematopoiesis may give rise to various diseases such as immunodeficiency and blood cancers [1–3]. In recent years, single-cell sequencing techniques have facilitated the exploration of cellular and molecular heterogeneity during hematopoietic cell differentiation [4]. Research investigating hematopoietic progenitor cells has highlighted the differentiation hierarchy of early hematopoiesis [5–7]. In contrast, lineageCD34+CD38 cells have been shown to be a transcriptional continuum of low-primed undifferentiated hematopoietic stem and progenitor cells [8]. Although these studies have considerably advanced our perception of hematopoiesis, a systematic view of hematopoietic cell differentiation based on multiple individuals and blood cell types by deep single-cell RNA sequencing has been lacking.

LncRNAs function as crucial regulators during hematopoietic cell differentiation and development [9], including hematopoietic stem cell (HSC) differentiation, erythropoiesis and the development of B and T lymphocytes [10–13]. Recently, lncRNAs were comprehensively defined at the single-cell level and displayed highly lineage-specific and dynamic expression during human HSPC differentiation [14]. The single-cell lncRNA landscape of embryonic hematopoietic stem cells was also profiled and lncRNA (H19) was recognized to be pivotal for HSC emergence [15]. Nevertheless, the full repertoire of lncRNAs in human blood cells has not been elucidated.

In this study, we constructed a comprehensive transcriptome reference of human blood cells (that is atlas of blood cells, ABC), by performing deep sequencing of over 7000 single cells, representing 32 immunophenotypic cell types from 21 healthy donors. The hematopoietic hierarchy was dissected based on both protein-coding genes and lncRNAs. Of note, we elaborated the dynamic regulatory networks and differentiation trajectories for HSPCs and each lineage. This work contributes to a comprehensive understanding of the molecular dynamics during lineage differentiation, and provides valuable transcriptome references for hematopoiesis under both homeostasis and disease.

RESULTS

Human hematopoietic transcriptome reference

To establish a comprehensive transcriptome reference of the human blood system, we profiled the transcriptomes of bone marrow and peripheral blood derived hematopoietic cells from 21 healthy adult donors, by combining fluorescence activated cell sorting (FACS) of single cells for 32 well-defined cell types and a Single-Cell Tagged Reverse Transcription RNA Sequencing (STRT-seq) strategy (Methods). Specifically, we harvested bone marrow derived progenitors and differentiated cells (including CD34+ HSPCs, B cells, NK cells, T cells, monocytes, neutrophils and erythrocytes), together with peripheral blood derived differentiated cells including regulatory B, naive B, memory B, cytotoxic NK, cytokine NK and T cells (Supplementary Fig. 1). In total, 7551 single cells were profiled to construct the transcriptional atlas of human hematopoietic cells (Fig. 1A and Supplementary Fig. 2A). By limiting the number of single cells mixed in each library and increasing the sequencing depth, we were able to detect on average ∼3000 protein-coding genes per single cell. This high quality of transcriptome data ensured an accurate construction of the hematopoietic hierarchy with more gene expression details. HSPCs and monocytes expressed the highest number of genes, whereas NK cells, T cells and neutrophils were relatively transcriptionally quiescent (Fig. 1B). Notably, compared with the data generated using 10X Genomics sequencing of bone marrow derived nucleated cells [7], our data enabled detection of more genes and transcription factors for in-depth analysis (P values < 2.2e−16, Wilcoxon test, Fig. 1C), and were particularly superior in detecting low-abundance genes (Supplementary Fig. 2B).

Figure 1.

Figure 1.

Transcriptome reference of human blood cells. (A) Scheme of the experimental design. (B) Box plot shows the detected gene number for 32 immunophenotypic cell types. Colors indicate cell types. (C) Atlas of Blood Cells (ABC) detects more genes and transcription factors than the bone marrow samples from Human Cell Atlas (HCA) (P values < 2.2e−16, Wilcoxon test). (D) Transcriptional atlas of 7551 human hematopoietic cells by UMAP. Colors indicate cell populations and arrows show the differentiation trajectories of lymphoid, neutrophil/monocyte and erythroid lineages. (E) UMAP displays of transcription activities for hematopoiesis-related genes (AVP, CD79B, GZMH, CCR7, SPI1 and GATA1, respectively, for HSPCs, B cells, NK cells, T cells, neutrophil/monocytes and erythrocytes).

We first integrated the single-cell transcriptome profiles of all hematopoietic cells, followed by dimension reduction and visualization by uniform manifold approximation and projection (UMAP). The overall differentiation trajectory of hematopoiesis was revealed by starting from HSPCs and branching toward lymphocytes (B cells, NK cells and T cells), myeloid cells (monocytes and neutrophils) and erythrocytes. In contrast to B cells, monocytes and erythrocytes, we found that NK cells, T cells and neutrophils lacked a continuous transcriptional transition from progenitors to differentiated cells, suggesting that these cells might acquire dramatic shifts in gene expression during maturation, or that there existed transition populations not captured by the known surface markers (Fig. 1D and Supplementary Fig. 2C). We observed that AVP was specifically expressed in HSCs and MPPs. CD79B, GZMH and CCR7, respectively, exhibited high specificity and expression in B cells, NK cells and T cells, while SPI1 and GATA1 were highly expressed in neutrophil/monocytes and erythrocytes, respectively (Fig. 1E), verifying well-known marker genes related to hematopoiesis in each cell type. By deep sequencing of multiple classical hematopoietic populations from more than 20 healthy donors, our single-cell transcriptional atlas of human blood cells provides valuable transcriptome references for studies of human physiological and pathological hematopoiesis.

Regulatory networks underlie hematopoietic differentiation

To resolve the transcription factor regulatory networks (regulons) that underlie hematopoietic differentiation, regulon activity scores (RASs) were calculated for transcription factors in all single cells using SCENIC [16], which were then submitted to construct the regulatory atlas of human blood cells. We observed that the overall differentiation trajectory of hematopoiesis revealed by regulons coincided with that revealed by single-cell transcriptomes (Fig. 2A). In addition, SPRING [17] was used to visualize the hematopoietic hierarchy deduced from regulons and recapitulate the trajectory of lineage branches (Supplementary Fig. 3A). Single cells were distributed according to their cell types instead of according to the donors, suggesting no batch effects or individual diversities in terms of regulons related to hematopoiesis. Hematopoietic cells were grouped into 20 regulatory clusters by unsupervised clustering, termed as C1 to C20 hereafter, and each showed activation of highly specific sets of regulons (Fig. 2B). In particular, HOX genes were activated and co-regulated in C1/C2, which were mainly composed of HSPCs. TCF4, EBF1 and LEF1 showed high activities in C3 to C7 representing B cells, PRDM1 and XBP1 were activated in plasma cells, while GATA3 and TBX21 were activated in NK/T cells. CEBP and SPI1 exhibited high activities in the neutrophil/monocyte

Figure 2.

Figure 2.

Transcription factor regulatory networks underlie hematopoiesis. (A) UMAP displays the distributions of 32 immunophenotypic cell types based on regulons. Colors indicate cell types and arrows reflect the differentiation trajectories of lymphoid, neutrophil/monocyte and erythroid lineages. (B) 20 regulatory clusters determined by Seurat package. C1 to C20 indicates the regulatory clusters. (C) Heat map displays the on/off states of cluster-specific regulons. Black represents when the network is activated, and white represents when the network is inactivated. Red boxes emphasize the regulons for identical cell types or lineages. (D) Cytoscape shows the regulatory networks comprising transcription factors and their target genes underlying the lymphoid lineage. Edges connect transcription factor-target gene pairs while nodes represent genes. Transcription factors are displayed in larger font size and red color highlights the novel ones. Enriched biological processes and motif sequences (above: annotated motif from JASPAR; below: enriched motif) of target genes for novel transcription factors are shown. Shadows highlight the cell-type specific sub-networks.

lineage, while GATA1 and KLF1 were activated in the erythroid lineage (Fig. 2C and Supplementary Fig. 3B). Remarkably, monocytes and neutrophils shared the majority of regulons unique to myeloid lineage, while lymphocytes were preferentially regulated by cell type specific sub-networks highlighted by the regulon shifts during B cell maturation (Fig. 2C).

We next identified 23 novel regulons activated in each lineage by comparing with previously reported canonical transcription factors. To further confirm the regulatory functions of novel transcription factors, gene ontology (GO) enrichment and motif analysis of their target genes were performed accordingly. We found that enriched GO terms for BBX, REL, CREM, CREB3L2 and GLIS1 were associated with the activation/differentiation of

 

lymphocytes (Fig. 2D). RXRA, STAT2, STAT6, TFF3, TFEC, EGR3, OLIG2, VDR, VENTX and ZXDA were targeting neutrophil/granulocyte related genes, while FOXO3, NFIX, KLF3, MXI1, GMEB2, GTF3C2, SREBF1 and GADD45A targeted genes were highly related to biological processes such as erythrocyte/oxygen/iron ion (Supplementary Fig. 3C and D). Additionally, the enriched sequences on the promoter region of target genes for transcription factors like CREM, TFF3 and NFIX were consistent with their annotated motifs, suggesting that these newly identified transcription factors might regulate hematopoiesis in cooperating with canonical networks (Fig. 2D and Supplementary Fig. 3C and D).

Landscapes of long non-coding RNAs

lncRNAs play crucial roles in hematopoietic differentiation and development. However, the expression spectrum of lncRNAs in hematopoietic cells at single-cell resolution has not been reported. Therefore, we sought to construct a transcriptional atlas of 7192 human blood cells based on lncRNAs by taking advantage of our deep sequencing datasets. On average, over 1700 lncRNAs were detected for 32 immunophenotypic cell types referred to the genome annotation from the NONCODE database [18]. Coincident with protein-coding genes, HSPCs and monocytes expressed the highest number of lncRNAs, whereas NK cells, T cells and neutrophils expressed the lowest number of lncRNAs (Fig. 3A). We showed that the hematopoietic differentiation trajectory reconstructed by only using lncRNAs was highly consistent with that revealed by protein-coding genes (Fig. 3B and Supplementary Fig. 4A). To further dissect the transcriptional heterogeneity, differentially expressed genes (DEGs) were calculated between any two immunophenotypic cell types for both protein-coding genes and lncRNAs. The normalized number of DEGs showed high agreement between lncRNAs and protein-coding genes (Fig. 3C). These results implicated that the dynamics of lncRNAs were capable of depicting the global hematopoietic hierarchy.

Figure 3.

Figure 3.

Reconstruction of hematopoietic hierarchy by using lncRNAs. (A) Box plot shows the detected lncRNA number for 32 immunophenotypic cell types. Colors indicate cell types. (B) Transcriptional atlas of 7192 human hematopoietic cells based on lncRNAs by UMAP. (C) Heat map shows the scaled DEG numbers between 32 immunophenotypic cell types based on protein-coding genes (the lower triangle) and lncRNAs (the upper triangle). (D) Bar plot shows the proportion of signature lncRNAs adjacent to signature protein-coding genes for each immunophenotypic cell type. (E and F) Signature lncRNAs show higher PhastCons conservation scores (E) and higher cell specificity (F) (P values < 2.2e−16, Wilcoxon test). (G) UMAP displays of transcription activities for hematopoiesis-related lncRNAs such as NONHSAG031143.2, NONHSAG073805.1, NONHSAG069091.1, NONHSAG108638.1, NONHSAG008235.2 and NONHSAG103763.2 (respectively adjacent to AVP, CD79B, GZMH, CCR7, SPI1 and GATA1).

Next, we identified signature protein-coding genes and lncRNAs for each immunophenotypic cell type. We found that signature lncRNAs tended to associate with their adjacent signature protein-coding genes for more differentiated cell types, in contrast to progenitors (Fig. 3D). However, distal signature lncRNAs did not show significant change tendency during differentiation relative to adjacent signature lncRNAs (Supplementary Fig. 4B). Moreover, signature lncRNAs showed higher PhastCons conservation scores (from UCSC 100-way) and higher cell specificity (based on Jensen-Shannon Divergence, JSD) compared to background estimations (P values <2.2e−16, Wilcoxon test) (Fig. 3E and F), suggesting their potential functional roles. To further exploit the function of signature lncRNAs, their neighboring protein-coding genes with distances less than 5kb were scanned along the genome. Protein-coding genes near signature lncRNAs significantly overlapped with hematopoietic signatures [7,8] (Supplementary Fig. 4C). In particular, lncRNAs (NONHSAG031143.2, NONHSAG073805.1, NONHSAG069091.1, NONHSAG108638.1, NONHSAG008235.2 and NONHSAG103763.2) adjacent to hematopoietic signatures (AVP, CD79B, GZMH, CCR7, SPI1 and GATA1) were specifically highly expressed in HSPCs, B cells, NK cells, T cells, neutrophil/monocytes and erythrocytes, respectively (Fig. 3G). Altogether, lncRNAs specifically expressed in hematopoietic cells were identified at the single-cell level, and they tended to co-express with their adjacent signature protein-coding genes for more differentiated cells.

Elaborate atlas for each hematopoietic cell population

We next aimed to precisely dissect the differentiation trajectory and characteristics for each particular cell population such as HSPCs, B cells, NK cells, T cells, monocytes, neutrophils and erythrocytes [19]. The batch effects and individual diversities were corrected well for each cell population (Supplementary Fig. 5A). Considering that the immune role of CD71+ erythrocytes was reported in previous research [20], first, to illuminate the sub-clusters of continuously sorted erythrocytes, cells from different donors were integrated and followed by dimension reduction and visualization by UMAP. Notably, within the erythroid lineage, we found that Ery/Gra1 and Ery/Gra2 showed highly expressed genes associated with neutrophil and monocyte/dendritic (MD) signatures (Fig. 4A and Supplementary Fig. 5B and C). GO terms of specifically expressed genes in Ery/Gra1 and Ery/Gra2 were enriched on neutrophil-related, phagocytosis and inflammatory responses, along with antigen processing and presentation, further implying the innate and adaptive immune function of these two erythrocyte subsets (Fig. 4B). Additionally, pseudotime analysis showed that Ery/Gra2 was separated from other erythrocyte clusters because of its unique expression of immune signals, for instance VCAN and S100A9 (Fig. 4C and D). The differentiation trajectory also reflected that Ery/Gra1 and Ery/Gra2 were

Figure 4.

Figure 4.

Immune activation of CD74+ nucleated erythrocytes and elaborate atlas for other hematopoietic cell populations. (A) UMAP displays the transcriptional clusters for erythrocytes. Colors correspond to cell clusters. (B) Heat map shows the relative expression of the top 50 signature genes for each cluster. Significantly enriched biological processes for corresponding clusters are presented on the right. (C) The differentiation trajectory of erythrocytes by pseudotime analysis using Monocle3. (D) UMAP displays of transcription activities for surface markers and signature genes related to erythroid and granulocyte lineages. (E–K) UMAP displays of the transcriptional clusters (on the left by Seurat) and differentiation trajectory (on the right by Monocle3) for HSPCs (E), B cells (F), NK cells (G), CD4 T cells (H), CD8 T cells (I), monocytes (J) and neutrophils (K). Colors correspond to cell clusters. (L) Percentages of single cells in S and G2M phase for each cell population by line chart.

 

unipotent erythroid progenitors and did not differentiate into granulocytes (Supplementary Fig. 5D). Furthermore, we found that CD74+ nucleated erythrocytes mainly contributed to Ery/Gra1 and Ery/Gra2 clusters (Fig. 4D).

Subsequently, the differentiation trajectories and functional diversities for HSPCs, B cells, NK cells, T cells, monocytes and neutrophils were also elaborated. It was noted that intermediate progenitor clusters such as HSC/MPP2, Pre-B/Regulatory B, NKP and hMDP/cMoP were more heterogeneous by transcriptome analysis (Supplementary Fig. 6A). As expected, the differentiation trajectory of HSPCs was highlighted by the branching of progenitors including lymphoid, myeloid and erythroid/megakaryocyte lineages (Fig. 4E and Supplementary Fig. 5E). Memory B and plasma (cytotoxic NK and cytokine NK) were distributed on two different branches by pseudotime analysis, better reflecting the differentiation trajectory of B and NK cell populations (Fig. 4F and G and Supplementary Fig. 5G and H). Regarding T cell populations, both CD4 T and CD8 T cells could be classified into three clusters, respectively corresponding to naive T, memory T and effector T cells. By contrast, CD4 naive T and memory T cells ordered together while CD8 memory T and effector T cells were closely adjacent to each other (Fig. 4H and I and Supplementary Fig. 5I and J). Sub-clusters for monocytes and neutrophils were profiled and further revealed their differentiation trajectories (Fig. 4J and K and Supplementary Fig. 5K and L). In addition, pseudotime analysis based on lncRNAs displayed that lncRNAs could reflect the differentiation trajectories comparable to protein-coding genes (Supplementary Fig. 7). Next, signature genes of each transcriptional cluster were revealed to further support the annotation of blood cell types (Supplementary Fig. 6B, Supplementary Table 1). AVP and CRHBP were specifically expressed in HSC/MPPs. CD79A, CD79B and VPREB1/3 exhibited high specificity and expression in B cells and their progenitors. NKG7 and GZMK were highly expressed in both NK cells and T cells, while LYZ and S100A6/8/9 were highly expressed in both neutrophils and monocytes (Supplementary Fig. 6B). Finally, we found that the cell cycle was activated during HSPC differentiation, while inactivated during blood cell maturation (Fig. 4L). In conclusion, we elaborated the differentiation atlas of each cell population and observed the immune activation of nucleated erythrocytes.

A web portal for expression data browsing and blood cell type prediction

A user-friendly web interface for browsing and prediction features has been designed to provide access to the single-cell transcriptome data of human blood cells (http://scrna.sklehabc.com/). The browser page was intended for querying the expression of interested genes in 32 immunophenotypic cell types and 43 transcriptional clusters (Supplementary Fig. 8A). This study enabled users to predict the cell types of hematopoietic cells by implementing two approaches (Scmap [21] and Seurat [22]) and visualize them in the hematopoietic atlas (Supplementary Fig. 8B). We evaluated the prediction power using different single-cell datasets. First, 87% of single cells in our dataset could be correctly annotated. Notably, closely related cell clusters, such as HSPC populations, could also be precisely discriminated (accuracies of most clusters exceeded 90%) by our website (Supplementary Fig. 8C). Next, hematopoietic cells downloaded from the studies of Velten et al. [8] and Pellin et al. [6] were projected to our atlas of blood cells (ABC); most cell types could be correctly matched. Of note, we observed that immature myeloid progenitor 1 (Im1) and immature myeloid progenitor 2 (Im2) cells were annotated to HSC/MPP, LMPP and G2M, suggesting a higher resolution annotation of these cells (Supplementary Fig. 8D). Additionally, HSC, MPP and CMP from Pellin et al. were assigned to HSC/MPP2, in agreement with the fact that a low number of HSCs could be harvested by a wider and continuous gating of progenitor cells (Supplementary Fig. 8E). These results indicated that we could better define the cell types by implementing a comprehensive transcriptome reference of human blood cells.

DISCUSSION

In summary, we established a comprehensive transcriptome reference of the human blood system by combining FACS sorting of 32 immunophenotypically distinct cell types and deep single-cell RNA sequencing. Both gene expression signatures and transcription factor regulatory networks were defined in each cell type. We provided a new angle of lncRNAs for depicting the whole hematopoietic differentiation at single-cell resolution. The in-depth transcriptome data further enabled the discovery of immune activation of nucleated erythrocyte subsets. Finally, we offered an interactive web interface for accessing the atlas data and utilities for cell type prediction.

A growing body of studies has demonstrated that lncRNAs play crucial roles in physiological and pathological processes. Here, we showed that lncRNAs were competitive and consistent with protein-coding genes in distinguishing cell populations. lncRNAs specifically expressed in hematopoietic cells were identified, and their adjacent protein-coding genes were enriched with canonical hematopoietic signatures, implying their potential roles in hematopoietic differentiation. Additionally, signature lncRNAs preferentially orchestrated their adjacent signature protein-coding genes for more differentiated cell types, although this must be further validated by experiments. However, genes adjacent to signature lncRNAs were more general and stochastic for hematopoietic progenitors. This phenomenon supported the point that stem and progenitor cells exhibited more transcriptomic stochasticity [23]. These results provided novel clues for studies of lncRNA biology in human hematopoietic cells.

We thoroughly constructed the differentiation trajectories (with no functional validations) and further defined the signature genes associated with HSPCs, B cells, NK cells, T cells, monocytes, neutrophils and erythrocytes. Notably, we observed a small part of nucleated erythrocytes unexpectedly expressing signatures related to innate immune (neutrophil, phagocytosis and inflammatory response) and adaptive immune (antigen processing and presentation) reactions. It has been reported that nucleated erythrocytes of human cord blood suppressed the production of inflammatory cytokines from monocytes in a lipopolysaccharide (LPS)-mimicked system, to avoid a vigorous innate immune reaction [24]. Our results from adult bone marrow further confirmed the mediator roles of nucleated erythrocytes in immune responses. Additionally, immune roles of CD71+ erythrocytes were observed in previous research [20]. Furthermore, by means of single-cell RNA-seq technology, we found that only a small part of CD71+ erythrocytes (that is, CD74+ erythrocytes) expressed the signature related to immune response.

Our work paves the way for an in-depth understanding of hematopoiesis. The transcription atlas of human blood cells provides a valuable reference for guiding exploration of human physiological and pathological hematopoiesis.

METHODS

Sources of donors

Human blood cells including CD34+ HSPCs, B cells, NK cells, T cells, monocytes, neutrophils and erythrocytes, were derived from the bone marrow and peripheral blood (only regulatory B, naive B, memory B, cytotoxic NK, cytokine NK and T cells were derived from peripheral blood) of 21 healthy adult donors. All experiments were implemented in accordance with the protocols approved by the institutional ethics review boards from the Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College. Written informed consents were acquired before sample collection. Donors for CD34+ HSPCs were enrolled from Qilu Hospital of Shangdong University and the Institute of Hematology and Blood Disease Hospital. Donors for B cells, NK cells, T cells, monocytes, neutrophils and erythrocytes were obtained from the Institute of Hematology and Blood Disease Hospital.

Processing of single-cell RNA-seq data

Raw reads of STAT-seq were first split based on the cell-specific barcode sequences attached in read2, and UMI sequence in read2 was integrated into the paired read1 by Python scripts. Then, the template switch oligo (TSO) sequence, the polyA tail sequence and the low-quality reads (N > 10%) were discarded for each separated single cell using Cutadapt (version 1.18) [25]. Subsequently, we aligned the trimmed reads to human reference genome (hg38 from Ensembl) by Hisat2 (version 2.0.3-beta) [26]. Uniquely mapped reads were calculated through htseq-count (version 0.9.1) [27], based on the genome annotations from Ensembl (release 84, hg38) for protein-coding genes and from NONCODE (version v5.0, hg38) [18] for lncRNAs. Finally, the read count of a given gene was quantified by the total number of distinct UMIs, and the raw UMIs of protein-coding genes were normalized by log2(TPM/10 + 1) (TPM: transcripts per million) for downstream analysis. To filter low-quality single cells, mapping rates under 10% for HSPCs/monocytes, under 5% for B cells/NK cells/neutrophils/erythrocytes and under 2.5% for T cells were discarded. Simultaneously, we only retained the cells where more than 1000 protein-coding genes/500 lncRNAs were detected, resulting in 7551 cells for protein-coding genes and 7192 cells for lncRNAs [28–31].

DATA AVAILABILITY

The single-cell RNA sequencing data reported in this study are deposited in NCBI’s Gene Expression Omnibus (GEO) with the accession numbers GSE137864 and GSE149938. All other relevant data and custom code are available from the corresponding authors upon reasonable request.

Supplementary Material

nwaa180_Supplemental_Files

Acknowledgements

We want to thank Xiuli An for providing the valuable Band3 antibody.

Contributor Information

Xiaowei Xie, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Mengyao Liu, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Yawen Zhang, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Bingrui Wang, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Caiying Zhu, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Chenchen Wang, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Qing Li, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Yingying Huo, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Jiaojiao Guo, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha 410078, China.

Changlu Xu, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Linping Hu, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Aiming Pang, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Shihui Ma, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Lina Wang, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Wenbin Cao, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Shulian Chen, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Qiuling Li, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Sudong Zhang, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Xueying Zhao, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Wen Zhou, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha 410078, China.

Hongbo Luo, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Guoguang Zheng, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Erlie Jiang, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Sizhou Feng, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Lixiang Chen, School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China.

Lihong Shi, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Hui Cheng, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Sha Hao, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Ping Zhu, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

Tao Cheng, State Key Laboratory of Experimental Hematology and National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Center for Stem Cell Medicine and Department of Stem Cell & Regenerative Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China.

FUNDING

This work was supported by the Ministry of Science and Technology of China (2018YFA0107801, 2016YFA0100600, 2017YFA0103400), the National Key Research and Development Program of China (2018YFA0107804), the National Natural Science Foundation of China (81670106, 81421002, 81730006, 81890990, 81970104, 32000803) and the CAMS Initiative for Innovative Medicine (2017-I2M-1-015, 2019-I2M-1-006, 2016-I2M-1-017, 2019-I2M-2-001).

AUTHOR CONTRIBUTIONS

Tao Cheng, Ping Zhu and Sha Hao designed the project. Yawen Zhang, Bingrui Wang, Caiying Zhu, Chenchen Wang, Qing Li, Yingying Huo, Jiaojiao Guo, Changlu Xu, Linping Hu, Aiming Pang, Shihui Ma, Lina Wang, Wenbin Cao, Shulian Chen, Qiuling Li, Sudong Zhang, Xueying Zhao, Wen Zhou, Hongbo Luo, Guoguang Zheng, Erlie Jiang, Sizhou Feng, Lixiang Chen, Xiuli An, Lihong Shi and Hui Cheng provided the bone marrow samples and performed the experiments. Xiaowei Xie and Mengyao Liu performed bioinformatics analysis. Tao Cheng, Ping Zhu, Sha Hao and Xiaowei Xie wrote the manuscript. All authors discussed the results and commented on the manuscript.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Orkin SH, Zon LI. Hematopoiesis: an evolving paradigm for stem cell biology. Cell 2008; 132: 631–44. 10.1016/j.cell.2008.01.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Laurenti E, Gottgens B. From haematopoietic stem cells to complex differentiation landscapes. Nature 2018; 553: 418–26. 10.1038/nature25022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Scala S, Aiuti A. In vivo dynamics of human hematopoietic stem cells: novel concepts and future directions. Blood Adv 2019; 3: 1916–24. 10.1182/bloodadvances.2019000039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Jacobsen SEW, Nerlov C. Haematopoiesis in the era of advanced single-cell technologies. Nat Cell Biol 2019; 21: 2–8. 10.1038/s41556-018-0227-8 [DOI] [PubMed] [Google Scholar]
  • 5. Karamitros D, Stoilova B, Aboukhalil Zet al. Single-cell analysis reveals the continuum of human lympho-myeloid progenitor cells. Nat Immunol 2018; 19: 85–97. 10.1038/s41590-017-0001-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Pellin D, Loperfido M, Baricordi Cet al. A comprehensive single cell transcriptional landscape of human hematopoietic progenitors. Nat Commun 2019; 10: 2395. 10.1038/s41467-019-10291-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Hay SB, Ferchen K, Chetal Ket al. The Human Cell Atlas bone marrow single-cell interactive web portal. Exp Hematol 2018; 68: 51–61. 10.1016/j.exphem.2018.09.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Velten L, Haas SF, Raffel Set al. Human haematopoietic stem cell lineage commitment is a continuous process. Nat Cell Biol 2017; 19: 271–81. 10.1038/ncb3493 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Satpathy AT, Chang HY. Long noncoding RNA in hematopoiesis and immunity. Immunity 2015; 42: 792–804. 10.1016/j.immuni.2015.05.004 [DOI] [PubMed] [Google Scholar]
  • 10. Luo M, Jeong M, Sun Det al. Long non-coding RNAs control hematopoietic stem cell function. Cell Stem Cell 2015; 16: 426–38. 10.1016/j.stem.2015.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Paralkar VR, Mishra T, Luan Jet al. Lineage and species-specific long noncoding RNAs during erythro-megakaryocytic development. Blood 2014; 123: 1927–37. 10.1182/blood-2013-12-544494 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Hu G, Tang Q, Sharma Set al. Expression and regulation of intergenic long noncoding RNAs during T cell development and differentiation. Nat Immunol 2013; 14: 1190–8. 10.1038/ni.2712 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Brazao TF, Johnson JS, Muller Jet al. Long noncoding RNAs in B-cell development and activation. Blood 2016; 128: e10–9. 10.1182/blood-2015-11-680843 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Wu Z, Gao S, Zhao Xet al. Long noncoding RNAs of single hematopoietic stem and progenitor cells in healthy and dysplastic human bone marrow. Haematologica 2019; 104: 894–906. 10.3324/haematol.2018.208926 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Zhou J, Xu J, Zhang Let al. Combined single-cell profiling of lncRNAs and functional screening reveals that H19 is pivotal for embryonic hematopoietic stem cell development. Cell Stem Cell 2019; 24: 285–98 e5. 10.1016/j.stem.2018.11.023 [DOI] [PubMed] [Google Scholar]
  • 16. Aibar S, Gonzalez-Blas CB, Moerman Tet al. SCENIC: single-cell regulatory network inference and clustering. Nat Methods 2017; 14: 1083–6. 10.1038/nmeth.4463 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Weinreb C, Wolock S, Klein AM. SPRING: a kinetic interface for visualizing high dimensional single-cell expression data. Bioinformatics 2018; 34: 1246–8. 10.1093/bioinformatics/btx792 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Fang S, Zhang L, Guo Jet al. NONCODEV5: a comprehensive annotation database for long non-coding RNAs. Nucleic Acids Res 2018; 46: D308–14. 10.1093/nar/gkx1107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Zeng Y, He J, Bai Zet al. Tracing the first hematopoietic stem cell generation in human embryo by single-cell RNA sequencing. Cell Res 2019; 29: 881–94. 10.1038/s41422-019-0228-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Elahi S. Neglected cells: immunomodulatory roles of CD71(+) erythroid cells. Trends Immunol 2019; 40: 181–5. 10.1016/j.it.2019.01.003 [DOI] [PubMed] [Google Scholar]
  • 21. Kiselev VY, Yiu A, Hemberg M. scmap: projection of single-cell RNA-seq data across data sets. Nat Methods 2018; 15: 359–62. 10.1038/nmeth.4644 [DOI] [PubMed] [Google Scholar]
  • 22. Stuart T, Butler A, Hoffman Pet al. Comprehensive integration of single-cell data. Cell 2019; 177: 1888–902 e21. 10.1016/j.cell.2019.05.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Han X, Zhou Z, Fei Let al. Construction of a human cell landscape at single-cell level. Nature 2020; 581: 303–9. 10.1038/s41586-020-2157-4 [DOI] [PubMed] [Google Scholar]
  • 24. Chico V, Nombela I, Puente-Marín Set al. Nucleated red blood cells contribute to the host immune response against pathogens. In: Tyagi RK, Bisen PS (eds.). Immune Response Activation and Immunomodulation. London: IntechOpen, 2018, 334–48. [Google Scholar]
  • 25. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J 2011; 17: 10–2. 10.14806/ej.17.1.200 [DOI] [Google Scholar]
  • 26. Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods 2015; 12: 357–60. 10.1038/nmeth.3317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Anders S, Pyl PT, Huber W. HTSeq-a Python framework to work with high-throughput sequencing data. Bioinformatics 2015; 31: 166–9. 10.1093/bioinformatics/btu638 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Li L, Dong J, Yan Let al. Single-Cell RNA-Seq analysis maps development of human germline cells and gonadal niche interactions. Cell Stem Cell 2017; 20: 891–2. 10.1016/j.stem.2017.05.009 [DOI] [PubMed] [Google Scholar]
  • 29. Zhong S, Zhang S, Fan Xet al. A single-cell RNA-seq survey of the developmental landscape of the human prefrontal cortex. Nature 2018; 555: 524–8. 10.1038/nature25980 [DOI] [PubMed] [Google Scholar]
  • 30. Zheng H, Brennan K, Hernaez Met al. Benchmark of long non-coding RNA quantification for RNA sequencing of cancer samples. Gigascience 2019; 8: giz145. 10.1093/gigascience/giz145 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Hou Y, Guo H, Cao Cet al. Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas. Cell Res 2016; 26: 304–19. 10.1038/cr.2016.23 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

nwaa180_Supplemental_Files

Data Availability Statement

The single-cell RNA sequencing data reported in this study are deposited in NCBI’s Gene Expression Omnibus (GEO) with the accession numbers GSE137864 and GSE149938. All other relevant data and custom code are available from the corresponding authors upon reasonable request.


Articles from National Science Review are provided here courtesy of Oxford University Press

RESOURCES