Abstract
The expression of human and microbial genes serves as biomarkers for disease and health. Blood RNA is an important biological resource for precision medicine and translational medicine. However, few studies have assessed the human transcriptome profiles and microbial communities composition and diversity of peripheral blood from different cell isolation methods, which could affect the reproducibility of researches. We collected peripheral blood from three healthy donors and processed it immediately. We used RNA sequencing to investigate the effect of three leukocyte isolation methods including buffy coat (BC) extraction, red blood cell (RBC) lysis and peripheral blood mononuclear cell (PBMC) isolation with the comparison with whole blood (WB), through analyzing the sensitivity of gene detection, the whole transcriptome profiling and microbial composition and diversity. Our data showed that BC extraction with high globin mRNA mapping rate had similar transcriptome profiles with WB, while RBC lysis and PBMC isolation depleted RBCs effectively. With the efficient depletion of RBC and distinct compositions of leukocyte subsets, RNA-seq of RBC lysis and PBMC isolation uniquely detected genes from specific cell types, like granulocytes and NK cells. In addition, we observed that the microbial composition and diversity were more affected by individuals than isolation methods. Our results showed that blood cell isolations could largely influence the sensitivity of detection of human genes and transcriptome profile.
Supplementary Information
The online version contains supplementary material available at 10.1007/s11033-021-06382-1.
Keywords: Peripheral blood, Cell isolation, Transcriptome profile, Microbial transcript, Biomarker
Introduction
Peripheral blood is a fluid connective tissue throughout the body connecting the entire biological system. Previous studies have shown that peripheral blood gene expression is an important source of information to profile individual uniqueness [1, 2]. Peripheral blood has distinct patterns of gene expression in different diseases including Parkinson’s disease [3], childhood asthma [4], rheumatoid arthritis [5, 6], infectious diseases [7] and early preeclampsia [8, 9]. A unique pattern of immune dysregulation is found in COVID-19 patients’ blood samples [10, 11]. In addition, studies have shown that composition and diversity of human blood microbial communities change with the onset of diseases including diabetes [12], cirrhotic [13], acute pancreatitis [14], cardiovascular diseases [15] schizophrenia [16], and cancer [17]. Peripheral blood is more available and less risky to collect than invasive organ biopsy, thus provides promising biomarkers for diseases in precision and translational medicine.
Peripheral whole blood (WB) contains white blood cells (WBCs), red blood cells (RBCs), platelets and plasma. WBCs include lymphocyte, granulocyte and monocyte and play critical roles in immunity, exhibiting the major complexity of blood. There are 3 popular methods to isolate WBCs from WB, including buffy coat (BC) extraction, RBC lysis and peripheral blood mononuclear cell (PBMC) isolation. BC extraction is achieved by extracting the white membrane layer of blood after centrifugation, remaining most WBCs and platelets [18, 19]. RBC lysis removes erythrocytes and leaves WBCs and platelets [20]. PBMC isolation uses density gradient centrifugation to separate mononuclear cells, such as lymphocytes and monocytes [21]. These cell isolation methods maintain different subpopulations of blood cells, leading to distinct expression patterns. Previous studies have observed a reduced number of detected genes, a high variability of gene expression profile and a low signal-to-noise ratio in WB compared with WBC, which was related to the RBC-derived globin transcripts [22, 23]. The gene expression pattern and cell subpopulations were also varied among WB and WBC isolations [22]. Most of these studies evaluated the difference on a group of selected genes by microarray. With the development of RNA sequencing (RNA-seq), it is possible to identify the changes of whole human transcriptome profile and microbial composition and diversity among WBC isolations, and the potential to affect biomarker discovery.
Here, we used RNA-seq to comprehensively investigate the influence of 3 popular WBC isolation methods on human transcriptome profile and microbial composition and diversity, including BC extraction, RBC lysis and PBMC isolation. With the assessment of the three cell isolation methods, we provide a reference to help researchers to choose suitable pretreatments for particular study purposes.
Materials and methods
Sample collection and leukocyte isolation
Three healthy volunteers (two males and one female) were enrolled, and 10 mL peripheral blood was collected each using EDTA anticoagulation tubes (BD, 0202992058) and processed immediately. Each tube of WB was divided into four parts, three of which underwent three different leukocyte isolation methods, and one portion was mixed with TRIzol LS Reagent (Thermo Fisher, 10296028). The methods of BC extraction, RBC lysis and PBMC isolation are described in Fig. 1. The study protocol was approved by the BGI Institutional Review Board (NO. BGI-IRB 17034). All donors signed consent forms for non-therapeutic use of their donated blood samples.
Fig. 1.
Schematic of sample processing. Whole blood was collected in EDTA anticoagulation tubes (n = 3). All samples were treated immediately after collection
RNA extraction, library preparation and sequencing
Twelve RNA samples were extracted by TRIzol Reagent (Thermo Fisher, 15596026) or TRIzol LS Reagent (Thermo Fisher, 10296028) according to the manufacturer’s manual. RNA concentration and integrity were measured by Agilent 2100 bioanalyzer (Agilent Technologies, G2939A). Samples with total RNA amount ≥ 200 ng and RIN score ≥ 6 were qualified to construct sequencing library. RNase H method was applied to deplete rRNA [24]. Sequencing was performed on BGISEQ-500RS (single-end 50 bp) developed by BGI.
Data filtering, alignment and expression quantification
Reads with adaptors and low quality were filtered by SOAPnuke [25], and rRNA was removed (hg19 rRNA ref) by SOAP2 [26] to obtain clean data. Metrics for clean data were calculated according to output files generated by SOAPnuke [25]. We used HISAT (version 2.1.0) [27] with default parameters to aligned all clean data on the human genome (hg19) and calculate metrics of alignment. We used RNA-SeQC (version 2.6.4) [28] to assessed the proportion of reads aligned to annotated CDS exons, 5′UTR exons. 3′UTR exons, introns, TSS up/down 10 kb and other regions.
We aligned all clean data on human transcript reference by Bowtie2 [29], the refMrna.fa.gz file from the UCSC database [30] removed NR_RNA as the mRNA reference, and NONCODEv5_human.fa.gz file from the NONCODE database [31] as the non-coding RNA reference. Saturation curves display the number of detected genes according to BAM files generated by Bowtie2. All transcripts estimated counts and Transcripts Per Million mapped reads (TPM) were obtained using Kallisto [32]. We performed principal component analysis (PCA) and calculated the coefficient of variation according to all expressed genes among isolation methods.
Blood cell subsets analysis
We estimated the content of globin mRNA (Supplementary Table S1) reads in clean data to present the residue of erythrocytes. CIBERSORT [33] was used to estimate the relative proportion of leukocyte subpopulations.
Identification of differentially expressed genes (DEGs) and co-expression genes
We used edgeR [34] to identify DEGs between isolation methods. The Benjamini-Hochberg (BH) method [35] was employed to correct multiple comparisons. DEGs were considered significant if they exhibited a BH-adjusted p-value ≤ 0.01 and fold change ≥ 2.
Then we identified isolation-associated co-expression genes by WGCNA [36] according to all DEGs’ TPM. We chose β = 9 as the soft threshold [37]. The Pearson correlation between module eigengenes (ME) [36] and isolation method was also calculated. Metascape [38] was used to analyze the disease enrichment according to DisGeNET database [39]. FunRich [40] was used to analyze the cell type enrichment of coding genes in different modules.
Human gene expression in different blood cell types
Gene expression levels summarized in 18 blood cell types is based on the Human Protein Atlas (version 20.0) [41] and Ensembl (version 92.38) [42]. TPM of uniquely detected genes in each group of cell was merged and observed in different cell types. Expression of coding genes of black and yellow modules in different blood cell types was also observed.
The blood microbiome analyses
Clean reads that failed to align to the human genome were further filtered with low-quality and low complexity reads, and the remaining reads were aligned to microbiome using Kraken (version 0.10.5) [43] with a database including viral, archaeal, bacterial, protozoa, fungi, and human. We excluded the human reads and calculated the microbiome reads in per million clean reads (microbiome-RPM) and the relative abundances of bacterial taxa at phylum level. The alpha diversity in each sample was determined using the Simpson index. To measure sample-to-sample dissimilarities between microbial communities, we used Bray–Curtis beta diversity index. Principal coordinates analysis (PCoA) was performed based on unweighted Bray–Curtis distances.
Statistical analysis
The coefficients of variation (CV) of commonly detected genes expression under each pretreatment were calculated. Paired two-sided t-test was used to compare the differences in CV, proportions of globin mRNA and leukocyte subsets between cell groups in this study, and p < 0.05 was considered to be significant. The relative proportions of leukocyte subsets were presented as mean ± SD.
Results
Quality control of RNA and RNA-seq
RNA was extracted from all processed samples with an average RIN value of 8.1 ± 1.0 (mean ± SD) and sequencing libraries were successfully constructed (Supplementary Table S2). For each sample of RNA-seq, about 100 M clean reads were obtained respectively (Supplementary Table S3). The proportion of rRNA reads and filtered reads in raw reads did not show significant difference among pretreatments (Supplementary Table S4). Q20 of all samples were larger than 98.75%, which inferred that the sequencing accuracy of most bases was up to 99% (Fig. S1A). The GC content in WB and BC extraction was higher than RBC lysis and PBMC isolation (Fig. S1B). All pretreatments have similar total aligned percentages, while WB and BC extraction with a higher multi-aligned and lower uniquely aligned percentages (Fig. S1C). We also observed that WB and BC extraction with higher proportion of CDS exons and lower proportion of introns (Fig. S1D).
Blood cell subsets analysis
The proportion of globin mRNA showed WB > BC extraction > RBC lysis > PBMC isolation (Fig. 2a, Supplementary Table S4). The proportion of leukocyte subsets varied with subjects (Fig. S2, Supplementary Table S5). As expected, neutrophils were rare in PBMC, with increasing proportions of lymphocyte and monocyte (Fig. 2b).
Fig. 2.
Blood cell subsets among WBCs isolations and WB. a Percentage of globin mRNA reads in clean reads. b The relative proportions of neutrophils, monocytes, CD8 cells, CD4 naïve cells, T regulatory cells, and NK cells resting. *Shows p-value < = 0.05
Detection of human expressed genes
Saturation curves displayed that the number of detected genes performed PBMC isolation ≥ RBC lysis > BC extraction > WB at a random number of clean reads (Fig. 3a). The number of high abundance genes (TPM > 1) and uniquely detected genes (only detected in one cell group) (Fig. 3b, c) showed similar trends.
Fig. 3.
Human gene detected sensitivity. a Sequencing saturation analysis of each pretreatment. b Distribution of all gene expression levels. c Venn diagram showed the overlap of the detected genes by different WBCs isolations and WB. A gene is considered “expressed” in one pretreatment if it has a TPM value of at least 0.3 in all three biology replicates. d Principal component analysis (PCA) according to commonly detected genes expression. e Coefficient of variation for commonly detected genes expression. f Distribution of commonly detected gene and uniquely detected gene expression and g Heatmap of gene expression in 18 cell types. TPM was transformed by log2 (TPM+1)
According to the commonly detected genes of the four cell groups, the PCA showed that the expression profiling of WB and BC extraction were similar and the other two groups were distinct (Fig. 3d), and the CV performed PBMC isolation < RBC lysis < BC extraction < WB (Fig. 3e). Compared with the expression of commonly detected genes, that of uniquely detected genes was lower (Fig. 3f). We further found that the uniquely detected genes in RBC lysis were highly expressed in granulocytes (neutrophils, basophils and eosinophils), and those in PBMC isolation were highly expressed in basophils and NK cells (Fig. 3g). Enrichment analysis for DisGeNET did not identify significant terms.
Characterization of DEGs among isolation methods
The fold change distribution of DEGs in any two groups was shown in Fig. 4a. No DEG was identified between BC and WB. Through WGCNA, we successfully identified two integrative gene modules, and labeled by black and yellow (Fig. 4b). Furthermore, we found that the black module showed positive correlations with WB and BC extraction, while the yellow module showed a positive correlation with RBC lysis and a negative correlation with PBMC isolation (Fig. 4c). Heatmap showed that genes in black module had relatively high expression in WB and BC extraction, while those in the yellow module were highly expressed in RBC lysis (Fig. 4d).
Fig. 4.
Human gene expression profile difference among WBCs isolations and WB. a The fold change distribution of DEGs. b Gene dendrogram. The color row underneath the dendrogram shows the module assignment determined by the Dynamic Tree Cut. c Pearson correlation between module eigengene and the pretreatment. The cor (up) and p-value (down) are shown in box. The numbers of coding (left) and non-coding (right) gene are shown in brackets. d Heatmap of genes expression in each sample. TPM was transformed by log2 (TPM + 1). e Distribution of gene expression levels. f Summary of enrichment analysis in DisGeNET. (Color figure online)
The enrichment analysis of cell type showed coding gens in black module were significantly enriched in erythrocytes and yellow module were enriched in neutrophils (Fig. S3A) which consistent with the principle of experiments. We further found that the genes in black and yellow modules were also expressed in other leukocyte subsets (Fig. S3B). The gene expression distributions of black and yellow modules were shown in Fig. 4e. In DisGeNET enrichment analysis terms, the genes in black module were mostly enriched in erythrocytes related disease such as erythroleukemia, anemia, beta thalassemia intermedia, acute erythroblastic leukemia and hereditary spherocytosis (Fig. 4f). Genes in yellow module were mostly enriched in inflammation-related diseases, such as pneumonitis, inflammation, infection, Juvenile psoriatic arthritis and inflammatory dermatosis (Fig. 4f).
The microbial composition and diversity among isolation methods
The microbiome-RPM was varied with subjects and did not show difference among isolation methods (Fig. 5a). We found that the microbial composition was stable for each individual in different pretreatments, and proteobacteria dominate all samples (Fig. 5b). PCoA demonstrated that the microbial communities at the genus level was mainly affected by individuals (Fig. 5c). Alpha and beta diversities at the genus level were not different among groups (Fig. 5d, e).
Fig. 5.
The microbial composition and diversity among WBCs isolations and WB. a The microbiome-RPM. b Relative abundances of microbial taxa at phylum level. c Principal coordinates analysis (PCoA) of microbial communities at the genus level based on unweighted Bray–Curtis distances. Alpha (Simpson index) (d) and beta (Bray–Curtis dissimilarity index) (e) diversity of per sample at the genus level of classification
Discussion
Peripheral blood is a valuable source for noninvasive diagnosis and prognosis of various diseases and biomarker discovery. Expression of genes and the microbial composition and diversity in the blood provide important information of diseases and health status. There are several methods to preprocess peripheral blood, and we comprehensively and systematically assessed the influence of three popular cell isolation methods on the performance of transcriptome and microbial composition and diversity profiling of peripheral blood.
RBCs make up around 45% of the WB volume, and the mapping rate of globin mRNA varies with the proportion of erythrocytes and affect the capability of RNA-seq [44, 45]. Among the three leukocyte isolation methods, BC extraction presented similar with WB in PCA analysis, the globin mRNA mapping rate and the composition of leukocyte subsets, which demonstrated that BC extraction could not deplete RBCs efficiently and had a comparable composition of blood cells with WB. BC layer probably contains some RBCs, or BC extraction inevitably includes erythrocyte layer [18, 46]. With the comparable residue of erythrocytes as WB, the RNA-seq data of BC extraction showed a smaller number of detected genes, a higher GC content and a higher proportion of CDS exon than RBC lysis and PBMC separation. More genes with low abundance (TPM < 1) and less uniquely detected genes were also identified in WB and BC extraction. In addition, genes in the black module positively associated with WB and BC extraction, were observed with high abundance in these two groups and enriched in diseases such as anemia and beta thalassemia. As a result, BC extraction could not remove erythrocytes effectively and affected the capability of RNA-seq.
WBCs are important components of the peripheral immune system and play an essential role in protecting the body against infection, illness and disease. They include granulocytes, monocytes and lymphocytes. In healthy peripheral blood, neutrophil ranges from 50 to 75%, monocytes range from 1 to 8%, and lymphocytes range from 20 to 40% [47]. The relative proportions and expression of leukocyte subsets change with diseases [48–50]. Neutrophil is usually maintained by RBC lysis but depleted by PBMC isolation. Though these two isolation methods removed most erythrocytes, they showed distinct transcriptome profiles due to maintaining neutrophils or not. Our results showed that the variability of gene expression was higher in RBC lysis compared with PBMC isolation, which might be caused by the variation of the proportion of neutrophils (51.19% ± 10.05%) in RBC lysis. Genes uniquely detected in RBC lysis showed low abundance (TPM < 1), and had relatively high expression in granulocytes (including neutrophil, basophil and eosinophil). Genes uniquely detected in PBMC isolation also presented low abundance (TPM < 1), and had relatively high expression in basophils and NK cells. The neutrophil genes expressed more highly in RBC lysis than PBMC isolation as we observed in the yellow module [22, 23], and these genes enriched in infection and inflammation. In conclusion, RBC lysis maintained most WBC subsets, and PBMC isolation kept lymphocyte and monocyte but not neutrophil, leading to different transcriptome profiles.
There are increasing evidences to prove that the microbiome exists in healthy human blood [51]. In our data, we observed that there were differences of the microbiome content, the relative abundance at the phylum level, and the microbiome diversity at the genus level among the three individuals but not WBCs isolation methods. Though more subjects are needed to draw a conclusion, similar phenomena were reported. A previous study set up a 16S rDNA quantitative polymerase chain reaction assay as well as a 16S targeted metagenomics sequencing pipeline specifically designed to analyze the blood microbiome, and demonstrated that it varied among healthy donors and blood fractions (BC, RBCs and plasma) [52]. Moreover, we found that the proteobacteria dominated the composition of microorganisms among different individuals and isolation methods, which was consistent with previous studies [14, 16, 52].
Different cell isolation methods obtained distinct blood cell subsets and affected transcriptome profiles. It is recommended to choose the cell isolation method according to the research purpose. WB has the complete information of blood cells including both WBCs and RBCs. Draw blood through PAXgene tubes and add the depletion of globin mRNA to RNA-seq could be an effective strategy. Due to the variable proportions of leukocyte subsets of individuals and distinct functions of these subsets, focusing on the whole WBCs or specific subpopulations like PBMC is according to the disease or health status. If necessary, commercial kits for isolating specific cell subtype could be used to enable a precise characterization. The cell isolation methods had less effects on microbial composition and diversity than human transcriptome profiles. Regarding to the relatively small number of subjects in this study, the findings need to be interpreted with caution.
Conclusions
We systematically assessed the effect of BC extraction, RBC lysis and PBMC isolation on human transcriptome profiles and microbial transcripts, and found that the composition of blood cell subpopulations varied with these methods. We provide a reference for researchers to develop proper sample processing strategies for their own study purposes.
Supplementary Information
Below is the link to the electronic supplementary material.
Supplementary file6 (PDF 392 kb)
Fig. S1: Metrics for clean data and sequence alignment. Q20 (A) and GC content (B) of clean data in four pretreatments. (C) Percentage of un-aligned reads, unique aligned reads and multi-aligned reads in clean data. (D) Proportion of reads mapping to CDS exons, 5′UTR exons, 3′UTR exons, introns, TSS up/down 10 kb and other regions.
Supplementary file7 (PDF 235 kb)
Fig. S2: The relative percentage of leukocyte subsets in each sample. The order of the legends of cell types were shown from bottom to up.
Supplementary file8 (PDF 1202 kb)
Fig. S3: (A) The cell type enrichment analysis for coding genes in black and yellow modules. (B) Heatmap of gene expression in 18 cell type. TPM was transformed by log2 (TPM+1).
Acknowledgements
We are grateful to the healthy volunteers who participated in the study, and the colleagues in BGI-Shenzhen who helped to conduct RNA sequencing. We also thank Dr. Qing Zhou, Dr. Zhongzhen Liu and Jinghua Sun for giving their comments on this study. We thank Liang Wu for sharing his analysis method on this study.
Funding
This project is supported by the National Natural Science Foundation of China (No.81300075), the Natural Science Foundation of Guangdong Province (No. 2014A030313795), the Shenzhen Municipal Government of China (No.JCYJ20170412152854656, JCYJ20180703093402288), and the Shenzhen Peacock Plan (No. KQTD20150330171505310).
Availability of data
The data reported in this study are also available in the CNGB Nucleotide Sequence Archive (CNSA: https://db.cngb.org/cnsa; accession number CNP0000266).
Declarations
Conflict of interest
The authors declare no conflicts of interest.
Informed consent
All volunteers signed informed consent, and the study protocol was approved by the BGI Institutional Review Board (NO. BGI-IRB 17166).
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Petrov PD, Bonet ML, Reynes B, Oliver P, Palou A, Ribot J. Whole blood RNA as a source of transcript-based nutrition- and metabolic health-related biomarkers. PLoS ONE. 2016;11(5):e0155361. doi: 10.1371/journal.pone.0155361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chomczynski P, Wilfinger WW, Eghbalnia HR, Kennedy A, Rymaszewski M, Mackey K. Inter-individual differences in RNA levels in human peripheral blood. PLoS ONE. 2016;11(2):e0148260. doi: 10.1371/journal.pone.0148260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Soreq L, Salomonis N, Guffanti A, Bergman H, Israel Z, Soreq H. Whole transcriptome RNA sequencing data from blood leukocytes derived from Parkinson's disease patients prior to and following deep brain stimulation treatment. Genom Data. 2015;3:57–60. doi: 10.1016/j.gdata.2014.11.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Persson H, Kwon AT, Ramilowski JA, Silberberg G, Soderhall C, Orsmark-Pietras C, Nordlund B, Konradsen JR, de Hoon MJ, Melen E, Hayashizaki Y, Hedlin G, Kere J, Daub CO. Transcriptome analysis of controlled and therapy-resistant childhood asthma reveals distinct gene expression profiles. J Allergy Clin Immunol. 2015;136(3):638–648. doi: 10.1016/j.jaci.2015.02.026. [DOI] [PubMed] [Google Scholar]
- 5.Sumitomo S, Nagafuchi Y, Tsuchida Y, Tsuchiya H, Ota M, Ishigaki K, Suzuki A, Kochi Y, Fujio K, Yamamoto K. Transcriptome analysis of peripheral blood from patients with rheumatoid arthritis: a systematic review. Inflamm Regen. 2018;38:21. doi: 10.1186/s41232-018-0078-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lequerre T, Gauthier-Jauneau AC, Bansard C, Derambure C, Hiron M, Vittecoq O, Daveau M, Mejjad O, Daragon A, Tron F, Le Loet X, Salier JP. Gene profiling in white blood cells predicts infliximab responsiveness in rheumatoid arthritis. Arthritis Res Ther. 2006;8(4):R105. doi: 10.1186/ar1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Holcomb ZE, Tsalik EL, Woods CW, McClain MT. Host-based peripheral blood gene expression analysis for diagnosis of infectious diseases. J Clin Microbiol. 2017;55(2):360–368. doi: 10.1128/JCM.01057-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tarca AL, Romero R, Erez O, Gudicha DW, Than NG, Benshalom-Tirosh N, Pacora P, Hsu CD, Chaiworapongsa T, Hassan SS, Gomez-Lopez N. Maternal whole blood mRNA signatures identify women at risk of early preeclampsia: a longitudinal study. J Matern Fetal Neonatal Med. 2020 doi: 10.1080/14767058.2019.1685964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hong S, Banchereau R, Maslow BL, Guerra MM, Cardenas J, Baisch J, Branch DW, Porter TF, Sawitzke A, Laskin CA, Buyon JP, Merrill J, Sammaritano LR, Petri M, Gatewood E, Cepika AM, Ohouo M, Obermoser G, Anguiano E, Kim TW, Nulsen J, Nehar-Belaid D, Blankenship D, Turner J, Banchereau J, Salmon JE, Pascual V. Longitudinal profiling of human blood transcriptome in healthy and lupus pregnancy. J Exp Med. 2019;216(5):1154–1169. doi: 10.1084/jem.20190185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Giamarellos-Bourboulis EJ, Netea MG, Rovina N, Akinosoglou K, Antoniadou A, Antonakos N, Damoraki G, Gkavogianni T, Adami ME, Katsaounou P, Ntaganou M, Kyriakopoulou M, Dimopoulos G, Koutsodimitropoulos I, Velissaris D, Koufargyris P, Karageorgos A, Katrini K, Lekakis V, Lupse M, Kotsaki A, Renieris G, Theodoulou D, Panou V, Koukaki E, Koulouris N, Gogos C, Koutsoukou A. Complex immune dysregulation in COVID-19 patients with severe respiratory failure. Cell Host Microbe. 2020;27(6):992–1000 e1003. doi: 10.1016/j.chom.2020.04.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zhang D, Guo R, Lei L, Liu H, Wang Y, Wang Y, Qian H, Dai T, Zhang T, Lai Y, Wang J, Liu Z, Chen T, He A, O'Dwyer M, Hu J. COVID-19 infection induces readily detectable morphologic and inflammation-related phenotypic changes in peripheral blood monocytes. J Leukoc Biol. 2020 doi: 10.1002/JLB.4HI0720-470R. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Qiu J, Zhou H, Jing Y, Dong C. Association between blood microbiome and type 2 diabetes mellitus: a nested case-control study. J Clin Lab Anal. 2019;33(4):e22842. doi: 10.1002/jcla.22842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Traykova D, Schneider B, Chojkier M, Buck M. Blood microbiome quantity and the hyperdynamic circulation in decompensated cirrhotic patients. PLoS ONE. 2017;12(2):e0169310. doi: 10.1371/journal.pone.0169310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Li Q, Wang C, Tang C, Zhao X, He Q, Li J. Identification and characterization of blood and neutrophil-associated microbiomes in patients with severe acute pancreatitis using next-generation sequencing. Front Cell Infect Microbiol. 2018;8:5. doi: 10.3389/fcimb.2018.00005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Amar J, Lange C, Payros G, Garret C, Chabo C, Lantieri O, Courtney M, Marre M, Charles MA, Balkau B, Burcelin R, Group DESIRS Blood microbiota dysbiosis is associated with the onset of cardiovascular events in a large general population: the D.E.S.I.R. study. PLoS ONE. 2013;8(1):e54461. doi: 10.1371/journal.pone.0054461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Olde Loohuis LM, Mangul S, Ori APS, Jospin G, Koslicki D, Yang HT, Wu T, Boks MP, Lomen-Hoerth C, Wiedau-Pazos M, Cantor RM, de Vos WM, Kahn RS, Eskin E, Ophoff RA. Transcriptome analysis in whole blood reveals increased microbial diversity in schizophrenia. Transl Psychiatry. 2018;8(1):96. doi: 10.1038/s41398-018-0107-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Poore GD, Kopylova E, Zhu Q, Carpenter C, Fraraccio S, Wandro S, Kosciolek T, Janssen S, Metcalf J, Song SJ, Kanbar J, Miller-Montgomery S, Heaton R, McKay R, Patel SP, Swafford AD, Knight R. Microbiome analyses of blood and tissues suggest cancer diagnostic approach. Nature. 2020;579(7800):567–574. doi: 10.1038/s41586-020-2095-1. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 18.Sutton DW, Chen PC, Schmid-Schonbein GW. Cell separation in the buffy coat. Biorheology. 1988;25(4):663–673. doi: 10.3233/bir-1988-25406. [DOI] [PubMed] [Google Scholar]
- 19.Teetson W, Cartwright C, Dreiling BJ, Steinberg MH. The leukocyte composition of peripheral blood buffy coat. Am J Clin Pathol. 1983;79(4):500–501. doi: 10.1093/ajcp/79.4.500. [DOI] [PubMed] [Google Scholar]
- 20.Brisco P, Sankbeil J, Kephart D. RNA purification: a rapid and versatile protocol for the isolation of total RNA. Promega Notes. 1997;64:07. [Google Scholar]
- 21.Ulmer AJ, Scholz W, Ernst M, Brandt E, Flad HD. Isolation and subfractionation of human peripheral blood mononuclear cells (PBMC) by density gradient centrifugation on Percoll. Immunobiology. 1984;166(3):238–250. doi: 10.1016/S0171-2985(84)80042-X. [DOI] [PubMed] [Google Scholar]
- 22.Debey S, Schoenbeck U, Hellmich M, Gathof BS, Pillai R, Zander T, Schultze JL. Comparison of different isolation techniques prior gene expression profiling of blood derived cells: impact on physiological responses, on overall expression and the role of different cell types. Pharmacogenomics J. 2004;4(3):193–207. doi: 10.1038/sj.tpj.6500240. [DOI] [PubMed] [Google Scholar]
- 23.Feezor RJ, Baker HV, Mindrinos M, Hayden D, Tannahill CL, Brownstein BH, Fay A, MacMillan S, Laramie J, Xiao W, Moldawer LL, Cobb JP, Laudanski K, Miller-Graziano CL, Maier RV, Schoenfeld D, Davis RW, Tompkins RG, Inflammation, Host Response to Injury L-SCRP Whole blood and leukocyte RNA isolation for gene expression analyses. Physiol Genomics. 2004;19(3):247–254. doi: 10.1152/physiolgenomics.00020.2004. [DOI] [PubMed] [Google Scholar]
- 24.Adiconis X, Borges-Rivera D, Satija R, DeLuca DS, Busby MA, Berlin AM, Sivachenko A, Thompson DA, Wysoker A, Fennell T, Gnirke A, Pochet N, Regev A, Levin JZ. Comparative analysis of RNA sequencing methods for degraded or low-input samples. Nat Methods. 2013;10(7):623–629. doi: 10.1038/nmeth.2483. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Chen Y, Chen Y, Shi C, Huang Z, Zhang Y, Li S, Li Y, Ye J, Yu C, Li Z, Zhang X, Wang J, Yang H, Fang L, Chen Q. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience. 2018;7(1):1–6. doi: 10.1093/gigascience/gix120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, Wang J. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009;25(15):1966–1967. doi: 10.1093/bioinformatics/btp336. [DOI] [PubMed] [Google Scholar]
- 27.Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12(4):357–360. doi: 10.1038/nmeth.3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.DeLuca DS, Levin JZ, Sivachenko A, Fennell T, Nazaire MD, Williams C, Reich M, Winckler W, Getz G. RNA-SeQC: RNA-seq metrics for quality control and process optimization. Bioinformatics. 2012;28(11):1530–1532. doi: 10.1093/bioinformatics/bts196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Rosenbloom KR, Armstrong J, Barber GP, Casper J, Clawson H, Diekhans M, Dreszer TR, Fujita PA, Guruvadoo L, Haeussler M, Harte RA, Heitner S, Hickey G, Hinrichs AS, Hubley R, Karolchik D, Learned K, Lee BT, Li CH, Miga KH, Nguyen N, Paten B, Raney BJ, Smit AF, Speir ML, Zweig AS, Haussler D, Kuhn RM, Kent WJ. The UCSC Genome Browser database: 2015 update. Nucleic Acids Res. 2015;43(Database issue):D670–D681. doi: 10.1093/nar/gku1177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Fang S, Zhang L, Guo J, Niu Y, Wu Y, Li H, Zhao L, Li X, Teng X, Sun X, Sun L, Zhang MQ, Chen R, Zhao Y. NONCODEV5: a comprehensive annotation database for long non-coding RNAs. Nucleic Acids Res. 2018;46(D1):D308–D314. doi: 10.1093/nar/gkx1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Bray NL, Pimentel H, Melsted P, Pachter L. Erratum: near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34(8):888. doi: 10.1038/nbt0816-888d. [DOI] [PubMed] [Google Scholar]
- 33.Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, Hoang CD, Diehn M, Alizadeh AA. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015;12(5):453–457. doi: 10.1038/nmeth.3337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B. 1995;57:289–300. [Google Scholar]
- 36.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.https://horvath.genetics.ucla.edu/html/CoexpressionNetwork/Rpackages/WGCNA/faq.html
- 38.Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, Benner C, Chanda SK. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10(1):1523. doi: 10.1038/s41467-019-09234-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Pinero J, Bravo A, Queralt-Rosinach N, Gutierrez-Sacristan A, Deu-Pons J, Centeno E, Garcia-Garcia J, Sanz F, Furlong LI. DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. Nucleic Acids Res. 2017;45(D1):D833–D839. doi: 10.1093/nar/gkw943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Pathan M, Keerthikumar S, Ang CS, Gangoda L, Quek CY, Williamson NA, Mouradov D, Sieber OM, Simpson RJ, Salim A, Bacic A, Hill AF, Stroud DA, Ryan MT, Agbinya JI, Mariadason JM, Burgess AW, Mathivanan S. FunRich: an open access standalone functional enrichment and interaction network analysis tool. Proteomics. 2015;15(15):2597–2601. doi: 10.1002/pmic.201400515. [DOI] [PubMed] [Google Scholar]
- 41.Ponten F, Jirstrom K, Uhlen M. The Human Protein Atlas—a tool for pathology. J Pathol. 2008;216(4):387–393. doi: 10.1002/path.2440. [DOI] [PubMed] [Google Scholar]
- 42.Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, Billis K, Cummins C, Gall A, Giron CG, Gil L, Gordon L, Haggerty L, Haskell E, Hourlier T, Izuogu OG, Janacek SH, Juettemann T, To JK, Laird MR, Lavidas I, Liu Z, Loveland JE, Maurel T, McLaren W, Moore B, Mudge J, Murphy DN, Newman V, Nuhn M, Ogeh D, Ong CK, Parker A, Patricio M, Riat HS, Schuilenburg H, Sheppard D, Sparrow H, Taylor K, Thormann A, Vullo A, Walts B, Zadissa A, Frankish A, Hunt SE, Kostadima M, Langridge N, Martin FJ, Muffato M, Perry E, Ruffier M, Staines DM, Trevanion SJ, Aken BL, Cunningham F, Yates A, Flicek P. Ensembl 2018. Nucleic Acids Res. 2018;46(D1):D754–D761. doi: 10.1093/nar/gkx1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014;15(3):R46. doi: 10.1186/gb-2014-15-3-r46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Choi I, Bao H, Kommadath A, Hosseini A, Sun X, Meng Y, Stothard P, Plastow GS, Tuggle CK, Reecy JM, Fritz-Waters E, Abrams SM, Lunney JK, le Guan L. Increasing gene discovery and coverage using RNA-seq of globin RNA reduced porcine blood samples. BMC Genomics. 2014;15:954. doi: 10.1186/1471-2164-15-954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Mastrokolias A, den Dunnen JT, van Ommen GB, t Hoen PA, van Roon-Mom WM Increased sensitivity of next generation sequencing-based expression profiling after globin reduction in human blood RNA. BMC Genomics. 2012;13:28. doi: 10.1186/1471-2164-13-28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sorg N, Poppe C, Bunos M, Wingenfeld E, Hummer C, Kramer A, Stock B, Seifried E, Bonig H. Red blood cell depletion from bone marrow and peripheral blood buffy coat: a comparison of two new and three established technologies. Transfusion. 2015;55(6):1275–1282. doi: 10.1111/trf.13001. [DOI] [PubMed] [Google Scholar]
- 47.Alberts B. Molecular biology of the cell. 4. New York: Garland Science; 2002. [Google Scholar]
- 48.Bobdey S, Ganesh B, Mishra P, Jain A. Role of monocyte count and neutrophil-to-lymphocyte ratio in survival of oral cancer patients. Int Arch Otorhinolaryngol. 2017;21(1):21–27. doi: 10.1055/s-0036-1587318. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Liu G, Zhang S, Hu H, Liu T, Huang J. The role of neutrophil-lymphocyte ratio and lymphocyte-monocyte ratio in the prognosis of type 2 diabetics with COVID-19. Scott Med J. 2020;65(4):154–160. doi: 10.1177/0036933020953516. [DOI] [PubMed] [Google Scholar]
- 50.Kazancioglu S, Yilmaz FM, Bastug A, Sakalli A, Ozbay BO, Buyuktarakci C, Bodur H, Yilmaz G. Lymphocyte subset alteration and monocyte CD4 expression reduction in patients with severe COVID-19. Viral Immunol. 2020 doi: 10.1089/vim.2020.0166. [DOI] [PubMed] [Google Scholar]
- 51.Castillo DJ, Rifkin RF, Cowan DA, Potgieter M. The healthy human blood microbiome: fact or fiction? Front Cell Infect Microbiol. 2019;9:148. doi: 10.3389/fcimb.2019.00148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Paisse S, Valle C, Servant F, Courtney M, Burcelin R, Amar J, Lelouvier B. Comprehensive description of blood microbiome from healthy donors assessed by 16S targeted metagenomic sequencing. Transfusion. 2016;56(5):1138–1147. doi: 10.1111/trf.13477. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary file6 (PDF 392 kb)
Fig. S1: Metrics for clean data and sequence alignment. Q20 (A) and GC content (B) of clean data in four pretreatments. (C) Percentage of un-aligned reads, unique aligned reads and multi-aligned reads in clean data. (D) Proportion of reads mapping to CDS exons, 5′UTR exons, 3′UTR exons, introns, TSS up/down 10 kb and other regions.
Supplementary file7 (PDF 235 kb)
Fig. S2: The relative percentage of leukocyte subsets in each sample. The order of the legends of cell types were shown from bottom to up.
Supplementary file8 (PDF 1202 kb)
Fig. S3: (A) The cell type enrichment analysis for coding genes in black and yellow modules. (B) Heatmap of gene expression in 18 cell type. TPM was transformed by log2 (TPM+1).
Data Availability Statement
The data reported in this study are also available in the CNGB Nucleotide Sequence Archive (CNSA: https://db.cngb.org/cnsa; accession number CNP0000266).





