Skip to main content
Protein & Cell logoLink to Protein & Cell
. 2016 Nov 22;8(2):123–133. doi: 10.1007/s13238-016-0342-x

Mass spectrometry based proteomics profiling of human monocytes

Yong Zeng 1,2, Fei-Yan Deng 2,4, Wei Zhu 2,3, Lan Zhang 2, Hao He 2, Chao Xu 2, Qing Tian 2, Ji-Gang Zhang 2, Li-Shu Zhang 1, Hong-Gang Hu 1, Hong-Wen Deng 1,2,
PMCID: PMC5291777  PMID: 27878450

Abstract

Human monocyte is an important cell type which is involved in various complex human diseases. To better understand the biology of human monocytes and facilitate further studies, we developed the first comprehensive proteome knowledge base specifically for human monocytes by integrating both in vivo and in vitro datasets. The top 2000 expressed genes from in vitro datasets and 779 genes from in vivo experiments were integrated into this study. Altogether, a total of 2237 unique monocyte-expressed genes were cataloged. Biological functions of these monocyte-expressed genes were annotated and classified via Gene Ontology (GO) analysis. Furthermore, by extracting the overlapped genes from in vivo and in vitro datasets, a core gene list including 541 unique genes was generated. Based on the core gene list, further gene-disease associations, pathway and network analyses were performed. Data analyses based on multiple bioinformatics tools produced a large body of biologically meaningful information, and revealed a number of genes such as SAMHD1, G6PD, GPD2 and ENO1, which have been reported to be related to immune response, blood biology, bone remodeling, and cancer respectively. As a unique resource, this study can serve as a reference map for future in-depth research on monocytes biology and monocyte-involved human diseases.

Electronic supplementary material

The online version of this article (doi:10.1007/s13238-016-0342-x) contains supplementary material, which is available to authorized users.

Keywords: human monocytes, proteomics knowledgebase, gene ontology, gene-disease association, network analysis

INTRODUCTION

The mission of the international Human Proteome Project (HPP) is to help elucidate biological and molecular function and advance diagnosis and treatment of diseases by systematically cataloging all proteins manufactured in the human body, and by generating a map of the protein-based molecular architecture of the human body.

Monocytes are a type of white blood cell derived from hemopoietic stem cells in bone marrow. They can usually be distinguished in stained smears by their large kidney shaped or notched nuclei. Monocytes play fundamental roles in human physiology; for example, as part of the human immune system, monocytes play a central role in regulating host inflammatory processes through chemotaxis, phagocytosis, and cytokine production (Jin et al., 2006). As precursors of bone-resorbing osteoclasts, monocytes are important for bone remodeling (Deng et al., 2011b; Zhou et al., 2015). Monocytes are also involved in muscle system processes (Grunberg et al., 2010), nervous system development (Verkman et al., 2011), regulation of tumor necrosis factor production (Kakazu et al., 2011). Despite the significance of monocytes in human physiology and pathology, the molecular bases underlying their diverse functions are poorly understood.

The human monocyte proteome has not yet been tackled by the Human Proteome Organization Initiative, and a global proteome database specifically for human monocytes has yet to be established. Quantitative proteomics, which is based on Liquid Chromatography (LC) and Mass Spectrometry (MS) technologies and can systematically identify and quantify proteins at a proteome-wide scale, has emerged as a novel and powerful methodology to characterize functions of various cell types and tissues in humans (Deng et al., 2011a). The purpose of this study is to present a comprehensive proteomic draft map for human monocytes. Dissecting the human monocyte proteome using modern quantitative proteomics methodologies will contribute to a comprehensive understanding of monocyte biology and the significance of monocytes in human health and diseases.

Results

Protein isoelectric point and molecular weight information

Corresponding to 2060 genes out of the combined gene list, 5310 potential proteins went through the online tool for molecular weight (Mw) and isoelectric point (pI) calculation. The distributions of protein Mw and pI are shown in Fig. 1A and Fig. 1B respectively. It is well known that proteins are significant macromolecular substances in organisms. As expected, for most of the proteins, molecular weight values were larger than 10,000 Dalton. For protein pI distribution, as shown in Fig 1B), a large proportion of proteins were weakly acidic or weakly alkaline (6 < pI < 8). There were more acidic proteins than alkaline proteins (Table S3). This information may benefit future studies regarding human monocytes, especially for protein identification in electrophoresis experiments.

Figure 1.

Figure 1

Basic information and disease association of human monocyte proteins. (A) Distribution of protein molecular weight. X-axis lists the specific Mw ranges. Y-axis shows the number of proteins distributed in each range. (B) Distribution of Protein Isoelectric Point. X axis lists the specific pI ranges. Y axis shows the number of proteins distributed in each range. (C) Top 10 terms in cellular component category. X axis shows the top 10 terms in the Cellular Component category. Y axis shows the number of genes enriched in each specific GO term. (D) 20 main diseases related to human monocytes. X axis shows the names of 20 diseases. Y axis shows the number of monocyte-expressed genes involved in each specific disease

Gene enrichment and functional annotation results

As displayed in Table S4, all the monocyte-expressed genes were annotated into a total of 13 categories and 2383 terms, derived from various databases such as GO, BIOCARTA, KEGG and others. All of the genes enriched in specific terms were listed. Meanwhile, for each term, a series of statistical values such as count and percentage of enriched genes, P value, fold enrichment score, FDR and Bonferroni value were calculated for the purpose of enrichment evaluation.

We looked into the most representative GO terms under several categories. In the cellular component (CC) category, based on the counts of enriched genes, Fig. 1C shows the top 10 terms illustrating the subcellular distribution of monocyte-expressed genes. As expected, most of the genes are enriched in cytosol, membrane, cytoskeleton and nuclear-related terms. In the molecular function (MF) category, based on the counts of enriched genes, Table 1 shows the top 20 terms involved in multiple functions such as nucleoside binding, ATP binding, RNA binding, calcium ion binding, and enzyme binding. In the biological process (BP) category, Table 2 highlights 20 terms which involved a variety of biological processes including immune response, defense response, protein localization and regulation of cell death. These 20 terms mainly represent the biological functions related to human monocytes.

Table 1.

Top 20 terms in molecular function category

Terms Number of genes P value
GO:0000166~nucleotide binding 520 1.81E-44
GO:0017076~purine nucleotide binding 435 9.56E-34
GO:0032555~purine ribonucleotide binding 416 5.60E-32
GO:0032553~ribonucleotide binding 416 5.60E-32
GO:0001882~nucleoside binding 319 2.50E-14
GO:0001883~purine nucleoside binding 316 4.90E-14
GO:0030554~adenyl nucleotide binding 311 9.50E-14
GO:0032559~adenyl ribonucleotide binding 293 1.60E-12
GO:0005524~ATP binding 290 1.46E-12
GO:0005198~structural molecule activity 168 2.70E-19
GO:0003723~RNA binding 160 3.14E-11
GO:0005509~calcium ion binding 142 0.045
GO:0005525~GTP binding 136 6.68E-30
GO:0019001~guanyl nucleotide binding 136 1.43E-28
GO:0032561~guanyl ribonucleotide binding 136 1.43E-28
GO:0008092~cytoskeletal protein binding 136 3.09E-16
GO:0019899~enzyme binding 132 2.36E-13
GO:0042802~identical protein binding 131 4.65E-07
GO:0008233~peptidase activity 108 2.02E-04
GO:0070011~peptidase activity, acting on L-amino acid peptides 105 1.32E-04

Notes: Based on the number of enriched genes, the first column shows the names of the top 20 terms in the molecular function category. The second column shows the number of genes enriched in each specific term. The third column shows the p value of each term

Table 2.

20 main biological processes related to monocytes

Terms Number of genes P value
GO:0008104~protein localization 214 1.59E-20
GO:0045184~establishment of protein localization 200 6.49E-23
GO:0007242~intracellular signaling cascade 199 0.002737
GO:0015031~protein transport 198 1.23E-22
GO:0043933~macromolecular complex subunit organization 197 2.97E-26
GO:0006508~proteolysis 191 1.73E-06
GO:0065003~macromolecular complex assembly 187 1.14E-25
GO:0046907~intracellular transport 181 9.74E-24
GO:0006796~phosphate metabolic process 166 2.48E-04
GO:0006793~phosphorus metabolic process 166 2.48E-04
GO:0010941~regulation of cell death 162 3.28E-08
GO:0043067~regulation of programmed cell death 161 4.34E-08
GO:0042981~regulation of apoptosis 158 1.08E-07
GO:0006952~defense response 153 1.49E-15
GO:0016192~vesicle-mediated transport 151 9.85E-18
GO:0006955~immune response 151 1.10E-10
GO:0007049~cell cycle 151 4.01E-07
GO:0009611~response to wounding 150 6.20E-21
GO:0009057~macromolecule catabolic process 145 1.03E-05
GO:0010033~response to organic substance 143 2.71E-07

Notes: 20 terms which represent the main biological processes of monocytes are presented in the first column. The second column shows the number of genes enriched in each specific term. The third column shows the p value of each term

In order to gain a better understanding of monocyte-expressed proteins in human physiology and pathology, we intentionally looked into specific GO terms in the CC, MF and BP categories respectively to reveal genes which have been well recognized for their association with the immune system, blood, bone, cellular response, muscle, nervous system, and tumors, among others.

Results of gene-disease association analysis

A total of 3444 gene-disease associations were obtained by matching the core gene list to the entire gene-disease association database. Among the associations, many diseases such as HIV infections, osteoporosis, arthritis, carcinoma and leukemia were highly associated with the physiology and pathology of human monocytes. Fig. 1D shows the number of genes involved in 20 main diseases. A single gene may be involved in different diseases; for example, ANXA2 is involved in carcinoma, osteoporosis, and leukemia. Table S5 shows detailed information for all of these gene-disease associations, giving us a clear and comprehensive view of monocyte related diseases and providing a better understanding of the physiology and pathology of human monocytes.

Results of pathway and network analysis

The core gene list, including 541 unique genes, was put through the Reactome online tool for pathway analysis. Accordingly, a total of 972 pathways as well as a series of statistical values including “Entities pValue” and “Entities FDR” were generated (Table S6). Based on physiological similarity and hierarchy, pathways were classified into several groups including immune system, signal transduction, metabolism, disease, cell cycle and hemostasis (Fig. 2). Similarly, in network analysis based on Cytoscape plugins, all provided genes were first enriched in specific terms, then all the terms were classified into 30 functional modules (S7 Table). As shown in Figs. 3 and S1, the genes shown to be involved in biological processes including “defense response,” “cellular localization,” “hemostasis,” “response to stress,” “cellular component movement,” and “catabolic process” make up a large proportion of all provided monocyte-expressed genes.

Figure 2.

Figure 2

Main groups and hierarchical relationship of Reactome pathways. Reactome pathways are classified into different groups based on the similarity of their biological functions. The bottom color gradation provides the P value comparison for significant pathways. The dendritic structure represents the hierarchical relation of pathways

Figure 3.

Figure 3

Network reconstruction of functional terms. Each dot represents one functional term. The same color dots represent terms which have similar biological functions. Accordingly, all terms were classified into different functional modules including “defense response,” “response to stress,” “hemostasis” and “cellular localization”

DISCUSSION

Human monocytes play fundamental roles in human physiology, including immune system processes, blood coagulation and circulation, bone remodeling and resorption, muscle system processes and nervous system development (see details in Table S4). Dissecting global monocyte proteomics may offer a fresh perspective into understanding monocyte-related biological process and pathology in humans. In this study, we developed the first proteome knowledge base specifically for human monocytes. Such a knowledge base, integrating in vivo monocyte proteome datasets from human PBM samples and in vitro proteome datasets from cultured monocytes, may represent a comprehensive knowledge base for monocyte-expressed proteins in human population. We cataloged a total of 2237 unique monocyte-expressed genes, and conveyed serial information for these genes and their corresponding proteins including protein biochemical features (Mw and pI), protein functions (cellular component, molecular function, and biological processes), gene-disease association and pathway/network information.

Monocytes can act as precursors of macrophages and dendritic cells in the human immune system. Macrophages and dendritic cells have key roles in antigenic infections, linking innate antigen detection to antigen adaptive immune responses (Hrecka et al., 2011). In our study, GO analysis results indicated that 151 monocyte-expressed genes are enriched in the “immune response” term. At the same time, Reactome pathway analysis showed that 134 genes are enriched in the pathway “immune system.” More importantly, in the gene-disease association study, a large number of genes are involved in immunological diseases such as HIV infections, arthritis and other autoimmune diseases. Among these genes, SAMHD1, NCF1, NCF2, and CD14 have been demonstrated to be involved in many kinds of human immune related physiology pathways or diseases. SAMHD1 has a protective role in preventing self-activation of innate immunity by cell-intrinsic components (Hrecka et al., 2011). Evidence indicates that SAMHD1 directly participates in HIV-1 infection macrophage mediation (Hrecka et al., 2011). Meanwhile, as a restriction factor, SAMHD1 has been proposed to inhibit HIV-1 by blocking viruses at different steps of their life cycle (Merindol and Berthoux, 2015). Our gene-disease association study supports the theory that NCF1 and NCF2 are associated with some immunological diseases including arthritis and granulomatous disease. Previous evidence has demonstrated that mutations in NCF1 and NCF2 may cause granulomatous disease, and the copy number variation of NCF1 is associated with rheumatoid arthritis (de Boer et al., 2002; Koker, 2010; Olsson et al., 2012). Previous studies have also reported that CD14 deficiency increased clinical symptoms in active experimental autoimmune encephalomyelitis (Walter et al., 2006). Furthermore, as a marker of monocyte activation, soluble CD14 is associated with morbidity and mortality in HIV disease (Shive et al., 2015). Consistently, our gene-disease association study denoted that CD14 is involved in many immunological diseases such as HIV infections, inflammation and some autoimmune diseases. These examples demonstrate that monocytes are a fundamental part of the human immune system and play an irreplaceable role in immune-associated biological processes in humans.

Monocytes are also involved in different kinds of blood-associated biological processes including blood coagulation, blood circulation, and leukocyte migration. Accordingly, as listed in S5 Table, monocyte-expressed genes are involved in several blood-associated diseases such as leukemia, anemia, blood coagulation and blood platelet disorders. Herein, we highlighted several genes such as G6PD, ITGB2 and ANXA5 which were sorted out from blood related terms including “hemostasis,” “blood coagulation,” “leukocyte migration” and “hereditary hemolytic anemia.” G6PD deficiency is the most common metabolic disorder of red blood cells, affecting about 35 million people worldwide (Nabavizadeh and Anushiravani, 2007). An atomic force microscopy study suggested that G6PD plays a significant role in hemolytic anemia by affecting the membrane structure of red blood cells (Tang et al., 2015). The severity of clinical manifestations of myeloid leukemia is directly related to the degree of CD18 (ITGB2) deficiency (Vasconcelos Dde et al., 2014). As a biochemical marker of atherosclerosis, the concentration of ANXA5 serves as a standard for clinical diagnosis of cardiovascular complications in kidney disease (Emanuel et al., 2013). These examples highlight the close relationship between monocyte-expressed genes/proteins and blood physiology in humans.

Monocytes can serve as a precursor of bone-resorbing osteoclasts. Monocytes may access bone resorption surfaces to differentiate into osteoclasts to resorb bone, which is important for bone remodeling and fracture repair (Deng et al., 2011a). Many proteins related to bone formation or resorption mechanism thus involved in osteoporosis were identified. Herein, four genes, GPD2, ANXA2, TPM4 and GSN, have been selected for subsequent discussion according to functional annotation and gene-disease association analysis in our study. These four genes are sorted out from the term “calcium ion binding” and are all involved in osteoporosis in gene-disease association analysis. Previous studies from our group have supported the contribution of GPD2, TPM4 and GSN to osteoporosis risk (Deng et al., 2008; Deng et al., 2014). Several previous studies suggested that ANXA2 can influence bone formation in a variety of ways including regulation of osteoprogenitor proliferation, differentiation, and responsiveness to cytokines (Deng et al., 2011a; Pandey et al., 2012; Genetos et al., 2014). These examples suggest that monocytes are functionally relevant to bone. Extensive studies on monocyte-expressed proteins may provide novel insights into the interaction between monocytes and bone.

Monocytes are involved in a variety of human physiological functions and associated with many complex human diseases (Dichamp et al., 2007; Chimma et al., 2009; Kraft-Terry et al., 2010). Five diseases closely related to the physiology and pathology of monocytes are presented in Table 3, along with a list of genes involved in 2 or more diseases. In-depth studies of monocyte-expressed proteins will contribute to a better understanding of the molecular pathophysiological mechanisms underlying these diseases. The in vivo protein expression data based on LC-MS may serve as a reference map and use for validation study for functional research on individual monocyte proteins. Basic information including molecular weight and isoelectric point is important for electrophoretic experiment focus on specific monocyte protein. Results from multiple bioinformatics analysis allow us to better understand the interaction between monocyte proteins, which may help other researchers to develop new ideas on physiology and etiology studies regarding human monocyte. In summary, our study provided a reference map for future in-depth research on monocytes and monocyte-involved human diseases, which may facilitate the understanding of monocyte biology and monocyte-related human health and diseases for the research community.

Table 3.

Five representative diseases and the involved genes related to monocytes

Arthritis Carcinoma HIV infections Leukemia Osteoporosis
HLA-A 1 1 1 1
HLA-B 1 1 1 1
HLA-C 1 1 1 1
CAT 1 1 1
ENO1 1 1 1
GSTP1 1 1 1
SOD2 1 1 1
ACTG1 1 1
ANXA1 1 1
ANXA2 1 1
ANXA4 1 1
ATIC 1 1
CD14 1 1
CDC42 1 1
HLA-DRB5 1 1
HSPD1 1 1
IDH2 1 1
ITGB2 1 1
LGALS3 1 1
MPO 1 1
MVP 1 1
NME1 1 1
PLEK 1 1
PRTN3 1 1
PSMA4 1 1
PSMA5 1 1
PTPRC 1 1
RAN 1 1
S100A8 1 1
SERPINA1 1 1
TAP1 1 1
VIM 1 1

Notes: In this table, 5 diseases closely related to human monocytes are selected and shown in the first row. Among these 5 diseases, genes involved in 2 or more diseases are listed in the first column. The number “1” means that the corresponding gene is involved in the specified disease

MATERIAL AND METHODS

In vivo data acquisition

Human subjects

The study was approved by the Institutional Review Boards of University of Missouri Kansas City and Tulane University. A total of 76 unrelated female Caucasians (Age: 57.8 ± 9.0) were involved in this study. Signed informed consent documents were obtained from all study participants before they were enrolled.

Peripheral blood monocyte (PBM) isolation

For in vivo data, monocytes were extracted from human peripheral blood. About 60ml of peripheral blood was collected from each subject by certificated phlebotomists. Ethylenediaminetetraacetic acid (EDTA) was used as an anti-coagulant. The fresh blood samples were processed instantly for PBM isolation by experienced technicians. First, peripheral blood mononuclear cells (PBMC) were isolated from whole blood using density gradient centrifugation with Histopaque-1077 (Sigma, Catalog No. H1077-1). Then, PBM were isolated from PBMC using a monocyte negative isolation kit (Dynal Biotech Inc.) following the manufacturer’s recommendation. The kit contains a highly optimized antibody mix, blocking reagent, and Depletion Dynabeads to deplete T cells, B cells, and natural killer cells from PBMC, leaving monocytes untouched and free of surface-bound antibodies and beads. Per our experience, the purity of PBM isolated using this method was 86% ± 3% (Liu et al., 2005).

Protein extraction and sample preparation

PBM total proteins were extracted using a complete proteome extraction mammalian kit (Calbiochem Catalog No. 539779). Protein concentration was measured using the Bradford method. Up to 20 μg total protein was precipitated using a protein precipitation kit (Calbiochem Catalog No. 539180). Protein pellets were dissolved in 50 μl. 50 mmol/L ammonium bicarbonate with 0.1% RapiGest (Waters), reduced by 5.0 mmol/L dithiothreitol, alkylated by 15 mmol/L indole acetic acid, then digested by trypsin (Sigma, Catalog No.T6567). Protein digests were concentrated to approximately 20 μL, of which 15 μL was aspirated and brought back to 20 μL, with 0.5% formic acid (FA) and 100 fmol yeast ADH1 (alcohol dehydrogenase I) digestion standard (Waters, Catalog No. 186002328).

PBM proteome profiling through LC-nano-ESI-MS/MSE analyses

PBM proteomes were profiled using the method of LC-nano-ESI-MSE, through nanoAcquity Ultra Performance Liquid Chromatography coupled with Synapt High Definition Mass Spectrometry (HDMS) (Waters Corporation). Proteome data acquisition was controlled by MassLynx4.1 software (Waters). Briefly, the protein digests (~ 500 ng) were injected into a BEH C18 75 μm × 150 mm analytical column (Waters), and separated by solvent A (water with 0.1% FA) and solvent B (acetonitrile with 0.1% FA) at a flow rate of 0.3 μL/min using a 2-h gradient. The eluate was analyzed by HDMS under positive ion V-mode. The following parameters were set for data acquisition: collision energy: 5 volts for MS and ramp 15–40 volts for MSE; scan time: 0.6 second per scan. The HDMS machine was calibrated daily to ensure high accuracy (2.0 ppm for lock mass of m/z 785.8426).

For each PBM proteome digest sample, triplicate LC-nano-ESI-MSE datasets were acquired, The MSE data were then processed with ProteinLynx Global Server v2.3 (Waters) using default parameters. Based on the alternating low- and elevated-energy nature of MSE data, properties of each ion (mass-to-charge ratio, retention time, intensity, etc.) were determined, and a list of all precursor and product ions was produced. Specifically, the ion’s intensity was derived from the areas of both the chromatographic and mass spectrometric peaks. The precursor ion intensity threshold was set to above 1000 counts. The human protein database International Protein Index v3.83 (153,078 protein entries) was searched by using the following parameters: enzyme specificity: trypsin; number of missed cleavages permitted: 1; fixed modification: Carbamidomethyl C; variable modifications: Acetyl N-TERM, Deamidation N, Deamidation Q, and Oxidation M; mass tolerance for precursor ions: 15 ppm; mass tolerance for product ions: 30 ppm; minimum peptide matches per protein: 1; minimum fragment ion matches per protein: 7; minimum fragment ion matches per peptide: 3; false positive rate: limited to 4% per randomized database searching. For each PBM sample, only proteins identified at least twice in the triplicate LC-nano-ESI-MSE analyses were reported as truly present. Total ion counts of the three most intensely matched peptides were used to quantify each protein. With the standard ADH1 as references, protein quantification level was exported in femtomol and nanogram. Mean values from triplicate analyses were used to represent protein expression levels in each PBM sample.

In vitro dataset

The in vitro dataset was downloaded from the Human Proteome Map (HPM) project (http://www.humanproteomemap.org/), which was published in Nature in 2014 by Kim et al (Kim et al., 2014) with great effort and collaboration from 20 research institutions. The project was based on LC-MS/MS and utilized high resolution and high accuracy Fourier transform mass spectrometry (LTQ-Orbitrap Elite and LTQ-Orbitrap Velos). The authors obtained numerous comprehensive datasets from multiple organs, tissues and cell types including 17 adult tissues, 6 primary hematopoietic cells and 7 fetal tissues.

For the specific purpose of our study, we used only the dataset obtained from monocytes. The monocytes used in their study were harvested from cell culture; hence, they provided in vitro results and contributed a significant supplemental dataset for our study. The top 2000 monocyte-expressed genes from the in vitro results were selected for the integrated proteomics in our study.

Protein isoelectric point and molecular weight calculation

By integrating both in vivo and in vitro data, a combined gene list including 2237 unique genes was generated (Table S1). Based on the combined gene list, we first converted the gene IDs to protein IDs (UniProt Accession Number) to meet the requirement of the online tool, then calculated the protein isoelectric point (pI) and molecular weight (Mw) for all the corresponding proteins using the Compute pI/Mw tool provided by Expert Protein Analysis System (ExPASy, http://web.expasy.org/compute_pi/). It should be noted that one gene may correspond to multiple proteins due to the variety in translation and protein modification.

Functional annotation and gene enrichment analysis

The DAVID (Database for Annotation, Visualization and Integrated Discovery) Knowledgebase has agglomerated tens of millions of gene/protein identifiers from a variety of public genomic resources. These identifiers improve cross-referencing capability, particularly across NCBI and UniProt systems. More than 40 publicly available functional annotation sources have been comprehensively integrated and centralized by the DAVID gene clusters (Sherman et al., 2007; Huang et al., 2009). As mentioned above, the combined gene list including 2237 unique genes was imported into this online tool for functional annotation and enrichment analysis. Comprehensive functional information for all genes enriched in specific terms has been fully exposed. The results were classified as “functional annotation clustering,” “functional annotation chart” and “functional annotation table.”

Gene-disease association analysis

Although the in vivo and in vitro datasets were generated from different experimental materials as well as different instruments, they both stem from the same cell type: monocyte. By extracting the overlapped genes between in vivo and in vitro data, a core gene list including 541 unique genes was generated (Table S2). With high repeatability and reliability between the independent in vivo and in vitro experiments, the overlapped genes are most likely to reflect the biological functions related to human monocyte exactly and objectively. Therefore, we performed gene-disease association analysis on the basis of the core gene list. First, we downloaded the whole gene-disease association dataset for the DisGeNET database (http://www.disgenet.org/web/DisGeNET/menu/home) which contains about 50 thousand associations between different genes and diseases. The associations are all supported by experimental evidence or previous publications. Secondly, we mapped our core gene list to the entire knowledge base, so that all of the gene-disease associations corresponding to the provided genes could be extracted; one gene may be involved in several diseases, and one disease may involve numerous genes.

Pathway and network analysis

Network analysis can characterize the functional correlation and interaction topology between different proteins. Pathways/modules generated from network analysis may reflect biological processes more comprehensively and objectively than from a single protein or gene (Swa et al., 2014). Reactome (http://www.reactome.org/) is a free, open-source, curated and peer-reviewed pathway database which provides multiple bioinformatics tools for the visualization, interpretation and analysis of pathway knowledge. Because of these advantages, we imported the core gene list into Reactome online tools for pathway analysis. The core gene list was also imported into the plugin ClueGO in conjunction with CluePedia for network analysis and visualization (Bindea et al., 2009; Bindea et al., 2013). The topological information including interactions between genes and pathways are shown in tables and charts.

Data source

We uploaded all the result files regarding the data sets as supplemental files (Tables S1–S8 and Fig. S1). They include in vivo protein expression data (Table S8), the combined and overlapped gene list corresponding to both in vivo and in vitro proteomics data (Tables S1 and S2), basic characteristics including molecular weight and isoelectric point of human monocyte proteins (Table S3), results from multiple bioinformatics analyses (Tables S4–S7 and Fig. S1). Readers can review, download, and use the data from the online version paper directly.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ACKNOWLEDGEMENTS

HW.D., Y.Z. and FY.D. conceived the idea and designed the study. Y.Z., W.Z. and FY.D. performed the experiments to obtain the in vivo data. Y.Z. and HW.D. wrote and revised the paper. All of the authors participated in data analysis. We thank our laboratory members for the collaboration and helpful discussion. We also thank Min-Sik Kim and his collaborators who published and released the in vitro dataset. This study was partially supported by Grants from NIH (Nos. P50AR055091, R01AR057049, R01AR059781, and R01MH104680), and Edward G. Schlieder Endowment. The study also benefited from the grant from the National Natural Science Foundation of China (No. 31371194).

ABBREVIATIONS

BP: Biological process; CC: Cellular component; DAVID: Database for annotation, visualization and integrated discovery; ESI: Electrospray ionization; GO: Gene ontology; HPM: Human proteome map; HPP: Human proteome project; LC: Liquid chromatograph; MF: Molecular function; MS: Mass spectrometry; Mw: Molecular weight; PBMC: Peripheral blood mononuclear cell; PBMs: Peripheral blood monocytes; pI: Isoelectric point.

COMPLIANCE WITH ETHICS GUIDELINES

Yong Zeng, Fei-Yan Deng, Wei Zhu, Lan Zhang, Hao He, Chao Xu, Qing Tian, Ji-Gang Zhang, Li-Shu Zhang, Hong-Gang Hu and Hong-Wen Deng declare that they have no conflict of interest. All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2000 (5). Informed consent was obtained from all patients for being included in the study.

REFERENCES

  1. Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, Kirilovsky A, Fridman WH, Pages F, Trajanoski Z, Galon J. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics. 2009;25:1091–1093. doi: 10.1093/bioinformatics/btp101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bindea G, Galon J, Mlecnik B. CluePedia Cytoscape plugin: pathway insights using integrated experimental and in silico data. Bioinformatics. 2013;29:661–663. doi: 10.1093/bioinformatics/btt019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Chimma P, Roussilhon C, Sratongno P, Ruangveerayuth R, Pattanapanyasat K, Perignon JL, Roberts DJ, Druilhe P. A distinct peripheral blood monocyte phenotype is associated with parasite inhibitory activity in acute uncomplicated Plasmodium falciparum malaria. PLoS Pathog. 2009;5:e1000631. doi: 10.1371/journal.ppat.1000631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. de Boer M, Singh V, Dekker J, Di Rocco M, Goldblatt D, Roos D. Prenatal diagnosis in two families with autosomal, p47(phox)-deficient chronic granulomatous disease due to a novel point mutation in NCF1. Prenat Diagn. 2002;22:235–240. doi: 10.1002/pd.296. [DOI] [PubMed] [Google Scholar]
  5. Deng FY, Liu YZ, Li LM, Jiang C, Wu S, Chen Y, Jiang H, Yang F, Xiong JX, Xiao P, et al. Proteomic analysis of circulating monocytes in Chinese premenopausal females with extremely discordant bone mineral density. Proteomics. 2008;8:4259–4272. doi: 10.1002/pmic.200700480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Deng FY, Lei SF, Zhang Y, Zhang YL, Zheng YP, Zhang LS, Pan R, Wang L, Tian Q, Shen H, et al. Peripheral blood monocyte-expressed ANXA2 gene is involved in pathogenesis of osteoporosis in humans. Mol Cell Proteomics. 2011;10(M111):011700. doi: 10.1074/mcp.M111.011700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Deng FY, Lei SF, Zhang Y, Zhang YL, Zheng YP, Zhang LS, Pan R, Wang L, Tian Q, Shen H, et al. Peripheral blood monocyte-expressed ANXA2 geneis involved in pathogenesis of osteoporosis in humans. Mol Cell Proteomics. 2011;10:M111-011700. doi: 10.1074/mcp.M111.011700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Deng FY, Zhu W, Zeng Y, Zhang JG, Yu N, Liu YZ, Liu YJ, Tian Q, Deng HW. Is GSN significant for hip BMD in female Caucasians? Bone. 2014;63:69–75. doi: 10.1016/j.bone.2014.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Dichamp I, Bourgeois A, Dirand C, Herbein G, Wendling D. Increased nuclear factor-kappaB activation in peripheral blood monocytes of patients with rheumatoid arthritis is mediated primarily by tumor necrosis factor-alpha. J Rheumatol. 2007;34:1976–1983. [PubMed] [Google Scholar]
  10. Emanuel VL, Mnuskina MM, Smirnov AV, Panina I, Rumiantsev A, Vasina E, Vasina LB (2013) [Annexin-5 as a biochemical marker of early vascular disorders under chronic disease of kidneys]. Klin Lab Diagn, 9–10 [PubMed]
  11. Genetos DC, Wong A, Weber TJ, Karin NJ, Yellowley CE. Impaired osteoblast differentiation in annexin A2- and -A5-deficient cells. PLoS ONE. 2014;9:e107482. doi: 10.1371/journal.pone.0107482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Grunberg W, Sacchetto R, Wijnberg I, Neijenhuis K, Mascarello F, Damiani E, Drogemuller C. Pseudomyotonia, a muscle function disorder associated with an inherited ATP2A1 (SERCA1) defect in a Dutch improved red and white cross-breed calf. Neuromuscul Disord. 2010;20:467–470. doi: 10.1016/j.nmd.2010.04.010. [DOI] [PubMed] [Google Scholar]
  13. Hrecka K, Hao C, Gierszewska M, Swanson SK, Kesik-Brodacka M, Srivastava S, Florens L, Washburn MP, Skowronski J. Vpx relieves inhibition of HIV-1 infection of macrophages mediated by the SAMHD1 protein. Nature. 2011;474:658–661. doi: 10.1038/nature10195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
  15. Jin M, Diaz PT, Bourgeois T, Eng C, Marsh CB, Wu HM. Two-dimensional gel proteome reference map of blood monocytes. Proteome Sci. 2006;4:16. doi: 10.1186/1477-5956-4-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Kakazu E, Ueno Y, Kondo Y, Inoue J, Ninomiya M, Kimura O, Wakui Y, Fukushima K, Tamai K, Shimosegawa T. Plasma L-cystine/L-glutamate imbalance increases tumor necrosis factor-alpha from CD14+ circulating monocytes in patients with advanced cirrhosis. PLoS ONE. 2011;6:e23402. doi: 10.1371/journal.pone.0023402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Kim MS, Pinto SM, Getnet D, Nirujogi RS, Manda SS, Chaerkady R, Madugundu AK, Kelkar DS, Isserlin R, Jain S, et al. A draft map of the human proteome. Nature. 2014;509:575–581. doi: 10.1038/nature13302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Koker MY. Novel human pathological mutations. Gene symbol: NCF2. Disease: chronic granulomatous disease. Hum Genet. 2010;127:113. [PubMed] [Google Scholar]
  19. Kraft-Terry S, Gerena Y, Wojna V, Plaud-Valentin M, Rodriguez Y, Ciborowski P, Mayo R, Skolasky R, Gendelman HE, Melendez LM. Proteomic analyses of monocytes obtained from Hispanic women with HIV-associated dementia show depressed antioxidants. Proteomics Clin Appl. 2010;4:706–714. doi: 10.1002/prca.201000010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Liu YZ, Dvornyk V, Lu Y, Shen H, Lappe JM, Recker RR, Deng HW. A novel pathophysiological mechanism for osteoporosis suggested by an in vivo gene expression study of circulating monocytes. J Biol Chem. 2005;280:29011–29016. doi: 10.1074/jbc.M501164200. [DOI] [PubMed] [Google Scholar]
  21. Merindol N, Berthoux L. Restriction factors in HIV-1 disease progression. Curr HIV Res. 2015;13:448–461. doi: 10.2174/1570162X13666150608104412. [DOI] [PubMed] [Google Scholar]
  22. Nabavizadeh SH, Anushiravani A. The prevalence of G6PD deficiency in blood transfusion recipients. Hematology. 2007;12:85–88. doi: 10.1080/10245330600938182. [DOI] [PubMed] [Google Scholar]
  23. Olsson LM, Nerstedt A, Lindqvist AK, Johansson ACM, Medstrand P, Olofsson P, Holmdahl R. Copy number variation of the gene NCF1 Is associated with rheumatoid arthritis. Antioxid Redox Signal. 2012;16:71–78. doi: 10.1089/ars.2011.4013. [DOI] [PubMed] [Google Scholar]
  24. Pandey S, Ranjan R, Pandey S, Mishra RM, Seth T, Saxena R. Effect of ANXA2 gene single nucleotide polymorphism (SNP) on the development of osteonecrosis in Indian sickle cell patient: a PCR-RFLP approach. Indian J Exp Biol. 2012;50:455–458. [PubMed] [Google Scholar]
  25. Sherman BT, da Huang W, Tan Q, Guo Y, Bour S, Liu D, Stephens R, Baseler MW, Lane HC, Lempicki RA. DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis. BMC Bioinformatics. 2007;8:426. doi: 10.1186/1471-2105-8-426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Shive CL, Jiang W, Anthony DD, Lederman MM. Soluble CD14 is a nonspecific marker of monocyte activation. AIDS. 2015;29:1263–1265. doi: 10.1097/QAD.0000000000000735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Swa HL, Shaik AA, Lim LH, Gunaratne J. Mass spectrometry based quantitative proteomics and integrative network analysis accentuates modulating roles of annexin-1 in mammary tumorigenesis. Proteomics. 2014;15:408–418. doi: 10.1002/pmic.201400175. [DOI] [PubMed] [Google Scholar]
  28. Tang J, Jiang C, Xiao X, Fang Z, Li L, Han L, Mei A, Feng Y, Guo Y, Li H, et al. Changes in red blood cell membrane structure in G6PD deficiency: an atomic force microscopy study. Clin Chim Acta. 2015;444:264–270. doi: 10.1016/j.cca.2015.02.042. [DOI] [PubMed] [Google Scholar]
  29. Vasconcelos Dde M, Beitler B, Martinez GA, Pereira J, Amigo Filho JU, Klautau GB, Lian YC, Della Negra M, Duarte AJ. CD18 deficiency evolving to megakaryocytic (M7) acute myeloid leukemia: case report. Blood Cells Mol Dis. 2014;53:180–184. doi: 10.1016/j.bcmd.2014.07.005. [DOI] [PubMed] [Google Scholar]
  30. Verkman AS, Ratelade J, Rossi A, Zhang H, Tradtrantip L. Aquaporin-4: orthogonal array assembly, CNS functions, and role in neuromyelitis optica. Acta Pharmacol Sin. 2011;32:702–710. doi: 10.1038/aps.2011.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Walter S, Doering A, Letiembre M, Liu Y, Hao W, Diem R, Bernreuther C, Glatzel M, Engelhardt B, Fassbender K. The LPS receptor, CD14, in experimental autoimmune encephalomyelitis and multiple sclerosis. Cell Physiol Biochem. 2006;17:167–172. doi: 10.1159/000092078. [DOI] [PubMed] [Google Scholar]
  32. Zhou Y, Deng HW, Shen H. Circulating monocytes: an appropriate model for bone-related study. Osteoporos Int. 2015;26:2561–2572. doi: 10.1007/s00198-015-3250-7. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Protein & Cell are provided here courtesy of Oxford University Press

RESOURCES